Ver registro no DEDALUS
Exportar registro bibliográfico

Metrics


Metrics:

Similarity-based support for text reuse in technical writing (2015)

  • Authors:
  • USP affiliated authors: MINGHIM, ROSANE - ICMC ; OLIVEIRA, MARIA CRISTINA FERREIRA DE - ICMC
  • USP Schools: ICMC; ICMC
  • DOI: 10.1145/2682571.2797068
  • Subjects: COMPUTAÇÃO GRÁFICA; PROCESSAMENTO DE IMAGENS
  • Language: Inglês
  • Imprenta:
  • Source:
  • Conference titles: ACM Symposium on Document Engineering - DocEng
  • Acesso online ao documento

    Online accessDOI or search this record in
    Informações sobre o DOI: 10.1145/2682571.2797068 (Fonte: oaDOI API)
    • Este periódico é de assinatura
    • Este artigo NÃO é de acesso aberto
    • Cor do Acesso Aberto: closed
    Versões disponíveis em Acesso Aberto do: 10.1145/2682571.2797068 (Fonte: Unpaywall API)

    Título do periódico: Proceedings of the 2015 ACM Symposium on Document Engineering - DocEng '15

    ISSN:

    • Melhor URL em Acesso Aberto:
      • Página do artigo
      • Link para o PDF
      • Evidência: oa repository (via OAI-PMH title and first author match)
      • Licença:
      • Versão: submittedVersion
      • Tipo de hospedagem: repository


    • Outras alternativas de URLs em Acesso Aberto:
        • Página do artigo
        • Link para o PDF
        • Evidência: oa repository (via OAI-PMH title and first author match)
        • Licença:
        • Versão: submittedVersion
        • Tipo de hospedagem: repository



    Exemplares físicos disponíveis nas Bibliotecas da USP
    BibliotecaCód. de barrasNúm. de chamada
    ICMC2722595-10PROD-2722595
    How to cite
    A citação é gerada automaticamente e pode não estar totalmente de acordo com as normas

    • ABNT

      SOTO, Axel J; MOHAMMAD, Abidalrahman; ALBERT, Andrew; et al. Similarity-based support for text reuse in technical writing. Anais.. New York: ACM, 2015.Disponível em: DOI: 10.1145/2682571.2797068.
    • APA

      Soto, A. J., Mohammad, A., Albert, A., Islam, A., Milios, E., Doyle, M., et al. (2015). Similarity-based support for text reuse in technical writing. In Proceedings. New York: ACM. doi:10.1145/2682571.2797068
    • NLM

      Soto AJ, Mohammad A, Albert A, Islam A, Milios E, Doyle M, Minghim R, Oliveira MCF de. Similarity-based support for text reuse in technical writing [Internet]. Proceedings. 2015 ;Available from: http://dx.doi.org/10.1145/2682571.2797068
    • Vancouver

      Soto AJ, Mohammad A, Albert A, Islam A, Milios E, Doyle M, Minghim R, Oliveira MCF de. Similarity-based support for text reuse in technical writing [Internet]. Proceedings. 2015 ;Available from: http://dx.doi.org/10.1145/2682571.2797068

    Referências citadas na obra
    Darwin information typing architecture (DITA) version 1.2 - OASIS standard http://docs.oasis-open.org/dita/v1.2/spec/DITA1.2-spec.html.
    J. Baptista. Pragmatic DITA on a budget. In Proceedings of the 26th Annual ACM International Conference on Design of Communication, pages 193--198. ACM, 2008.
    D. Bär, T. Zesch, and I. Gurevych. Text reuse detection using a composition of text similarity measures. In Proceedings of the International Conference on Computational Linguistics, volume 1, pages 167--184, 2012.
    A. Z. Broder, M. Charikar, A. M. Frieze, and M. Mitzenmacher. Min-wise independent permutations. In Proceedings of the 30th Annual ACM Symposium on Theory of Computing, pages 327--336. ACM, 1998.
    M. Büchler, A. Geßner, T. Eckart, and G. Heyer. Unsupervised detection and visualisation of textual reuse on ancient greek texts. Journal of the Chicago Colloquium on Digital Humanities and Computer Science, 1(2), 2010.
    F. Y. L. Chin and C. K. Poon. A fast algorithm for computing longest common subsequences of small alphabet size. Journal of Information Processing, 13(4):463--469, 1991.
    P. Clough and M. Stevenson. Developing a corpus of plagiarised short answers. Language Resources and Evaluation, 45(1):5--24, 2011.
    S. Duszynski, J. Knodel, and M. Becker. Analyzing the source code of multiple software variants for reuse potential. In 18th Working Conference on Reverse Engineering (WCRE) 2011, pages 303--307, Oct 2011.
    A. Gionis, P. Indyk, and R. Motwani. Similarity search in high dimensions via hashing. In Proceedings of the 25th International Conference on Very Large Data Bases, pages 518--529, 1999.
    B. Gipp and N. Meuschke. Citation pattern matching algorithms for citation-based plagiarism detection: greedy citation tiling, citation chunking and longest common citation sequence. In Proceedings of the 11th ACM Symposium on Document Engineering, pages 249--258. ACM, 2011.
    N. Harrison. The Darwin information typing architecture (DITA): Applications for globalization. In Proceedings of International Professional Communication Conference, pages 115--121. IEEE, 2005.
    M. Henzinger. Finding near-duplicate web pages: A large-scale evaluation of algorithms. In Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 284--291. ACM, 2006.
    J. W. Hunt and M. MacIlroy. An algorithm for differential file comparison. Bell Laboratories, 1976.
    R. W. Irving and C. Fraser. Two algorithms for the longest common subsequence of three (or more) strings. In Proceedings of the Third Annual Symposium on Combinatorial Pattern Matching, CPM '92, pages 214--229. Springer-Verlag, 1992.
    A. Islam, E. Milios, and V. Kešelj. Text similarity using google tri-grams. In Advances in Artificial Intelligence, pages 312--317. Springer, 2012.
    S. Jänicke, A. Geßner, M. Büchler, and G. Scheuermann. Visualizations for text re-use. In Proceedings of the 5th International Conference on Information Visualization Theory and Applications, pages 59--70, 2014.
    J. Leskovec, A. Rajaraman, and J. D. Ullman. Mining of massive datasets. Cambridge University Press, 2014.
    C. D. Manning, P. Raghavan, and H. Schütze. Introduction to information retrieval, volume 1. Cambridge University Press Cambridge, 2008.
    R. Mihalcea, C. Corley, and C. Strapparava. Corpus-based and knowledge-based measures of text semantic similarity. In Proceedings of the 21st National Conference on Artificial Intelligence, volume 6, pages 775--780, 2006.
    C. Paris, K. Vander Linden, M. Fischer, A. Hartley, L. Pemberton, R. Power, and D. Scott. A support tool for writing multilingual instructions. In International Joint Conference on Artificial Intelligence, volume 14, pages 1398--1404, 1995.
    M. Potthast, M. Hagen, M. Völske, and B. Stein. Crowdsourcing interaction logs to understand text reuse from the web. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, pages 1212--1221, 2013.
    A. Rockley, S. Manning, and C. Cooper. DITA 101: Fundamentals of DITA for Authors and Managers. Soc. Technical Communication, FAIRFAX, VA, USA, 2010.
    M. Sanchez-Perez, G. Sidorov, and A. Gelbukh. The winning approach to text alignment for text reuse detection at PAN 2014. Notebook for PAN at CLEF, pages 1004--1011, 2014.
    M. Slaney and M. Casey. Locality-sensitive hashing for finding nearest neighbors. Signal Processing Magazine, IEEE, 25(2):128--131, 2008.
    D. Smith, R. Cordell, and E. Dillon. Infectious texts: Modeling text reuse in nineteenth-century newspapers. In 2013 IEEE International Conference on Big Data, pages 86--94, Oct 2013.
    N. P. Vo, S. Magnolini, and O. Popescu. Paraphrase identification and semantic similarity in twitter with simple features. In The 3rd International Workshop on Natural Language Processing for Social Media, page 10, 2015.