Hybrid system for plagiarism detection


Autoria(s): Bru, Javier R.; Martínez-Barco, Patricio; Muñoz, Rafael
Contribuinte(s)

Universidad de Alicante. Departamento de Lenguajes y Sistemas Informáticos

Procesamiento del Lenguaje y Sistemas de Información (GPLSI)

Data(s)

17/05/2012

17/05/2012

2011

Resumo

The Internet boom in recent years has increased the interest in the field of plagiarism detection. A lot of documents are published on the Net everyday and anyone can access and plagiarize them. Of course, checking all cases of plagiarism manually is an unfeasible task. Therefore, it is necessary to create new systems that are able to automatically detect cases of plagiarism produced. In this paper, we introduce a new hybrid system for plagiarism detection which combines the advantages of the two main plagiarism detection techniques. This system consists of two analysis phases: the first phase uses an intrinsic detection technique which dismisses much of the text, and the second phase employs an external detection technique to identify the plagiarized text sections. With this combination we achieve a detection system which obtains accurate results and is also faster thanks to the prefiltering of the text.

Identificador

BRU, Javier R.; MARTÍNEZ-BARCO, Patricio; MUÑOZ, Rafael. "Hybrid system for plagiarism detection". En: Proceedings of Recent Advances in Natural Language Processing : Hissar, Bulgaria, 12-14 September 2011, pp. 527-532

1313-8502

http://hdl.handle.net/10045/22489

Idioma(s)

eng

Publicador

INCOMA

Direitos

info:eu-repo/semantics/openAccess

Palavras-Chave #Plagiarism detection #Intrinsic detection #External detection #Lenguajes y Sistemas Informáticos
Tipo

info:eu-repo/semantics/conferenceObject