Automatic Detection of Plagiarism: Cut and paste, paraphrases and translation

Friday, September 16, 2011 - 11:30 - 12:30
Alberto Barrón Cedeño


Nowadays text can be easily found, manipulated, combined, translated, and re-used. As a result, plagiarism, the unacknowledged re-use of text, occurs in a scale previously unseen.

In this talk an overview of models for automatic plagiarism detection is offered, including standard frameworks for its evaluation. Special attention is paid to models focused on translated plagiarism. The problem of detecting cut and paste plagiarism seems to be solved by state of the art models. Nevertheless, paraphrase and, in particular translated plagiarism are still far from being considered solved. In this talk some avenues for future research are proposed.


Alberto Barrón Cedeño is a PhD student in the Natural Language Engineering Lab (Universidad Politécnica de Valencia) under the supervision of Paolo Rosso. His main research interests are information extraction, plagiarisim detection, and text similarity analysis.


