全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...
Polibits  2011 

Detecting Derivatives using Specific and Invariant Descriptors

Keywords: textual derivatives, detection of derivations, near-duplicates, revisions, linguistic descriptors, french corpus.

Full-Text   Cite this paper   Add to My Lib

Abstract:

this paper explores the detection of derivation links between texts (otherwise called plagiarism, near-duplication, revision, etc.) at the document level. we evaluate the use of textual elements implementing the ideas of specificity and invariance as well as their combination to characterize derivatives. we built a french press corpus based on wikinews revisions to run this evaluation. we obtain performances similar to the state of the art method (n-grams overlap) while reducing the signature size and so, the processing costs. in order to ensure the verifiability and the reproducibility of our results we make our code as well as our corpus available to the community.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133