全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

Near Duplicate Document Detection Survey

Keywords: Duplicate document , near duplicate pages , near duplicate detection , Detection approaches

Full-Text   Cite this paper   Add to My Lib

Abstract:

Search engines are the major breakthrough on the web for retrieving the information. But List of retrieved documents contains a high percentage of duplicated and near document result. So there is the need to improve the performance of search results. Some of current search engine use data filtering algorithm which can eliminate duplicate and near duplicate documents to save the users’ time and effort. The identification of similar or near-duplicate pairs in a large collection is a significant problem with wide-spread applications. In this paper survey present an up-to-date review of the existing literature in duplicate and near duplicate detection in Web

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133