OALib Journal期刊
ISSN: 2333-9721
费用：99美元

投递稿件

查看量	下载量

相关文章
更多...

现代图书情报技术 2010

A Survey of Approximately Duplicate Data Cleaning Method
相似重复记录清理方法研究综述

Ye Huanzhuo,Wu Di,
叶焕倬,吴迪

Keywords: 相似重复记录,数据清洗,检测算法,清除算法

Full-Text Cite this paper Add to My Lib

Abstract:

This paper introduces the steps, frameworks and metrics of approximately duplicate data cleaning. Then, the detect algorithms and the elimination algorithms are surveyed essentially,according to type and their improvement methods, and the algorithms usage scope and their advantages and disadvantages are given. Many data cleaning tools are presented, such as Merge/Purge. Finaly, it discusses the future research topics in data cleaning and points out that the concept of knowledge and semantic used in the framework of data cleaning will be an important trend.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133

A Survey of Approximately Duplicate Data Cleaning Method相似重复记录清理方法研究综述

A Survey of Approximately Duplicate Data Cleaning Method
相似重复记录清理方法研究综述