全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

A Survey of Approximately Duplicate Data Cleaning Method
相似重复记录清理方法研究综述

Keywords: 相似重复记录,数据清洗,检测算法,清除算法

Full-Text   Cite this paper   Add to My Lib

Abstract:

This paper introduces the steps, frameworks and metrics of approximately duplicate data cleaning. Then, the detect algorithms and the elimination algorithms are surveyed essentially,according to type and their improvement methods, and the algorithms usage scope and their advantages and disadvantages are given. Many data cleaning tools are presented, such as Merge/Purge. Finaly, it discusses the future research topics in data cleaning and points out that the concept of knowledge and semantic used in the framework of data cleaning will be an important trend.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133