%0 Journal Article %T 基于内容相关的条件函数依赖的一致性清洗方法 %A 杜岳峰 %A 申德荣 %A 张亮 %A 于戈 %J 东北大学学报(自然科学版) %D 2016 %R 10.12068/j.issn.1005-3026.2016.12.003 %X 摘要 基于条件函数依赖提出了一种内容相关的条件函数依赖,并给出基于内容相关的条件函数依赖的一致性清洗方法.通过分析条件函数依赖之间的关系,将相关联的条件函数依赖合并组成内容相关的条件函数依赖.内容相关的条件函数依赖可以检测多条件值下的数据一致性问题并提供可用于一致性修复的参考值.同时,提出了一种一致性修复的代价模型.模型参考内容相关的条件函数依赖对应元组的实际情况进行修复,实现代价最优,同时保证数据一致性.通过在两组真实数据集上进行试验测试,证明提出的基于内容相关的条件函数依赖的一致性清洗方法能够准确地检测数据的一致性问题并加以修复.</br>Abstract:Based on conditional functional dependencies, content-related conditional functional dependencies (CCFDs) and the consistency cleaning method were presented based on CCFDs. By analyzing the relationship of the conditional functional dependencies, the related conditional functional dependencies were combined into CCFDs. The CCFDs can not only detect the consistencies under multi-conditional values, but also provide reference values for the consistency repairing. A consistency repairing-cost model was presented. Then the data was corrected to be consistent with the minimal repairing cost according to the actual data. And the repaired results are approved accuracy for both the inconsistency detection and the inconsistency repairing via the experimental evaluation on two real-life datasets. %K 数据清洗 %K 条件函数依赖 %K 内容相关 %K 数据一致性 %K 修复代价模型< %K /br> %K Key words: data cleaning conditional functional dependency content relativity data consistency repairing-cost model %U http://xuebao.neu.edu.cn/natural/CN/abstract/abstract10051.shtml