全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

半监督聚类的匿名数据发布

DOI: doi:10.3969/j.issn.1006-7043.2011.11.017

Keywords: 数据发布, 隐私保护, 匿名数据, 半监督, 聚类

Full-Text   Cite this paper   Add to My Lib

Abstract:

为增强个体与隐私信息的保护力度, 提高数据效用和降低时间代价, 提出半监督聚类的(α, k)匿名模型, 并设计算法予以实现, 分析了算法时间复杂度. 针对数据集包含数值属性和分类属性的特点, 把数值属性和分类属性映射到相同的度量空间进行运算, 以相异矩阵表示数据集元组之间的距离, 使相同或者相近的元组有效地聚集到同一个簇内. 把高敏感度属性设置较高的保护度, 低敏感度设置较低的保护度, 实现了敏感属性的个性化保护. 实验结果表明, 半监督(α, k)匿名模型可安全且高效地实现隐私保护, 保证了发布数据的质量.

References

[1]  FUNG B C M, WANG K, CHEN R, et al. Privacy-preserving data publishing: a survey of recent developments[J]. ACM Comput Surv, 2010, 42(4):1-53. ?
[2]  CHEN B, KIFER D, LEFEVRE K, et al. Privacy-preserving data Publishing[J]. Found Trends databases, 2009, 2 (1): 1-167.?
[3]  SWEENEY L. ?k-?anonymity: a model for protecting ?privacy[J].? International Journal of Uncertainty Fuzziness and Knowledge Based Systems, 2002, 10(5): 557-570. ?
[4]  AGGARWAL G, PANIGRAHY R. Achieving anonymity via clustering[J]. ACM Trans Algorithms, 2010, 6 (3): 1-19.?
[5]  LIN J, WEN T, HSIEH J, et al. Density-based microaggregation for statistical disclosure control[J]. Expert Systems with Applications,2010, 37(4): 3256-3263. ?
[6]  MACHANAVAJJHALA A, KIFER D, GEHRKE J, et al. ?l?-diversity: privacy beyond ?k-?anonymity[J]. ACM Transactions on Knowledge Discovery from Data, 2007, 1(1): 1-52. ?
[7]  WONG R, LI J, FU A, et al. (α, ?k?)-anonymous data publishing[J]. Journal of Intelligent Information Systems, 2009, 33(2): 209-234. ?
[8]  CAMPAN A, TRUTA T M, COOPER N. P-sensitive ?k-?anonymity with generalization constraints[J]. Transactions on Data Privacy, 2010, 3(2): 65-89.?
[9]  MACHANAVAJJHALA A, GEHRKE J, KIFER D, et al. ?l?-diversity: privacy beyond ?k-?anonymity[C]//22nd International Conference on Data Engineering. Atlanta, GA, US, 2006: 24.?
[10]  王智慧, 许俭, 汪卫, 等. 一种基于聚类的数据匿名方法[J]. 软件学报, 2010, 21(4): 680-693. ?WANG Zhihui, XU Jian, WANG Wei, et al. Clustering- based approach for data anonymization[J]. Journal of Software, 2010, 21(04): 680-693.?
[11]  WONG R, LI J, FU A, et al. (α, ?k?)-anonymity: an enhanced ?k-?anonymity model for privacy preserving data publishing[C]//Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining.[s.l.], 2006: 754-759.?
[12]  韩建民, 于娟, 虞慧群, 等. 面向敏感值的个性化隐私保护[J]. 电子学报,2010,38(7): 1723-1728. ?HAN Jianmin, YU Juan, YU Huiqun, et al. Individuation privacy preservation oriented to sensitive values[J]. Acta Electronica Sinica, 2010, 38(7): 1723-1728.?
[13]  BAYARDO R J, AGRAWAL R. Data privacy through optimal ?k-?anonymization[C]//Proceedings of the International Conference on Data Engineering. Tokyo, Japan, 2005: 217-228.?
[14]  HUANG Z. Extensions to the ?k-?means algorithm for clustering large data sets with categorical values[J]. Data Mining and Knowledge Discovery, 1998, 2(3): 283-304.?

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133