全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

基于Eros距离的纵向数据模糊聚类方法

Keywords: 纵向数据,扩展范式距离,FErosCM聚类,信息熵

Full-Text   Cite this paper   Add to My Lib

Abstract:

针对纵向数据集的数据特征,如多维、含缺失值、序列不等间隔和不全等长等特点,研究一种基于Eros距离的纵向数据的相似性度量方法,并对模糊C均值聚类算法进行改进,提出一种基于Eros距离度量的模糊聚类数据处理方法.对于纵向数据集,首先进行缺失值填充、变量标准化等预处理,使用粗糙集理论对冗余属性进行约简,然后基于FErosCM聚类方法进行数据自动分类.对比实验证实此方法可用于纵向数据集的自动聚类处理,并使用信息熵作为聚类效果的评价手段。实验结果表明:无论在聚类效率还是准确度上,FErosCM方法对于纵向数据的分类处理均是有效可行的.

References

[1]  ZHAO Pei-xin,XUE Liu-gen.Empirical likelihoodinferences for semi-parameter varying coefficient partiallylinear errors-in-variables models with longitudinal data[J].Journal of Nonparametric Statistics,2009,21:907-923.
[2]  黄河,史忠植,郑征.基于形状特征k-d树的多维时间序列相似搜索[J].软件学报,2006,(17)10:2048-2056.HUANG He,SHI Zhong-zhi,ZHENG Zheng.Similaritysearch based on shape k-d tree for multidimensional timesequences[J].Journal of Software,2006,17(10):2048-2056.(in Chinese)
[3]  李会民,方丽英,闫健卓,等.基于扩展范式距离的纵向数据相似性度量[J].计算机与应用化学,2012,29(10):1176-1180.LI Hui-min,FANG Li-ying,YAN Jian-zhuo,et al.Algorithm based on norm distance similarity measurementfor longitudinal data[J].Computer and AppliedChemistry,2012,29(10):1176-1180.(in Chinese)
[4]  郭小芳,李锋.基于Eros的多元时间序列相似度分析[J].计算机工程与应用,2012,48(23):111-116.GUO Xiao-fang,LI Feng.Analysis on similarity ofmultivariate time series based on Eros[J].ComputerEngineering and Applications,2012,48(23):111-116.(in Chinese)
[5]  孙吉贵,刘杰,赵连宇.聚类算法综述[J].软件学报,2008,(19)1:48-61.SUN Ji-gui,LIU Jie,ZHAO Lian-yu.Research on clusteralgorithm[J].Journal of Software,2008,19(1):48-61.(in Chinese)
[6]  陈黎飞,姜青山,王声瑞.基于层次划分的最佳聚类数确定方法[J].软件学报,2008,(19)1,:62-72.CHEN Li-fei,JIANG Qing-shan,WANG Sheng-rui.AHierarchical method for determining the number of clusters[J].Journal of Software,2008,19(1):62-72.(inChinese)
[7]  杜奕,卢德唐,李道伦,等.基于层次聚类的时间序列在线划分算法[J].模式识别与人工智能,2007,(20)3:415-421.DU Yi,LU De-tang,LI Dao-lun,et al.Onlinesegmentation algorithm for time series based on hierarchicalclustering[J].PR&AI,2007,20(3):415-421.(inChinese)
[8]  刘全金,李颖新,阮晓钢.基于统计方法的肿瘤特征基因提取[J].北京工业大学学报,2005,(31)2:122-126.LIU Quan-jin,LI Ying-xin,RUAN Xiao-gang.Cancerinformative gene identification based on statistical method[J].Journal of Beijing University of Technology,2005,31(2):122-126.(in Chinese)
[9]  李蓉,颜平兰,陈健,等.随机矩阵理论在肺癌基因网络识别中的应用[J].物理学报,2009,(58)10:6703-6709.LI Rong,YAN Ping-lan,CHEN Jian,et al.Application ofrandom matrix theory to identification of lung cancer genenetworks[J].Acta Physica Sinica,2009,58(10):6703-6709.(in Chinese)
[10]  王丽娜,费如纯,董晓梅,等.基于范数的多维数据模糊聚类方法[J].东北大学学报:自然科学版,2003,(24)5:449-452.WANG Li-na,FEI Ru-chun,DONG Xiao-mei,et al.Norm-based fuzzy clustering method for multi-dimensiondata[J].Journal of Northeastern University:NaturalScience,2003,24(5):449-452.(in Chinese)
[11]  王扬.基于模糊聚类建立模糊模型的新方法[J].北京工业大学学报,2012,38(2):257-261.WANG Yang.Novel approach to fuzzy systemidentification based on fuzzy clustering[J].Journal ofBeijing University of Technology,2012,38(2):257-261.(in Chinese)
[12]  孟海东,马娜娜,宋宇辰,等.基于密度函数加权的模糊C均值聚类算法研究[J].计算机工程与应用,2012,48(27):123-128.MENG Hai-dong,MA Na-na,SONG Yu-chen,et al.Research on fuzzy C-means clustering algorithm based ondensity function weighted[J].Computer Engineering andApplications,2012,48(27):123-128.(in Chinese)
[13]  王学恩,韩德强,韩崇昭.基于不确定性度量的粗糙模糊C均值聚类参数获取方法[J].西安交通大学学报,2013,47(6):y1-y7.WANG Xue-en,HAN De-qiang,HAN Chong-zhao.Aselection method for parameters of rough fuzzy C-meansclustering based on uncertainty measurement[J].Journalof Xi'an Jiaotong University,2013,47(6):y1-y7.(inChinese)
[14]  任永功,于戈.一种多维数据的聚类算法及其可视化研究[J].计算机学报,2005,(28)11:1861-1865.REN Yong-gong,YU Ge.Clustering for multi-dimensional data and its visualization[J].ChineseJournal of Computers,2005,28(11):1861-1865.(inChinese)
[15]  GUAN He-shan,JIANG Qing-shan.Pattern matchingmethod based on point distribution for multivariate timeseries[J].Journal of Software,2009,20(1):67-79.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133