|
- 2017
基于百科词条的本体概念聚类方法研究
|
Abstract:
该文面向本体关系集合的自动构建,提出一种基于百科词条的本体概念聚类方法,用于发现领域概念之间的语义关系。在给定领域本体概念集合的条件下,该方法首先获取相关的百科词条并建立每一概念的向量模型,然后根据距离判别法进行概念聚类,得到概念间的相近关系。采用该方法对3个领域中的领域概念集合进行聚类,实验结果表明,该文方法比传统聚类算法有更好的聚类结果,有助于概念间关系的自动获取和领域本体自动构建。
[1] | HAN Jia-wei, KAMBER M, PEI Jian. Data mining: Concepts and techniques[M]. 3rd ed. Beijing: China Machine Press, 2012: 443-444. |
[2] | CHEN Shi-xi, WANG Hai-xun, ZHOU Shui-geng. Concept clustering of evolving data[C]//IEEE 25th International Conference on Data Engineering. Shanghai, China: IEEE Computer Society, 2009: 1327-1330. |
[3] | THO Q T, HUI S C, FONG A C M, et al. Automatic fuzzy ontology generation for semantic web[J]. IEEE Transactions on Knowledge and Data Engineering, 2006, 18(6): 842-856. |
[4] | KUMAR C A, SRINIVAS S. Concept lattice reduction using fuzzy k-means clustering[J]. Expert Systems with Applications, 2010, 37(3): 2696-2704. |
[5] | WONG W, LIU W, BENNAMOUN M. Tree-traversing ant algorithm for term clustering based on featureless similarities[J]. Data Mining and Knowledge Discovery, 2007, 15(3): 349-381. |
[6] | 全国科学技术名词审定委员会. 全国科学技术名词审定委员会简介[EB/OL]. [2016-12-24]. http://www.cnctst.cn/. China National Committee for Terms in Sciences and Technologies. An introduction of China national committee for terms in sciences and technologies [EB/OL]. [2016-12- 24]. http://www.cnctst.cn/. |
[7] | 张明卫, 刘莹, 张斌, 等. 一种基于概念的数据聚类模型[J]. 软件学报, 2009, 20(9): 2387-2396. ZHANG Ming-wei, LIU Ying, ZHANG Bin, et al. Concept-based data clustering model[J]. Journal of Software, 2009, 20(9): 2387-2396. |
[8] | GRUBER T R. A translation approach to portable ontology specifications[J]. Knowledge Acquisition, 1993, 5(2): 199-220. |
[9] | 徐德智, JUNAID. Cluster-Merge本体构造算法[J]. 计算技术与自动化, 2010, 59(3): 49-52. XU De-zhi, JUNAID. An ontology learning based on documents clustering[J]. Computing Technology and Automation, 2010, 59(3): 49-52. |
[10] | LEUNG K W T, LEE D L. Deriving concept-based user profiles from search engine logs[J]. IEEE Transactions on Knowledge and Data Engineering, 2010, 22(7): 969-982. |
[11] | 何琳, 侯汉清. 基于统计自然语言处理技术的领域本体半自动构建研究[J]. 情报学报, 2009, 28(2): 201-207. HE Lin, HOU Han-qing. Research on semi-automatic construction of domain ontology based on statistical NLP technique[J]. Journal of the China Society for Scientific and Technical Information, 2009, 28(2): 201-207. |
[12] | 胡云飞. 本体学习中关系获取的研究[D]. 西安: 西安建筑科技大学, 2012. HU Yun-fei. Research on relations acquisition of ontology learning[D]. Xi'an: Xi'an University of Architecture and Technology, 2012. |
[13] | LEE C S, KAO Y F, KUO Y H, et al. Automated ontology construction for unstructured text documents[J]. Data & Knowledge Engineering, 2007, 60(3): 547-566. |
[14] | 于娟. 基于文本的领域本体学习方法及其应用研究[D]. 大连: 大连理工大学, 2010. YU Juan. Learning domain ontologies from Chinese text corpora[D]. Dalian: Dalian University of Technology, 2010. |
[15] | 刘金岭. 基于《现代汉语语义分类词典》的文本聚类方法[J]. 情报杂志, 2010, 29(11): 170-173. LIU Jin-ling. Text clustering method based on thesaurus of modern Chinese[J]. Journal of Intelligence, 2010, 29(11): 170-173. |