全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...
电子学报  2008 

基于ontology抽取优化初始选择的检索结果聚类

Keywords: 检索结果聚类,ontology,标签

Full-Text   Cite this paper   Add to My Lib

Abstract:

本文针对互联网的数据量的不断增加,准确搜索引擎的作用日益困难的问题,为了提高搜索引擎返回结果结构化聚类的效果,让信息的定位更迅速,本文采用基于标签的聚类算法,并使用自然语言处理技术中的依存句法分析和词典资源,深度挖掘语义结构,提出基于优化初始选择的K均值聚类方法.本文深入分析K均值聚类算法特点,并利用类别标签技术对该算法进行有效改进.实验证明该算法不仅在效果上优于一般聚类算法,对结果描述也有很大帮助,在效率上也得到很大提高.

References

[1]  Chuang SL,Chien LF.A practical web-based approach to generating Topic hierarchy for text segments[A].Proceeding of CIKM''04[C].Washington D.C.,USA,2004.127-136.
[2]  Giannotti F,Nanni M,Pedreschi D.Webcat:Automatic categorization of web search results[A].SEBD03[C].Cetraro,Italy.71-82.
[3]  Hearst M A,Pedersen J O.Reexamining the cluster hypothesis:Scatter/gather on retrieval results[A].Proceedings of the ACM Special Interest Group on Information Retrieval Conference[C].1996.76-84.
[4]  Ting Liu,Jinshan Ma,Huijia Zhu,Sheng Li.Dependency parsing based on dynamic local optimization[A].Proceedings of Tenth Conference on Computational Natural Language Learning[C].CoNLL shared task,New York,2006.111-115.
[5]  梅家驹,竺一鸣,高蕴琦,殷鸿翔.同义词词林[M].上海:上海辞书出版社,1996.Mei J,Zhu Y,Gao Y,Yin H.Tong Yi Ci Ci Lin[M].Shanghai:Shanghai Lexicographical Publishing House,1996.(in Chinese)
[6]  Campos R,Dias G,Nunes C.WISE:Hierarchical soft clustering of web page search results based on web content mining techniques[A].Proceeding of the 2006 WlC/ACM International Conference on Web Intelligence[C].Hong Kong,2006.
[7]  Osinski S,Weiss D.Conceptual clustering using lingo algorithm:Evaluation on open directory project data[A].ⅡPWM04[C].Sapporo,Japan,2004.81-88.
[8]  Salton G.The SMART Retrieval Systems[M].Prentice Hall,Englewood Cliffs,N.J,1971.
[9]  Geraci F,Pellegrini M,Maggini M,Sebastiani F.Cluster generation and cluster labeling for web snippets[A].SPIRE 2006,LNCS[C].Glasgow,UK,2006.25-36.
[10]  Hiroyuki Toda,Ryoji Kataoka.A search result clustering method using informatively named entities[A].Proceedings of the ACM Workshop n Web Information[C].Louisiana,USA,2005.81-86.
[11]  F Giannotti,M Nanni,D Pedreschi.Webcat:Automatic categorization of web search results[A].Proceedings of the Eleventh Italian Symposium on Advanced Database Systems[C].Italia,2003.507-518.
[12]  Franzen K,Karlgren J.Verboity and interface design[A].Technical Report T2000:04[C].Swedish Institute of Computer Science,2000.61-69.
[13]  Kosala R,Blockeel H.Web mining research:A survey[J].ACM SIGKDD Exploration,2000,2(1):1-15.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133