OALib Journal期刊
ISSN: 2333-9721
费用：99美元

投递稿件

查看量	下载量

相关文章
更多...

计算机科学 2007

Web Pages Information Retrieval Based on Keywords Cluster and Node Instance
基于关键词聚类和节点距离的网页信息抽取

DENG Jian-Shuang,ZHENG Qi-Lun,PENG Hong,LIN Xu-Dong,
邓健爽,郑启伦,彭宏,林旭东

Keywords: Cluster,Information retrieval,Machine learning,Instance of node
聚类,信息抽取,机器学习,节点距离

Full-Text Cite this paper Add to My Lib

Abstract:

Many Web information retrieval methods are related to special Web sites, for example, the method based on extracting rules and the one based on training page samples. These methods can do well in a Web site but fail in the others without adding new rules or inputting new training pages manually. Furthermore, if the template of the Web site is changed, it has to redesign the extracting rules or re-inputting the training pages. It is hard to be maintained and used to extract information from large number of different Web sites. In the paper, there is a new method which can extract the useful information from the different sites automatically based on the keywords of a certain topic and the distance of the nodes. Experimental evaluation on a large of Web pages from different Web sites indicates that this method correctly and automatically extracts the information ignoring which Web sites the pages come from. This method has been applied to the system of intelligent searching and mining of electronic business successfully.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133

Web Pages Information Retrieval Based on Keywords Cluster and Node Instance基于关键词聚类和节点距离的网页信息抽取

Web Pages Information Retrieval Based on Keywords Cluster and Node Instance
基于关键词聚类和节点距离的网页信息抽取