|
现代图书情报技术 2010
Research on Focused Merchandise Information Crawling Based on Semantic Crawler
|
Abstract:
This article proposes a method to gather merchandise information based on focused crawler,which integrates the Web topic link analysis and topic content semantic analysis.Through the statistical learning to Ontology during the crawling,the reference of domain-specific Ontology is optimized continuously.The experiment results show that comparing with other conventional crawling algorithms,this method is more effective,as it can prevent the occurrence of topic drift and bring a higher topic harvest rate.