|
计算机科学 2012
Semantic-based Chinese Web Page Retrieval
|
Abstract:
Semantic-based chinese Web page retrieval is a promsing application.The existing semantic retrieval mechanisms are categorized into three types,which are based on ontology,natural language understanding,and text classification and clustering respectively.The three technologies were reviewed and examined in detail.Semantic-based chinese Web page retrieval system should focus on popular fields to draw great attention from Web users.Moreover,Web pages should be indexed with words rather than chinese characters.Advanced chinese information processing technologies should be integrated into semantic retrieval systems.Some directions for future research were finally presented,including semantic relevance ranking,ontology definition and instance automatic extraction,semantic-based indexing,and large-scale semantic training collections construction.