%0 Journal Article %T A Web Bibliographies Retrieval Structure Based on the Longest Sequential Frequent Phrases
基于最长顺序频繁词组的Web文献检索结构 %A WANG Da-Ling %A YU Ge %A BAO Yu-Bin %A
王大玲 %A 于戈 %A 鲍玉斌 %J 软件学报 %D 2006 %I %X Most Web bibliographies cannot meet the retrieval requirements of the researchers with different academic levels. The reason resulting in the problem is analyzed, and the idea of constructing an auxiliary Web bibliography retrieval structure for the users to obtain more proper bibliographies is proposed. Based on the idea, an algorithm of mining the longest sequential frequent phrases for extracting features of the bibliographies is designed, and an extended feature hierarchical tree describing the relationship among the features, among the bibliographies, and among the features, the bibliographies and its construction is presented. The experiments show that the new method outperforms the current popular TFIDF method in extraction features. The theoretical analysis explains that the extended feature hierarchical tree has constringent structure, reveals the relationship between phrases and bibliographies, and provides better assistant retrievals. %K longest sequential frequent phrases %K extended feature hierarchical tree %K feature extraction %K text mining %K information retrieval
最长顺序频繁词组 %K 扩展的特征层次树 %K 特征抽取 %K 文本挖掘 %K 信息检索 %U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=7735F413D429542E610B3D6AC0D5EC59&aid=EF9E20F8CD7AA7DB&yid=37904DC365DD7266&vid=BCA2697F357F2001&iid=F3090AE9B60B7ED1&sid=572ABCACB4426B6D&eid=0FC8B9772E3A7521&journal_id=1000-9825&journal_name=软件学报&referenced_num=0&reference_num=12