%0 Journal Article %T Auto-extraction methods of Web pagelet
网页Pagelet的自动抽取方法 %A ZHU Ming %A LI Wei %A
朱明 %A 李伟 %J 计算机应用 %D 2005 %I %X Besides the needed data, there are lots of navigation information and advertisements in the Web pages. A DOM tree comparison algorithm was proposed. It compared several pages within a class, and recognized the main contents in pages. Experiment results show that it is feasible and effective. %K Web mining %K information retrieval %K DOM similarity %K DOM node clustering
Web挖掘 %K 信息获取 %K DOM相似度 %K DOM节点聚类 %U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=831E194C147C78FAAFCC50BC7ADD1732&aid=7D3BA904BA675F71&yid=2DD7160C83D0ACED&vid=C5154311167311FE&iid=708DD6B15D2464E8&sid=0C855F950D3C0D58&eid=6D324E2981A4CD88&journal_id=1001-9081&journal_name=计算机应用&referenced_num=0&reference_num=7