%0 Journal Article %T Extracting Information by Mining Structures of Web Pages
基于网页结构挖掘的信息提取 %A LI Yuan %A GENG Hua %A ZHANG Meng %A PAN Jin-Gui %A
李媛 %A 耿桦 %A 张甍 %A 潘金贵 %J 计算机科学 %D 2006 %I %X To simplify the task of obtaining information from the vast number of information sources that are available on the WWW, we have developed two different methods to extract information of fine grain. This paper firstly describes the principles of the two methods, which work by mining structures of Web pages, and then compares the advantages and disadvantages of them. Finally, we test the performance of the two methods and analyze the experiment results. %K RSS
信息提取 %K 网页结构挖掘 %K 重复模式 %K 时间特征 %U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=64A12D73428C8B8DBFB978D04DFEB3C1&aid=35C2845AB354B854&yid=37904DC365DD7266&vid=27746BCEEE58E9DC&iid=38B194292C032A66&sid=AC1578C6BB9EBDEF&eid=23104246A5FCFCEF&journal_id=1002-137X&journal_name=计算机科学&referenced_num=2&reference_num=9