%0 Journal Article
%T An Entropy-Based Approach for News Article Extraction from Web Page
基于熵的新闻网页抽取方法的研究
%A Zhu Hongcan Long Zhaoyang
%A
朱红灿
%A 龙朝阳
%J 现代图书情报技术
%D 2007
%I
%X In this paper,an approach for news article extraction from Web page is proposed and this approach applies information theory to DOM tree. Experiment on several news Web sites shows that it is practical.
%K DOM
熵
%K 信息抽取
%K 信息块
%U http://www.alljournals.cn/get_abstract_url.aspx?pcid=B5EDD921F3D863E289B22F36E70174A7007B5F5E43D63598017D41BB67247657&cid=E46382710BF131B2&jid=24AADBCD0D5373C73F37F78D10E2F717&aid=A0C499109C249592&yid=A732AF04DDA03BB3&vid=0B39A22176CE99FB&iid=E158A972A605785F&sid=B6DA1AC076E37400&eid=987EDA49D8A7A635&journal_id=1003-3513&journal_name=现代图书情报技术&referenced_num=0&reference_num=4