|
现代图书情报技术 2007
An Entropy-Based Approach for News Article Extraction from Web Page
|
Abstract:
In this paper,an approach for news article extraction from Web page is proposed and this approach applies information theory to DOM tree. Experiment on several news Web sites shows that it is practical.