%0 Journal Article
%T Web Literature Collection System
WEB文献资料采集系统
%A MA Chuang-Xin
%A
马创新
%J 计算机系统应用
%D 2012
%I
%X In order to take advantage of the rich literature resources on the WEB,this paper designed a professional web literature collection system WLES.The WLES integrates Web crawling and Web cleaning technology.The machine learning method is introduced to the study of Web cleaning.Machine learning on the training data can get a clean model,and then use the model to implement web cleaning.Experiments show: WLES in web crawling and web page cleaning has an excellent performance,to meet the needs of the user's literature collection.
%K literature collection
%K machine learning
%K pages clean
%K cleaning model
文献资料采集
%K 机器学习
%K 网页清洗
%K 清洗模型
%U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=D4F6864C950C88FFCE5B6C948A639E39&aid=57C6AD9B0ADA05379D6B69BEFA413B9F&yid=99E9153A83D4CB11&vid=659D3B06EBF534A7&iid=DF92D298D3FF1E6E&sid=9CF7A0430CBB2DFD&eid=C06386AA4807371E&journal_id=1003-3254&journal_name=计算机系统应用&referenced_num=0&reference_num=6