%0 Journal Article %T Template-based information automatic extraction of Web
基于模板的Web信息自动提取方法* %A ZHENG Chang-song %A FU Yang %A SHE Li %A
郑长松 %A 傅彦 %A 佘莉 %J 计算机应用研究 %D 2009 %I %X In order to deal with the contradiction between accuracy and efficiency in the traditional Web information extraction,proposed one method to automatically extract Web information,which was based on the combination of template and machine automatic diagnosis.First,used a set of heuristic rules of automatic diagnosis to detect separating characters between different attributes in HTML text,and deployed those characters to the template,then based on the template analyzed Web page of the same kind,and finally s... %K information extraction %K template %K automatic cognition %K separator tag %K structurization
信息提取 %K 模板化 %K 自动识别 %K 分隔标记 %K 结构化 %U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=A9D9BE08CDC44144BE8B5685705D3AED&aid=17A473BA111D75F09AFEF4CBFBEA9FE0&yid=DE12191FBD62783C&vid=96C778EE049EE47D&iid=0B39A22176CE99FB&sid=18F040DBCB74FFF9&eid=6A9657F54F754BF6&journal_id=1001-3695&journal_name=计算机应用研究&referenced_num=2&reference_num=8