OALib Journal期刊
ISSN: 2333-9721
费用：99美元

投递稿件

查看量	下载量

相关文章
更多...

计算机应用研究 2009

Template-based information automatic extraction of Web
基于模板的Web信息自动提取方法*

ZHENG Chang-song,FU Yang,SHE Li,
郑长松,傅彦,佘莉

Keywords: information extraction,template,automatic cognition,separator tag,structurization
信息提取,模板化,自动识别,分隔标记,结构化

Full-Text Cite this paper Add to My Lib

Abstract:

In order to deal with the contradiction between accuracy and efficiency in the traditional Web information extraction,proposed one method to automatically extract Web information,which was based on the combination of template and machine automatic diagnosis.First,used a set of heuristic rules of automatic diagnosis to detect separating characters between different attributes in HTML text,and deployed those characters to the template,then based on the template analyzed Web page of the same kind,and finally s...

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133

Template-based information automatic extraction of Web基于模板的Web信息自动提取方法*

Template-based information automatic extraction of Web
基于模板的Web信息自动提取方法*