全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

Template-based information automatic extraction of Web
基于模板的Web信息自动提取方法*

Keywords: information extraction,template,automatic cognition,separator tag,structurization
信息提取
,模板化,自动识别,分隔标记,结构化

Full-Text   Cite this paper   Add to My Lib

Abstract:

In order to deal with the contradiction between accuracy and efficiency in the traditional Web information extraction,proposed one method to automatically extract Web information,which was based on the combination of template and machine automatic diagnosis.First,used a set of heuristic rules of automatic diagnosis to detect separating characters between different attributes in HTML text,and deployed those characters to the template,then based on the template analyzed Web page of the same kind,and finally s...

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133