%0 Journal Article
%T Text information extraction based on wrapper model
基于包装器模型的文本信息抽取
%A WANG Jing-pu
%A LIN Ya-ping
%A ZHOU Shun-xian
%A YUE Wen
%A
王敬普
%A 林亚平
%A 周顺先
%A 岳文
%J 计算机应用
%D 2006
%I
%X A new wrapper induction algorithm was proposed for text information extraction after analyzing two types of algorithms based on landmark and text pattern. The new algorithm can take the advantage of above-mentioned two algorithms. It can locate the information based on the landmark information of Web pages, and can use the text pattern to extract and filter large quantity of Web text. Experiment results show that the new method achieves higher accuracy and expressiveness of information extraction.
%K information extraction
%K wrapper
%K landmark
%K text pattern
%K induction
信息抽取
%K 包装器
%K 标志
%K 文本模式
%K 归纳学习
%U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=831E194C147C78FAAFCC50BC7ADD1732&aid=C4D8C2E45CF28501&yid=37904DC365DD7266&vid=96C778EE049EE47D&iid=38B194292C032A66&sid=04FC77FB58A9B53A&eid=F86C3AE4543FDF24&journal_id=1001-9081&journal_name=计算机应用&referenced_num=5&reference_num=13