%0 Journal Article
%T Optimized Web information extraction based on XQuery
一种基于XQuery的优化Web信息抽取方法
%A ZHI Zong-liang
%A CHEN Shao-fei
%A
支宗良
%A 陈少飞
%J 计算机应用
%D 2008
%I
%X Due to lack of the analysis of the adaptability of the Web page's characteristics, the current typical systems can hardly provide robust extraction rules. This paper proposed an optimized Web information extraction method which divided rules into three associated layers, suggested an optimized algorithm for extraction rules from the view of the precision and recall ratio through analyzing the adaptability of the page's characteristics, and expressed the complicated object rule in standard XQuery. Experiments indicate that our approach enhances the robustness and usability of the rules.
%K Information extraction
%K XPath
%K XQuery
%K Rule optimizing
信息抽取
%K 规则优化
%K XPath
%K XQuery
%U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=831E194C147C78FAAFCC50BC7ADD1732&aid=EE722DA074E55A4AE0809F0569D050D3&yid=67289AFF6305E306&vid=D3E34374A0D77D7F&iid=CA4FD0336C81A37A&sid=04445C1D2BDA24EE&eid=303A4CC37123AF07&journal_id=1001-9081&journal_name=计算机应用&referenced_num=0&reference_num=8