|
计算机科学 2009
Research and Implementation of Web Table Positioning Technology
|
Abstract:
Web表格的定位作为Web表格抽取的一个重要研究内容,现在越来越得到更多人的重视.根据Web表格的结构标记和自定义的启发式规则,通过对 嵌套问题的解决、数据表格完整性的判断、 树的遍历来完成表格的定位. 关 键 词: DOM树 表格定位 启发式规则 《TABLE》嵌套 遍历 修稿时间: 1/7/2009 12:00:00 AM Research and Implementation of Web Table Positioning Technology LIAO Tao,LIU Zong-tian,SUN Rong.Research and Implementation of Web Table Positioning Technology[J].Computer Science,2009,36(9):227-230. Authors: LIAO Tao LIU Zong-tian SUN Rong Affiliation: Department of Computer Science and Engineering;Anhui University of Science & Technology;Huainan 232001;China;School of Computer Engineering and Science;Shanghai University;Shanghai 200072;China Abstract: Web table positioning technology is considered as essential components of Web table information extraction,and more and more people pay attention to them This paper realized table positioning according to Web table structure label and heuristic method rules of user-definition,which includes the solution of nesting problem,the deterurination of table data's integrality,