全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

Research and Realization of a Web Information Extraction and Knowledge Presentation System
Web信息抽取及知识表示系统的研究与实现

Keywords: web information extraction,knowledge presentation,data intensive web pages,ontology-based keyword library
Web信息提取
,知识表示,数据密集型Web页面,基于本体的关键词库

Full-Text   Cite this paper   Add to My Lib

Abstract:

The Web Information Extraction and Knowledge Presentation System is proposed to extract information from data intensive web pages. It downloads dynamic web pages, based on a knowledge database, changes them to XML documents after preprocessing, finds repeated patterns from them, by using a PAT-array based Pattern Discovery Algorithm, recognizes their data display structure models, automatically based on the repeated patterns and an ontology-based keyword library, and then extracts the data and stores them in the knowledge database with the object-relational mapping technology of XML. Through these steps, web data is extracted automatically, and the knowledge database is also expanded automatically. Experiments on the traffic information auto-extraction and mixed traffic travel schemes auto-creation system showed that the system has high precision and is adaptive to web pages in different domains with different structures.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133