OALib Journal期刊
ISSN: 2333-9721
费用：99美元

投递稿件

查看量	下载量

相关文章
更多...

计算机系统应用 2011

Web Medicine Information Extraction Algorithm Based on Semantics
基于语义的互联网药品信息抽取算法

SHEN Yuan-Yi,ZHENG Xiao-Qing,GU Yi-Ling,
沈元一,郑骁庆,顾轶灵

Keywords: Web information extraction,semantic dictionary,DOM,information entropy,XPath,medical E-business
Web,信息抽取,语义词典,DOM,信息熵,Xpath,医药电子商务

Full-Text Cite this paper Add to My Lib

Abstract:

This article addresses defects of current Web information extraction technology such as low accuracy, low coverage, and manual intervention required, proposes a novel extraction algorithm of web medicine information. The algorithm sets up a three-dimentional semantic dictionary by introduction of the semantics technology, masks the isomerisms of the web page contents and structures, and at the same time, taking advantage of the fact that the attributes of the target medicine tend to have a character of aggregation, designs a way of intellectually locating and extracting the target information based on the theory of information entropy. Through related experiments proves that the algorithm is able to reduce the requirement of manual intervention of the information extraction, and has a high accuracy and recall rate. The application of this algorithm can automatically, comprehensively, and accurately obtain Internet medicine information in real time, offers abundant basis of supervision for the medicine supervision department, and therefore has a significant practical meaning of normalizing medical e-business and ensuring secure medication.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133

Web Medicine Information Extraction Algorithm Based on Semantics基于语义的互联网药品信息抽取算法

Web Medicine Information Extraction Algorithm Based on Semantics
基于语义的互联网药品信息抽取算法