全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

Research on Deep Web Classification Based on Domain Feature Text
基于领域特征文本的Deep Web分类研究

Keywords: Fcaturc tcxt,Domain classification,Vcctor space model,Dccp Web
特征文本,领域分类,向量空间模型,Deep
,Web

Full-Text   Cite this paper   Add to My Lib

Abstract:

Automatic Decp Web classification is the basis of building Decp Web data intergration system. An approach was proposed to classify the Deep Web based on domain feature text. Using the ontology knowledge, the concepts which express the same semantics were firstly extracted from different texts. Then the definition of domain correlation was given as the quantitative criteria for feature text selection, in order to avoid the subjectivity and uncertainty of manual selection. In the process of the interface vector space model construction, an improved weighting method namedw I}FIDF was proposed to evaluate the different roles of feature text. At last, a KNN algorithm was used to classify these interface vectors. Comparative experiments indicate that the feature text selected by our method is accurate and effec- tive, and the new weighting method can improve the classification precision significantly and shows good stability in KNN classification.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133