全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...
软件学报  2003 

A Method to Query Document Database by Content and Structure
一种通过内容和结构查询文档数据库的方法

Keywords: document database,information retrieval,passage retrieval,structured document
文档数据库
,信息检索,段落检索,结构化文档

Full-Text   Cite this paper   Add to My Lib

Abstract:

Structured documents are made up of a few logical components, such as title, sections, subsections andparagraphs. The components in each structured document can be represented by an ordered tree model, which canalso be viewed as a hierarchical concept relationship. To meet the user's requirements for more precise andconcentrated search results, the retrieval techniques should allow the user to retrieve document components withvarying granularity. This paper presents a method to query document database by content and structure. The keyidea is to construct a more comprehensive similarity function by taking advantage of the inherent hierarchicalstructure in documents. This work combines Information Retrieval techniques, semi-structured data query andproximate search for document documents. The proposed method is evaluated on the Encarta encyclopediadocument set and the experimental results show that it can provide more accurate and focused answers thantraditional document retrieval methods.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133