全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

Content Analysis - A Corpus Based Model

Keywords: Discourse Analysis , Information Retrieval , Natural Language Processing

Full-Text   Cite this paper   Add to My Lib

Abstract:

An important step to understand text is to build the discourse structure through cohesion and coherence. However, to build the discourse structure in turn depends on the full understanding of texts, so that many efforts on this line are not automatic and not successful. A corpus-based model based on 1) repetition of words, 2) importance of words, and 3) collocational semantics for texts is proposed in this paper. It focuses on association norms of noun-noun relations and noun-verb relations defined on discourse level and sentence level, respectively. According to this model, a text partition algorithm is proposed to determine the boundaries of discourse structures and a topic identification algorithm is also presented. The results of a series of experiments show that the proposed model is Promising.[Article content in Chinese]

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133