全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

Fast dictionary mechanism for Chinese word segmentation
一种快速中文分词词典机制

Keywords: text real-time processing,Chinese word segmentation,lexicon mechanism,double-character-and-long-word-Hash-indexing(DCLWHI)
文本实时处理
,中文分词,词典法分词,双字词-长词哈希机制

Full-Text   Cite this paper   Add to My Lib

Abstract:

With the development of global networking through Internet, the amount of articles in Chinese or other native languages is increasing rapidly. As the lack of explicit separator, word segmentation is a precondition for the processing of these character-based languages and thus it affects the whole system in performance. In this paper, we propose a new solution for Chinese word segmentation problem based on Lexicon named double-character-and-long-word-hash-indexing (DCLWHI).Compared with traditional lexicon mechanism, DCLWHI improves the speed and efficiency of word segmentation without extra memory spending and gains the same accuracy.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133