全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

The Research of Character-Position-Based Chinese Word Segmentation
基于字位信息的中文分词方法研究*

Keywords: Chinese word segmentation Character-position Maximum entropy Unknown word recognition
中文分词
,字位,最大熵,未登录词识别

Full-Text   Cite this paper   Add to My Lib

Abstract:

This paper analyses the actuality and introduces several different representative approaches of Chinese word segmentation,then brings out a character-position-based segmentation method which takes the Chinese character as the least unit.It indicates the probability distribution of a word through the probability distribution of Chinese character,so it plays much better than other approaches in unknown word recognition.This idea takes a machine-learning method called maximum entropy for implementation and two experiments for comparing and analyzing the results.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133