全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

A New Error-driven Learning Approach for Chinese Word Segmentation
一种新的错误驱动学习方法在中文分词中的应用

Keywords: Chinese word segmentation,Rule template,Word class,Word internal structure,Transformation-based Learning(TBL)
中文分词
,规则模板,词类,词内结构,基于转换的学习(TBL)

Full-Text   Cite this paper   Add to My Lib

Abstract:

A well known problem for Chinese word segmentation(CWS)is that we can not have a unique definition of words.Different standards may result in different word segmentation outputs.It is unrealizable to develop different CWS systems according to different applications or standards,so it is significantly important to flexibly adapt segmen- tation outputs towards different standards or applications using existing CWS system.The paper presents a linguistical- ly enriched transformation-based learning approach for performing CWS adaptation as a postprocessor.Different from other transform-based learning used in CWS,the approach utilizes some linguistics information,and introduces word class and word internal structure to rule templates and transformations.The performance of the approach is evaluated on four different test sets,which represent four different standards.It turns out to be comparable to several state-of- the-art approaches which perform Chinese word segmentation based on single standard.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133