全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

面向英语文章的词性标注算法

DOI: 10.13190/j.jbupt.2014.06.025, PP. 120-124

Keywords: 词性标注,学生英语文章,特征,词聚类

Full-Text   Cite this paper   Add to My Lib

Abstract:

面向英语文章的词性标注是对英语文章实现自动批改的基础,虽然研究者对英语词性标注做了大量有益的研究,但是大多数的研究都面向英语为第一语言的用户,而面向英语为第二语言用户的相关研究则很少.为此,对以英语为第二语言用户的英语文章进行了人工标注,在此基础上提出了一种面向英语文章的词性标注算法,融合了词聚类、无标语料统计信息、单词发音等特征.实验结果表明,该算法能有效提高词性标注性能,标注正确率从94.49%可提高到97.07%.

References

[1]  Kristina Toutanova, Dan Klein, Christopher D Manning, et al. Feature-rich part-of-speech tagging with a cyclic dependency network[C]//Proceedings of NAACL-HLT 2003. Los Angeles, California: Association for Computational Linguistics, 2003, 1: 173-180.
[2]  Ana Daz-Negrillo, Detmar Meurers, Salvador Valera, et al. Towards interlanguagepos annotation for effective learner corpora in sla and flt[J]. Language Forum, 2010, 36: 1-15.
[3]  Mitchell Marcus, Beatrice Santorini, Mary Ann Marcinkiewicz. Building a large annotated corpus of English: the penn treebank[J]. Computational Linguistics, 1993, 19: 313-330.
[4]  李红. 大学生英语写作常见错误归类分析[J]. 当代教育论坛: 学科教育研究, 2006, 8: 120-121. Li Hong. The common errors analysis of college englishwriting[J]. Forum on Contemporary Education, 2006, 8: 120-121.
[5]  张雷刚. 英语写作常见错误分析及教学建议[J]. 经济研究导刊, 2010, 19: 243-244. Zhang Leigang. Common errors in English writing and teaching suggestions[J]. Economic Research Guide, 2010, 19: 243-244.
[6]  Yoav Goldberg, Michael Elhadad. An efficient algorithm for easy-first non-directional dependency parsing[C]//Proceedings of NAACL 2010. Los Angeles, California: Association for Computational Linguistics, 2010: 742-750.
[7]  Jun'ichi Kazama, Jun'ichi Tsujii. Evaluation and extension of maximum entropy models with inequality constraints[C]//Proceedings of EMNLP 2003. Honolulu, Hawaii: Association for Computational Linguistics, 2003: 137-144.
[8]  Peter F. Brown, Peter V. Desouza, Robert L. Mercer, et al. Vincent J. Della Pietra, and Jenifer C. Lai. Class-based n-gram models of natural language[J]. Computational Linguistics, 1992, 18(4): 467-479.
[9]  Alan Ritter, Sam Clark, Mausam, et al. Named entity recognition in tweets: an experimental study[C]//Proceedings of EMNLP 2011. Honolulu, Hawaii: Association for Computational Linguistics, 2011: 1524-1534.
[10]  Olutobi Owoputi, Brendan O'Connor, Chris Dyer, et al. Improved part-of-speech tagging for online conversational text with word clusters[C]//Proceedings of NAACL-HLT 2013. Los Angeles, California: Association for Computational Linguistics, 2013: 380-390.
[11]  Adwait Ratnaparkhi. A maximum entropy model for part-of-speech tagging[C]//Proceedings of EMNLP 1997. Honolulu, Hawaii: Association for Computational Linguistics, 1996, 1: 133-142.
[12]  Lawrence Phillips. Hanging on the Metaphone[J]. Computer Language, 1990, 7(12): 39-44.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133