全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

融合C4.5与SVM算法的汉语句义类型识别方法

Keywords: 自然语言处理,语义分析,句义结构,句义类型识别

Full-Text   Cite this paper   Add to My Lib

Abstract:

选择50个词法和句法特征,进行了大量特征筛选实验,并基于筛选后的特征组合提出了一种融合C4.5和SVM的句义类型识别方法.该方法充分利用C4.5对多重句义的高精度识别和SVM对简单句义、复杂句义的高精度识别的特点,将C4.5与SVM分别识别的结果进行融合处理.给出最终的句义类型识别结果.识别结果表明,在BFS-CTC汉语标注语料库中,选取了4500个句子,经十折交叉验证,句义类型的识别准确率达到92.1%.

References

[1]  贾彦德.汉语语义学[M].北京:北京大学出版社,2005. Jia Yande. Chinese semantics[M]. Beijing: Peking University Press, 2005. (in Chinese)
[2]  张涛.基于HNC理论的句子语义分析 .北京:北京理工大学出版社,2010. Zhang Tao. Semantic analysis of sentences based on the HNC theory . Beijing: Beijing Institute of Technology Press, 2010.(in Chinese)
[3]  李伟.现代汉语句型自动识别的研究 .厦门:厦门大学出版社,2007. Li Wei. Research on automatic recognition of sentence patterns of modern Chinese . Xiamen: Xiamen University Press,2007.(in Chinese)
[4]  程琪龙.系统功能语法导论[M].汕头:汕头大学出版社, 1994. Cheng Qilong. Introduction to systemic functional grammar[M]. Shantou: Shantou University Press,1994.(in Chinese)
[5]  徐昌火.试论现代汉语核心句的句义结构类型[J].南京师大学报:社会科学版,2002(5):125-131. Xu Changhuo. On the semantic structural patterns of chinese core-sentences[J]. Journal of Nanjing Normal University: Social Science ed, 2002(5):125-131. (in Chinese)
[6]  Quinlan J R. Induction of decision trees[J]. Machine Learning, 1986(1):81-106.
[7]  Vapnik V N. The nature of statistical learning[M]. New York: Theory Springer, 1995
[8]  Li S Z, Guo Guodong. Content-based audio classification and retrieval using SVM learning //Proceedings of ICME(IEEE International Conference on Multimedia and Expo). Tokyo, Japan: IEEE Computer Society 2001 Contents, 2001:749-752.
[9]  罗森林,刘盈盈,冯扬,等.BFS-CTC汉语句义结构标注语料库[J].北京理工大学学报,2012,32(3):311-315. Luo Senlin, Liu Yingying, Feng Yang, et al. Method of building BFS-CTC a Chinese tagged corpus of sentential semantic structure[J]. Transactions of Beijing Institute of Technology, 2012,32(3):311-315. (in Chinese)
[10]  Xue N, Palmer M. Annotating the propositions in the penn Chinese treebank //Proceedings of the 2nd SIGHAN Workshop on Chinese Language Processing. Sapporo, Japan: , 2003:47-54.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133