全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

基于词袋绑定的问句新特征自动生成

Keywords: 问答系统,问句分类,特征提取,词袋绑定

Full-Text   Cite this paper   Add to My Lib

Abstract:

针对中文问句分类缺乏丰富的句法语义特征,提出一种基于词袋绑定的问句新特征自动生成方法.在词袋(BOW)、词性(POS)和词义(WS)等基本特征的基础上,通过将词性、词义等与词袋分别进行绑定,自动获取一类新的问句特征即词袋绑定特征.采用SVM分类器在哈工大中文问句集上实验,结果表明与原来单个的POS、WS等基本特征相比,对应的W/POS、W/WS等词袋绑定特征在分类精度上均获得了显著的提升;而且对这些词袋绑定特征进行启发式组合以后,在77个小类问题类别的总体分类精度达到82.333%,取得了较好的分类效果.说明在基本特征基础上借助词袋绑定操作进一步构造问句新特征的方法简单而有效.

References

[1]  文勖,张宇,刘挺,等.基于句法结构分析的中文问题分类[J].中文信息学报,2006,20(2):33-39. Wen Xu, Zhang Yu, Liu Ting, et al. Syntactic structure parsing based Chinese question classification[J]. Journal of Chinese Information Processing, 2006,20(2):33-39. (in Chinese)
[2]  张志昌,张宇,刘挺,等.基于线索词识别和训练集扩展的中文问题分类[J].高技术通讯,2009,19(2):111-118. Zhang Zhichang, Zhang Yu, Liu Ting, et al. Chinese question classification based on identification of cue words and extension of training set[J]. Chinese High Technology Letters, 2009,19(2):111-118. (in Chinese)
[3]  余正涛,樊孝忠,郭剑毅.基于支持向量机的汉语问句分类[J].华南理工大学学报:自然科学版,2005,33(9):25-29. Yu Zhengtao, Fan Xiaozhong, Guo Jianyi.Chinese question classification based on support vector machine[J]. Jounal of South China University of Technology: Natural Science ed, 2005,33(9):25-29. (in Chinese)
[4]  贾可亮,樊孝忠,许进忠.基于KNN的汉语问句分类[J].微电子学与计算机,2008,25(1):111-118. Jia Keliang, Fan Xiaozhong, Xu Jinzhong. Chinese question classification based on KNN[J]. Microelectronics & Computer, 2008,25(1):111-118. (in Chinese)
[5]  孙景广,蔡东风,吕德新,等.基于知网的中文问题自动分类[J].中文信息学报,2007,21(1):90-96. Sun Jingguang, Cai Dongfeng, Lü Dexin, et al. HowNet based Chinese question automatic classification[J]. Journal of Chinese Information Processing, 2007,21(1):90-96. (in Chinese)
[6]  张志昌,张宇,刘挺,等.开放域问答技术研究进展[J].电子学报,2009,37(5):1058-1069. Zhang Zhichang, Zhang Yu, Liu Ting, et al. Advances in open-domain question answering[J]. Acta Electronica Sinica,2009,37(5):1058-1069. (in Chinese)
[7]  Moldovan D, Pasca M, Harabagiu S, et al. Performance issues and error analysis in an open-domain question answering system[J].ACM Transactions on Information Systems, 2003,21(2):133-154.
[8]  Li X, Roth D. Learning question classifiers //Proceedings of the 19th International Conference on Computational Linguistics (COLING2002). Taipei: Association for Computational Linguistics, 2002:1-7.
[9]  Li X, Roth D. Learning question classifiers: the role of semantic information[J]. Journal of Natural Language Engineering, 2006,12(3):229-250.
[10]  Zhang D, Lee W. Question classication using support vector machines //Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Informaion Retrieval (SIGIR 2003). Toronto, Canada: ACM, 2003:26-32.
[11]  Huang Zhiheng, Thint M, Qin Zengchang. Question classification using head words and their hypernyms //Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing(EMNLP). Honolulu: Association for Computational Linguistics, 2008:927-936.
[12]  Huang Zhiheng, Thint M, Celikyilmaz A. Investigation of question classifier in question answering //Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing(EMNLP). Singapore: Association for Computational Linguistics, 2009:543-550.
[13]  Li Fangtao, Zhang Xian, Yuan Jinhui, et al. Classifying what-type questions by head noun tagging //Proceedings of the 22nd International Conference on Computational Linguistics (COLING). Manchester: Association for Computational Linguistics, 2008:481-488.
[14]  李鑫,黄萱菁,吴立德.基于错误驱动算法组合分类器及其在问题分类中的应用[J].计算机研究与发展,2008,45(3):535-541. Li Xin, Huang Xuanjing, Wu Lide. Combined multiple classifiers based on TBL algorithm and their application in question classification[J]. Journal of Computer Research and Development, 2008,45(3):535-541. (in Chinese)
[15]  Wu Youzheng, Zhao Jun, Xu Bo. Chinese question classification from approach and semantic views //Proceedings of the Second Asia Information Retrieval Symposium. Ieju Island, Korea: , 2005:485-490.
[16]  张宇,刘挺,文勖.基于改进贝叶斯模型的问题分类[J].中文信息学报,2005,19(2):100-105. Zhang Yu, Liu Ting, Wen Xu. Modified Bayesian model based question classification[J]. Journal of Chinese Information Processing, 2005,19(2):100-105. (in Chinese)
[17]  Yu Zhengtao, Su Lei, Li Lina, et al. Question classification based on co-training style semi-supervised learning[J].Pattern Recognition Letters, 2010,31:1975-1980.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133