|
- 2015
航班预定口语对话系统的设计与实现
|
Abstract:
摘要 介绍一个航班预定口语对话系统的设计与实现, 该系统允许用户通过普通话进行航班信息查询与预定.重点介绍口语对话系统中的口语语言理解.为了克服语音识别引入的识别错误导致语义理解错误的问题, 提出基于词混淆网络的两阶段中文口语语言理解方法:首先从词混淆网络中选择N元文法作为分类特征, 进行主题分类, 并通过语义分类模型解析获取对应的语义树结构;然后利用基于规则的语义槽填充器抽取相应的语义槽属性-值.该方法是数据驱动的, 训练数据的标记比较容易.实验在汉语航班预定领域进行, 结果表明, 在语音识别字错误率很高的情况下, 该方法比传统的基于语法规则的语言理解方法更加鲁棒, 在语义理解正确率方面有明显改善.
[1] | Tür G, Wright J H, Gorin A L, et al. Improving spoken language understanding using word confusion networks [C]//Interspeech. 2002. |
[2] | Chang C C, Lin C J. LIBSVM: a library for support vector machines[J]. ACM Transactions on Intelligent Systems and Technology (TIST), 2011, 2(3): 27. |
[3] | Lee C, Jung S, Kim K, et al. Recent approaches to dialog management for spoken dialog systems[J]. JCSE, 2010, 4(1): 1-22 |
[4] | Liu J, Xu Y, Seneff S, et al. CityBrowser II: a multimodal restaurant guide in mandarin [C]//International Symposium on Chinese Spoken Language Processing. 2008: 1-4. |
[5] | Huang C, Xu P, Zhang X, et al. LODESTAR: a mandarin spoken dialogue system for travel information retrieval [C]//Proceedings of Eurospeech. 1999, 99: 1 159-1 162. |
[6] | 黄寅飞, 郑方,燕鹏举,等. 校园导航系统 EasyNav 的设计与实现[J]. 中文信息学报, 2001, 15(4): 35-40. |
[7] | Lin Y C, Chiang T H, Wang H M, et al. The design of a multi-domain mandarin Chinese spoken dialogue system [C]//International Conference on Spoken Language Processing. 1998. |
[8] | Ward W, Issar S. Recent improvements in the CMU spoken language understanding system [C]//Proceedings of the workshop on Human Language Technology. Association for Computational Linguistics, 1994: 213-216. |
[9] | Oerder M, Ney H. Word graphs: an efficient interface between continuous-speech recognition and language understanding [C]//Proceedings of ICASSP. 1993, 2: 119-122. |
[10] | Mangu L, Brill E, Stolcke A. Finding consensus among words: lattice-based word error minimization [C]//Proceedings of Eurospeech. 1999. |
[11] | Hakkani-Tür D, Béchet F, Riccardi G, et al. Beyond ASR 1-best: using word confusion networks in spoken language understanding[J]. Computer Speech & Language, 2006, 20(4): 495-514. |
[12] | Mairesse F, Gasic M, Jurcícek F, et al. Spoken language understanding from unaligned data using discriminative classification models [C]//Processing of ICASSP. 2009: 4 749-4 752. |
[13] | Rich C, Sidner C L. COLLAGEN: a collaboration manager for software interface agents[J]. User Modeling and User-Adapted Interaction, 1998, 8(3/4): 315-350. |
[14] | <p> Weng F, Cavedon L, Raghunathan B, et al. A conversational dialogue system for cognitively overloaded users [C]//International Conference on Spoken Language Processing. 2004. |
[15] | ?ibert J, Martin ?i ?-Ip?i ? S, Hajdinjak M, et al. Development of a bilingual spoken dialog system for weather information retrieval [C]//Proceedings of Eurospeech. 2003: 1 917-1 920. |
[16] | Bos J, Klein E, Lemon O, et al. DIPPER: Description and formalisation of an information-state update dialogue system architecture [C]//4th SIGdial Workshop on Discourse and Dialogue. 2003: 115-124. |
[17] | Ljungl?f P. trindikit. py: an open-source Python library for developing ISU-based dialogue systems[J]. Proceedings of IWSDS, 2009, 9. |
[18] | Bohus D, Rudnicky A I. The RavenClaw dialog management framework: architecture and systems[J]. Computer Speech & Language, 2009, 23(3): 332-361.</p> |
[19] | Ward W H. The Phoenix system: understanding spontaneous speech [C]//Proceedings of ICASSP. 1991, 66. |
[20] | Cole R. Tools for research and education in speech science [C]//Proceedings of the International Conference of Phonetic Sciences. 1999: 1 277-1 280. |
[21] | Aust H, Schroer O. An overview of the Philips dialog system [C]//DARPA Broadcast News Transcription and Understanding Workshop, Lansdowne, VA. 1998. |