OALib Journal期刊
ISSN: 2333-9721
费用：99美元

投递稿件

查看量	下载量

相关文章
更多...

自动化学报 2012

A Novel Large Vocabulary Continuous Speech Recognition Algorithm Combined with Language Recognition
一种联合语种识别的新型大词汇量连续语音识别算法

SHAN Yu-Xiang,DENG Yan,LIU Jia,
单煜翔,邓妍,刘加

Keywords: Speech recognition,language recognition,out-of-language problem,phone lattice reconstruction
语音识别,语种识别,集外语种问题,音素格重构

Full-Text Cite this paper Add to My Lib

Abstract:

In this paper, a novel large vocabulary continuous speech recognition (LVCSR) algorithm combined with language recognition is proposed, and a real-time processing system is developed. This algorithm can make full use of phonetic hypotheses collected during decoding, and identify language types simultaneously. In a multilingual environment, this algorithm can not only take the place of a standalone language recognizer at a lower system overall computational cost, but also effectively cope with the case where target and non-target languages mix in a single utterance. It can significantly reduce speech recognition error introduced by non-target language, and avoid error accumulation which may mislead the subsequent decoding procedure. In order to tightly combine the content and language recognition into a unified decoding procedure, three different phone lattice reconstruction algorithms are also proposed to eliminate pronunciation and grammar restrictions introduced by the target language's dictionary and language model of the LVCSR decoder, and to encode lattices with richer phonetic information. Experiments show that the lattice reconstruction algorithms can significantly improve language recognition accuracy in the combined recognition. Evaluated on a Mandarin/English mixed conversational telephone speech corpus where Mandarin is the target language, the proposed algorithms reduced the recognition error introduced by non-target language by 91.76%, and achieved a character error rate of 54.98%.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133

A Novel Large Vocabulary Continuous Speech Recognition Algorithm Combined with Language Recognition一种联合语种识别的新型大词汇量连续语音识别算法

A Novel Large Vocabulary Continuous Speech Recognition Algorithm Combined with Language Recognition
一种联合语种识别的新型大词汇量连续语音识别算法