OALib Journal期刊
ISSN: 2333-9721
费用：99美元

投递稿件

查看量	下载量

相关文章
更多...

模式识别与人工智能 2006

融合段长信息的中、英文语种辨识*

, PP. 567-571

孙健,王作英

Keywords: 语种辨识,基于段长分布的隐含Markov模型(DDBHMM),Gauss混合模型,连续音素识别,大词汇量连续语音识别(LVCSR)

Full-Text Cite this paper Add to My Lib

Abstract:

状态的段长信息反映语言发音变化速率的信息.不同语言的发音速率也存在着差异,因此状态的段长信息可以作为区分语种的信息之一.本文在建立基于段长分布的隐含Markov模型(DDBHMM)的音素识别系统和大词汇量连续语音识别(LVCSR)系统的基础上进行中、英文语种辨识,表明DDBHMM可以准确描述状态的段长信息,改善中、英文语种的辨识性能.

References

[1]	Zissman M A, Berkling K M. Automatic Language Identification. Speech Communication, 2001, 35(1/2): 115-124
[2]	Zissman M A. Automatic Language Identification Using Gauss Mixture and Hidden Markov Models // Proc of the IEEE International Conference on Acoustics, Speech, and Signal Processing. Minneapolis, USA, 1993, Ⅱ: 399-402
[3]	House A S, Neuburg E P. Toward Automatic Identification of the Language of an Utterance. I. Preliminary Methodological Considerations. Journal of Acoustical Society of America, 1977, 62(3): 708-713
[4]	Muthusam Y K, Jain N, Cole R A. Perceptual Benchmarks for Automatic Language Identification // Proc of the IEEE International Conference on Acoustics, Speech, and Signal Processing. Adelaide, Australia, 1994, Ⅰ: 333-336
[5]	Lamel L F, Gauvain J L. Cross-Lingual Experiments with Phone Recognition // Proc of the IEEE International Conference on Acoustics, Speech, and Signal Processing. Minneapolis, USA, 1993, Ⅱ: 507-510
[6]	Kwan H K, Hirose K. Use of Recurrent Network for Unknown Language Rejection in Language Identification System // Proc of the 5th European Conference on Speech Communication and Technology. Rhodes, Greece, 1997, Ⅰ: 63-67
[7]	Wang Zuoying, Gao Hongge. An Inhomogeneous HMM Speech Recognition Algorithm. Chinese Journal of Electronic. 1998, 7(1): 73-77
[8]	Dalsgaard P, Andersen O. Identification of Mono-and Poly-Phonemes Using Acoustic-Phonetic Features Derived by a Self-Organizing Neural Network // Proc of the International Conference on Spoken Language Processing. Banff, Canada, 1992: 547-550
[9]	Kadambe S, Hieronymus J L. Language Identification with Phonological and Lexical Models // Proc of the IEEE International Conference on Acoustic, Speech, and Signal Processing. Detroit, USA, 1995, Ⅴ: 3507-3511
[10]	Mendoza S, Gillick L, Ito Y, et al. Automatic Language Identification Using Large Vocabulary Continuous Speech Recognition // Proc of the IEEE International Conference on Acoustics, Speech, and Signal Processing. Atlanta, USA, 1996, Ⅱ: 785-788
[11]	Schultz T, Rogina I, Waibel A. LVCSR-Based Language Identification // Proc of the IEEE International Conference on Acoustics, Speech, and Signal Processing. Atlanta, USA, 1996, Ⅱ: 781-784
[12]	Schultz T, Waibel A. Language Independent and Language Adaptive Large Vocabulary Speech Recognition // Proc of the International Conference on Spoken Language Processing. Sydney, Australia, 1998, Ⅴ: 1819-1823
[13]	Hieronymus J L, Kadamebe S. Robust Spoken Language Identification Using Large Vocabulary Speech Recognition // Proc of the IEEE International Conference on Acoustics, Speech, and Signal Processing. Munich, Germany, 1997, Ⅱ: 1111-1114
[14]	Wang Zuoying, Xiao Xi. Duration Distribution Based HMM Speech Recognition Models. Acta Electronica Sinica, 2004, 32(1): 46-50 (in Chinese) (王作英,肖熙. 基于段长分布的HMM语音识别模型. 电子学报, 2004, 32(1): 46-50)

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133