OALib Journal期刊
ISSN: 2333-9721
费用：99美元

投递稿件

查看量	下载量

相关文章
更多...

模式识别与人工智能 2012

基于音素识别的语种辨识方法中的因子分析

, PP. 105-110

仲海兵,宋彦,戴礼荣

Keywords: 自动语种识别,因子分析,音素识别器

Full-Text Cite this paper Add to My Lib

Abstract:

在基于音素识别的语种辨识系统中，特定的一段语音，音素识别的结果会受到说话人和信道等干扰因素的影响。对此，文中基于音素搭配关系对每段语音构建相应的特征向量表示。在向量空间中，利用因子分析建立噪声子空间的数学描述模型，并在语言模型的训练和识别过程加以消除。在NISTLRE2007的测试任务中，相对于基于音素识别的语种辨识基线系统，该方法可有效提高系统性能。在30s时长测试中，基于音素识别的语言模型和基于音素识别的支持向量机模型的等错误率分别相对降低14。4%和12。9%。

References

[1]	Matejka P,Schwarz P,Cernocky J,et al.Phonotactic Language Identification Using High Quality Phoneme Recognition // Proc of the 9th European Conference on Speech Communication and Technology.Lisbon,Portugal,2005: 2237-2241
[2]	Povey D.Discriminative Training for Large Vocabulary Speech Recognition.Ph.D Dissertation.Cambridge,UK: Cambridge University,2004
[3]	Gauvain J L,Messaoudi A,Schewenk H.Language Recognition Using Phone Lattices // Proc of the 8th International Conference on Spoken Language Processing.Jeju Island,Korea,2004: 1283-1286
[4]	Shen Wade,Reynolds D.Improving Phonotactic Language Recognition with Acoustic Adaption // Proc of the 8th Annual Conference of the International Speech Communication Association.Antwerp,Belgium,2007: 358-361
[5]	Gales M J F.Maximum Likelihood Linear Transformations for HMM-Based Speech Recognition.Computer Speech and Language,1998,12(2): 75-98
[6]	Wegmann S,McAllester D,Orloff J,et al.Speaker Normalization on Conversational Telephone Speech // Proc of the IEEE International Conference on Acoustics,Speech and Signal Processing.Atlanta,USA,1996: 339-341
[7]	Matéjka P,Schwarz P,Hermansky H,et al.Phoneme Recognition Using Temporal Patterns // Proc of the 6th International Conference on Text,Speech and Dialogue.Ceske Budejovice,Czech Republic,2003: 198-205
[8]	Campbell W M,Campbell J R,Reynolds D A,et al.High-Level Speaker Verification with Support Vector Machines // Proc of the IEEE International Conference on Acoustics,Speech and Signal Processing.Montreal,Canada,2004: 73-76
[9]	Zissman M A.Comparison of Four Approaches to Automatic Language Identification of Telephone Speech.IEEE Trans on Speech and Audio Processing,1996,4(1): 31-44
[10]	Campbell W M,Campbell J P.Support Vector Machines for Speaker and Language Recognition.Computer Speech and Language,2006,20(2/3): 210-229
[11]	Solomonoff A,Campbell W,Quillen C.Channel Compensation for SVM Speaker Recognition // Proc of the Speaker and Language Recognition Workshop.Toledo,Spain,2004: 57-62
[12]	Rubin D B,Thayer D T.EM Algorithms for ML Factor Analysis.Psychometrika,1982,47(1): 69-76
[13]	Fu Qiang,Song Yan,Dai Lirong.Factor Analysis in GMM-Based Language Identification.Journal of Chinese Information Processing,2009,23(4): 77-81 (in Chinese)(付强,宋彦,戴礼荣.因子分析在基于GMM的自动语种识别中的应用.中文信息学报,2009,23(4): 77-81)
[14]	Xu Bing,Song Yan,Dai Lirong.The Adaptation Schemes in PR-SVM Based Language Recognition // Proc of the 6th International Symposium on Chinese Spoken Language Processing.Kunming,China,2008: 334-337

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133