OALib Journal期刊
ISSN: 2333-9721
费用：99美元

投递稿件

查看量	下载量

相关文章
更多...

模式识别与人工智能 2015

一种基于改进得分分布的查询项特定阈值方法*

DOI: 10.16451/j.cnki.issn1003-6059.201505007, PP. 437-442

陆梨花,张连海

Keywords: 得分分布,查询项特定阈值,K-means聚类,语音查询项检索

Full-Text Cite this paper Add to My Lib

Abstract:

为提高语音查询项检索系统的准确率，提出一种基于改进得分分布的查询项特定阈值方法.在系统判决阶段，根据每个查询项的后验得分分布设定不同阈值.后验得分分布用指数混合模型描述，通过无监督的最大期望(EM)算法估计模型参数，最后根据贝叶斯最小风险准则计算阈值.针对EM算法对初始值较为敏感的问题，初始化时采用K-means聚类算法代替随机初始化方法，首先将候选结果得分分为两类，然后计算每类的先验分布并用最大似然法估计模型参数的初始值.实验结果表明该阈值方法有更好的检索性能.

References

[1]	Mamou J, Ramabhadran B, Siohan O. Vocabulary Independent Spoken Term Detection // Proc of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. Amsterdam, The Netherland, 2007: 615-622
[2]	Tejedor J, Wang D, King S, et al. A Posterior Probability-Based System Hybridisation and Combination for Spoken Term Detection // Proc of the 10th Annual Conference of the International Speech Communication Association. Brighton, UK, 2009: 2131-2134
[3]	Tejedor J, Echeverría A, Wang D, et al. Evolutionary Discriminative Confidence Estimation for Spoken Term Detection. Multimedia Tools and Applications, 2013, 62(1): 5-34
[4]	Lee H Y, Chen C P, Lee L S. Integrating Recognition and Retrieval with Relevance Feedback for Spoken Term Detection. IEEE Trans on Audio, Speech, and Language Processing, 2012, 20(7): 2095-2110
[5]	Tu T W, Lee H Y, Lee L S. Improved Spoken Term Detection Using Support Vector Machines with Acoustic and Context Features from Pseudo-Relevance Feedback // Proc of the IEEE Workshop on Automatic Speech Recognition and Understanding. Waikoloa, USA, 2011: 383-388
[6]	Lee H H, Lee L S. Enhanced Spoken Term Detection Using Support Vector Machines and Weighted Pseudo Examples. IEEE Trans on Audio, Speech, and Language Processing, 2013, 21(6): 1272-1284
[7]	Miller D R H, Kleber M, Kao C L, et al. Rapid and Accurate Spoken Term Detection // Proc of the 8th Annual Conference of the International Speech Communication Association. Antwerp, Belgium, 2007: 314-317
[8]	Soltau H, Saon G, Povey D, et al. The IBM 2006 Gale Arabic ASR System // Proc of the IEEE International Conference on Acoustics, Speech and Signal Processing. Honolulu, USA, 2007, IV: 349-352
[9]	Vergyri D, Shafran I, Stolcke A, et al. The SRI/OGI 2006 Spoken Term Detection System // Proc of the 8th Annual Conference of the International Speech Communication Association. Antwerp, Belgium, 2007: 2393-2396
[10]	Allauzen C, Mohri M, Saraclar M. General Indexation of Weighted Automata: Application to Spoken Utterance Retrieval // Proc of the Workshop on Interdisciplinary Approaches to Speech Indexing and Retrieval at HLT-NAACL. Stroudsburg, USA, 2004: 33-40
[11]	Parlak S, Saraclar M. Spoken Term Detection for Turkish Broadcast News // Proc of the IEEE International Conference on Acoustics, Speech and Signal Processing. Las Vegas, USA, 2008: 5244-5247
[12]	Can D, Saraclar M. Lattice Indexing for Spoken Term Detection. IEEE Trans on Audio, Speech, and Language Processing, 2011, 19(8): 2338-2347
[13]	Mohri M, Pereira F, Riley M. Weighted Finite-State Transducers in Speech Recognition. Computer Speech & Language, 2002, 16(1): 69-88
[14]	Allauzen C, Riley M, Schalkwyk J, et al. OpenFst: A General and Efficient Weighted Finite-State Transducer Library // Proc of the 12th International Conference on Implementation and Application of Automata. Prague, Czech Republic, 2007: 11-23
[15]	Manmatha R, Toni M, Feng F F. Modeling Score Distributions for Combining the Outputs of Search Engines // Proc of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. New Orleans, USA, 2001: 267-275

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133