%0 Journal Article %T A Robust Front-end for Speech Recognition Based on Computational Auditory Scene Analysis and Speaker Model
基于计算听觉场景分析和语者模型信息的语音识别鲁棒前端研究 %A GUAN Yong LI Peng LIU Wen-Ju XU Bo National Laboratory of Pattern Recognition %A Institute of Automation %A Chinese Academy of Sciences %A Beijing Nokia Research Center %A Beijing Digital Content Technology Research Center %A Beijing %A
关勇 %A 李鹏 %A 刘文举 %A 徐波 %J 自动化学报 %D 2009 %I %X Conventional noise robust speech recognition system does not work well when human speech is presented in the background. In this paper, a computational auditory scene analysis (CASA) and speaker model based speech segregation system is proposed to solve this problem. By utilizing speaker model and factorial-max vector quantization (MAXVQ) to estimate real-value masks in CASA framework, a robust front-end for speech recognition is constructed. Evaluations on speech separation challenge (SSC) showed that the proposed system won 15.68% improvement over the baseline system. The results of evaluation also proved the validity of the multi-speaker recognition and the real-value mask estimation module. %K Computational auditory scene analysis (CASA) %K speech segregation %K robust speech recognition %K factorial-max vector quantization (MAXVQ) %K speaker recognition
计算听觉场景分析 %K 语音分离 %K 鲁棒语音识别 %K 因子最大矢量量化 %K 语者识别 %U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=E76622685B64B2AA896A7F777B64EB3A&aid=EB9ACF64409212549DCBCECBD2C4E641&yid=DE12191FBD62783C&vid=6209D9E8050195F5&iid=E158A972A605785F&sid=FED44C0135DC1D9C&eid=1FF3CD54EFC256A1&journal_id=0254-4156&journal_name=自动化学报&referenced_num=1&reference_num=0