%0 Journal Article %T Dynamic Channel Compensation Based on Statistical Model for Mandarin Speech Recognition over Telephone
电话语音识别中基于统计模型的动态通道 %A Han Zhao-bing %A Zhang Hua-yun %A Zhang Shu-wu %A Xu Bo %A
韩兆兵 %A 张化云 %A 张树武 %A 徐波 %J 电子与信息学报 %D 2004 %I %X Automatic speech recognition in telecommunications environment still has a lower correct rate compared to its desktop pairs. Improving the performance of telephonequality speech recognition is an urgent problem for its application in those practical fields.Previous works have shown that the main reason for this performance degradation is the var ational mismatch caused by different telephone channels between the testing and training sets. In this paper, they propose an efficient implementation to dynamically compensate this mismatch based on a phone-conditioned prior statistic model for the channel bias.This algorithm uses Bayes' rule to estimate telephone channels and dynamically follows the time-variations within the channels. In their experiments on mandarin Large Vocabulary Continuous Speech Recognition (LVCSR) over telephone lines, the average Character Error Rate (CER) decreases more than 27% when applying this algorithm; in short utterance test,the Word-Error-Rate(WER) relatively reduced 30%. At the same time, the structural delay and computational consumptions required by this algorithm are limited. The average delay is about 200 ms. So it could be embedded into practical telephone-based applications. %K Telephone speech recognition %K Dynamic channel compensation %K Maximum-Likelihood(ML)estimation %K Maximum A Posteriori(MAP)estimation
电话语音识别 %K 动态通道补偿 %K 最大似然估计 %K 最大后验估计 %U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=1319827C0C74AAE8D654BEA21B7F54D3&jid=EFC0377B03BD8D0EF4BBB548AC5F739A&aid=F7636FB55F4B86CC&yid=D0E58B75BFD8E51C&vid=96C778EE049EE47D&iid=708DD6B15D2464E8&sid=7128E4A5513059D9&eid=4E17F6A5D7499FF3&journal_id=1009-5896&journal_name=电子与信息学报&referenced_num=0&reference_num=10