OALib Journal期刊
ISSN: 2333-9721
费用：99美元

投递稿件

查看量	下载量

相关文章
更多...

电子学报 2014

基于AR-HMM在线能量调整的语音增强方法

DOI: 10.3969/j.issn.0372-2112.2014.10.019, PP. 1991-1997

何玉文,鲍长春,夏丙寅

Keywords: 语音增强,非平稳噪声,隐马尔可夫模型,高斯混合模型

Full-Text Cite this paper Add to My Lib

Abstract:

针对单通道语音增强技术对非平稳噪声的跟踪不准确、噪声抑制效果较差的问题,本文提出一种基于在线能量调整的语音增强方法.该方法以归一化临界带能量为特征,采用高斯混合模型对背景噪声进行分类,利用对应类型噪声的自回归隐马尔可夫模型(Auto-RegressiveHiddenMarkovModel,AR-HMM)和纯净语音的AR-HMM,在最小均方误差准则下估计语音和噪声的功率谱.考虑到非平稳环境中训练集和测试集的差异性,需在线调整语音模型和噪声模型中的能量,语音模型的能量调整采用迭代的期望最大化算法；噪声模型的能量调整则利用的是模型训练过程中的能量重估方法,并以最小值控制的递归平均算法确定噪声能量调整的初始值.在ITU-TG.160标准下对算法进行性能测试,测试结果表明,本文方法对非平稳噪声的跟踪效果较好,对噪声衰减量较大,收敛时间较短.

References

[1]	Ephraim Y.A Bayesian estimation approach for speech enhancement using hidden Markov models[J].IEEE Transactions on Signal Processing,1992,40(4):725-735.
[2]	Ephraim Y.Gain-adapted hidden Markov models for recognition of clean and noisy speech[J].IEEE Transactions on Signal Processing,1992,40(6):1303-1316.
[3]	Sameti H,Sheikhzadeh H,Deng L,Brennan R L.HMM-based strategies for enhancement of speech signals embedded in non-stationary noise[J].IEEE Transactions on Speech and Audio Processing,1998,6(5):445-455.
[4]	Srinivasan S,Samuelsson J,Kleijn W B.Codebook-based Bayesian speech enhancement[A].IEEE International Conference on Acoustics,Speech,and Signal Processing[C].IEEE,2005.1077-1080.
[5]	Zhao D Y,Kleijn W B.HMM-based gain modeling for enhancement of speech in noise[J].IEEE Transactions on Audio,Speech,and Language Processing,2007,15(3):882-892.
[6]	Zhao D Y,Kleijn W B,Ypma A,et al.Online noise estimation using stochastic-gain HMM for speech enhancement[J].IEEE Transactions on Audio,Speech,and Language Processing,2008,16(4):835-846.
[7]	Srinivasan S,Samuelsson J,Kleijn W B.Codebook-based Bayesian speech enhancement for nonstationary environments[J].IEEE Transactions on Audio,Speech,and Language Processing,2007,15(2):441-452.
[8]	Varga A,Steeneken H J M.Assessment for automatic speech recognition:II.NOISEX-92:a database and an experiment to study the effect of additive noise on speech recognition systems[J].Speech Communication,1993,12 (3):247-251.
[9]	ITU-T Recommendation P.862.Perceptual Evaluation of Speech Quality(PESQ):An Objective Method for End-to-end Speech Quality Assessmen of Narrow-band Telephone Networks and Speech Codecs[S].1996.
[10]	Loizou P.Speech enhancement based on perceptually motivated Bayesian estimators of the speech magnitude spectrum[J].IEEE Transactions on Acoustics,Speech,and Signal Processing,2005,13(5):857-869.
[11]	Johnston J D.Transform coding of audio signals using perceptual noise criteria[J].IEEE Journal on Selected Areas in Communications,1988,6(2):314-323.
[12]	Ephraim Y.A minimum mean square error approach for speech enhancement[A].International Conference on Acoustics,Speech,and Signal Processing[C].IEEE,1990.829-832.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133