OALib Journal期刊
ISSN: 2333-9721
费用：99美元

投递稿件

查看量	下载量

相关文章
更多...

电子学报 2012

基于高斯混合模型的压缩域语音增强方法

DOI: 10.3969/j.issn.0372-2112.2012.10.022, PP. 2031-2038

梁岩,鲍长春,夏丙寅,何玉文,周璇,李娜

Keywords: 语音增强,参数域,高斯混合模型,贝叶斯估计,非连续性传输,帧擦除

Full-Text Cite this paper Add to My Lib

Abstract:

为了有效利用纯净语音导抗谱频率参数(ISFs)的先验知识,本文针对ITU-TG.722.2宽带语音编码标准提出了一种基于高斯混合模型的压缩域语音增强方法.首先,将含噪语音、纯净语音的导抗谱频率参数,以及对应的增益调整因子构成特征矢量,并利用高斯混合模型拟合其概率密度;然后,在最小均方误差(MMSE)准则下对纯净语音的特征参数进行最优贝叶斯估计.为了兼容编码器中的非连续性传输模式,当处理信号为非语音信息时,算法在保持噪声帧谱包络参数不变的前提下,按固定比例调整对数帧能量;且若出现帧擦除情况,算法不调整接收到的码流,并按正常帧处理方式调整恢复后的参数以更新相关历史.本文采用ITU-TG.160标准进行了性能测试,结果表明,与参考方法相比,所提方法在保证信噪比提高能力的同时,可以达到更大的噪声衰减量,且增强语音的客观质量更优.

References

[1]	Ephraim Y.Speech enhancement using a minimum mean-square error log-spectral amplitude estimator [J].IEEE Transactions on Acoustics,Speech and Signal Processing,1985,33(2):443-445.
[2]	R Chandran,D J Marchok.Compres domain noise reduction and echo suppression for network speech enhancement [A].Proceedings of the 43rd IEEE Midwest Symposium on Circuits and Systems [C].MI,USA:IEEE Press,2000.10-13.
[3]	H Taddei,C Beaugeant,M de Meuleneire.Noise reduction on speech codec parameters [A].Proceedings of IEEE International Conference Acoustic Speech Signal Process (ICASSP) [C].NJ,USA:IEEE Press,2004.497-500.
[4]	胡广书.数字信号处理导论[M].北京:清华大学出版社,2005.294-296.
[5]	ITU-T Recommendation G.722.2 Annex A.Comfort Noise Aspects [S].
[6]	ITU-T Recommendation G.722.2 Annex B.Source Controlled Rate Operation [S].
[7]	ITU-T Recommendation G.160.Voice Enhancement Devices for Mobile Networks [S].
[8]	Ephraim Y.Speech enhancement using a minimum mean-square error short-time spectral amplitude estimator [J].IEEE Transactions on Acoustics,Speech and Signal Processing,1984,32(6):1109-1121.
[9]	N Duetsch,H Taddei,C Beaugeant,T Fingscheidt.Noise reduction on speech codec parameters [A].Proceedings of IEEE International Conference on Acoustics,Speech,and Signal Processing (ICASSP ''04) [C].USA:IEEE Press,2004,Vol.1.497-500.
[10]	H M Goodarzi,S Seyedtabaii.Speech enhancement using spectral subtraction based on a modified noise minimum statistics estimation [A].Proceedings of the 2009 Fifth International Joint Conference on INC,IMS and IDC [C].Seoul,South Korea,2009.1339-1343.
[11]	R A Sukkar,R C Younce,Peng Zhang.Method and Apparatus for Voice Quality Enhancement.USA:US2006215683-A1,2006-09-28.
[12]	Cuntai Guan,Yongbin Chen,Boxiu Wu.Direct modulation on LPC coefficients with application to speech enhancement and improving the performance of speech recognition in noise [A].Proceedings of IEEE International Conference on Acoustics,Speech,and Signal Processing (ICASSP) [C].Minneapolis,MN,USA:IEEE Press,1993,Vol.2.107-110.
[13]	E T Fapi,C Beauqeant,H Taddei,D Pastor.Noise reduction within network through modification of LPC parameters [A].Proceedings of the 7th International ITG Conference on Source and Channel Coding (SCC) [C].UIm,Germany,2008.1-6.
[14]	P Harding,B Milner.Speech enhancement by reconstruction from cleaned acoustic features [A].Interspeech 2011 [C].Florence,Italy,2011.1189-1192.
[15]	ITU-T Recommendation G.722.2.Wideband Coding of Speech at Around 16kbit/s Using Adaptive Multi-Rate Wideband (AMR-WB) [S].
[16]	刘鑫.宽带音频的非线性频带扩展技术研究 [D].北京:北京工业大学,2011.21-34. Liu Xin.Nonlinear Bandwidth Extending for Wideband Audio.Beijing:Beijing University of Technology,2011.21-34.(in Chinese)
[17]	ITU-T Recommendation P.862.Perceptual Evaluation of Speech Quality (PESQ):An Objective Method for End-to-End Speech Quality Assessment of Narrow Band Telephone Networks and Speech Codes [S].
[18]	ITU-T Recommendation G.722.2 Appendix I.Error Concealment of Erroneous or Lost Frames [S].

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133