全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

A Noise Suppression Method for Speech Signal by Jointly Using Bayesian Estimation and Fuzzy Theory

DOI: 10.4236/jsea.2021.1412037, PP. 631-645

Keywords: Air- and Bone-Conducted Speeches, Noise Suppression, Bayesian Estimation, Fuzzy Data

Full-Text   Cite this paper   Add to My Lib

Abstract:

Speech recognition systems have been applied to inspection and maintenance operations in industrial factories to recording and reporting routines at construction sites, etc. where hand-writing is difficult. In these actual circumstances, some countermeasure methods for surrounding noise are indispensable. In this study, a new method to remove the noise for actual speech signal was proposed by using Bayesian estimation with the aid of bone-conducted speech and fuzzy theory. More specifically, by introducing Bayes’ theorem based on the observation of air-conducted speech contaminated by surrounding background noise, a new type of algorithm for noise removal was theoretically derived. In the proposed noise suppression method, bone-conducted speech signal with the reduced high-frequency components was regarded as fuzzy observation data, and a stochastic model for the bone-conducted speech was derived by applying the probability measure of fuzzy events. The proposed method was applied to speech signals measured in real environment with low SNR, and better results were obtained than an algorithm based on observation of only air-conducted speech.

References

[1]  Yamashita, K. and Shimamura, T. (2005) Nonstationary Noise Estimation Using Low-Frequency Regions for Spectral Subtraction. IEEE Signal Processing Letters, 12, 465-468.
https://doi.org/10.1109/LSP.2005.847864
[2]  Plapous, C., Marro, C. and Scalart, P. (2006) Improved Signal-to-Noise Ratio Estimation for Speech Enhancement. IEEE Transactions on Speech and Audio Processing, 14, 2098-2108.
https://doi.org/10.1109/TASL.2006.872621
[3]  McCowan, I.A. and Bourlard, H. (2003) Microphone Array Post-Filter Based on Noise Field Coherence. IEEE Transactions on Speech and Audio Processing, 11, 709-716.
https://doi.org/10.1109/TSA.2003.818212
[4]  Kawamura, A., Fujii, K., Itoh, Y. and Fukui, Y. (2002) A Noise Reduction Method Based on Linear Prediction Analysis. IEICE Transactions on Fundamentals, J85-A, 415-423.
https://doi.org/10.1109/ICASSP.2002.1004860
[5]  Kawamura, A., Fujii, K. and Itoh, Y. (2005) A Noise Reduction Method Based on Linear Prediction with Variable Step-Size. IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, E88-A, 855-861.
https://doi.org/10.1093/ietfec/e88-a.4.855
[6]  Kim, W. and Ko, H. (2001) Noise Variance Estimation for Kalman Filtering of Noisy Speech. IEICE Transactions on Information and Systems, E84-D, 155-160.
[7]  Li, H., Wang, X., Dai, B. and Lu, W. (2007) A Kalman Smoothing Algorithm for Speech Enhancement Based on the Properties of Vocal Tract Varying Slowly. Proceedings of Eighth ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing, Qingdao, 30 July-1 Aug. 2007, 832-836.
[8]  Tanabe, N., Furukawa, T. and Tsuji, S. (2008) Robust Noise Suppression Algorithm with the Kalman Filter Theory for White and Colored Disturbance. IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, E91-A, 818-829.
https://doi.org/10.1093/ietfec/e91-a.3.818
[9]  Jia, H., Zhang, X. and Jin, C. (2009) A Modified Speech Enhancement Algorithm Based on the Subspace. Proceedings of 2009 Second International Symposium on Knowledge Acquisition and Modeling, Wuhan, 30 November-1 December 2009, 344-347.
https://doi.org/10.1109/KAM.2009.19
[10]  Candy, J.V. (2009) Bayesian Signal Processing: Classical, Modern, and Particle Filtering Methods. John Wiley & Sons Ltd., Hoboken.
https://doi.org/10.1002/9780470430583
[11]  Ikuta, A. and Orimoto, H. (2011) Adaptive Noise Suppression Algorithm for Speech Signal Based on Stochastic System Theory. IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, E94-A, 1618-1627.
https://doi.org/10.1587/transfun.E94.A.1618
[12]  Ikuta, A., Orimoto, H. and Gallagher, G. (2018) Noise Suppression Method by Jointly Using Bone- and Air-Conducted Speech Signals. Noise Control Engineering Journal, 66, 472-488.
https://doi.org/10.3397/1/376640
[13]  Orimoto, H., Ikuta, A. and Hasegawa, K. (2021) Speech Signal Detection Based on Bayesian Estimation by Observing Air-Conducted Speech under Existence of Surrounding Noise with the Aid of Bone-Conducted Speech. Intelligent Information Management, 13, 199-213.
https://doi.org/10.4236/iim.2021.134011
[14]  Shin, H.S., Kang, H.G. and Fingscheidt, T. (2012) Survey of Speech Enhancement Supported by a Bone Conduction Microphone. Proceedings of 10th ITG Conference on Speech Communication, Braunschweig, 26-28 September 2012, 47-50.
[15]  Ikuta, A. and Orimoto, H. (2014) Fuzzy Signal Processing of Sound and Electromagnetic Environment by Introducing Probability Measure of Fuzzy Events. Proceedings of International Conference on Fuzzy Computation Theory and Applications, Rome, 22-24 October 2014, 5-13.
[16]  Orimoto, H. and Ikuta, A. (2012) Prediction of Response Probability Distribution by Considering Additive Property of Energy and Evaluation in Decibel Scale for Sound Environment System with Unknown Structure. Transactions of the Society of Instrument and Control Engineers, 48, 830-836.
https://doi.org/10.9746/sicetr.48.830
[17]  Orimoto, H. and Ikuta, A. (2019) State Estimation for Sound Environment System with Nonlinear Observation Characteristics by Introducing Wide-Sense Particle Filter. Intelligent Information Management, 11, 87-101.
https://doi.org/10.4236/iim.2019.116008

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133