OALib Journal期刊
ISSN: 2333-9721
费用：99美元

投递稿件

查看量	下载量

相关文章
更多...

北京理工大学学报 2014

融合GMM及SVM的特定音频事件高精度识别方法

罗森林,王坤,谢尔曼,潘丽敏,李金玉

Keywords: 音频识别,高斯混合模型（GMM）,支持向量机（SVM）,Mel频率倒谱系数（MFCC）,特定音频事件

Full-Text Cite this paper Add to My Lib

Abstract:

针对特定音频事件识别中持续时间特别短的音频事件漏检概率高、识别速度较慢的问题，提出一种融合高斯混合模型（GMM）及支持向量机（SVM）的特定音频事件识别算法.该方法利用GMM的统计分布描述能力和SVM的推广泛化能力，将GMM和SVM分别识别的结果进行融合处理，以手枪、步枪、机关枪等10类以上枪声为实验数据，无需针对每种枪声生成相应的识别模板，仅需训练生成2个识别模板.实验结果表明，识别准确率达到92.71%.该方法模板数量少，不需要多次训练，算法复杂度较低，不仅便于应用而且可大幅提升识别效率.

References

[1]	Xiong Ziyou, Radhakrishnan R, Divakaran A, et al. Audio events detection based highlights extraction from baseball, golf and soccer games in a unified framework[C]//Proceedings of 2003 IEEE International Conference on Multimedia and Expo. Hong Kong: ICME, 2003:401-404.
[2]	Shirazi J, Ghaemmaghami S. Audio classification based on sinusoidal model: a new feature[C]//Proceedings of Conference on TENCON 2008. Hyderabad: IEEE, 2008:1-5.
[3]	Trancoso I, Pellegrini T, Portelo J, et al. Audio contributions to semantic video search[C]//Proceedings of 2009 IEEE International Conference on Multimedia and Expo (ICME). New York, 2009:630-633.
[4]	Li Lu, Ge Fengpei, Zhao Qingwei, et al. A SVM-based audio event detection system[C]//Proceedings of 2010 International Conference on Electrical and Control Engineering (ICECE 2010). Wuhan, China: Electrical and Control Engineering (ICECE), 2010:292-295.
[5]	Dhanalakshmi P, Palanivel S, Ramalingam V. Classification of audio signals using SVM and RBFNN[J]. Expert Systems with Applications, 2009, 36:6069-6075.
[6]	Pikrakis A, Giannakopoulos T, Theodoridis S. Gunshot detection in audio streams from movies by means of dynamic programming and Bayesian networks[C]//Proceedings of Acoustics, Speech, and Signal Processing. Piscataway, USA:[s.n.], 2008:21-24.
[7]	罗森林, 李金玉, 潘丽敏.特定类型音频流泛化识别方法[J].北京理工大学学报, 2011(10):1231-1235. Luo Senlin, Li Jinyu, Pan Limin. A generic method of recognition specific type audio stream[J]. Transactions of Beijing Institute of Technology, 2011(10):1231-1235. (in Chinese)
[8]	杨行峻, 迟惠生.语音信号数字处理[M].北京:电子工业出版社, 1994. Yang Xingjun, Chi Huisheng. Digital processing of speech signals[M]. Beijing: Publishing House of Electronics Industry, 1994. (in Chinese)
[9]	吕霄云, 王宏霞.基于MFCC和GMM的异常声音识别算法研究[D].成都:西南交通大学, 2010. Lü Xiaoyun, Wang Hongxia. Rearch on abnormal audio recognition algorithm based on MFCC and GMM[D]. Chengdu: Southwest Jiaotong University, 2010. (in Chinese)
[10]	牛滨, 孔令志, 罗森林, 等.基于MFCC和GMM的个性音乐推荐模型[J].北京理工大学学报, 2009(4):351-355. Niu Bin, Kong Lingzhi, Luo Senlin, et al. Individuality music recommendation model based on MFCC and GMM[J]. Transactions of Beijing Institute of Technology, 2009(4):351-355. (in Chinese)
[11]	Schapire Robert E, Singer Yoram. Improved boosting algorithms using confidence-rated predictions[J]. Machine Learning, 1999, 37:297-336.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133