OALib Journal期刊
ISSN: 2333-9721
费用：99美元

投递稿件

查看量	下载量

相关文章
更多...

科技导报 2015

基于组合核函数SVM的说话人识别方法

DOI: 10.3981/j.issn.1000-7857.2015.01.016, PP. 90-94

樊持杰,司巧梅,徐岩,张丹,蔡春华,于旭

Keywords: 说话人识别,支持向量机,组合核函数,多重网格搜索

Full-Text Cite this paper Add to My Lib

Abstract:

鉴于应用支持向量机进行说话人识别过度依赖于选择核函数的问题,提出一种基于组合核函数支持向量机(SVM)的说话人识别方法.对多项式核函数、径向基核函数进行线性加权,构建既具有全局核函数优点又具有局部核函数优点的组合核函数,并通过多重网格搜索调节权重系数使组合核函数适用于当前数据分布,确定组合核函数SVM的最优参数,实现对说话人的有效识别.对TIMIT数据集和含噪声数据集的仿真实验显示,基于组合核函数SVM的说话人识别性能明显优于单一的多项式核函数、径向基核函数和线性核函数.

References

[1]	Vapnik V. The nature of statistical learning theory[M]. Berlin: Springer Publishing Company, 2000.
[2]	兰均, 施化吉, 李星毅, 等. 基于特征词复合权重的关联网页分类[J]. 计算机科学, 2011, 38(3): 187-190. Lan Jun, Shi Huaji, Li Xingyi, et al. Associative web document classification based on word mixed weight[J]. Computer Science, 2011, 38(3): 187-190.
[3]	Reynolds D A, Rose R C. Robust text-independent speaker identification using Gaussian mixture speaker models[J]. IEEE Transactions on Speech and Audio Processing, 1995, 3(1): 72-83.
[4]	Gish H, Schmidt M. Text-independent speaker identification[J]. IEEE Signal Processing Magazine, 1994, 11(4): 18-32.
[5]	张亮. 说话人识别中语音增强算法的研究和系统实现[D]. 重庆: 重庆大学, 2009. Zhang Liang. Speech enhancement algorithm research and system implementation for speaker recognition[D]. Chongqing: Chongqing University, 2009.
[6]	Kinnunen T, Li H. An overview of text-independent speaker recognition: From features to supervectors[J]. Speech Communication, 2010, 52(1): 12-40.
[7]	Sakoe H, Chiba S. Dynamic programming algorithm optimization for spoken word recognition[J]. IEEE Transactions on Acoustics, Speech and Signal Processing, 1978, 26(1): 43-49.
[8]	Togneri R, Pullella D. An overview of speaker identification: Accuracy and robustness issues[J]. IEEE Circuits and Systems Magazine, 2011, 11 (2): 23-61.
[9]	Rosenberg A, Soong F. Evaluation of a vector quantization talker recognition system in text independent and text dependent modes[J]. Computer Speech and Language, 1987, 22(4): 143-157.
[10]	HigginsA L, Bahler L G, Porter J E. Voice identification using nearestneighbor distance measure[C]. IEEE International Conference on the Acoustics, Speech, and Signal Processing, Minneapolis, USA, April 27- 30, 1993.
[11]	Wang G W, Luo S X, He L, et al. Application BP neural network in the speaker recognition based on chaos particle swarm optimization algorithm[J]. Advanced Materials Research, 2013, 765: 2805-2808.
[12]	刘雪燕, 李明, 张亚芬. 基于PCA和多约简SVM的多级说话人辨识[J]. 计算机应用, 2008, 28(1): 127-130. Liu Xueyan, Li Ming, Zhang Yafen. Hierarchical speaker identification based on PCA and multi- reduced SVM[J]. Computer Applications, 2008, 28(1): 127-130.
[13]	You C H, Lee K A, Li H. GMM-SVM kernel with a Bhattacharyyabased distance for speaker recognition[J]. IEEE Transactions on Audio, Speech, and Language Processing, 2010, 18(6): 1300-1312.
[14]	Fisher W M, Zue V, Bernstein J, et al. An acoustic-phonetic data base[J]. Journal of the Acoustical Society of America, 1987, 81(Suppl 1): 92-93.
[15]	Kohavi R. A study of cross- validation and bootstrap for accuracy estimation and model selection[C]. 14th International Joint Conference on Artificial Intelligence, Adelaide, Australia, December 10-14, 1995.
[16]	Nakagawa S, Wang L, Ohtsuka S. Speaker identification and verification by combining MFCC and phase information[J]. IEEE Transactions on Audio, Speech, and Language Processing, 2012, 20 (4): 1085-1095.
[17]	Hsu C W, Lin C J. A comparison of methods for multiclass support vector machines[J]. IEEE Transactions on Neural Networks, 2002, 13 (2): 415-425.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133