OALib Journal期刊
ISSN: 2333-9721
费用：99美元

投递稿件

查看量	下载量

相关文章
更多...

模式识别与人工智能 2006

基于音素绑定码本映射的说话人声音转换方法

, PP. 300-306

王子祥,戴礼荣,王玉平,王仁华

Keywords: 声音转换,码本映射,决策树

Full-Text Cite this paper Add to My Lib

Abstract:

介绍说话人声音转换系统框架,并对传统的基于码本映射的说话人声音转换方法进行讨论.指出传统的码本映射方法由于对谱的转换采用所有码本加权叠加,因此会产生转换后语音频谱平滑效应过重的问题,从而使转换后语音音质较差.为了克服这种问题,本文提出基于音素绑定的码本加权叠加方法来完成语音谱的转换,同时利用决策树来完成韵律的转换.实验表明,即使在数据量较少的情况下,该方法也能较好地完成说话人声音转换,并能得到较高的语音音质.

References

[1]	Abe M, Nakamura S, Shikano K, Kuwabara H. Voice Conversion through Vector Quantization. In: Proc of the IEEE International Conference on Acoustics, Speech and Signal Processing. New York, USA, 1988, Ⅰ: 655-658
[2]	Narendranath M, Murthy H A, Rajendran S, Yegnanarayana B. Transformation of Formants for Voice Conversion Using Artificial Neural Networks. Speech Communication, 1995, 16(2): 207-216
[3]	Mizuno H, Abe M. Voice Conversion Algorithm Based on Piecewise Linear Conversion Rules of Formant Frequency and Spectrum Tilt. Speech Communication, 1995, 16(2): 153-164
[4]	Stylianou Y, Cappe O, Moulines E. Continuous Probabilistic Transform for Voice Conversion. IEEE Trans on Speech and Audio Processing, 1998, 6(2): 131-142
[5]	Wang Z X, Wang R H, Shuang Z W, Ling Z H. A Novel Voice Conversion System Based on Codebook Mapping with Phoneme-tied Weighting. In: Proc of the 8th Intenational Conference on Spoken Language Processing. Jeju Island, Korea, 2004, 1197-1200
[6]	Kawahara H. Restructuring Speech Representations Using a Pitch-Adaptive Time Frequency Smoothing and a Instantaneous-Frequency-Based F0 Extraction: Possible Role of a Repetitive Structure in Sound. Speech Communication, 1999, 27(3-4): 187-207
[7]	Arslan L M. Speaker Transformation Algorithm Using Segmental Codebooks (STASC). Speech Communication, 1999, 28(3): 211-226
[8]	Turk O, Arslan L M. Subband Based Voice Conversion. In: Proc of the International Conference on Spoken Language Processing. Denver, USA, 2002, Ⅰ: 289-292
[9]	Breiman L, Friedman J, Olshen R, Stone C. Classification and Regression Trees. New York, USA: Chapman and Hall, 1984
[10]	Hasan M M, Nasr A M, Sultana S. An Approach to Voice Conversion Using Feature Statistical Mapping. Applied Acoustics, 2005, 66(5): 513-532

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133