全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

Trainable unit selection speech synthesis under statistical framework

Keywords: speech synthesis,unit selection and waveform concatenation,statistical modeling,maximum likelihood criterion

Full-Text   Cite this paper   Add to My Lib

Abstract:

This paper proposes a trainable unit selection speech synthesis method based on statistical modeling framework. At training stage, acoustic features are extracted from the training database and statistical models are estimated for each feature. During synthesis, the optimal candidate unit sequence is searched out from the database following the maximum likelihood criterion derived from the trained models. Finally, the waveforms of the optimal candidate units are concatenated to produce synthetic speech. Experiment results show that this method can improve the automation of system construction and naturalness of synthetic speech effectively compared with the conventional unit selection synthesis method. Furthermore, this paper presents a minimum unit selection error model training criterion according to the characteristics of unit selection speech synthesis and adopts discriminative training for model parameter estimation. This criterion can finally achieve the full automation of system construction and improve the naturalness of synthetic speech further. Supported by the National Natural Science Foundation of China (Grant Nos. 60475015, 60610298) and National Hi-Tech Research and Development Program of China (Grant Nos. 2006AA01Z137 and 2006AA010104)

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133