Chou F C, Tseng C Y, Lee L S, et al. Automatic generation of prosodic structure for high quality Mandarin speech synthesis//Proceeding of the Fourth International Conference on Spoken Language Processing. Philadelphia: , 1996: 1624-1627.
[2]
Chu Min, Lu S N. A text-to-speech system with high intelligibility and high naturalness for Chinese[J]. Chinese Journal of Acoustics, 1996, 15(1): 81-90.
[3]
Niu Zhengyu, Chai Peiqi. Segmentation of prosodic phrases for improving the naturalness of synthesized mandarin Chinese speech//ICSLP 2000. Beijing: , 2000: 350-353.
[4]
Mao Xinnian, Dong Yuan, Han Jinyu, et al. Inequality maximum entropy classifier with character featrues for polyphone disambiguation in mandarin TTS systems//IEEE International Conference on Acoustics, Speech, and Signal Processing. Honolulu: , 2007: IV-705-IV-708.
[5]
Wang M, Hirschberg J. Automatic classification of intonational phrase boundaries[J]. Computer Speech and Language, 1992, 16(6): 175-196.
[6]
Taylor P, Black A W. Assigning phrase breaks from part-of-speech sequences[J]. Computer Speech and Language, 1998, 12(2): 99-117.
[7]
Jia Yuxiang, Huang Dezhi, Liu Wu, et al. Text normalization in mandarin text-to-speech system//IEEE International Conference on Acoustics, Speech, and Signal Processing. Las Vegas: , 2008: 4693-4696.
[8]
Lafferty J, McCallum A, Pereira F C N. Conditional random fields: probabilistic models for segmenting and labeling sequence data//Proc of the 18th ICML. San Francisco: , 2001: 282-289.
[9]
Mao Xinnian, Dong Yuan, He Saike, et al. Chinese word segmentation and named entity recognition based on conditional random fields//IJCNLP 2008. Hyderabad: , 2008: 90-93.
[10]
Brill E. Transformation-based error-driven learning and natural language processing: a case study in part of speech tagging[J]. Computational Linguistics, 1995, 21(4): 543-565.