OALib Journal期刊
ISSN: 2333-9721
费用：99美元

投递稿件

查看量	下载量

相关文章
更多...

- 2018

基于瓶颈特征的藏语拉萨话连续语音识别研究
Study on Continuous Speech Recognition Based on Bottleneck Features for Lhasa-Tibetan Dialect

DOI: 10.13209/j.0479-8023.2017.154

周楠,赵悦,李要嫱,徐晓娜,才旺拉姆,吴立成

Keywords: 藏语拉萨话,连续语音识别,高斯混合–隐马尔科夫模型,瓶颈特征,深度神经网络
Lhasa-Tibetan,continuous speech recognition,GMM-HMM,bottleneck features,deep neural network (DNN)

Full-Text Cite this paper Add to My Lib

Abstract:

摘要基于从深度神经网络提取的瓶颈特征具有语音长时相关性和紧凑表示的特点, 将瓶颈特征及其与MFCC的复合特征用于藏语连续语音识别任务中, 可以代替传统的MFCC特征进行GMM-HMM声学建模。在藏语拉萨话连续语音识别任务中的实验表明, 瓶颈特征的复合特征取得比深度神经网络后验特征和单瓶颈特征更好的识别表现。
Abstract The bottleneck features extracted from deep neural network not only have long term contextdependence and compact representation of speech signal, but also can replace the traditional MFCC features for GMM-HMM acoustic modeling. The authors apply bottleneck features and their concatenated features with MFCC into Lhasa-Tibetan continuous speech recognition. The experiments in Lhasa-Tibetan continuous speech recognition show that the concatenated features of bottleneck features and MFCC achieve better performance than the posterior features of deep neural network and mono-bottleneck features.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133

基于瓶颈特征的藏语拉萨话连续语音识别研究Study on Continuous Speech Recognition Based on Bottleneck Features for Lhasa-Tibetan Dialect

基于瓶颈特征的藏语拉萨话连续语音识别研究
Study on Continuous Speech Recognition Based on Bottleneck Features for Lhasa-Tibetan Dialect