全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...
-  2017 

基于DTW的俄语短指令语音识别
Speech recognition of Russian short instructions based on DTW

DOI: 10.6040/j.issn.1671-9352.0.2017.253

Keywords: 端点检测,俄语语音识别,跨语言语音识别,DTW算法,
Russian speech recognition
,endpoint detection,DTW algorithm,cross language speech recognition

Full-Text   Cite this paper   Add to My Lib

Abstract:

摘要: 面向训练语料有限的语音识别任务,基于动态时间规整(dynamic time warping, DTW)算法对俄语语音进行识别。首先,以跨语言标注的语音语料为资源基础,研究融合音字转换和机器翻译的语音识别方法。其次,结合俄语语音特点,以元音为中心设置动态门限阈值,实现精确至音节的端点检测,识别速度提高了34.4%,准确率提高了14%。然后,综合时域、频域分析,提取反映语音静态特征和动态变化的参数模板。另外,引入全局限制和早弃策略改进DTW算法,避免病态匹配,缩小计算规模,使速度提高了19.7%,准确率提高了4.8%。在俄语短指令语音集上做五折交叉验证,识别准确率达到74.9%。
Abstract: Focus on speech recognition task with limited training corpus, this paper makes research of Russian speech recognition based on DTW(dynamic time warping)algorithm. Firstly, we study methods for combining speech recognition and machine translation with the speech corpus which annotating tags of cross language text. Secondly, based on the characteristics of Russian speech, in order to detected syllable endpoint, we set dynamic threshold according to the central vowel, which increased the speed by 34.4% and increased the accuracy by 14%. Finally, we extract the parameters of the static and dynamic characteristics by analyzing speech features of time domain and frequency domain. In addition, the DTW algorithm is improved to overcome the ill condition and reduce the computation scale with global restrictions and early discard strategies, which increased the speed by 4.8% and increased the accuracy by 19.7%. Experiments on the Russian short instruction set with 5 fold cross validation, and the accuracy of speech recognition reached 74.9%

References

[1]  宋知用.MATLAB在语音信号分析与合成中的应用[M]. 北京:北京航空航天大学出版社,2013. SONG Zhiyong. MATLAB application in speech signal analysis and synthesis[M]. Beijing: Beihang University Press, 2013.
[2]  孙宏伟.基于DTW距离的时间序列相似性查询和shapelets分类算法研究[D]. 太原:太原理工大学,2016. SUN Hongwei. Research of time series distance similarity and shapelets classification algorithm based on DTW[D]. Taiyuan: Taiyuan University of Technology, 2016.
[3]  韩纪庆,张磊,郑铁然,等.语音信号处理[M]. 北京:清华大学出版社,2013. HAN Jiqing, ZHANG Lei, ZHENG Tieran, et al. Processing of speech signal[M]. Beijing:Tsinghua University Press, 2013.
[4]  马延周.基于标注新闻语料的俄语连续语音识别研究[D]. 郑州:解放军外国语学院, 2015. MAYanzhou. A study of Russian continuous speech recognition based on tagged news corpus[D]. Zhengzhou: PLA University of Foreign Language, 2015.
[5]  YOO IC,YOOK D. Robust voice activity detection using the spectral peaks of vowel sounds[J]. ETRI Journal, 2009, 31(4):451-453.
[6]  ORTIZ P D, VILLA LF, SALAZAR C, et al. A simple but efficient voice activity detection algorithm through Hilbert transform and dynamic threshold for speech pathologies[J]. Journal of Physics: Conference Series, 2016, 705(1):012037.
[7]  于俊婷,刘伍颖,易绵竹,等.国内语音识别研究综述[J]. 计算机光盘软件与应用,2014(10):76-78. YU Junting, LIU Wuying, YI Mianzhu, et al. Review of speech recognition in China[M].Computer CD Software and Applications, 2014(10):76-78.
[8]  胡航.语音信号处理[M]. 哈尔滨:哈尔滨工业大学出版社,2009. HU Hang. Processing of speech signal[M]. Harbin: Harbin Institute of Technology Press, 2009.
[9]  SAKOE H, CHIBA S. Dynamic programming algorithm optimization for spoken word recognition[J]. IEEE Transactions on Acoustics, Speech and Signal Process, 1978, 26(1):43-49.
[10]  ZHAO Fangli. Russian pronunciation analysis utilized praat software[J]. Computer Engineering and Applications, 2012, 48(11):133-136.
[11]  张雪英.数字语音处理及Matlab仿真[M]. 北京:电子工业出版社,2016. ZHANG Xueying. Processing of digital speech and Matlab simulation[M]. Beijing: Electronics Industry Press, 2016.
[12]  徐来娣.俄语音节理论研究与俄语音节切分优化方案[J]. 中国俄语教学,2009, 28(4):69-72. XU Laidi. Russian syllable theory study andoptimization of Russian syllable segmentation[J]. Teaching Russian in China, 2009, 28(4):69-72.
[13]  徐来娣.俄汉语流重音声学实验对比研究[J]. 中国俄语教学,2016, 35(2):75-81. XU Laidi. A contrastive study of Russian and Chinese acoustics stream stress[J]. Teaching Russian In China, 2016, 35(2):75-81.
[14]  赵芳丽.中国人说俄语声学特征的实验分析及训练对策[J]. 中国俄语教学,2011, 30(3):76-79. ZHAO Fangli. Experimental analysis and training strategies of Russian acoustic features by Chinese[J]. Teaching Russian In China, 2011, 30(3):76-79.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133