|
计算机应用研究 2009
Articulatory feature based on audio visual speech recognition model
|
Abstract:
This paper presented an articulatory feature (AF)-based multi-stream dynamic Bayesian networks (DBN) model (AF_AV_DBN) for audio visual speech recognition. Defined conditional probability of each node and degree of asynchrony between AFs,and carried out speech recognition experiments on an audio visual connected digit database. Comparing results with the other two single stream DBN models (audio-only model and video-only model) show that AF_AV_DBN performs the best when the signal-noise ratio on the audio s...