OALib Journal期刊
ISSN: 2333-9721
费用：99美元

投递稿件

查看量	下载量

相关文章
更多...

自动化学报 2008

Audio-visual Speaker Tracking Based on Dynamic Bayesian Network
基于动态贝叶斯网络的音视频联合说话人跟踪

JIN Nai-Gao,YIN Fu-Liang,CHEN Zhe,
金乃高,殷福亮,陈喆

Keywords: Speaker tracking,dynamic Bayesian network,particle filter,microphone array
说话人跟踪,动态贝叶斯网络,粒子滤波,麦克风阵列

Full-Text Cite this paper Add to My Lib

Abstract:

Multi-sensor data fusion technique is applied to speaker tracking problem,and a novel audio-visual speaker tracking approach based on dynamic Bayesian network is proposed.Based on the complementarity and redundancy between speech and image of a speaker,three kinds of perception methods,including sound source localization based on microphone array,face detection based on skin color information,and maximization mutual information based on audio-visual synchronization,are proposed to acquire the tracking information.In the framework of dynamic Bayesian network,particle filtering is used to fuse the tracking information,and perception management is achieved to improve the tracking efficiency by information entropy theory.Experiments using real-world data demonstrate that the proposed method can robustly track the speaker even in the presence of perturbing factors such as high room reverberation and video occlusions.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133

Audio-visual Speaker Tracking Based on Dynamic Bayesian Network基于动态贝叶斯网络的音视频联合说话人跟踪

Audio-visual Speaker Tracking Based on Dynamic Bayesian Network
基于动态贝叶斯网络的音视频联合说话人跟踪