OALib Journal期刊
ISSN: 2333-9721
费用：99美元

投递稿件

查看量	下载量

相关文章
更多...

模式识别与人工智能 2007

一种自动的唇部定位及唇轮廓提取、跟踪方法*

, PP. 485-491

王晓平,郝玉峰,付德刚,袁春伟

Keywords: 唇读,人脸检测,颜色空间,Fisher变换,定位,变形模板,轮廓提取,跟踪

Full-Text Cite this paper Add to My Lib

Abstract:

实现一种结合CbCr颜色空间、Fisher变换及变形模板的自动唇部定位及唇轮廓提取、跟踪方法.首先在CbCr空间建立肤色模型进行人脸检测、定位，并由人脸几何特征进行唇部粗定位.然后结合唇色模型进行Fisher变换使肤、唇色差别明显化，提出根据亮度信息对变换结果预处理后用Otsu法进行图像分割，经唇色模型进一步验证后实现唇部精定位.再使用变形模板来进行嘴唇轮廓特征提取，为增强内轮廓定位的鲁棒性，本文提出对经亮度预处理和唇色模型验证得到的口腔区域边缘图进行曲线拟合来实现内轮廓定位.最后，将唇读图像序列中上一帧的唇部定位结果拓展后作为当前帧的预测区域再进行处理来实现唇动跟踪.

References

[1]	Hennecke M E, Prasad K V, Stork D G. Automatic Speech Recognition System Using Acoustic and Visual Signals // Proc of the 29th Asilomar Conference on Signals, Systems and Computers. Pacific Grove, USA, 1995, Ⅱ: 12141218
[2]	Lanitis A, Taylor C J, Cootes T F. An Automatic Face Identification System Using Flexible Appearance Models. Image and Vision Computing, 1995, 13(5): 393401
[3]	Turk M, Pentland A. Eigenfaces for Recognition. Journal of Cognitive Neuroscience, 1991, 3(1): 7186
[4]	Yang Jie, Waibel A. A Realtime Face Tracker // Proc of the 3rd IEEE Workshop on Applications of Computer Vision. Sarasota, USA, 1996: 142147
[5]	Chai D, Ngan K N. Face Segmentation Using SkinColor Map in Videophone Application. IEEE Trans on Circuits and Systems for Video Technology, 1999, 9(4): 551564
[6]	Bregler C, Konig Y. Eigenlips for Robust Speech Recognition // Proc of the IEEE International Conference on Acoustics, Speech and Signal Processing. Adelaide, Australia, 1994: 669672
[7]	Iwano K, Tamura S, Furui S. Bimodal Speech Recognition Using Lip Movement Measured by OpticalFlow Analysis // Proc of the International Workshop on HandsFree Speech Communication. Kyoto, Japan, 2001: 187190
[8]	Luettin J, Thacker N A, Beet S W. Speechreading Using Shape and Intensity Information // Proc of the IEEE International Conference on Spoken Language. Philadelphia, USA, 1996, Ⅰ: 5861
[9]	Cootes T F, Edwards G J, Taylor C J. Active Appearance Models. IEEE Trans on Pattern Analysis and Machine Intelligence, 2001, 23(6): 681685
[10]	Aleksic P S, Katsaggelos K. Comparison of MPEG4 Facial Animation Parameter Groups with Respect to AudioVisual Speech Recognition Performance // Proc of the IEEE International Conference on Image Processing. Genoa, Italy, 2005, Ⅲ: 501504
[11]	Xue Yi. The Principles and Methods for Optimization. Beijing, China: Beijing Industry University Press, 2001 (in Chinese) (薛毅.最优化原理与方法.北京:北京工业大学出版社, 2001)
[12]	Gonzalez R C, Woods R E. Digital Image Processing. 2nd Edition. Upper Saddle River, USA: Prentice Hall, 2002
[13]	Lin Fuzong. Fundamentals of Multimedia Technology. 2nd Edition. Beijing, China: Tsinghua University Press, 2002 (in Chinese) (林福宗.多媒体技术基础.第2版.北京:清华大学出版社, 2002)
[14]	Yao Hongxun, Liu Mingbao, Gao Wen, et al. Method of Face Locating and Tracking Based on Chromatic Coordinates Transformation of Color Images. Chinese Journal of Computers, 2000, 23(2): 158165 (in Chinese) (姚鸿勋,刘明宝,高文,等.基于彩色图像的色系坐标变换的面部定位与跟踪法.计算机学报, 2000, 23(2): 158165)
[15]	Wang Rui, Gao Wen, Ma Jiyong. An Approach to Robust and Fast Locating of Lip Motion. Chinese Journal of Computers, 2001, 24(8): 866871 (in Chinese) (王瑞,高文,马继涌.一种快速、鲁棒的唇动检测与定位方法.计算机学报, 2001, 24(8): 866871)
[16]	Bian Zhaoqi, Zhang Xuegong. Pattern Recognition. 2nd Edition. Beijing, China: Tsinghua University Press, 2002 (in Chinese) (边肇祺, 张学工.模式识别.第2版.北京:清华大学出版社, 2002)
[17]	Otsu N. A Threshold Selection Method from GrayLevel Histogram. IEEE Trans on Systems, Man and Cybernetics, 1979, 9(1): 6266
[18]	Hennecke M E, Prasad K V, Stork D G. Using Deformable Templates to Infer Visual Speech Dynamics // Proc of the 28th Annual Asilomar Conference on Signals, Systems and Computers. Pacific Grove, USA, 1994, Ⅰ: 578582
[19]	Kass M, Witkin A, Terzopoulus D. Snakes: Active Contour Models. International Journal of Computer Vision, 1988, 1(4): 321331

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133