Facial expressions are widely used in the behavioral interpretation of emotions, cognitive science, and social interactions. In this paper, we present a novel method for fully automatic facial expression recognition in facial image sequences. As the facial expression evolves over time facial landmarks are automatically tracked in consecutive video frames, using displacements based on elastic bunch graph matching displacement estimation. Feature vectors from individual landmarks, as well as pairs of landmarks tracking results are extracted, and normalized, with respect to the first frame in the sequence. The prototypical expression sequence for each class of facial expression is formed, by taking the median of the landmark tracking results from the training facial expression sequences. Multi-class AdaBoost with dynamic time warping similarity distance between the feature vector of input facial expression and prototypical facial expression, is used as a weak classifier to select the subset of discriminative feature vectors. Finally, two methods for facial expression recognition are presented, either by using multi-class AdaBoost with dynamic time warping, or by using support vector machine on the boosted feature vectors. The results on the Cohn-Kanade (CK+) facial expression database show a recognition accuracy of 95.17% and 97.35% using multi-class AdaBoost and support vector machines, respectively.
References
[1]
Valstar, M.F.; Mehu, M.; Jiang, B.; Pantic, M.; Scherer, K. Meta-analysis of the first facial expression recognition challenge. IEEE Trans. Syst. Man. Cybern. B Cybern. 2012, 42, 966–979.
[2]
Mehrabian, A. Communication without words. Psychol. Today 1968, 2, 53–56.
[3]
Ekman, P. Strong evidence of universal in facial expressions: A reply to Russell's mistaken critique. Psychol. Bull. 1994, 115, 268–287.
[4]
Samal, A.; Iyenger, P.A. Automatic recognition and analysis of human faces and facial expressions: A survey. Pattern Recognit. 1992, 25, 65–77.
[5]
Pantic, M.; Rothkrantz, L. Automatic analysis of facial expressions: The state of the art. IEEE Trans. Pattern Anal. Mach. Intell. 2000, 22, 1424–1445.
[6]
Fasel, B.; Luettin, J. Automatic facial expression analysis: A survey. Pattern Recognit. 2003, 36, 259–275.
[7]
Zhao, G.; Pietik?inen, M. Dynamic texture recognition using local binary patterns with an application to facial expressions. IEEE Trans. Pattern Anal. Mach. Intell. 2007, 29, 915–928.
[8]
Shan, C.; Gong, S.; McOwan, P.W. Facial expression recognition based on local binary patterns: A compressive study. Image Vision Comput. 2009, 27, 803–816.
[9]
Zhao, X.; Zhang, S. Facial expression recognition based on local binary patterns and kernel discriminant isomap. Sensors 2011, 11, 9573–9588.
[10]
Moore, S.; Bowden, R. Local binary patterns for multi-view facial expression recognition. Comput. Vision Image Underst. 2011, 115, 541–558.
[11]
Zhao, G.; Huang, X.; Taini, M.; Li, S.Z.; Pietik?inen, M. Facial expression recognition from near-infrared videos. Image Vision Comput. 2011, 11, 607–619.
[12]
Ghimire, D.; Lee, J. Histogram of orientation gradient feature-based facial expression classification using bagging with extreme learning machine. Adv. Sci. Lett. 2012, 17, 156–161.
[13]
Dhall, A.; Asthana, A.; Goecke, R.; Gedeon, T. Emotion Recognition Using PHOG and LPQ Features. Proceedings of the IEEE International Conference on Face and Gesture Recognition and Workshop, Santa Barbara, CA, USA, 21–25 March 2011; pp. 878–883.
[14]
Jabid, T.; Kabir, Md.H.; Chae, O. Robust facial expression recognition based on local directional pattern. ETRI J. 2010, 32, 784–794.
[15]
Zhi, R.; Flierl, M.; Ruan, Q.; Kleijn, W.B. Graph-preserving sparse nonnegative matrix factorization with application to facial expression recognition. IEEE Trans. Syst. Man Cybernet.-Part. B Cybernet. 2011, 41, 38–52.
[16]
Kotsia, I.; Buciu, I.; Pitas, I. An analysis of facial expression recognition under partial facial image occlusion. Image Vision Comput. 2008, 26, 1052–1067.
[17]
Lin, D.-T. Facial expression classification using PCA and hierarchical radial basic function network. J. Inf. Sci. Eng. 2006, 22, 1033–1046.
[18]
Wang, Z.; Ruan, Q. Facial Expression Recognition Based Orthogonal Local Fisher Discriminant Analysis. Proceedings of the International Conference on Signal Processing (ICSP), Beijing, China, 24–28 October 2010; pp. 1358–1361.
[19]
Littlewort, G.; Bartlett, M.S.; Fasel, I. Dynamics of facial expression extracted automatically from video. Image Vision Comput. 2006, 24, 615–625.
[20]
Choi, H.-C.; Oh, S.-Y. Realtime Facial Expression Recognition Using Active Appearance Model and Multilayer Perceptron. Proceedings of the International Joint Conference SICE-ICASE, Busan, Korea, 18–21 October 2006; pp. 5924–5927.
[21]
Asthana, A.; Saragih, J.; Wagner, M.; Goecke, R. Evaluating AAM Fitting Methods for Facial Expression Recognition. Proceedings of the International Conference on Affective Computing and Intelligent Interaction, Amsterdam, The Netherlands, 10–12 September 2009; pp. 1–8.
[22]
Kotsia, I.; Pitas, I. Facial expression recognition in image sequence using geometric deformation features and support vector machines. IEEE Trans. Image Process. 2007, 16, 172–187.
Sung, J.; Kim, D. Real-time facial expression recognition using STAAM and layered GDA classifiers. Image Vision Comput. 2009, 27, 1313–1325.
[25]
Chang, Y.; Hu, C.; Feris, R.; Turk, M. Manifold based analysis of facial expression. Image Vision Comput. 2006, 24, 605–614.
[26]
Sandbach, G.; Zafeiriou, S.; Pantic, M.; Rueckert, D. Recognition of 3D facial expressions dynamics. Image Vision Comput 2012, 762–773.
[27]
Rudovic, O.; Pantic, M.; Patras, I. Coupled Gaussian processes for pose-invariant facial expression recognition. IEEE Trans. Pattern Anal. Mach. Intell. 2012, 35, 1357–1369.
[28]
Lyons, M.; Skamatsu, S. Coding Facial Expressions with Gabor Wavelets. Proceedings of the IEEE International Conference on Automatic Face and Gesture Recognition, Nara, Japan, 14–16 April 1998; pp. 200–205.
[29]
Zhang, Z.; Lyons, M.; Schuster, M.; Skamatsu, S. Comparison between Geometric-Based and Gabor-Wavelets-Based Facial Expression Recognition Using Multi-Layer Perceptron. Proceedings of the IEEE International Conference on Automatic Face and Gesture Recognition, Nara, Japan, 14–16 April 1998; pp. 454–459.
[30]
Huang, X.; Zhao, G.; Pietik?inen, M.; Zheng, W. Dynamic Facial Expression Recognition Using Boosted Component-Based Spatiotemporal Features and Multi-Classifier Fusion. Proceedings of Advanced Concepts for Intelligent Vision Systems, Sydney, Australia, 13–16 December 2010; pp. 312–322.
[31]
Sakoe, H.; Chiba, S. Dynamic programming algorithm optimization for spoken word recognition. IEEE Trans. Acoust. Speech Signal. Process. 1978, 26, 43–49.
[32]
Lemire, D. Faster retrieval with a two-pass dynamic-time-warping lower bound. Pattern Recognit. 2009, 42, 2169–2180.
[33]
Lades, M.; Vorbüggen, J.C.; Buhmann, J.; Lange, J.; Malsburg, C.; Würtz, R.P.; Konen, W. Distortion invariant object recognition in the dynamic link architecture. IEEE Trans. Comput. 1993, 42, 300–311.
[34]
Wiskott, L.; Fellous, J.-M.; Krüger, N. Face recognition by elastic bunch graph matching. IEEE Trans. Pattern Anal. Mach. Intell. 1997, 19, 775–779.
[35]
Blome, D.S. Elastic Bunch Graph Matching. M.Sc. Thesis, Colorado State University, Fort Collins, CO, USA, 22 May 2003.
[36]
Viola, P.; Jones, M.J. Robust real-time face detection. Int. J. Comput. Vision 2004, 57, 137–154.
[37]
Freund, Y.; Schapire, R.E. A decision-theoretic generalization of on-line learning and an application to boosting. J. Comput. Syst. Sci. 1997, 55, 119–139.
[38]
Silapachote, P.; Karuppiah, D.R.; Hanson, A.R. Feature Selection Using AdaBoost for Face Expression Recognition. Proceedings of the International Conference on Visualization, Image and Image Processing, Marbella, Spain, 6–8 April 2004; pp. 84–89.
[39]
Lajevardi, S.M.; Lech, M. Facial Expression Recognition from Image Sequence Using Optimized Feature Selection. Proceedings of the International Conference on Image and Vision Computing, Christchurch, New Zealand, 26–28 November 2008; pp. 1–6.
[40]
Zhu, J.; Zou, H.; Rosset, S.; Hastie, T. Multi-class AdaBoost. Stat Interface 2009, 2, 349–360.
[41]
Lucey, P.; Cohn, J.F.; Kanade, T.; Saragih, J.; Ambadar, Z. The Extended Cohn-Kanade Dataset (CK+): A Complete Dataset for Action Unit and Emotion-Specified Expression. Proceedings of the 3rd IEEE Workshop on CVPR for Human Communication Behavior Analysis, San Francisco, CA, USA, 13–18 June 2010; pp. 94–101.
[42]
Chang, C.-C.; Lin, C.-J. LIBSVM: A library for support vector machines, 2001. Available online: http://www.csie.ntu.edu.tw/~cjlin/libsvm (accessed on 9 June 2013).
[43]
Hsu, C.-W.; Chang, C.-C.; Lin, C.-J. A Practical Guide to Support Vector Classification; Technical Report; Department of Computer Science, National Taiwan University: Taiwan, 2010.
[44]
Zafeiriou, S.; Pitas, I. Discriminant graph structures for facial expression recognition. IEEE Trans. Multimed. 2008, 10, 1528–1540.
[45]
Zhang, S.; Zhao, X.; Lei, B. Robust facial expression recognition via compressive sensing. Sensors 2012, 12, 3748–3761.