Due to the widening semantic gap of videos, computational tools to classify these videos into different genre are highly needed to narrow it. Classifying videos accurately demands good representation of video data and an efficient and effective model to carry out the classification task. Kernel Logistic Regression (KLR), kernel version of logistic regression (LR), proves its efficiency as a classifier, which can naturally provide probabilities and extend to multiclass classification problems. In this paper, Weighted Kernel Logistic Regression (WKLR) algorithm is implemented for video genre classification to obtain significant accuracy, and it shows accurate and faster good results. 1. Introduction Recently, video usage gains increasing importance, especially with the advance of recent internet technology and digital media; people have access to huge amount of data through internet and television. It is difficult for people to find videos of interest among these tremendous amounts of data and use them when there is a need, and it is not feasible to watch all the videos searching for the one of interest. Exploiting content-based of video footage is continuously needed for many applications, for example, for retrieving video sequence, creating automatic video summarization, or detecting specific action or activities in a video surveillance [1]. Many works are done dealing with video classification problem by categorizing videos in certain categories or genre, bridging the wide semantic gap between computed low-level features and high-level concepts and helping people to find their videos of interest within narrow domain. To get good understanding of video content, many different techniques have been developed and different video features have been identified for better video representation. Many techniques are used for video classification such as Bayesian, Hidden Markov Model (HMM), Gaussian Mixture Model (GMM), Neural-Network (NN), and Support Vector Machine (SVM). SVM is considered a state-of-the-art algorithm for classifying binary data through its implementation of kernels. Oh et al. developed general framework to perform the fundamental tasks for video data mining which are temporal segmentation of video sequences and feature extraction and how to capture the location of motions occurring in a segment, how to cluster those segmented pieces, and how to find whether a segment has normal or abnormal events [2]. Chen et al. proposed a multimedia data mining framework for extraction of soccer goal events in soccer videos, by using combined multimodal
References
[1]
A. Hervieu, P. Bouthemy, and J. P. L. Cadre, “Video event classification and detection using 2D trajectories,” in Proceedings of the 3rd International Conference on Computer Vision Theory and Applications (VISAPP '08), pp. 158–166, January 2008.
[2]
J. H. Oh, J. Lee, and S. Kote, “Multimedia data mining framework for raw video sequences,” in Mining Multimedia and Complex Data, pp. 18–35, 2003.
[3]
S. C. Chen, M.-L. Shyu, M. Chen, and C. Zhang, “A decision tree-based multimodal data mining framework for soccer goal detection,” in Proceedings of IEEE International Conference on Multimedia and Expo (ICME), pp. 265–268, IEEE, June 2004.
[4]
B. Li, J. H. Errico, H. Pan, and I. Sezan, “Bridging the semantic gap in sports video retrieval and summarization,” Journal of Visual Communication and Image Representation, vol. 15, no. 3, pp. 393–424, 2004.
[5]
D. Zhong and S. F. Chang, “Real-time view recognition and event detection for sports video,” Journal of Visual Communication and Image Representation, vol. 15, no. 3, pp. 330–347, 2004.
[6]
J. Fan, H. Luo, J. Xiao, and L. Wu, “Semantic video classification and feature subset selection under context and concept uncertainty,” in Proceedings of the 4th ACM/IEEE Joint Conference on Digital Libraries; Global Reach and Diverse Impact (JCDL '04), pp. 192–201, IEEE, June 2004.
[7]
V. A. Petrushin, “Mining rare and frequent events in multi-camera surveillance video using self-organizing maps,” in Proceedings of the 11th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD '05), pp. 794–800, ACM, August 2005.
[8]
A. Adam, E. Rivlin, I. Shimshoni, and D. Reinitz, “Robust real-time unusual event detection using multiple fixed-location monitors,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 30, no. 3, pp. 555–560, 2008.
[9]
K. Shirahama, Y. Matsuoka, and K. Uehara, “Video event retrieval from a small number of examples using rough set theory,” in Advances in Multimedia Modeling, pp. 96–106, 2011.
[10]
M. Maalouf, T. B. Trafalis, and I. Adrianto, “Kernel logistic regression using truncated Newton method,” Computational Management Science, vol. 8, no. 4, pp. 415–428, 2011.
[11]
M. Maalouf and T. B. Trafalis, “Robust weighted kernel logistic regression in imbalanced and rare events data,” Computational Statistics & Data Analysis, vol. 55, no. 1, pp. 168–183, 2011.
[12]
G. King and L. C. Zeng, “Explaining rare events in international relations,” International Organization, vol. 55, no. 3, pp. 693–715, 2001.
[13]
D. Brezeale and D. J. Cook, “Automatic video classification: a survey of the literature,” IEEE Transactions on Systems, Man and Cybernetics C, vol. 38, no. 3, pp. 416–430, 2008.
[14]
A. A. M. Hamed, Z. Xiaoming, C. Xu, et al., “Video genre classification using support vector machine ensemble,” International Journal of Digital Content Technlogy and Its Applications, vol. 6, no. 15, pp. 191–200, 2012.
[15]
H. Zhou, T. Hermans, A. V. Karandikar, and J. M. Rehg, “Movie genre classification via scene categorization,” in Proceedings of the International Conference on Multimedia (MM '10), pp. 747–750, ACM, October 2010.
[16]
Z. Rasheed, Y. Sheikh, and M. Shah, “On the use of computable features for film classification,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 15, no. 1, pp. 52–64, 2005.
[17]
A. Girgensohn and J. Foote, “Video classification using transform coefficients,” in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '99), pp. 3045–3048, IEEE, March 1999.
[18]
S. Dabbaghchian, M. P. Ghaemmaghami, and A. Aghagolzadeh, “Feature extraction using discrete cosine transform and discrimination power analysis with a face recognition technology,” Pattern Recognition, vol. 43, no. 4, pp. 1431–1440, 2010.
[19]
C. A. Dhawale and S. Jain, “A novel approach towards keyframe selection for video summarization,” Asian Journal of Information Technology, vol. 7, no. 4, pp. 133–137, 2008.
[20]
M. Heckmann, K. Kroschel, C. Savariaux, et al., DCT-based video features for audio-visual speech recognition, 2002.
[21]
L. Q. Xu and Y. Li, “Video classification using spatial-temporal features and PCA,” in Proceedings of the International Conference on Multimedia and Expo (ICME '03), vol. 3, pp. 485–488, IEEE, July 2003.
[22]
M. Mentzelopoulos and A. Psarrou, “Key-frame extraction algorithm using entropy difference,” in Proceedings of the 6th ACM SIGMM International Workshop on Multimedia Information Retrieval (MIR '04), pp. 39–45, ACM, October 2004.
[23]
D. Rim, K. Hassan, and C. Pal, “Semi supervised learning in wild faces and videos,” in Proceedings of the British Machine Vision Conference, J. Hoey, S. McKenna, and E. Trucco, Eds., pp. 3.1–3.12, September 2011.