Slow-motion replays are content full segments of broadcast soccer videos. In this paper, we propose an efficient method for detection of slow-motion shots produced by high-speed cameras in soccer broadcasts. A rich set of color, motion, and cinematic features are extracted from compressed video by partial decoding of the MPEG-1 bitstream. Then, slow-motion shots are modeled by SVM classifiers for each shot class. A set of six full-match soccer games is used for training and evaluation of the proposed method. Our algorithm presents satisfactory results along with high speed for slow-motion detection in soccer videos. 1. Introduction Replays in soccer broadcasts cover most important contents of the video. Quick development of video compression techniques led to huge compressed video archives. Compressed domain video analysis on these archived videos can result in efficient video processing frameworks. However, noisy features are main challenge in compressed video analysis. Nowadays, most sports broadcasters use logo transitions before and after replay shots [1–3]; however, detection of replay shots by slow-motion detection can bring robustness and generality to prevailing systems. Slow-motion replays could be produced by standard or high-speed cameras. Several approaches are proposed for slow-motion detection based on each production style. A slow-motion replay from a standard camera can be generated by repeating some normal frames or inserting morphed frames between two consecutive frames [4, 5]. Repeated or inserted frames result in special patterns in frame difference feature and could be detected easily in spatial [4–6] or compressed domain [7–9]. Recently, majority of broadcasters are using high-speed cameras for slow-motion generation to achieve finer presentation of fast movements. When high-speed cameras are used with recording frame rate higher than desired slow-motion frame rate, some frames must be dropped. Dropped frames could be detected by plentiful fluctuations in frame difference feature [10]. In addition, slow-motion replays generated by dropped frames has higher mean of absolute frame difference than slow-motion replays generated by normal cameras [4]. With a high-speed camera, the slow-motion effect can also be generated by simply playing out the video at the normal speed. We call these slow-motion replays generated by high-speed cameras HISM replays. Detection of HISMs is very challenging because they do not result in any special pattern in visual features. Han et al. in [11] tried to detect HISMs by exploiting camera motion patterns
References
[1]
Q. Huang, J. Hu, W. Hu, T. Wang, H. Bai, and Y. Zhang, “A reliable logo and replay detector for sports video,” in Proceedings of the IEEE International Conference on Multimedia and Expo (ICME '07), pp. 1695–1698, July 2007.
[2]
Z. Dang, J. Du, Q. Huang, and S. Jiang, “Replay detection based on semi-automatic logo template sequence extraction in sports video,” in Proceedings of the ICME 4th International Conference on Image and Graphics (ICIG '07), pp. 839–844, August 2007.
[3]
Z. Zhao, J. Shuqiang, H. Qingming, and Z. Guangyu, “Highlight summarization in sports video based on replay detection,” in Proceedings of the IEEE International Conference on Multimedia and Expo (ICME '06), pp. 1613–1616, July 2006.
[4]
E. J. Farn, L. H. Chen, and J. H. Liou, “A new slow-motion replay extractor for soccer game videos,” International Journal of Pattern Recognition and Artificial Intelligence, vol. 17, no. 8, pp. 1467–1481, 2003.
[5]
Y. Q. Yang, W. Gu, and Y. D. Lu, “An improved slow-motion detection approach for soccer video,” in Proceedings of the International Conference on Machine Learning and Cybernetics (ICMLC '05), pp. 4593–4598, August 2005.
[6]
A. Ekin, A. M. Tekalp, and R. Mehrotra, “Automatic soccer video analysis and summarization,” IEEE Transactions on Image Processing, vol. 12, no. 7, pp. 796–807, 2003.
[7]
W.-H. Chuang, D.-Y. Hsiao, S.-C. Pei, and H. Chen, “A rapid scheme for slow-motion replay segment detection,” in Proceedings of the 5th Pacific Rim Conference on Multimedia, vol. 3331 of Lecture Notes in Computer Science, pp. 239–246, 2004.
[8]
V. Kobla, D. DeMenthon, and D. Doermann, “Identifying sports videos using replay, text, and camera motion features,” in Storage and Retrieval for Media Database, vol. 3972 of Proceedings of SPIE, pp. 332–343, 2000.
[9]
V. Kobla, D. DeMenthon, and D. Doermann, “Detection of slow-motion replay sequences for identifying sports videos,” in Proceedings of the IEEE 3rd Workshop on Multimedia Signal Processing, pp. 135–140, 1999.
[10]
H. Pan, P. Van Beek, and M. I. Sezan, “Detection of slow-motion replay segments in sports video for highlights generation,” in Proceedings of the IEEE Interntional Conference on Acoustics, Speech, and Signal Processing, pp. 1649–1652, May 2001.
[11]
B. Han, Y. Yan, Z. Chen, C. Liu, and W. Wu, “A general framework for automatic on-line replay detection in sports video,” in Proceedings of the 17th ACM International Conference on Multimedia (MM '09), pp. 501–504, October 2009.
[12]
L. Wang, X. Liu, S. Lin, G. Xu, and H. Y. Shum, “Generic slow-motion replay detection in sports video,” in Proceedings of the International Conference on Image Processing (ICIP '04), pp. 1585–1588, October 2004.
[13]
Y. Yang, S. Lin, Y. Zhang, and S. Tang, “A statistical framework for replay detection in soccer video,” in Proceedings of the IEEE International Symposium on Circuits and Systems (ISCAS '08), pp. 3538–3541, May 2008.
[14]
J. Wang, E. Chng, and C. Xu, “Soccer replay detection using scene transition structure analysis,” in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '05), pp. 433–436, March 2005.
[15]
C. Lang, D. Xu, and Y. Jiang, “Shot type classification in sports video based on visual attention,” in Proceedings of the International Conference on Computational Intelligence and Natural Computing (CINC '09), pp. 336–339, June 2009.
[16]
N. Zhenxing, G. Xinbo, T. Dacheng, and L. Xuelong, “Semantic video shot segmentation based on color ratio feature and SVM,” in Proceedings of the International Conference on Cyberworlds (CW '08), pp. 157–162, September 2008.
[17]
Y. H. Zhou, Y. D. Cao, L. F. Zhang, and H. X. Zhang, “An SVM-based soccer video shot classification,” in Proceedings of the International Conference on Machine Learning and Cybernetics (ICMLC '05), vol. 9, pp. 5398–5403, August 2005.
[18]
B. Han, Y. Hu, G. Wang, W. Wu, and T. Yoshigahara, “Enhanced sports video shot boundary detection based on middle level features and a unified model,” IEEE Transactions on Consumer Electronics, vol. 53, no. 3, pp. 1168–1176, 2007.
[19]
U. Srinivasan, S. Pfeiffer, S. Nepal, M. Lee, L. Gu, and S. Barrass, “A survey of MPEG-1 audio, video and semantic analysis techniques,” Multimedia Tools and Applications, vol. 27, no. 1, pp. 105–141, 2005.
[20]
J. L. Mitchell, W. B. Pennebaker, C. E. Fogg, and D. J. LeGall, “MPEG-1 video syntax,” in MPEG Video Compression Standard, pp. 1–470, Kluwer Academic Publishers, 1st edition, 2002.
[21]
H. S. Wong, H. H. S. Ip, L. P. L. Iu, K. K. T. Cheung, and L. Guan, “Transformation of compressed domain features for content-based image indexing and retrieval,” Multimedia Tools and Applications, vol. 26, no. 1, pp. 5–26, 2005.
[22]
Z. L. Tao, X. L. Huang, and X. Wang, “Shot boundary detection based on macroblock and DC image,” in Proceedings of the International Conference on Management and Service Science (MASS '09), pp. 1–3, September 2009.
[23]
X. Wang, S. Wang, and H. Chen, “An effective soccer video shot detection algorithm,” in Proceedings of the International Conference on Computer Science and Software Engineering (CSSE '08), pp. 214–217, December 2008.
[24]
K. Hoashi, M. Sugano, M. Naito, K. Matsumoto, F. Sugaya, and Y. Nakajima, “Shot boundary determination on MPEG compressed domain and story segmentation experiments for TRECVID 2003,” in Proceedings of the TREC Video Retrieval Evaluation (TRECVID '04), 2004.
[25]
C. Grana and R. Cucchiara, “Linear transition detection as a unified shot detection approach,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 17, no. 4, pp. 483–489, 2007.
[26]
V. Kiani and H. R. Pourreza, “Robust GME in encoded MPEG video,” in Proceedings of the 9th International Conference on Advances in Mobile Computing and Multimedia Processing (MoMM '11), pp. 147–154, 2011.
[27]
M. H. Kolekar, K. Palaniappan, S. Sengupta, and G. Seetharaman, “Semantic concept mining based on hierarchical event detection for soccer video indexing,” Journal of Multimedia, vol. 4, no. 5, pp. 298–312, 2009.