全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

视频摘要技术综述

DOI: 10.11834/jig.20141201

Keywords: 视频内容分析,摘要生成,实时视频摘要,多视角视频摘要,视频语义获取

Full-Text   Cite this paper   Add to My Lib

Abstract:

目的类似于文本摘要,视频摘要是对视频内容的总结。为了合理地评估视频摘要领域的研究进展,正确导向视频摘要的继续研究,本文归纳总结视频摘要技术的主要研究方法和显著性成果,对视频摘要技术进行综述。方法依据视频摘要的两个主要生成步骤:视频内容分析和摘要生成分别介绍视频摘要的主要研究方法。同时,分析了近5年视频摘要领域的研究状况,对视频摘要发展的新趋势:实时视频摘要和多视角视频摘要进行了阐述。最后,还对视频摘要的评价系统进行了分类总结。结果对视频摘要进行综述,对摘要中的语义获取难题提出了2种指导性建议。并依据分析结果,展望了视频摘要技术未来的发展方向。结论视频摘要技术作为视频内容理解的重要组成部分,有较大研究价值。而目前,视频摘要在视频语义表达和摘要评价系统方面并不精确完善,还需进一步的深入研究。

References

[1]  Maybury M T. Broadcast news understanding and navigation[C]//Proceedings of the Fifteenth Conference on Innovative Applications of Artificial Intelligence. Trier, German: DBLP,2003: 117-122.
[2]  Pfeiffer S, Lienhart R, Kühne G, et al. The MoCA project.[M]//Informatik\'98. Berlin, Heidelberg: Springer, 1998: 329-338.
[3]  Chang S F, Chen W, Meng H J, et al. VideoQ: an automated content based video search system using visual cues[C]//Proceedings of the 5th ACM International Conference on Multimedia.New York, USA:ACM, 1997: 313-324.
[4]  Snoek C G M, Worring M.Time interval maximum entropy based event indexing in soccer [C]// Proceedings of IEEE International Conference on Multimedia and Expo. Washington DC,USA: IEEE, 2003:481-484.
[5]  Uchihashi S, Foote J, Girgensohn A, et al. Video manga: generating semantically meaningful video summaries[C]//Proceedings of the seventh ACM International Conference on Multimedia (Part 1). New York, USA:ACM, 1999: 383-392.
[6]  Wu L Q, Li G H. Video\'s structured browsering and querying system: Videowser[J]. Mini-micro Systems, 2001, 22(1): 112-115.[吴玲琦, 李国辉. 视频结构化浏览和查询系统: Videowser[J]. 小型微型计算机系统, 2001,22(1):112-115.][DOI:10.3969/j.issn.1000-1220.2001.01.030]
[7]  Zhuang Y, Rui Y, Huang T S, et al. Adaptive key frame extraction using unsupervised clustering[C]// Proceedings of International Conference on Image Processing. Washington DC,USA: IEEE, 1998, 1: 866-870.[DOI:10.1109/ICIP.1998.723655]
[8]  Almeida J, Torres R D S, Leite N J. Rapid video summarization on compressed video[C]// IEEE International Symposium on Multimedia. Washington DC,USA: IEEE, 2010: 113-120.[DOI:10.1109/ISM.2010.25]
[9]  Coldefy F, Bouthemy P. Unsupervised soccer video abstraction based on pitch, dominant color and camera motion analysis[C]//Proceedings of the 12th Annual ACM International Conference on Multimedia. New York, USA:ACM, 2004: 268-271.
[10]  Wolf W. Key frame selection by motion analysis[C]// Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing. Washington DC,USA: IEEE, 1996, 2: 1228-1231.[DOI:10.1109/ICASSP.1996.543588]
[11]  Chau W S, Au O C, Chong T S. Key frame selection by macroblock type and motion vector analysis[C]// Proceedings of International Conference on Multimedia and Expo. Washington DC,USA: IEEE, 2004, 1: 575-578.[DOI:10.1109/ICME.2004.1394257]
[12]  Mei T, Tang L X, Tang J, et al. Near-lossless semantic video summarization and its applications to video analysis[J]. ACM Transactions on Multimedia Computing, Communications, and Applications, 2013, 9(3): #16.
[13]  Kim J G, Chang H S, Kim J, et al. Efficient camera motion characterization for MPEG video indexing[C]// Proceedings of International Conference on Multimedia and Expo. Washington DC,USA: IEEE, 2000, 2: 1171-1174.[DOI:10.1109/ICME.2000.871569]
[14]  Lienhart R, Pfeiffer S, Effelsberg W. Video abstracting [J]. Communications of the ACM, 1997, 40(12): 54-62.
[15]  Zhang D, Chang S F. Event detection in baseball video using superimposed caption recognition[C]//Proceedings of the 10th ACM International conference on Multimedia. New York, USA:ACM, 2002: 315-318.
[16]  Taskiran C M, Pizlo Z, Amir A, et al. Automated video program summarization using speech transcripts [J]. IEEE Transactions on Multimedia, 2006, 8(4): 775-791.
[17]  Lee S, Kim H. News keyword extraction for topic tracking[C]// Proceedings of the 4th International Conference on Networked Computing and Advanced Information Management. Washington DC,USA: IEEE, 2008, 2: 554-559.[DOI:10.1109/NCM.2008.199]
[18]  Evangelopoulos G, Zlatintsi A, Potamianos A, et al. Multimodal saliency and fusion for movie summarization based on aural, visual, and textual attention[C] //IEEE Transactions on Multimedia, Washington DC,USA: IEEE, 2013:1553-1568.[DOI:10. 1109/TMM.2013.2267205]
[19]  Jiang W, Cotton C, Loui A C. Automatic consumer video summarization by audio and visual analysis[C]// Proceedings of International Conference on Multimedia and Expo. Washington DC,USA: IEEE, 2011: 1-6.[DOI:10.1109/ICME.2011.6011841]
[20]  Xu S, Feng B, Xu B. Multi-modal topic unit segmentation in videos using conditional random fields[C]// Proceedings of International Conference on Acoustics, Speech and Signal Processing. Washington DC,USA: IEEE, 2013: 2287-2291.[DOI:10. 1109/ICASSP.2013.6638062]
[21]  更多...
[22]  Fu W, Wang J, Zhao C, et al. Object-centered narratives for video surveillance[C]// Proceedings of the 19th IEEE International Conference on Image Processing. Washington DC,USA: IEEE, 2012: 29-32.[DOI:10.1109/ICIP.2012.64666787]
[23]  Lin C, Tsai C, Kang L, et al. Scene-based movie summarization via role-community networks [C]//IEEE Transactions on Circuits and Systems for Video Technology, Washington DC,USA: IEEE, 2013:1927-1940.[DOI:10.1109/TCSVT.2013.2269186]
[24]  Tamrakar A, Ali S, Yu Q,et al. Evaluation of low-level features and their combinations for complex event detection in open source videos[C]// Proceedings of 2012 IEEE Conference on Computer Vision and Pattern Recognition. Washington DC,USA: IEEE,2012; 3681-3688.[DOI:10.1109/CVPR.2012.6248114]
[25]  Babaguchi N, Kawai Y, Kitahashi T. Generation of personalized abstract of sports video[C]// Proceedings of International Conference on Multimedia and Expo. Washington DC,USA: IEEE, 2001: 619-622.[DOI:10.1109/ICME.2001.1237796]
[26]  Agnihotri L, Dimitrova N, Kender J R. Design and evaluation of a music video summarization system[C]// Proceedings of International Conference on Multimedia and Expo. Washington DC,USA: IEEE, 2004, 3: 1943-1946.[DOI:10.1109/ICME.2004.1394641]
[27]  Aizawa K, Ishijima K, Shiina M. Summarizing wearable video[C]// Proceedings of International Conference on Image Processing, Washington DC,USA: IEEE, 2001, 3: 398-401.[DOI:10.1109/ICIP.2001.958135]
[28]  Aizawa K, Tancharoen D, Kawasaki S, et al. Efficient retrieval of life log based on context and content[C]//Proceedings of the 1st ACM Workshop on Continuous Archival and Retrieval of Personal Experiences. New York, USA:ACM, 2004: 22-31.
[29]  Peng W T, Chu W T, Chang C H, et al. Editing by viewing: automatic home video summarization by viewing behavior analysis[J]. IEEE Transactions on Multimedia, 2011, 13(3): 539-550.
[30]  Yoshitaka A, Sawada K. Personalized Video summarization based on behavior of viewer[C]// Proceedings of the 8th International Conference on Signal Image Technology and Internet Based Systems. Washington DC,USA: IEEE, 2012: 661-667.[DOI:10.1109/SITIS.2012.100]
[31]  Syeda-Mahmood T, Ponceleon D. Learning video browsing behavior and its application in the generation of video previews[C]//Proceedings of the 9th ACM International Conference on Multimedia. New York, USA:ACM, 2001: 119-128.
[32]  Mongy S. A study on video viewing behavior: application to movie trailer miner[J]. The International Journal of Parallel, Emergent and Distributed Systems, 2007, 22(3): 163-172.
[33]  Jain A K, Murty M N, Flynn P J. Data clustering: a review [J]. ACM Computing Surveys, 1999, 31(3): 264-323.
[34]  Amiri A, Fathy M. Hierarchical keyframe-based video summarization using QR-decomposition and modified k-means clustering [J]. EURASIP Journal on Advances in Signal Processing, 2010: #102.
[35]  Guimar?es S J F, Gomes W. A static video summarization method based on hierarchical clustering[M]//Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications. Berlin, Heidelberg: Springer, 2010: 46-54.
[36]  Frey B J, Dueck D. Clustering by passing messages between data points [J]. Science, 2007, 315(5814): 972-976.
[37]  Xie X, Wu F. Automatic video summarization by affinity propagation clustering and semantic content mining[C]// The 2008 International Symposium on Electronic Commerce and Security. Washington DC,USA: IEEE, 2008: 203-208.[DOI:10.1109/ISECS.2008.118]
[38]  Shafeian H, Bhanu B. Integrated personalized video summarization and retrieval[C]// Proceedings of the 21st International Conference on Pattern Recognition. Washington DC,USA: IEEE, 2012: 996-999.
[39]  Latecki L J, DeMenthon D, Rosenfeld A. Extraction of key frames from videos by polygon simplification[J]. Proc. of Signal Processing and its Applications, 2001: 643-646.
[40]  Han S H, Kweon I S. Scalable temporal interest points for abstraction and classification of video events[C]// Proceedings of IEEE International Conference on Multimedia and Expo. Washington DC,USA: IEEE, 2005:670-673.[DOI:10.1109/ICME.2005.1521512]
[41]  Albanese M, Fayzullin M, Picariello A, et al. The priority curve algorithm for video summarization [J]. Information Systems, 2006, 31(7): 679-695.
[42]  Lavee G, Rivlin E, Rudzsky M. Understanding video events: a survey of methods for automatic interpretation of semantic occurrences in video [J]. IEEE Transactions on Systems, Man, and Cybernetics: Part C: Applications and Reviews, 2009, 39(5): 489-504.
[43]  Zawbaa H M, El-Bendary N, Abraham A. SVM-based soccer video summarization system[C]// Proceedings of the 3rd World Congress on Nature and Biologically Inspired Computing (NaBIC). Washington DC,USA: IEEE, 2011: 7-11.[DOI:10.1109/NaBIC.2011.6089409]
[44]  Huang C L, Chang C Y. Video summarization using hidden Markov model[C]// Proceedings of International Conference on Information Technology: Coding and Computing. Washington DC,USA: IEEE, 2001: 473-477.[DOI:10.1109/ITCC.2001.918841]
[45]  Furini M, Geraci F, Montangero M, et al. STIMO: STIll and MOving video storyboard for the web scenario [J]. Multimedia Tools and Applications, 2010, 46(1): 47-69.
[46]  Valdés V, Martínez J M. On-line video abstract generation of multimedia news [J]. Multimedia Tools and Applications, 2012, 59(3): 795-832.
[47]  Fu Y, Guo Y, Zhu Y, et al. Multi-view video summarization [J]. IEEE Transactions on Multimedia, 2010, 12(7): 717-729.
[48]  Leo C, Manjunath B S. Multicamera video summarization and anomaly detection from activity motifs [J]. ACM Transactions on Sensor Networks (TOSN), 2014, 10(2): #27. [DOI:10.1145/2530285]
[49]  Truong B T, Venkatesh S. Video abstraction: A systematic review and classification [J]. ACM Transactions on Multimedia Computing, Communications, and Applications, 2007, 3(1): #3.
[50]  Liu T, Zhang X, Feng J, et al. Shot reconstruction degree: a novel criterion for key frame selection [J]. Pattern Recognition Letters, 2004, 25(12): 1451-1457.
[51]  He L, Sanocki E, Gupta A, et al. Auto-summarization of audio-video presentations[C]//Proceedings of the 7th ACM International Conference on Multimedia: Part 1. New York, USA:ACM, 1999: 489-498.
[52]  Valdés V, Martínez J M. Automatic evaluation of video summaries [J]. ACM Transactions on Multimedia Computing, Communications, and Applications, 2012, 8(3): #25. [DOI:10.1145/2240136.2240138]

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133