全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

Automatic TV Broadcast Structuring

DOI: 10.1155/2010/153160

Full-Text   Cite this paper   Add to My Lib

Abstract:

TV broadcast structuring is needed to precisely extract long useful programs. These can be either archived as part of our audio-visual heritage or used to build added-value novel TV services like TVoD or Catch-up-TV. First, the problem of digital TV content structuring is positioned. Related work and existing solutions are deeply and carefully analyzed. This paper presents then DealTV, our fully automatic system. It is based on studying repeated sequences in the TV stream in order to segment it. Segments are then classified using an inductive logic programming-based technique that makes use of the temporal relationships between segments. Metadata are finally used to label and extract programs using simple overlapping-based criteria. Each processing step of DealTV has been separately evaluated in order to carefully analyze its impact on the final results. The system has been proven on a real TV stream to be very effective. 1. Introduction Broadcasted digital TV contents have incredibly increased over the last few decades. The resulting huge and continuously growing content has given rise to many novel services around TV and video platforms like TV-on-Demand (TVoD), interactive TV, Catch-up-TV, Network Personal Video Records (NPVRs), and so forth. This content is also part of our audio-visual heritage and must be properly archived. Archiving digital TV content is generally achieved by national public institutions like INA in France, Beeld en Geluid in Netherlands, ORF in Austria or BBC archives in UK. Therefore, the digital TV content has to be analyzed and indexed in order to be used within services and to be easily retrieved from archives. Basically, analyzing and indexing a digital TV content consists in finding key instants in the content. These correspond to events of interest (e.g., goals in soccer footage or key scenes in a movie) users may like to directly find through either search engines (in the case of querying an archive) or services built on top of the content. These key instants could also be features and positions that allow structuring the content. Here, the objective is twofold: to properly prepare the content before archiving it in order to easily answer user queries later on; (2) to repurpose the content in another more convenient format to final users. In the case of TV structuring, the main key instants are the start and end times of each program in TV broadcasts. These times allow automatically recovering the structure of the TV stream. They are at the root of novel added-value services or any archiving service. They allow

References

[1]  ETSI EN 300 231, “Television systems; specification of the domestic video programme delivery control system,” European Standard (Telecommunications series), European Telecommunications Standards Institute, 2003.
[2]  S. A. Berrani, P. Lechat, and G. Manson, “TV broadcast macro-segmentation: metadata-based vs. content-based approaches,” in Proceedings of the ACM International Conference on Image and Video Retrieval, pp. 325–332, Amsterdam, The Netherlands, July 2007.
[3]  J. P. Poli and J. Carrive, “Television stream structuring with program guides,” in Proceedings of the IEEE International Symposium on Multimedia, pp. 329–334, San Diego, Calif, USA, December 2006.
[4]  I. Foster and R. L. Grossman, “Data integration in a bandwidth-rich world,” Communications of the ACM, vol. 46, no. 11, pp. 51–57, 2003.
[5]  A. Motro and P. Anokhin, “Fusionplex: resolution of data inconsistencies in the integration of heterogeneous information sources,” Information Fusion, vol. 7, no. 2, pp. 176–196, 2006.
[6]  J. P. Poli, “Predicting program guides for video structuring,” in Proceedings of the 17th IEEE International Conference on Tools with Artificial Intelligence, pp. 407–411, Hong-Kong, November 2005.
[7]  Z. Rasheed and M. Shah, “Detection and representation of scene in videos,” IEEE Transactions on Multimedia, vol. 7, no. 6, pp. 1097–1105, 2005.
[8]  J. R. Kender and B. L. Yeo, “Video scene segmentation via continuous video coherence,” in Proceedings of the International Conference on Computer Vision and Pattern Recognition, pp. 367–373, Santa Barbara, Calif, USA, January 1998.
[9]  A. Hanjalic, R. L. Lagendijk, and J. Biemond, “Automated high-level movie segmentation for advanced video-retrieval systems,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 9, no. 4, pp. 580–588, 1999.
[10]  J. Wang, L. Duan, Q. Liu, H. Lu, and J. S. Jin, “A multimodal scheme for program segmentation and representation in broadcast video streams,” IEEE Transactions on Multimedia, vol. 10, no. 3, pp. 393–408, 2008.
[11]  J. Haitsma and T. Kalker, “A highly robust audio fingerprinting system,” in Proceedings of the 3rd International Symposium on Music Information Retrieval, Paris, France, 2002.
[12]  A. Joly, C. Frelicot, and O. Buisson, “Content-based video copy detection in large databases: a local fingerprints statistical similarity search approach,” in Proceedings of the IEEE International Conference on Image Processing, vol. 1, pp. 505–508, Genova, Italy, September 2005.
[13]  J. Oostveen, T. Kalker, and J. Haitsma, “Feature extraction and a database strategy for video fingerprinting,” in Proceedings of the 5th International Conference on Recent Advances in Visual Information Systems, pp. 117–128, Hsin Chu, Taiwan, 2002.
[14]  J. Barr, B. Bradley, and B. Hannigan, “Using digital watermarks with image signatures to mitigate the threat of the copy attack,” in Proceeedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP '03), vol. 3, pp. 69–72, Hong Kong, 2003.
[15]  B. Coskun and B. Sankur, “Robust video hash extraction,” in Proceedings of the IEEE 12th Signal Processing and Communications Applications Conference, pp. 292–295, Vienna, Austria, 2004.
[16]  X. Naturel, G. Gravier, and P. Gros, “Fast structuring of large television streams using program guides,” in Proceedings of the 4th International Workshop on Adaptive Multimedia Retrieval, pp. 223–232, Geneva, Switzerland, March 2006.
[17]  R. Lienhart, C. Kuhmunch, and W. Effelsberg, “On the detection and recognition of television commercials,” in Proceedings od the IEEE International Conference on Multimedia Computing and Systems, pp. 509–516, Ottawa, Canada, November 1997.
[18]  J. Gauch and A. Shivadas, “Identification of new commercials using repeated video sequence detection,” in Proceedings of the IEEE International Conference on Image Processing, vol. 3, pp. 1252–1255, Genova, Italy, 2005.
[19]  J. M. Gauch and A. Shivadas, “Finding and identifying unknown commercials using repeated video sequence detection,” Computer Vision and Image Understanding, vol. 103, no. 1, pp. 80–88, 2006.
[20]  M. Covell, S. Baluja, and M. Fink, “Advertisement detection and replacement using acoustic and visual repetition,” in Proceedings of the 8th IEEE International Workshop on Multimedia Signal Processing, pp. 461–466, Victoria, Canada, 2006.
[21]  C. Herley, “ARGOS: automatically extracting repeating objects from multimedia streams,” IEEE Transactions on Multimedia, vol. 8, no. 1, pp. 115–129, 2006.
[22]  A. Albiol, M. J. Fulla, A. Albiol, and L. Torres, “Detection of TV commercials,” in Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP '04), vol. 3, pp. 541–544, Montreal, Canada, 2004.
[23]  S. Fischer, R. Lienhart, and W. Effelsberg, “Automatic recognition of film genres,” in Proceedings of the 3rd ACM International Conference on Multimedia, pp. 295–304, San Francisco, Calif, USA, 1995.
[24]  A. G. Hauptmann and M. J. Witbrock, “Story segmentation and detection of commercials in broadcast news video,” in Proceedings of the IEEE Forum on Reasearch and Technology Advances in Digital Libraries, pp. 168–179, Santa Barbara, Calif, USA, April 1998.
[25]  B. T. Truong, S. Venkatesh, and C. Dorai, “Automatic genre identification for content-based video categorization,” in Proceedings of the 15th International Conference on Pattern Recognition, pp. 4230–4233, Barcelona, Spain, 2000.
[26]  S. A. Berrani, G. Manson, and P. Lechat, “A non-supervised approach for repeated sequence detection in TV broadcast streams,” Signal Processing: Image Communication, vol. 23, no. 7, pp. 525–537, 2008.
[27]  G. Manson and S. A. Berrani, “An inductive logic programming-based approach for TV stream segment classification,” in Proceedings of the IEEE International Symposium on Multimedia, pp. 130–135, Berkeley, Calif, USA, 2008.
[28]  T. Zhang, R. Ramakrishnan, and M. Livny, “Birch: an efficient data clustering method for very large databases,” in Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 103–114, Montreal, Canada, 1996.
[29]  S. Muggleton, “Inverse entailment and progol,” New Generation Computing, vol. 13, no. 3-4, pp. 245–286, 1995.
[30]  A. Srinivasan, “Aleph: a learning engine for proposing hypotheses,” 2007, http://web2.comlab.ox.ac.uk/oucl/research/areas/machlearn/Aleph/aleph.pl.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133