全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

新闻视频故事单元分割技术综述

DOI: 10.11834/jig.20071102

Keywords: 故事单元分割,基于内容的视频检索,新闻视频,上下文信息

Full-Text   Cite this paper   Add to My Lib

Abstract:

新闻视频的故事单元分割一般采用统计学或者信息沦的方法,将新闻节目分割成一系列有各自主题内容的故事单元。这些单元反映的是视频流的高层语义,是建立视频索引的最佳层次。该文对这一技术进行了综述,将现有方法根据利用信息的角度分为3类:单模态的分割方法、多模态融合的分割方法和基于上下文信息的分割方法,并且详细讨论了每一类方法的特点。此外,还分析了一些分割错误的原因和今后的发展趋势。

References

[1]  Smeaton A F,Kraaij W,Over P.TRECVID 2003-An Overview[EB/OL].http://www-nlpir.nist.gov/projects/tvpubs/tvpapers03/tv3intro.paper.ps,2003-12-04.
[2]  Kraaij W,Smeaton A F,Over P,et al.TRECVID 2004-An Overview[EB/OL].http://www-nlpir.nist.gov/projects/tvpubs/tvpapers04/tv4 overview.pdf,2005-02-23.
[3]  Hsu W,Kennedy L,Huang C W,et al.News video story segmentation using fusion of multi-level multi-modal features in TRECVID 2003[A].In:Proceedings of IEEE International Conference on Acoustics,Speech,and Signal Processing[C],Montreal,Queboc,Canada,2004:Ⅲ645~Ⅲ648.
[4]  Janvier B,Bruno E,Marchand-Maillet S,et al.Performance evaluation of a contextual news story segmentation algorithm[A].In:Proceedings of International Conference on Multimedia Content Analysis,Management,and Retrieval 2006[C],San Jose,CA,US,2006:60730X-1~60730X-10.
[5]  Sugano M,Hoash K,Mutsumato K,et al.Shot Boundary Determination on MPEG Compressed Domain and Story Segmentation Experiments for TRECVID 2003[EB/OL].http://www-nlpir.nist.gov/projects/tvpubs/tvpapers03/kddi.final2.paper.pdf.
[6]  Allan J,Carbonell J,Doddington G,et al.Topic detection and tracking pilot study final report[A].In:Proceedings of DARPA Broadcast News Transcription and Understanding Workshop[C],Lansdowne,Virginia,USA,1998:194~218.
[7]  Hsu W,Chang S F,Huang C W,et al.Discovery and fusion of salient multi-modal features towards news story segmentation[A].In:Proceedings of International Conference on Storage and Retrieval Methods and Applications for Multimedia 2004[C],San Jose,CA,USA,2004:244~258.
[8]  Gao X B,Li J,Yang B.A graph-theoretical clustering based anchorperson shot detection for news video indexing[A].In:International Conference on Computational Intelligence and Multimedia Applications[C],Xi\'an,China,2003:108~113.
[9]  Eichmann D,Park D J.Experiments in Boundaries Recognition at the University of Iowa[EB/OL].http://www.itl.nist.gov/iaui/894.02/projeets/tvpubs/tvpapers03/uiowa.paper.pdf.
[10]  Gargi U,Kasturi R,Strayer S H.Performance characterization of video-shot-change detection methods[J].IEEE Transactions on Circuits and Systems for Video Technology,2000,10(1):1~13.
[11]  Hanjalic Alan.Shot-boundary detection:unraveled and resolved[J].IEEE Transactions on Circuits and Systems for Video Technology,2002,12(2):90~105.
[12]  Zhang H J,Gong Y,Smoliar S W,et al.Automatic parsing of news video[A].In:Proceedings of the International Conference on Multimedia Computing and Systems[C],Boston,NJ,USA,1994:45~54.
[13]  Merlino A,Morey D,Maybury M.Broadcast news navigation using story segmentation[A].In:Proceedings of ACM Multimedia\' 97[C],Bedford,MA,USA,1997:381~391.
[14]  TDT-4 Corpus Annotation Specification[EB/OL].http://projects.ldc.Upenn.edu/TDT4/Annotation/annot_task_def_Vl.4.pdf,2002,11.
[15]  Arlandis J,Over P,Kraaij W.Boundary error analysis and categorization in the TRECVID news story segmentation task[A].In:Proceedings of International Conference on Image and Video Retrieval[C],Singapore,2005:103~112.
[16]  Gauvain J,Lamel L,Adda G.The LIMSI broadcast news transcription system[J]-Speech Communication,2002,37(1-2):89~108.
[17]  Chaisorn L,Chua T S,Koh C K,et al.A Two-Level Multi-Modal Approach for Story Segmentation of Large News Video Corpus[EB/OL].http://www-nlpir.nist.gov/projects/tvpubs/tvpapers03/nus.final.paper.pdf.
[18]  Wang C,Wang Y,Liu H Y,et al.Automatic story segmentation of news video based on audio-visual features and text information[A].In:Proceedings of International Conference on Machine Learning and Cybernetics[C],Xi\'an,China,2003:3008~3011.
[19]  Qi W,Gu L,Jiang H,et al.Integrating visual,audio and text analysis for news video[A].In:Proceedings of International Conference on Mage Processing[C],Vancouver,BC,Canada,2000:520~523.
[20]  Chua T S,Chang S F,Chaisorn L,et al.Story Boundary Detection in Large Broadcast News Video Archives-Techniques,Experience and Trends[A].In:Proceedings of ACM Multimedia \' 2004[C].New York,US,2004:656~659.
[21]  Chaisorn L,Chua T S.The segmentation and classification of story boundaries in news video[A].In:Proceedings of International Conference on Visual and Multimedia Information Management[C],Brisbane,Australia,2002:95~109.
[22]  Yamron J P,Gillick L,Knecht S,et al.Statistical models for tracking and detection[A].In:Proceedings of the DARPA Topic Detection and Tracking Workshop[C].Gaithersburg,Maryland,US,2000:139~144.
[23]  Beeferman D,Berger A,Lafferty J.Statistical models for text segmentation[J].Machine Learning,1999,34(1):177~210.
[24]  Liu Z,Huang J C,Wang Y.Classification of TV programs based on audio information using hidden Markov model[A].In:Proceedings of IEEE Workshop on Multimedia Signal Processing[C],Redondo Beach,CA,USA,1998:27~32.
[25]  Lu L,Zhang H J,Li S Z.Content-based audio classification and segmentation by using support vector machines[J].Multimedia Systems,2003,8(6):482~492.
[26]  Hanjalic A,Lagensijk R L,Biemond J.Template-based detection of anchorperson shots in news programs[A].In:Proceedings of International Conference on Image Processing[C],Chicago,IL,US,1998:148~152.
[27]  更多...
[28]  Hsu W,Chang S F.Generative,discriminative,and ensemble learning on multi-modal perceptual fusion toward news video story segmentation[A].In:Proceedings of IEEE International Conference on Multimedia and Expo[C].Taipei,China,2006:1091~1094.
[29]  Shriberg E,Stolcke A,Hakkani-Tur D,et al.Prosody-based automatic segmentation of speech into sentences and topics[J].Speech Communication,2000,32(1):127~154.
[30]  Lan D J,Ma Y F,Zhang H J.Multi-level anchorperson detection using multimodal association[A].In:Proceedings of International Conference on Pattern Recognition[C],Cambridge,UK,2004:890~893.
[31]  Browne P,Czirjek C,Gaughan G,et al.Dublin City University Video Track Experiments for TREC 2003[EB/OL].http://www.nlpir.nist.gov/projects/tvpubs/tvpapers03/dublin.Lee.paper.pdf.
[32]  Hoashi K,Sugano M,Naito M,et al.Shot Boundary Determination on MPEG Compressed Domain and Story Segmentation Experiments for TRECVID\' 2004[EB/OL].http://www-24.nist.gov/projects/tvpubs/tvpapers04/kddi.pdf,2004,11.
[33]  Hsu W,Chang S F.A statistical framework for fusing mid-level perceptual features in news story segmentation[A].In:Proceedings of IEEE International Conference on Multimedia and Expo[C].Baltimore,MD,USA,2003:Ⅱ-413~416.
[34]  Slonim N.The Information Bottleneck:Theory and Applications[D].Jerusalem,Israel,Hebrew University,2002.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133