OALib Journal期刊
ISSN: 2333-9721
费用：99美元

投递稿件

查看量	下载量

相关文章
更多...

中国图象图形学报 2007

新闻视频故事单元分割技术综述

DOI: 10.11834/jig.20071102

冀中,张春田,苏育挺

Keywords: 故事单元分割,基于内容的视频检索,新闻视频,上下文信息

Full-Text Cite this paper Add to My Lib

Abstract:

新闻视频的故事单元分割一般采用统计学或者信息沦的方法，将新闻节目分割成一系列有各自主题内容的故事单元。这些单元反映的是视频流的高层语义，是建立视频索引的最佳层次。该文对这一技术进行了综述，将现有方法根据利用信息的角度分为3类：单模态的分割方法、多模态融合的分割方法和基于上下文信息的分割方法，并且详细讨论了每一类方法的特点。此外，还分析了一些分割错误的原因和今后的发展趋势。

References

[1]	Smeaton A F,Kraaij W,Over P.TRECVID 2003-An Overview[EB/OL].http://www-nlpir.nist.gov/projects/tvpubs/tvpapers03/tv3intro.paper.ps,2003-12-04.
[2]	Kraaij W,Smeaton A F,Over P,et al.TRECVID 2004-An Overview[EB/OL].http://www-nlpir.nist.gov/projects/tvpubs/tvpapers04/tv4 overview.pdf,2005-02-23.
[3]	Hsu W,Kennedy L,Huang C W,et al.News video story segmentation using fusion of multi-level multi-modal features in TRECVID 2003[A].In:Proceedings of IEEE International Conference on Acoustics,Speech,and Signal Processing[C],Montreal,Queboc,Canada,2004:Ⅲ645～Ⅲ648.
[4]	Janvier B,Bruno E,Marchand-Maillet S,et al.Performance evaluation of a contextual news story segmentation algorithm[A].In:Proceedings of International Conference on Multimedia Content Analysis,Management,and Retrieval 2006[C],San Jose,CA,US,2006:60730X-1～60730X-10.
[5]	Sugano M,Hoash K,Mutsumato K,et al.Shot Boundary Determination on MPEG Compressed Domain and Story Segmentation Experiments for TRECVID 2003[EB/OL].http://www-nlpir.nist.gov/projects/tvpubs/tvpapers03/kddi.final2.paper.pdf.
[6]	Allan J,Carbonell J,Doddington G,et al.Topic detection and tracking pilot study final report[A].In:Proceedings of DARPA Broadcast News Transcription and Understanding Workshop[C],Lansdowne,Virginia,USA,1998:194～218.
[7]	Hsu W,Chang S F,Huang C W,et al.Discovery and fusion of salient multi-modal features towards news story segmentation[A].In:Proceedings of International Conference on Storage and Retrieval Methods and Applications for Multimedia 2004[C],San Jose,CA,USA,2004:244～258.
[8]	Gao X B,Li J,Yang B.A graph-theoretical clustering based anchorperson shot detection for news video indexing[A].In:International Conference on Computational Intelligence and Multimedia Applications[C],Xi\'an,China,2003:108～113.
[9]	Eichmann D,Park D J.Experiments in Boundaries Recognition at the University of Iowa[EB/OL].http://www.itl.nist.gov/iaui/894.02/projeets/tvpubs/tvpapers03/uiowa.paper.pdf.
[10]	Gargi U,Kasturi R,Strayer S H.Performance characterization of video-shot-change detection methods[J].IEEE Transactions on Circuits and Systems for Video Technology,2000,10(1):1～13.
[11]	Hanjalic Alan.Shot-boundary detection:unraveled and resolved[J].IEEE Transactions on Circuits and Systems for Video Technology,2002,12(2):90～105.
[12]	Zhang H J,Gong Y,Smoliar S W,et al.Automatic parsing of news video[A].In:Proceedings of the International Conference on Multimedia Computing and Systems[C],Boston,NJ,USA,1994:45～54.
[13]	Merlino A,Morey D,Maybury M.Broadcast news navigation using story segmentation[A].In:Proceedings of ACM Multimedia\' 97[C],Bedford,MA,USA,1997:381～391.
[14]	TDT-4 Corpus Annotation Specification[EB/OL].http://projects.ldc.Upenn.edu/TDT4/Annotation/annot_task_def_Vl.4.pdf,2002,11.
[15]	Arlandis J,Over P,Kraaij W.Boundary error analysis and categorization in the TRECVID news story segmentation task[A].In:Proceedings of International Conference on Image and Video Retrieval[C],Singapore,2005:103～112.
[16]	Gauvain J,Lamel L,Adda G.The LIMSI broadcast news transcription system[J]-Speech Communication,2002,37(1-2):89～108.
[17]	Chaisorn L,Chua T S,Koh C K,et al.A Two-Level Multi-Modal Approach for Story Segmentation of Large News Video Corpus[EB/OL].http://www-nlpir.nist.gov/projects/tvpubs/tvpapers03/nus.final.paper.pdf.
[18]	Wang C,Wang Y,Liu H Y,et al.Automatic story segmentation of news video based on audio-visual features and text information[A].In:Proceedings of International Conference on Machine Learning and Cybernetics[C],Xi\'an,China,2003:3008～3011.
[19]	Qi W,Gu L,Jiang H,et al.Integrating visual,audio and text analysis for news video[A].In:Proceedings of International Conference on Mage Processing[C],Vancouver,BC,Canada,2000:520～523.
[20]	Chua T S,Chang S F,Chaisorn L,et al.Story Boundary Detection in Large Broadcast News Video Archives-Techniques,Experience and Trends[A].In:Proceedings of ACM Multimedia \' 2004[C].New York,US,2004:656～659.
[21]	Chaisorn L,Chua T S.The segmentation and classification of story boundaries in news video[A].In:Proceedings of International Conference on Visual and Multimedia Information Management[C],Brisbane,Australia,2002:95～109.
[22]	Yamron J P,Gillick L,Knecht S,et al.Statistical models for tracking and detection[A].In:Proceedings of the DARPA Topic Detection and Tracking Workshop[C].Gaithersburg,Maryland,US,2000:139～144.
[23]	Beeferman D,Berger A,Lafferty J.Statistical models for text segmentation[J].Machine Learning,1999,34(1):177～210.
[24]	Liu Z,Huang J C,Wang Y.Classification of TV programs based on audio information using hidden Markov model[A].In:Proceedings of IEEE Workshop on Multimedia Signal Processing[C],Redondo Beach,CA,USA,1998:27～32.
[25]	Lu L,Zhang H J,Li S Z.Content-based audio classification and segmentation by using support vector machines[J].Multimedia Systems,2003,8(6):482～492.
[26]	Hanjalic A,Lagensijk R L,Biemond J.Template-based detection of anchorperson shots in news programs[A].In:Proceedings of International Conference on Image Processing[C],Chicago,IL,US,1998:148～152.
[27]	更多...
[28]	Hsu W,Chang S F.Generative,discriminative,and ensemble learning on multi-modal perceptual fusion toward news video story segmentation[A].In:Proceedings of IEEE International Conference on Multimedia and Expo[C].Taipei,China,2006:1091～1094.
[29]	Shriberg E,Stolcke A,Hakkani-Tur D,et al.Prosody-based automatic segmentation of speech into sentences and topics[J].Speech Communication,2000,32(1):127～154.
[30]	Lan D J,Ma Y F,Zhang H J.Multi-level anchorperson detection using multimodal association[A].In:Proceedings of International Conference on Pattern Recognition[C],Cambridge,UK,2004:890～893.
[31]	Browne P,Czirjek C,Gaughan G,et al.Dublin City University Video Track Experiments for TREC 2003[EB/OL].http://www.nlpir.nist.gov/projects/tvpubs/tvpapers03/dublin.Lee.paper.pdf.
[32]	Hoashi K,Sugano M,Naito M,et al.Shot Boundary Determination on MPEG Compressed Domain and Story Segmentation Experiments for TRECVID\' 2004[EB/OL].http://www-24.nist.gov/projects/tvpubs/tvpapers04/kddi.pdf,2004,11.
[33]	Hsu W,Chang S F.A statistical framework for fusing mid-level perceptual features in news story segmentation[A].In:Proceedings of IEEE International Conference on Multimedia and Expo[C].Baltimore,MD,USA,2003:Ⅱ-413～416.
[34]	Slonim N.The Information Bottleneck:Theory and Applications[D].Jerusalem,Israel,Hebrew University,2002.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133