%0 Journal Article %T Fusing audio-words with visual features for adult video detection
融合音频单词与视觉特征的成人视频检测 %A Liu Yizhi %A Tang Sheng %A Wang Xiangdong %A Lin Shouxun %A Zhang Yongdong %A
刘毅志 %A 唐胜 %A 王向东 %A 林守勋 %A 张勇东 %J 中国图象图形学报 %D 2012 %I %X Multi-modality based adult video detection is an effective approach for filtering pornographic information.However,existing methods lack accurate representation methods of audio semantics.Therefore,a novel method is presented in this paper to fuse audio-words with visual features for adult video detection.First,we propose a periodicity-based segmentation algorithm of units of energy envelope (EE).Audio streams are divided into sequences of EE.Second,audio semantics representation method based on EE and BoW (Bag-of-Words) is presented to describe the features of the EE as the occurrence probabilities of audio-words.Integrated weighting methods are used to fuse the detection results of audio-words and visual features.Furthermore,we propose a periodicity-based decision algorithm to judge adult videos to cooperate with the preceding periodicity-based segmentation algorithm.Therefore,we make full use of the periodicity.Our experiments show that our approach remarkably improves the detection performance compared with the method based on visual features.The true positive rate achieves 94.44% while the false positive rate is 9.76%. %K adult video detection %K multi-modality fusion %K audio-words %K visual features %K units of energy envelope
成人视频检测 %K 多模态融合 %K 音频单词 %K 视觉特征 %K 能量包络单元 %U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=D06194629680C940ACE75262F54B9D85&aid=804D8F4691AB78563DA73D5F1687294C&yid=99E9153A83D4CB11&vid=BCA2697F357F2001&iid=DF92D298D3FF1E6E&sid=4198A31627C9B2A6&eid=97747634025A5F36&journal_id=1006-8961&journal_name=中国图象图形学报&referenced_num=0&reference_num=17