全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...

Logistic视频字幕增强模型

DOI: 10.11834/jig.20140505

Keywords: 复杂背景,字幕增强,Logistic模型,字幕检测与跟踪,时域特征

Full-Text   Cite this paper   Add to My Lib

Abstract:

目的为提高复杂背景下的视频字幕在光学字符识别(OCR)中的识别率,需要对提取的视频字幕进行有效地字幕增强。首次将Logistic模型应用到视频字幕增强中,提出了基于Logistic模型的融合多帧信息的视频字幕增强方法。方法对字幕进行检测与跟踪,将出现在连续多帧中的同一字幕片段进行对齐;通过分析字幕片段在多帧中信息,提出字幕背景在时域上的变化特征、背景和字幕文本的固有特征,并将3个特征进行量化与融合,构建适用于字幕增强的Logistic模型,实现对视频字幕的增强。结果对含阴影或描边效果的特殊复杂背景字幕、普通复杂背景字幕、单一背景字幕分别进行实验,增强后的字幕在OCR软件中的识别正确率分别为81.76%、97.13%、98.19%,与对比方法比较均有一定的提高。结论实验结果表明,本文方法既可以降低字幕背景的复杂度,又可以提高字幕背景与文本的对比度,从而可以对复杂背景和单一背景下的视频字幕进行有效地增强。

References

[1]  Otsu N. A threshold selection method from grey-level histograms [J]. IEEE Transactions on Systems,Man,and Cybernetics,1979,9(1):377-393.
[2]  Niblack W. An Introduction to Digital Image Processing [M]. Denmark:Strandberg Publishing Company Birkeroed, 1986,115-116.
[3]  Liu K. Extraction and recognition of video caption[D]. Beijing:Beijing Information Science and Technology University,2009. [刘坤.视频字幕的提取与识别研究[D].北京:北京信息科技大学, 2009.]
[4]  Wang Y D,Jiang X S. News video text segmentation algorithm based on gradient reinforcement [J]. Jounal of Computer-aided Design & Computer Graphics, 2009,21(8):1170-1174.[王一丁, 蒋小森.基于梯度增强的新闻字幕分割算法[J]. 计算机辅助设计与图形学学报, 2009,21(8):1170-1174.]
[5]  Li H,Doermann D. Text enhancement in digital video using multiple frame integration[C]//Proceedings of the seventh ACM international conference on Multimedia(Part 1). New York, USA:ACM,1999:19-22.[DOI:10.1145/319463.319466]
[6]  Yi J,Peng Y X,Xiao J G. Recognition of text in video based on color clustering and multiple frame integration [J].Journal of Software,2011,22(12):2919-2933.[易剑, 彭宇新, 肖建国.基于颜色聚类和多帧融合的视频文字识别方法[J].软件学报, 2011,22(12):2919-2933.]
[7]  Zhu C J,Li C,Xue L,et al. Video text enhancement using multiple frame information [J]. Journal of Image and Graphics,2008,13(9):1667-1672. [朱成军, 李超, 薛玲, 等.一种基于多帧视频的文本图像质量增强方法[J].中国图象图形学报, 2008,13(9):1667-1672.][DOI:10.11834/jig.20080907]
[8]  Mi C J,Li Y,Xue X Y. Video texts tracking and segmentation based on multiple frames[J]. Journal of Computer Research and Development,2006,43(9):1523-1529.[密聪杰,刘洋,薛向阳. 基于多帧图像的视频文字跟踪和分割算法[J]. 计算机研究与发展,2006,43(9):1523-1529.]
[9]  Shivakumara P,Dutta A,Tan C L,et al. Multi-oriented scene text detection in video based on wavelet and angle projection boundary growing [J]. Multimedia Tools and Applications,2013,2:1-25. [DOI:10.1007/s11042-013-1385-0]
[10]  Cao X X,Liu J,Yang X D,et al. A novel algorithm for the video caption extraction [J]. Acta Scientiarum Naturalium Universitatis Pekinensis,2013,49(2):197-202. [曹喜信,刘京,杨旭东,等. 一种新的视频字幕提取算法[J]. 北京大学学报:自然科学版,2013,49(2):197-202.]
[11]  Yang Z,Shi P. Caption detection and text recognition in news video[C]//Proceedings of the 5th International Congress on Image and Signal Processing. Washington DC,USA:IEEE Computer Society,2012:188-191. [DOI:10.1109/CISP. 2012. 6469754]
[12]  Sang L. Rolling and non-rolling news subtitle location and segmentation [D].Shanghai:Shanghai Jiao Tong University, 2012.[桑亮.滚动与非滚动新闻字幕的定位与分割[D].上海:上海交通大学, 2012.]
[13]  Bouaziz B,Mahdi W,Ardabilain M,et al. A new approach for texture features extraction:Application for text localization in video images[C]//Proceedings of the IEEE International Conference on Multimedia and Expo. Washington DC,USA:IEEE Computer Society,2006:1737-1740. [DOI:10.1109/ICME. 2006.262886]
[14]  Zhou C J. Video caption detection algorithm based on multiple instance learning [D]. Heilongjiang:Harbin Engineering University,2012. [周长建.基于多示例学习的视频字幕提取算法研究[D].黑龙江:哈尔滨工程大学, 2012.]
[15]  Xu H Y. Object recognition and motion tracking based on viedo [D]. Shanghai:Shanghai Jiao Tong University,2013. [许涵洋.基于视频的物体识别及运动跟踪[D].上海:上海交通大学, 2013.]

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133