OALib Journal期刊
ISSN: 2333-9721
费用：99美元

投递稿件

查看量	下载量

相关文章
更多...

中国图象图形学报 2014

Logistic视频字幕增强模型

DOI: 10.11834/jig.20140505

李钦瑞,吕学强,李卓,刘坤

Keywords: 复杂背景,字幕增强,Logistic模型,字幕检测与跟踪,时域特征

Full-Text Cite this paper Add to My Lib

Abstract:

目的为提高复杂背景下的视频字幕在光学字符识别（OCR）中的识别率，需要对提取的视频字幕进行有效地字幕增强。首次将Logistic模型应用到视频字幕增强中，提出了基于Logistic模型的融合多帧信息的视频字幕增强方法。方法对字幕进行检测与跟踪，将出现在连续多帧中的同一字幕片段进行对齐；通过分析字幕片段在多帧中信息，提出字幕背景在时域上的变化特征、背景和字幕文本的固有特征，并将3个特征进行量化与融合，构建适用于字幕增强的Logistic模型，实现对视频字幕的增强。结果对含阴影或描边效果的特殊复杂背景字幕、普通复杂背景字幕、单一背景字幕分别进行实验，增强后的字幕在OCR软件中的识别正确率分别为81.76%、97.13%、98.19%，与对比方法比较均有一定的提高。结论实验结果表明，本文方法既可以降低字幕背景的复杂度，又可以提高字幕背景与文本的对比度，从而可以对复杂背景和单一背景下的视频字幕进行有效地增强。

References

[1]	Otsu N. A threshold selection method from grey-level histograms [J]. IEEE Transactions on Systems,Man,and Cybernetics,1979,9(1):377-393.
[2]	Niblack W. An Introduction to Digital Image Processing [M]. Denmark:Strandberg Publishing Company Birkeroed, 1986,115-116.
[3]	Liu K. Extraction and recognition of video caption[D]. Beijing:Beijing Information Science and Technology University,2009. [刘坤.视频字幕的提取与识别研究[D].北京：北京信息科技大学, 2009.]
[4]	Wang Y D,Jiang X S. News video text segmentation algorithm based on gradient reinforcement [J]. Jounal of Computer-aided Design & Computer Graphics, 2009,21(8):1170-1174.[王一丁, 蒋小森.基于梯度增强的新闻字幕分割算法[J]. 计算机辅助设计与图形学学报, 2009,21(8):1170-1174.]
[5]	Li H,Doermann D. Text enhancement in digital video using multiple frame integration[C]//Proceedings of the seventh ACM international conference on Multimedia(Part 1). New York, USA:ACM,1999:19-22.[DOI:10.1145/319463.319466]
[6]	Yi J,Peng Y X,Xiao J G. Recognition of text in video based on color clustering and multiple frame integration [J].Journal of Software,2011,22(12):2919-2933.[易剑, 彭宇新, 肖建国.基于颜色聚类和多帧融合的视频文字识别方法[J].软件学报, 2011,22(12):2919-2933.]
[7]	Zhu C J,Li C,Xue L,et al. Video text enhancement using multiple frame information [J]. Journal of Image and Graphics,2008,13(9):1667-1672. [朱成军, 李超, 薛玲, 等.一种基于多帧视频的文本图像质量增强方法[J].中国图象图形学报, 2008,13(9):1667-1672.][DOI：10.11834/jig.20080907]
[8]	Mi C J,Li Y,Xue X Y. Video texts tracking and segmentation based on multiple frames[J]. Journal of Computer Research and Development,2006,43(9)：1523-1529.[密聪杰,刘洋,薛向阳. 基于多帧图像的视频文字跟踪和分割算法[J]. 计算机研究与发展,2006,43(9):1523-1529.]
[9]	Shivakumara P,Dutta A,Tan C L,et al. Multi-oriented scene text detection in video based on wavelet and angle projection boundary growing [J]. Multimedia Tools and Applications,2013,2:1-25. [DOI:10.1007/s11042-013-1385-0]
[10]	Cao X X,Liu J,Yang X D,et al. A novel algorithm for the video caption extraction [J]. Acta Scientiarum Naturalium Universitatis Pekinensis,2013,49(2):197-202. [曹喜信,刘京,杨旭东,等. 一种新的视频字幕提取算法[J]. 北京大学学报:自然科学版,2013,49(2):197-202.]
[11]	Yang Z,Shi P. Caption detection and text recognition in news video[C]//Proceedings of the 5th International Congress on Image and Signal Processing. Washington DC,USA:IEEE Computer Society,2012:188-191. [DOI:10.1109/CISP. 2012. 6469754]
[12]	Sang L. Rolling and non-rolling news subtitle location and segmentation [D].Shanghai:Shanghai Jiao Tong University, 2012.[桑亮.滚动与非滚动新闻字幕的定位与分割[D].上海：上海交通大学, 2012.]
[13]	Bouaziz B,Mahdi W,Ardabilain M,et al. A new approach for texture features extraction:Application for text localization in video images[C]//Proceedings of the IEEE International Conference on Multimedia and Expo. Washington DC,USA:IEEE Computer Society,2006:1737-1740. [DOI:10.1109/ICME. 2006.262886]
[14]	Zhou C J. Video caption detection algorithm based on multiple instance learning [D]. Heilongjiang:Harbin Engineering University,2012. [周长建.基于多示例学习的视频字幕提取算法研究[D].黑龙江：哈尔滨工程大学, 2012.]
[15]	Xu H Y. Object recognition and motion tracking based on viedo [D]. Shanghai:Shanghai Jiao Tong University,2013. [许涵洋.基于视频的物体识别及运动跟踪[D].上海：上海交通大学, 2013.]

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133