%0 Journal Article
%T Page Segmentation and Classification Algorithm for Document Images
文本页面图像的图文分割与分类算法
%A WANG Jia-jun
%A HUANG Xian-wu
%A Guo Wei-wei
%A ZHONG Xing-rong
%A
王加俊
%A 黄贤武
%A 郭玮玮
%A 仲兴荣
%J 中国图象图形学报
%D 2004
%I
%X In this paper, a system valid of the segmentation and classification of skewed document images with irregular graph regions and form regions is proposed. In this system, the skew angle of the document images is detected with a novel algorithm based on the morphological operation of Hit-or-Miss and the hierarchical Hough transform. The former(Hit-or-Miss operation) is for the detection of the baseline points while the latter(Hough transform) is for the detection of the skew angle of the baseline which is also of the page image. To make the system valid for the document images with irregular graph regions involved, we proposed to introduce a middle point cut process to the traditional projection profile cut algorithm so that the irregular graph regions can be approximated with a lot of small rectangles. The segmented regions are classified with two features of the black to white ratio and the cross correlation between adjacent pixels of the sub-blocks. Experimental results have proved the fastness and the reliability of the system proposed in this paper.
%K document image
%K morphological operation
%K image segmentation
%K hough
%K transform
文本图像
%K 图文分割
%K 分类算法
%K 形态学
%K 霍夫变换
%K 二值图像
%K 电子文件
%U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=D06194629680C940ACE75262F54B9D85&aid=8FA3EEEEAC55653F&yid=D0E58B75BFD8E51C&vid=9CF7A0430CBB2DFD&iid=94C357A881DFC066&sid=F16C0F639D87527E&eid=52B9DFFFCC2EB041&journal_id=1006-8961&journal_name=中国图象图形学报&referenced_num=2&reference_num=8