全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...
Sensors  2011 

An Approach to a Comprehensive Test Framework for Analysis and Evaluation of Text Line Segmentation Algorithms

DOI: 10.3390/s110908782

Keywords: document image processing, text line segmentation, algorithms, experiments framework, testing, signal detection theory

Full-Text   Cite this paper   Add to My Lib

Abstract:

The paper introduces a testing framework for the evaluation and validation of text line segmentation algorithms. Text line segmentation represents the key action for correct optical character recognition. Many of the tests for the evaluation of text line segmentation algorithms deal with text databases as reference templates. Because of the mismatch, the reliable testing framework is required. Hence, a new approach to a comprehensive experimental framework for the evaluation of text line segmentation algorithms is proposed. It consists of synthetic multi-like text samples and real handwritten text as well. Although the tests are mutually independent, the results are cross-linked. The proposed method can be used for different types of scripts and languages. Furthermore, two different procedures for the evaluation of algorithm efficiency based on the obtained error type classification are proposed. The first is based on the segmentation line error description, while the second one incorporates well-known signal detection theory. Each of them has different capabilities and convenience, but they can be used as supplements to make the evaluation process efficient. Overall the proposed procedure based on the segmentation line error description has some advantages, characterized by five measures that describe measurement procedures.

References

[1]  Likforman Sulem, L; Zahour, A; Taconet, B. Text Line Segmentation of Historical Documents: A Survey. IJDAR 2007, 9, 123–138.
[2]  Kavallieratou, E; Stamatatos, S. Discrimination of Machine-Printed from Handwritten Text Using Simple Structural Characteristics. Proceedings of the 17th International Conference on Pattern Recognition (ICPR’04), Cambridge, UK, 23–26 August 2004; pp. 437–440.
[3]  Amin, A; Wu, S. Robust Skew Detection in Mixed Text/Graphics Documents. Proceedings of International Conference on Document Analysis and Recognition (ICDAR), Seoul, Korea, 29 August–1 September 2005.
[4]  Li, Y; Zheng, Y; Doermann, D; Jaeger, S. Script-Independent Text Line Segmentation in Freestyle Handwritten Documents. Technical Report: LAMP-TR-136/CS-TR-4836/UMIACSTR-2006-51/CFAR-TR-1017;; University of Maryland: College Park, MD, USA, 2006.
[5]  Razak, Z; Zulkiflee, K; Idris, MYI; Tamil, EM; Noor, MNM; Salleh, R; Yaakob, M; Yusof, ZM; Yaacob, M. Off-Line Handwriting Text Line Segmentation: A Review. IJCSNS 2008, 8, 12–20.
[6]  Louloudis, G; Gatos, B; Pratikakis, I; Halatsis, C. Text Line and Word Segmentation of Handwritten Documents. Patt. Recogn 2009, 42, 3169–3183.
[7]  Li, Y; Zheng, Y; Doermann, D; Jaeger, S. Script-Independent Text Line Segmentation in Freestyle Handwritten Documents. IEEE Trans. Patt. Anal. Mach. Intell 2008, 30, 1313–1329.
[8]  Sanchez, A; Suarez, PD; Mello, CAB; Oliveira, ALI; Alves, VMO. Text Line Segmentation in Images of Handwritten Historical Documents. Proceedings of the First Workshops on Image Processing Theory, Tools and Applications (IPTA), Sousse, Tunisia, 23–26 November 2008; pp. 1–6.
[9]  Brodi?, D; Milivojevi?, DR; Milivojevi?, Z. Basic Test Framework for the Evaluation of Text Line Segmentation and Text Parameter Extraction. Sensors 2010, 10, 5263–5279.
[10]  Brodi?, D. Methodology for the Evaluation of the Algorithms for Text Line Segmentation Based on Extended Binary Classification. Meas. Sci. Rev 2011, 11, 71–78.
[11]  Brodi?, D; Milivojevi?, DR. Methodology for the Evaluation of the Algorithms for Text Line Segmentation. Proceeding of 10th International Scientific Conference (UNITECH), Gabrovo, Bulgaria, 19–20 November 2010; pp. 424–428.
[12]  Brodi?, D. The Evaluation of the Initial Skew Rate for Printed Text. J. Elect. Eng. Elektrotech. ?asopis 2011, 62, 134–140.
[13]  Mao, M; Peng, Y; Spring, M. Ontology Mapping: As a Binary Classification Problem. Proceedings of the 4th International Conference on Semantics, Knowledge and Grid, Beijing, China, 3–5 December 2008.
[14]  Abdi, H. Signal Detection Theory. In Encyclopedia of Measurement and Statistics; Salkind, NJ, Ed.; Sage Publications, Inc: Thousand Oaks, CA, USA, 2007; pp. 1–9.
[15]  Qian, X; Liu, G; Wang, H; Su, R. Text Detection, Localization, and Tracking in Compressed Video. Sign. Process. Image Commun 2007, 22, 752–768.
[16]  Bukhari, SS; Shafait, F; Bruesl, TM. Adaptive Binarization of Unconstrained Hand-Held Camera-Captured Document Images. J. Univ. Comput. Sci 2009, 15, 3343–3363.
[17]  Shi, Z; Govindaraju, V. Line Separation for Complex Document Images Using Fuzzy Runlength. Proceedings of the International Workshop on Document Image Analysis for Libraries, Palo Alto, CA, USA, 23–24 January 2004.
[18]  Basu, S; Chaudhuri, C; Kundu, M; Nasipuri, M; Basu, DK. Text Line Extraction from Multi-Skewed Handwritten Documents. Patt. Recogn 2007, 40, 1825–1839.
[19]  Brodi?, D; Milivojevi?, Z. New Approach to Water Flow Algorithm for Text Line Segmentation. J. Univ. Comput. Sci 2011, 17, 30–47.
[20]  Brodi?, D. Advantages of the Extended Water Flow Algorithm for Handwritten Text Line Segmentation. In Pattern Recognition and Machine Intelligence; Kuznetsov, SO, Mandal, DP, Kundu, MK, Pal, SK, Eds.; Springer: Berlin, Germany, 2011.
[21]  Brodi?, D; Milivojevi?, Z. Optimization of the Gaussian Kernel Extended by Binary Morphology for Text Line Segmentation. Radioengineering 2010, 19, 718–724.
[22]  Brodi?, D. Optimization of the Anisotropic Gaussian Kernel for Text Segmentation and Parameter Extraction. In Theoretical Computer Science; Callude, CS, Sassone, V, Eds.; Springer-Verlag: Berlin, Germany, 2011.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133