|
计算机应用研究 2004
A Normalization Method of Multi-font Printed Tibetan Characters
|
Abstract:
In an OCR (Optical Character Recognition) system,character normalization is a crucial step to eliminate variations in character size or position.In this paper,based on the detailed analysis of the characteristics of shape and stroke distribution of multi-font printed tibetan characters,a new normalization algorithm for tibetan OCR is proposed.Firstly character position is normalized combing profile information with the centroid of input character images.Then the 48×96 block is introduced to perform the size normalization by cubic B-spline.The effectiveness of proposed algorithm is demonstrated by experimental results.