|
计算机科学 2006
Removing of Preprinted Characters in Document Image Processing
|
Abstract:
Extraction of recognition item is an important preprocess procedure in a Document image analysis system. In reality, user fill-in data usually cross or touch the preprinted lines and characters, creating tremendous problems for the recognition engines. In this paper, we proposed a practical preprinted character removing method. Image matching algorithm is applied to locate the position of the preprinted character, and then the character image is binarized by mathematical morphology method combing with stroke width information. Last, the preprinted character is removed based on the varying of stroke contours. Experiment results on real-life check images demonstrate the efficient of the proposed method.