|
计算机应用研究 2008
Automatic proofreading techniques for texts digitization
|
Abstract:
Aiming at improving the performance of texts digitization system,with the characteristics of errors analyzed,an automatic proofreading method based on rules and statistics was proposed,making use of frequency statistical tree for error check model,segmentation information for interpunctions correction,Biao-Xing code and cache for correcting suggestions.The experiment results indicate that this method gets an 84.65% recall,a 78.89% precision,a 9.07% false correction ratio and can meet the digitization system requirements.