%0 Journal Article
%T Text Extraction Based on Maximum-Minimum Similarity Training Method
基于最大-最小相似度学习方法的文本提取
%A FU Hui
%A LIU Xia-Bi
%A JIA Yun-De
%A
付慧
%A 刘峡壁
%A 贾云得
%J 软件学报
%D 2008
%I
%X This paper proposes a maximum-minimum similarity training algorithm to optimize the parameters in the effective method of text extraction based on Gaussian mixture modeling of neighbor characters. The maximum-minimum similarity training (MMS) methods optimize recognizer performance through maximizing the similarities of positive samples and minimizing the similarities of negative samples. Based on this approach to discriminative training, it defines the objective function for text extraction, and uses the gradient descent method to search the minimum of the objective function and the optimum parameters for the text extraction method. The experimental results of text extraction show the effectiveness of MMS training in text extraction. Compared with the maximum likelihood estimation of parameters from expectation maximization (EM) algorithm, the training results after MMS has the performance of text extraction improved greatly. The recall rate of 98.55% and the precision rate of 93.56% are achieved. The experimental results also show that the maximum-minimum similarity (MMS) training behaves better than the commonly used discriminative training of the minimum classification error (MCE).
%K text extraction
%K Gaussian mixture modeling
%K discriminative training
%K maximum-minimum similarity training
%K minimum classification error training
文本提取
%K 高斯混合模型
%K 判别学习
%K 最大-最小相似度学习
%K 最小分类错误学习
%U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=7735F413D429542E610B3D6AC0D5EC59&aid=CBA5847D466A70DEE0963AEE7336CA7D&yid=67289AFF6305E306&vid=2A8D03AD8076A2E3&iid=38B194292C032A66&sid=BA3451F2C9E4FB70&eid=E4EC39E73004B593&journal_id=1000-9825&journal_name=软件学报&referenced_num=0&reference_num=26