%0 Journal Article
%T Statistical Language Model
汉语统计语言模型训练样本容量的定量化度量
%A ZHANG Yang-sen
%A
张仰森
%J 计算机科学
%D 2009
%I
%X The training of statistical language model parameter is the key of language modeling.Chooseing how many training samples to meet the demand of the model parameter estimation error is one of concern problems of language modeling theory.We applied mathematical statistics theory to give the estimating method for training samples lower bound capability for chinese model,the quantification estimation formula was suggested.By using this formula,the corpus sample capability needed to train model parameters can be ...
%K Chinese statistical language model
%K Training corpus sample
%K Sample capacity
%K Relative error
汉语统计语言模型
%K 训练语料样本
%K 样本容量
%K 相对误差
%U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=64A12D73428C8B8DBFB978D04DFEB3C1&aid=FE1423442F24F43B45023282B87A78F9&yid=DE12191FBD62783C&vid=933658645952ED9F&iid=F3090AE9B60B7ED1&sid=8B59EA573021D671&eid=A1266CF37D675CF1&journal_id=1002-137X&journal_name=计算机科学&referenced_num=0&reference_num=8