%0 Journal Article
%T Study on Chinese-English corpus construction toward multiple-domain resources
面向多领域资源的汉英双语语料库构建的研究
%A LI Xiao-guang
%A WANG Peng
%A ZHANG Wei
%A WANG Da-ling
%A
李晓光
%A 王鹏
%A 张威
%A 王大玲
%J 计算机应用
%D 2008
%I
%X With the consideration of the features of open, multiple-domain and layout regularity of bilingual resources on Web, a mixture probabilistic alignment model was proposed to reveal the domain-specific and position-specific characteristic for aligning texts. Compared to the traditional lengthen-based aligning model, the model in this paper achieves 37% and 40.4% improvement on precise and recall respectively with the extensive experiments.
%K statistic probabilistic alignment
%K mixture probability model
%K multiple-domain
统计概率对齐
%K 混合模型
%K 多领域
%U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=831E194C147C78FAAFCC50BC7ADD1732&aid=7AE7EADA8DCC910FB5442D12F9D76345&yid=67289AFF6305E306&vid=D3E34374A0D77D7F&iid=CA4FD0336C81A37A&sid=A020552C37306588&eid=856C2E13D1000DB7&journal_id=1001-9081&journal_name=计算机应用&referenced_num=0&reference_num=7