%0 Journal Article %T Study on Chinese-English corpus construction toward multiple-domain resources
面向多领域资源的汉英双语语料库构建的研究 %A LI Xiao-guang %A WANG Peng %A ZHANG Wei %A WANG Da-ling %A
李晓光 %A 王鹏 %A 张威 %A 王大玲 %J 计算机应用 %D 2008 %I %X With the consideration of the features of open, multiple-domain and layout regularity of bilingual resources on Web, a mixture probabilistic alignment model was proposed to reveal the domain-specific and position-specific characteristic for aligning texts. Compared to the traditional lengthen-based aligning model, the model in this paper achieves 37% and 40.4% improvement on precise and recall respectively with the extensive experiments. %K statistic probabilistic alignment %K mixture probability model %K multiple-domain
统计概率对齐 %K 混合模型 %K 多领域 %U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=831E194C147C78FAAFCC50BC7ADD1732&aid=7AE7EADA8DCC910FB5442D12F9D76345&yid=67289AFF6305E306&vid=D3E34374A0D77D7F&iid=CA4FD0336C81A37A&sid=A020552C37306588&eid=856C2E13D1000DB7&journal_id=1001-9081&journal_name=计算机应用&referenced_num=0&reference_num=7