%0 Journal Article %T Fast dictionary mechanism for Chinese word segmentation
一种快速中文分词词典机制 %A WU Jing-Jing %A JING Ji-Wu %A NIE Xiao-Feng %A Wang Ping-Jian %A
吴晶晶 %A 荆继武 %A 聂晓峰 %A 王平建 %J 中国科学院研究生院学报 %D 2009 %I %X With the development of global networking through Internet, the amount of articles in Chinese or other native languages is increasing rapidly. As the lack of explicit separator, word segmentation is a precondition for the processing of these character-based languages and thus it affects the whole system in performance. In this paper, we propose a new solution for Chinese word segmentation problem based on Lexicon named double-character-and-long-word-hash-indexing (DCLWHI).Compared with traditional lexicon mechanism, DCLWHI improves the speed and efficiency of word segmentation without extra memory spending and gains the same accuracy. %K text real-time processing %K Chinese word segmentation %K lexicon mechanism %K double-character-and-long-word-Hash-indexing(DCLWHI)
文本实时处理 %K 中文分词 %K 词典法分词 %K 双字词-长词哈希机制 %U http://www.alljournals.cn/get_abstract_url.aspx?pcid=B5EDD921F3D863E289B22F36E70174A7007B5F5E43D63598017D41BB67247657&cid=B47B31F6349F979B&jid=67CDFDECD959936E166E0F72DE972847&aid=286EAD1F63D5621894C416E1A8A0C8C1&yid=DE12191FBD62783C&vid=96C778EE049EE47D&iid=94C357A881DFC066&sid=7D2B339649A57040&eid=FD6137FFCE59D193&journal_id=1002-1175&journal_name=中国科学院研究生院学报&referenced_num=3&reference_num=12