|
计算机科学 2010
Word Segmentation Approach in Military Text on the Basis of Word Combination
|
Abstract:
Since the unknown word in military texts is excessive,and the feature of some words is incomplete,the word segmentation method which is based on lexical chunk as the unit was provided,word segmentation was divided into some sections:bidirectional scanning in the text in the base of dictionary,marking the various and segment the words; deleting the stop-words which share the same segmentation results,then count words mutual information and adjacency frequency by the first time's word segmentation,according t...