|
现代图书情报技术 2007
A Design of Algorithm for Chinese Phrase Segmentation
|
Abstract:
This paper analyses the shortcoming of segmentation algorithm, designs a new algorithm for Chinese phrase segmentation. By building two levels index for Chinese thesaurus, we attain a highly efficient Chinese phrase segmentation thesaurus which supports hashing operation by means of the first Chinese character in a string and full binary search. Based on this thesaurus, we design a new algorithm for Chinese phrase segmentation.