|
计算机应用研究 2005
Segmentation of Named Entity Based on Dynamic Programming
|
Abstract:
This paper puts forward an approach to segment Chinese name when is put together.It carries out a statistical analysis of a great number of most commonly used named entity and sets up a corpus of name according to the position and the frequency of the occurrences of the characters used in named entity, and also transforms the frequency of the occurrences into evaluation value. Based on the Chinese characteristics used in Chinese name, it changes the segmentation of Chinese name into decision process of optimality. Dynamic programming is used to choose the optimum method. AS an example, Chinese names are used in our experiments to show that this approach works at a fairly high speed and with a high precision. The precision amounts to over 99.66% when 10 names written together are tested. Moreover, the approach can be further used in Chinese words segmentation.