|
计算机应用研究 2004
Design and Implementation of Theme-based Corpus System
|
Abstract:
This paper introduced the character and function of a theme- based Chinese corpus system,and put forward the design scheme and framework of the system.The corpus took the texts of "People's Daily Year 2001" as the raw material.With the corpus users can obtain KWIC concordance,wordlist,collocation analysis and theme words for specified topic, it provides researchers rich and real language environment for Web information mining and natural language processing studies.