|
计算机应用研究 2011
Protein-protein interaction extraction method using co-training
|
Abstract:
Abstract: In order to solve the problem of lack of manually labeled samples, a semi-supervised co-training based method is proposed. After preprocessing, the bag of words based method and the pattern learning based method select different subset of features in samples and are incorporated into co-training. In the training stage, each method can utilize a small set of initial labeled samples and a large set of unlabeled samples to learn and the results of the other method to enlarge labeled sample set. Tested in the AIMED corpus, this method achieved F1 value of 63.9%.The comparative experimental results showed that the method outperforms supervised methods and can utilize unlabeled samples efficiently to be adaptive to the real extraction tasks.