|
自动化学报 2009
A Multi-level Disambiguation Framework for Gene Name Normalization
|
Abstract:
The flexible nomenclature of gene name results in severe semantic ambiguity, which is an obstacle for deep biomed-ical text mining. Gene name normalization (GN) is an effective way to resolve this problem. In this work, a multi-level disam-biguation framework was proposed to solve gene name normal-ization problem. Aiming at different ambiguity situations during the procedure of GN, three different strategies were included in the framework. They were dictionary-based gene name detec-tion, machine-learning-based candidate seleetion, and semantic-based disambiguation. Experimental results showed that the proposed method could achieve 0.746 F-measure on the BioCre-AtlvE2006 GN task test data set.