|
生物物理学报 2005
Identification of 5’UTR Splice Sites In Human Gene Based On Support Vector Machine
|
Abstract:
Identification of splice sites in non-coding regions of genes is one of the most challenging aspects of gene structure recognition, especially the identification of splice sites embedded in human 5' untranslated regions (UTRs). Different from the conventional splice sites identification, there is no transition from coding to non-coding in 5'UTRs, so conventional splice sites prediction methods perform poorly in UTRs. In this paper, support vector machines was used to identify 5'UTRs splice sites. To increase recognition accuracy, the measurement of matrix similarity was used as the criterion of parameters selection. By doing this, apropos parameters were achieved quickly and simply, thereby improved the identification performance. Experiment results showed that 5'UTRs splice sites can be identified well based on SVM with the selection of parameters.