%0 Journal Article
%T Test Data Sets and Evaluation of Gene Prediction Programs on the Rice Genome
%A Heng Li
%A Jin-Song Liu
%A Zhao Xu
%A Jiao Jin
%A Lin Fang
%A Lei Gao
%A Yu-Dong Li
%A Zi-Xing Xing
%A Shao-Gen Gao
%A Tao Liu
%A Hai-Hong Li
%A Yan Li
%A Li-Jun Fang
%A Hui-Min Xie
%A Wei-Mou Zheng
%A Bai-Lin Hao
%A
Heng Li
%A Lei Gao
%A Lin Fang
%A Tao Liu
%A Hai-Hong Li
%A Yan Li
%A Li-Jun Fang
%A Hui-Min Xie
%A Wei-Mou Zheng
%A Jin-Song Liu
%A Zhao XuJiao Jin
%A Yu-Dong Li
%A Zi-Xing Xing
%A Shao-Gen GaoBai-Lin Hao
%J 计算机科学技术学报
%D 2005
%I
%X With several rice genome projects approaching completion gene prediction/finding by computer algorithms has become an urgent task. Two test sets were constructed by mapping the newly published 28,469 full-length KOME rice cDNA to the RGP BAC clone sequences of Oryza sativa ssp. japonica: a single-gene set of 550 sequences and a multi-gene set of 62 sequences with 271 genes. These data sets were used to evaluate five ab initio gene prediction programs: RiceHMM, GlimmerR, GeneMark, FGENSH and BGF. The predictions were compared on nucleotide, exon and whole gene structure levels using commonly accepted measures and several new measures. The test results show a progress in performance in chronological order. At the same time complementarity of the programs hints on the possibility of further improvement and on the feasibility of reaching better performance by combining several gene-finders.
%K gene prediction
%K rice genome
%K test sets
%K accuracy measures
%K hidden Markov models
%K dynamic programming
程序设计
%K 计算机
%K SGP-2
%K SLAN
%U http://www.alljournals.cn/get_abstract_url.aspx?pcid=5B3AB970F71A803DEACDC0559115BFCF0A068CD97DD29835&cid=8240383F08CE46C8B05036380D75B607&jid=F57FEF5FAEE544283F43708D560ABF1B&aid=76369C04BB7864545B0A52D46FD16966&yid=2DD7160C83D0ACED&vid=A04140E723CB732E&iid=E158A972A605785F&sid=BD7D27247C63490C&eid=FB36B1C076A263FA&journal_id=1000-9000&journal_name=计算机科学技术学报&referenced_num=3&reference_num=35