%0 Journal Article %T Exploration of multivariate analysis in microbial coding sequence modeling %A Tahir Mehmood %A Jon Bohlin %A Anja Br£¿then Kristoffersen %A Solve S£¿b£¿ %A Jonas Warringer %A Lars Snipen %J BMC Bioinformatics %D 2012 %I BioMed Central %R 10.1186/1471-2105-13-97 %X The multivariate CPPLS approach classified coding sequence substantially better than the commonly used IMM on the same set of sequences. We also found that the use of CPPLS with codon representation gave significantly better classification results than both IMM with protein (p < 0.001) and with DNA (p < 0.001). Further, although the mean performance was similar, the variation of CPPLS performance on codon representation was significantly smaller than for IMM (p < 0.001).The performance of coding sequence modeling can be substantially improved by using an algorithm based on the multivariate CPPLS method applied to codon or DNA frequencies. %U http://www.biomedcentral.com/1471-2105/13/97/abstract