|
Quantitative Biology 2005
New symmetry in nucleotide sequencesAbstract: Information valuable words are the strings with the significant deviation of real frequency from the expected one. The expected frequency is determined through the maximum entropy principle of the reconstructed (extended) frequency dictionary of strings composed from the shorter words. The information valuable words are found to be the complementary palindromes: they are read equally in opposite directions, if nucleotides are changed for the complementary ones (A <--> T; C <--> G) in one of them. Some properties of such symmetric words are discussed.
|