|
计算机科学 2007
DnaReSM: A Multi-Supports-based DNA Repetitive Sequences Mining Algorithm
|
Abstract:
Research on DNA sequence analysis is one of important subjects in Bioinforrnatics. There exist a great numher of repeats in DNA sequences. Presently, most of them have largely unknown functions, but they have played important roles in genetics. Mining DNA repeats is a promising task. The methods of bottom-up pattern generation in mining sequential pattern, which produce considerable short patterns, will reduce efficiency. Furthermore, present algorithms based on the definition of single support are difficult to find DNA repeats, therefore a novel method which combined bottom-up with top-down named DnaReSM based on multiple supports framework is presented to mine DNA repetitive sequences. Our experimental results demonstrate that DnaReSM is efficient.