|
生物物理学报 2006
Discovery and applications of the association rules in protein sequences
|
Abstract:
In recent years,many complicated motif-discovering algorithms have been developed to find the motifs in protein sequences,but their results and running processes are hard to be interpreted.Association rules discovery is a simple method,but it has a theoretical background of probability.Applying the method,thousands of motifs have been found,which can be used in protein sequence analysis such as preserved site discovery and secondary structure prediction.A secondary structure rule library is constructed using the found association rules,and a simple secondary structure prediction algorithm is developed.It shows that nearly 81 percent of secondary structure can be implied by at least one rule.