We revisit the protein secondary structure prediction problem using linear and backpropagation neural network architectures commonly applied in the literature. In this context, neural network mappings are constructed between protein training set sequences and their assigned structure classes in order to analyze the class membership of test data and associated measures of significance. We present numerical results demonstrating that classifier performance measures can vary significantly depending upon the classifier architecture and the structure class encoding technique. Furthermore, an analytic formulation is introduced in order to substantiate the observed numerical data. Finally, we analyze and discuss the ability of the neural network to accurately model fundamental attributes of protein secondary structure. 1. Introduction The protein secondary structure prediction problem can be phrased as a supervised pattern recognition problem [1–5] for which training data is readily available from reliable databases such as the Protein Data Bank (PDB) or CB513 [6]. Based upon training examples, subsequences derived from primary sequences are encoded based upon a discrete set of classes. For instance, three class encodings are commonly applied in the literature in order to numerically represent the secondary structure set (alpha helix , beta sheet , coil ) [7–11]. By applying a pattern recognition approach, subsequences of unknown classification can then be tested to determine the structure class to which they belong. Phrased in this way, backpropagation neural networks [7, 12–14], and variations on the neural network theme [8, 10, 11, 15–18] have been applied to the secondary structure prediction problem with varied success. Furthermore, many tools currently applying hybrid methodologies such as PredictProtein [19, 20], JPRED [8, 17, 21], SCRATCH [22, 23] and PSIPRED [24, 25] rely on the neural network paradigm as part of their prediction scheme. One of the main reasons for applying the neural network approach in the first place is that they tend be good universal approximators [26–30] and, theoretically, have the potential to create secondary structure models. In other words, after a given network architecture has been chosen and presented with a robust set of examples, the optimal parameters associated with the trained network, in principle, define an explicit function that can map a given protein sequence to its associated secondary structure. If the structure predicted by the network function is generally correct and consistent for an arbitrary input
References
[1]
C. M. Bishop, Pattern Recognition and Machine Learning, Springer, 2007.
[2]
M. T. Hagan, H. B. Demuth, and M. H. Beale, Neural Network Design, PWS Publishing, 1996.
[3]
M. N. Nguyen and J. C. Rajapakse, “Multi-class support vector machines for protein secondary structure prediction,” Genome Informatics, vol. 14, pp. 218–227, 2003.
[4]
H. J. Hu, Y. Pan, R. Harrison, and P. C. Tai, “Improved protein secondary structure prediction using support vector machine with a new encoding scheme and an advanced tertiary classifier,” IEEE Transactions on Nanobioscience, vol. 3, no. 4, pp. 265–271, 2004.
[5]
W. Zhong, G. Altun, X. Tian, R. Harrison, P. C. Tai, and Y. Pan, “Parallel protein secondary structure prediction schemes using Pthread and OpenMP over hyper-threading technology,” Journal of Supercomputing, vol. 41, no. 1, pp. 1–16, 2007.
[6]
J. A. Cuff and G. J. Barton, “Application of enhanced multiple sequence alignment profiles to improve protein secondary structure prediction,” Proteins, vol. 40, pp. 502–511, 2000.
[7]
J. M. Chandonia and M. Karplus, “Neural networks for secondary structure and structural class predictions,” Protein Science, vol. 4, no. 2, pp. 275–285, 1995.
[8]
J. A. Cuff and G. J. Barton, “Evaluation and improvement of multiple sequence methods for protein secondary structure prediction,” Proteins, vol. 34, pp. 508–519, 1999.
[9]
G. E. Crooks and S. E. Brenner, “Protein secondary structure: entropy, correlations and prediction,” Bioinformatics, vol. 20, no. 10, pp. 1603–1611, 2004.
[10]
L. H. Wang, J. Liu, and H. B. Zhou, “A comparison of two machine learning methods for protein secondary structure prediction,” in Proceedings of 2004 International Conference on Machine Learning and Cybernetics, pp. 2730–2735, chn, August 2004.
[11]
G. Z. Zhang, D. S. Huang, Y. P. Zhu, and Y. X. Li, “Improving protein secondary structure prediction by using the residue conformational classes,” Pattern Recognition Letters, vol. 26, no. 15, pp. 2346–2352, 2005.
[12]
N. Qian and T. J. Sejnowski, “Predicting the secondary structure of globular proteins using neural network models,” Journal of Molecular Biology, vol. 202, no. 4, pp. 865–884, 1988.
[13]
L. Howard Holley and M. Karplus, “Protein secondary structure prediction with a neural network,” Proceedings of the National Academy of Sciences of the United States of America, vol. 86, no. 1, pp. 152–156, 1989.
[14]
J. M. Chandonia and M. Karplus, “The importance of larger data sets for protein secondary structure prediction with neural networks,” Protein Science, vol. 5, no. 4, pp. 768–774, 1996.
[15]
B. Rost and C. Sander, “Improved prediction of protein secondary structure by use of sequence profiles and neural networks,” Proceedings of the National Academy of Sciences of the United States of America, vol. 90, no. 16, pp. 7558–7562, 1993.
[16]
B. Rost and C. Sander, “Prediction of protein secondary structure at better than 70% accuracy,” Journal of Molecular Biology, vol. 232, no. 2, pp. 584–599, 1993.
[17]
J. A. Cuff, M. E. Clamp, A. S. Siddiqui, M. Finlay, and G. J. Barton, “JPred: a consensus secondary structure prediction server,” Bioinformatics, vol. 14, no. 10, pp. 892–893, 1998.
[18]
S. Hua and Z. Sun, “A novel method of protein secondary structure prediction with high segment overlap measure: support vector machine approach,” Journal of Molecular Biology, vol. 308, no. 2, pp. 397–407, 2001.
[19]
B. Rost, “PHD: predicting one-dimensional protein structure by profile-based neural networks,” Methods in Enzymology, vol. 266, pp. 525–539, 1996.
[20]
B. Rost, G. Yachdav, and J. Liu, “The PredictProtein server,” Nucleic Acids Research, vol. 32, pp. W321–W326, 2004.
[21]
C. Cole, J. D. Barber, and G. J. Barton, “The Jpred 3 secondary structure prediction server,” Nucleic Acids Research, vol. 36, pp. W197–W201, 2008.
[22]
G. Pollastri, D. Przybylski, B. Rost, and P. Baldi, “Improving the prediction of protein secondary structure in three and eight classes using recurrent neural networks and profiles,” Proteins, vol. 47, no. 2, pp. 228–235, 2002.
[23]
J. Cheng, A. Z. Randall, M. J. Sweredoski, and P. Baldi, “SCRATCH: a protein structure and structural feature prediction server,” Nucleic Acids Research, vol. 33, no. 2, pp. W72–W76, 2005.
[24]
D. T. Jones, “Protein secondary structure prediction based on position-specific scoring matrices,” Journal of Molecular Biology, vol. 292, no. 2, pp. 195–202, 1999.
[25]
K. Bryson, L. J. McGuffin, R. L. Marsden, J. J. Ward, J. S. Sodhi, and D. T. Jones, “Protein structure prediction servers at University College London,” Nucleic Acids Research, vol. 33, no. 2, pp. W36–W38, 2005.
[26]
G. Cybenko, “Approximation by superpositions of a sigmoidal function,” Mathematics of Control, Signals, and Systems, vol. 2, no. 4, pp. 303–314, 1989.
[27]
J. Moody and C. J. Darken, “Fast learning in networks of locally tuned processing units,” Neural Computation, vol. 1, pp. 281–294, 1989.
[28]
T. Poggio and F. Girosi, “Networks for approximation and learning,” Proceedings of the IEEE, vol. 78, no. 9, pp. 1481–1497, 1990.
[29]
D. F. Specht, “Probabilistic neural networks,” Neural Networks, vol. 3, no. 1, pp. 109–118, 1990.
[30]
P. András, “Orthogonal RBF neural network approximation,” Neural Processing Letters, vol. 9, no. 2, pp. 141–151, 1999.
[31]
P. Baldi and S. Brunak, Bioinformatics: The Machine Learning Approach, MIT Press, 1998.
[32]
E. Sakk, D. J. Schneider, C. R. Myers, and S. W. Cartinhour, “On the selection of target vectors for a class of supervised pattern recognizers,” IEEE Transactions on Neural Networks, vol. 20, no. 5, pp. 745–757, 2009.
[33]
G. H. Golub and C. F. Van Loan, Matrix Computations, Johns Hopkins University Press, 1989.