|
酶蛋白质中8类二级结构的识别
|
Abstract:
酶是一种具有催化功能的蛋白质,研究酶蛋白质中的二级结构对研究酶的结构及功能有重要作用。本文从酶蛋白质序列出发,以位点氨基酸和20种氨基酸n-gap 2肽组分为参数,首次将矩阵打分的方法用于酶蛋白质中8类二级结构的识别,预测总精度Q8最高达到61.4%。
Enzymes are a kind of protein that has catalytic function. The study of secondary structures in en-zymes plays an important role in the structure and function of enzymes. Based on enzyme protein sequence information, amino acids of sites and n-gap dipeptide composition of twenty amino acids were selected as parameters. Scoring matrix method was first applied to the identification of 8-state secondary structure in enzymes protein. The prediction accuracy of Q8 reached 61.4%.
[1] | Chandonia, J.M. and Karplus, M. (1996) The Importance of Larger Datasets for Protein Secondary Structure Prediction with Neural Network. Protein Science, 5, 768-774. https://doi.org/10.1002/pro.5560050422 |
[2] | Anders, K. and Lareo, R. (1994) Hidden Markov Models in Computational Biology Applications to Protein Modeling. Journal of Molecular Biology, 235, 1501-1531. https://doi.org/10.1006/jmbi.1994.1104 |
[3] | Asai, K., Hayamizu, S. and Hands, K. (1993) Prediction of Protein Secondary Structure by the Hidden Markov Model. Computer Applications in the Biosciences, 9, 141-146. https://doi.org/10.1093/bioinformatics/9.2.141 |
[4] | Goldman, N., Thorne, J.L. and Jones, D.T. (1996) Using Evolutionary Trees in Protein Secondary Structure Prediction and Other Comparative Sequence Analyses. Journal of Molecular Biology, 263, 196-208.
https://doi.org/10.1006/jmbi.1996.0569 |
[5] | Rost, B. and Sander, C. (1994) Combining Evolutionary Information and Neural Networks to Predict Protein Secondary Structure. Proteins, 19, 55-72. https://doi.org/10.1002/prot.340190108 |
[6] | Dor, O. and Zhou, Y.Q. (2007) Achieving 80% Ten-Fold Cross-Validated Accuracy for Secondary Structure Prediction by Large-Scale Training. Protein, 66, 838-845. https://doi.org/10.1002/prot.21298 |
[7] | Pollastri, G. and Mclysaght, A. (2005) Porter: A New, Accurate Server for Protein Secondary Structure Prediction. Bioinformatics, 21, 2. https://doi.org/10.1093/bioinformatics/bti203 |
[8] | Pollastri, G., Przybylski, D., Rost, B. and Baldi, P. (2002) Improving the Prediction of Protein Secondary Structure in Three and Eight Classes Using Recurrent Neural Networks and Profiles. Proteins-Structure Function and Genetics, 47, 228-235. https://doi.org/10.1002/prot.10082 |
[9] | Wang, Z.Y., Zhao, F., Peng, J. and Xu, J.B. (2011) Protein 8-Class Secondary Structure Prediction Using Conditional Neural Fields. Proteomics, 11, 3786-3792. https://doi.org/10.1002/pmic.201100196 |
[10] | Cong, P.S., Li, D.P., Wang, Z.H., Tang, S.N. and Li, T.H. (2013) SPSSM8: An Accurate Approach for Predicting Eight-State Secondary Structures of Proteins. Biochimie, 95, 2460-2464. https://doi.org/10.1016/j.biochi.2013.09.007 |
[11] | Yaseen, A. and Li, Y.H. (2014) Template-Based C8-SCORPION: A Protein 8-State Secondary Structure Prediction Method Using Structural Information and Context-Based Features. BMC Bioinformatics, 15, 204-218.
https://doi.org/10.1186/1471-2105-15-S8-S3 |
[12] | 王志强, 董彩华, 王延枝. 温度和底物对大豆液泡膜H+-ATPase二级结构的影响[J]. 生物物理学报, 2000, 16(3): 489-493. |
[13] | 王玮, 葛毅强, 陈颖, 徐幸莲, 周光宏. 超高压对高铁肌红蛋白还原酶活性及二级结构的影响[J]. 中国食品学报, 2015, 15(10): 134-140. |
[14] | Webb, E.C. (1992) Enzyme Nomenclature. Academic Press, San Diego. |
[15] | Kabsch, W. and Sander, C. (1983) Dictionary of Protein Secondary Structure: Pattern Recognition of Hydrogen Bonded and Geometrical Features. Biopolymers, 22, 2577-2637. https://doi.org/10.1002/bip.360221211 |
[16] | Cartharius, K., Frech, K., Grote, K., et al. (2005) Mat Inspector and Beyond: Promoter Analysis Based on Transcription Factor Binding Sites. Bioinformatics, 21, 2933-2942. https://doi.org/10.1093/bioinformatics/bti473 |
[17] | Quandt, K., Frech, K., Karas, H., et al. (1995) MatIand and Mat Inspector: New Fast and Versatile Tools for Deteciono Consensus Matches in Nucleotide Sequence Data. Nucleic Acids Research, 23, 4878-4884.
https://doi.org/10.1093/nar/23.23.4878 |
[18] | Kel, A.E., GoBling, E., Reuter, I., et al. (2003) MATCHTM: A Tool for Searching Transcription Factor Binding Sites in DNA Sequences. Nucleic Acids Research, 31, 3576-3579. https://doi.org/10.1093/nar/gkg585 |
[19] | Wasserman, W.W. and Sandelin, A. (2004) Applied Bioinformatics for the Identification of Regulatory Elements. Nature Reviews Genetics, 5, 276-287. https://doi.org/10.1038/nrg1315 |
[20] | Zhong, L. and Johnson, W.C. (1992) Environment Affects Amino Acid Preference for Secondary Structure. Proceedings of the National Academy of Sciences of the United States of America, 89, 4462-4465.
https://doi.org/10.1073/pnas.89.10.4462 |
[21] | Lakizadeh, A. and Marashi, S.A. (2009) Addition of Contact Number Information Can Improve Protein Secondary Structure Prediction by Neural Networks. EXCLI Journal, 8, 66-73. |
[22] | Macdonald, J.R. and Johnson, W.C. (2001) Environmental Features Are Important in Determining Protein Secondary Structure. Protein Science, 10, 1172-1177. https://doi.org/10.1110/ps.420101 |
[23] | Costantini, S., Colonna, G. and Facchiano, A.M. (2006) Amino Acid Propensities for Secondary Structures Are Influenced by the Protein Structural Class. Biochemical and Biophysical Research Communication, 342, 441-451.
https://doi.org/10.1016/j.bbrc.2006.01.159 |
[24] | Marash, S.A., Behrouzi, R. and Pezehk, H. (2007) Adaptation of Proteins to Different Environments: A Comparison of Proteome Structural Properties in Bacillus subtilis and Escherichia coli. Journal of Theoretical Biology, 244, 127-132.
https://doi.org/10.1016/j.jtbi.2006.07.021 |