全部 标题 作者
关键词 摘要

OALib Journal期刊
ISSN: 2333-9721
费用:99美元

查看量下载量

相关文章

更多...
PLOS ONE  2012 

β-sheet Topology Prediction with High Precision and Recall for β and Mixed α/β Proteins

DOI: 10.1371/journal.pone.0032461

Full-Text   Cite this paper   Add to My Lib

Abstract:

The prediction of the correct -sheet topology for pure and mixed proteins is a critical intermediate step toward the three dimensional protein structure prediction. The predicted beta sheet topology provides distance constraints between sequentially separated residues, which reduces the three dimensional search space for a protein structure prediction algorithm. Here, we present a novel mixed integer linear optimization based framework for the prediction of -sheet topology in and mixed proteins. The objective is to maximize the total strand-to-strand contact potential of the protein. A large number of physical constraints are applied to provide biologically meaningful topology results. The formulation permits the creation of a rank-ordered list of preferred -sheet arrangements. Finally, the generated topologies are re-ranked using a fully atomistic approach involving torsion angle dynamics and clustering. For a large, non-redundant data set of 2102 and mixed proteins with at least 3 strands taken from the PDB, the proposed approach provides the top 5 solutions with average precision and recall greater than 78%. Consistent results are obtained in the -sheet topology prediction for blind targets provided during the CASP8 and CASP9 experiments, as well as for actual and predicted secondary structures. The -sheet topology prediction algorithm, BeST, is available to the scientific community at http://selene.princeton.edu/BeST/.

References

[1]  Kryshtafovych A, Fidelis K (2009) Protein structure prediction and model quality assessment. Drug Disc Today 14: 386–393.
[2]  Zhang Y (2008) Progress and challenges in protein structure prediction. Current Opinion in Structural Biology 18: 342–348.
[3]  Floudas CA, Fung HK, McAllister SR, M?nnigmann M, Rajgaria R (2006) Advances in protein structure prediction and de novo protein design: A review. Chem Eng Sc 61: 966–988.
[4]  Floudas CA (2007) Computational methods in protein structure prediction. Biotech Bioeng 97: 207–213.
[5]  Rose GD (1979) Hierarchic organization of domains in globular proteins. J Mol Bio 134: 447–470.
[6]  Lesk AM, Rose GD (1981) Folding units in globular proteins. Proc Nat Acad Sci USA 78: 4304–4308.
[7]  Baldwin RL, Rose GD (1999) Is protein folding hierarchic? i. local structure and peptide folding. Trends Biochem Sci 134: 26–33.
[8]  Baldwin RL, Rose GD (1999) Is protein folding hierarchic? ii. folding intermediates and transition states. Trends Biochem Sci 24: 77–83.
[9]  Kryshtafovych A, Venclovas C, Fidelis K, Moult J (1999) Protein folding: from the levinthal paradox to structure prediction. J Mol Bio 293: 283–293.
[10]  Subramani A, Wei Y, Floudas CA (2011) Astro-fold 2.0: An enhanced framework for protein structure prediction. AIChE J. doi:10.1002/aic.12669.
[11]  McAllister SR, Floudas CA (2010) An improved hybrid global optimization method for protein tertiary structure prediction. Comput Optim Appl 45: 377–413.
[12]  Srinivasan R, Rose GD (1995) Linus: A hierarchic procedure to predict the fold of a protein. Proteins 22: 81–89.
[13]  Maity H, Maity M, Krishna MMG, Mayne L, Englander SW (2005) Protein folding: The stepwise assembly of folding units. Proc Nat Acad Sci USA 102: 4741–4746.
[14]  Maisuradze GG, Senet P, Czaplewski C, Liwo A, Scheraga HA (2010) Investigation of protein folding by coarse-grained molecular dynamics with the unres force field. J Phys Chem A 114: 4471–4485.
[15]  Pandit SB, Zhou H, Skolnick J (2010) Introduction to Protein Structure Prediction: Methods and Algorithms. Hoboken, NJ: John Wiley and Sons, Inc. pp. 219–242. Chapter 10.
[16]  Richardson JS (1981) The anatomy and taxonomy of protein structure. Adv Prot Chem 34: 167–339.
[17]  Chothia C, Finkelstein AV (1990) The classification and origins of protein folding patterns. Annu Rev Biochem 59: 1007–1039.
[18]  Holm L, Ouzounis C, Sander C, Tuparev G, Vriend G (1992) A database of protein structure families with common folding motifs. Prot Sci 1: 1691–1698.
[19]  Orengo CA, Flores TP, Taylor WR, Thornton JM (1993) Identification and classification of protein fold families. Prot Eng 6: 485–500.
[20]  Orengo CA (1994) Classification of protein folds. Curr Opin Struct Biol 4: 429–440.
[21]  Murzin AG, Brenner SE, Hubbard T, Chothia C (1995) SCOP - a structural classification of proteins database for the investigation of sequences and structures. J Mol Bio 247: 536–540.
[22]  Orengo CA, Thornton JM (1993) Alpha plus beta folds revisited: some favoured motifs. Structure 1: 105–120.
[23]  Hutchinson EG, Thornton JM (1993) The greek key motif: extraction, classification and analysis. Prot Eng 6: 233–245.
[24]  Richardson JS (1976) Handedness of crossover connections in β-sheets. Proc Nat Acad Sci USA 73: 2619–2623.
[25]  Sternberg MJE, Thornton JM (1977) On the conformation of proteins: An analysis of β-pleated sheets. J Mol Bio 110: 285–296.
[26]  Richardson J (1977) β-sheet topology and the relatedness of proteins. Nature 268: 495–500.
[27]  Ruczinksi I, Kooperberg C, Bonneau R, Baker D (2002) Distribution of beta sheets in proteins with application to structure prediction. Proteins 48: 85–97.
[28]  Floudas JLKCA (2003) Prediction of β-sheet topology and disulfide bridges in polypeptides. J Comput Chem 24: 191–208.
[29]  Liu Y, Carbonell JG, Klein-Seetharaman J, Gopalakrishnan V (2003) Prediction of anti-parallel and parallel beta-sheets using conditional random fields. Institute of Software Research 24: 191–208.
[30]  Zhu H, Braun W (1999) Sequence specificity, statistical potentials, and three-dimensional structure prediction with self-correcting distance geometry calculations of beta-sheet formation in proteins. Prot Sci 8: 326–342.
[31]  Steward RE, Thornton JM (2002) Prediction of strand pairing in antiparallel and parallel β-sheets using information theory. Proteins 48: 178–191.
[32]  Cheng J, Baldi P (2005) Three-stage prediction of protein beta-sheets by neural networks, alignments and graph algorithms. Bioinformatics 21: 75–84.
[33]  Hubbard TJ, Park J (1995) Fold recognition and ab initio structure predictions using hidden markov models and β-strand pair potentials. Proteins 23: 398–402.
[34]  Asogawa M (1997) Beta-sheet prediction using inter-strand residue pairs and refinement with hopfield neural network. Proc Int Conf Intell Syst Mol Biol 5: 48–51.
[35]  Mamitsuka H, Abe N (1994) Predicting location and structure of beta-sheet regions using stochastic tree grammars. Proc Int Conf Intell Syst Mol Biol 2: 276–284.
[36]  Jeong J, Berman P, Przytycka TM (2008) Improved strand pairing prediction through exploring folding cooperativity. IEEE/ACM Trans Comput Biol Bioinform 5: 484–491.
[37]  Aydin Z, Altunbasak Y, Erdogan H (2011) Bayesian models and algorithms for protein β-sheet prediction. IEEE/ACM Transactions on Computational Biology and Bioinformatics 8: 395–409.
[38]  Murzin AG, Lesk AM, Chothia C (1994) Principles determining the structure of beta barrels in proteins. i. a theoretical analysis. J Mol Bio 236: 1369–1381.
[39]  Murzin AG, Lesk AM, Chothia C (1994) Principles determining the structure of beta barrels in proteins. ii. the observed structures. J Mol Bio 236: 1382–1400.
[40]  Kabsch W, Sander C (1983) Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 22: 2577–2637.
[41]  Hutchinson EG, Thornton JM (1996) Promotif - a program to identify and analyze structural motifs in proteins. Prot Sci 5: 212–220.
[42]  Wei Y, Thompson J, Floudas CA (2012) Concord: A consensus method for protein secondary structure prediction via mixed integer linear optimization. Proc Royal Soc A. doi:10.1098/rspa.2011.0514.
[43]  Ozkan SB, Wu GA, Chodera JD, Dill KA (2007) Protein folding by zipping and assembly. Proc Nat Acad Sci USA 104: 11987–11992.
[44]  Dill KA, Fiebig KM, Chan HS (1993) Cooperativity in protein-folding kinetics. Proc Nat Acad Sci USA 90: 1942–1946.
[45]  Sternberg MJE, Thornton JM (1977) On the conformation of proteins: Towards the prediction of strand arrangements in β-pleated sheets. J Mol Bio 113: 401–418.
[46]  Przytycka T, Srinivasan R, Rose GD (2002) Recursive domains in proteins. Prot Sci 11: 409–417.
[47]  Chiang YS, Gelfand TI, Kister AE, Gelfand IM (2007) New classification of supersecondary structures of sandwich-like proteins uncovers strict patterns of strand assemblage. Proteins 68: 915–921.
[48]  Kister AE, Fokas AS, Papatheodorou TS, Gelfand IM (2006) Strict rules determine arrangements in sandwich proteins. Proc Nat Acad Sci USA 103: 4107–4110.
[49]  Sternberg MJE, Thornton JM (1977) On the conformation of proteins: Hydrophobic ordering of strands in β-pleated sheets. J Mol Bio 115: 1–17.
[50]  Stickle DF, Presta LG, Dill KA, Rose GD (1992) Hydrogen bonding in globular proteins. J Mol Bio 226: 1143–1159.
[51]  Glyakina AV, Bogatyreva NS, Galzitskaya OV (2011) Accessible surfaces of beta proteins increase with increasing protein molecular mass more rapidly than those of other proteins. PLoS One 6: e28464.
[52]  Cohen FE, Sternberg MJE, Taylor WR (1982) Analysis and prediction of the packing of α-helices against a β-sheet in the tertiary structure of globular proteins. J Mol Bio 156: 821–862.
[53]  Grainger B, Sadowski MI, Taylor WR (2010) Re-evaluating the “rules” of protein topology. J Comput Biol 17: 1371–1384.
[54]  Kister AE, Finkelstein AV, Gelfand IM (2002) Common features in structures and sequences of sandwich-like proteins. Proc Nat Acad Sci USA 99: 14137–14141.
[55]  Fokas AS, Papatheodorou TS, Kister AE, Gelfand IM (2005) A geometric construction determines all permissible strand arrangements of sandwich proteins. Proc Nat Acad Sci USA 102: 15851–15853.
[56]  Crippen GM, Havel TF (1988) Distance Geometry and Molecular Conformation. New York: Wiley.
[57]  Moré JJ, Wu Z (1999) Distance geometry optimization for protein structures. J Glob Opt 15: 219–234.
[58]  Güntert P, Mumenthaler C, Wüthrich K (1997) Torsion angle dynamics for NMR structure calculation with the new program dyana. J Mol Bio 273: 283–298.
[59]  Subramani A, DiMaggio PA, Floudas CA (2009) Selecting high quality structures from diverse conformational ensembles. Biophysical Journal 97: 1728–1736.
[60]  DiMaggio PA, Subramani A, Judson RS, Floudas CA (2010) A novel framework for predicting in vivo toxicities from in vitro data using optimal methods for dense and sparse matrix reordering and logistic regression. Toxicol Sci 118: 251–265.
[61]  DiMaggio PA, McAllister SR, Floudas CA, Fend XJ, Rabinowitz JD, et al. (2008) Biclustering via optimal re-ordering of data matrices in systems biology: rigorous methods and comparative studies. BMC Bioinformatics 97: 207–213.
[62]  McAllister SR, DiMaggio PA, Floudas CA (2009) Mathematical modeling and efficient optimization methods for the distance-dependent rearrangement clustering problem. J Glob Opt 45: 111–129.

Full-Text

Contact Us

service@oalib.com

QQ:3279437679

WhatsApp +8615387084133