%0 Journal Article
%T Statistical Analysis of Terminal Extensions of Protein ¦Ā-Strand Pairs
%A Ning Zhang
%A Shan Gao
%A Lei Zhang
%A Jishou Ruan
%A Tao Zhang
%J Advances in Bioinformatics
%D 2013
%I Hindawi Publishing Corporation
%R 10.1155/2013/909436
%X The long-range interactions, required to the accurate predictions of tertiary structures of ¦Ā-sheet-containing proteins, are still difficult to simulate. To remedy this problem and to facilitate ¦Ā-sheet structure predictions, many efforts have been made by computational methods. However, known efforts on ¦Ā-sheets mainly focus on interresidue contacts or amino acid partners. In this study, to go one step further, we studied ¦Ā-sheets on the strand level, in which a statistical analysis was made on the terminal extensions of paired ¦Ā-strands. In most cases, the two paired ¦Ā-strands have different lengths, and terminal extensions exist. The terminal extensions are the extended part of the paired strands besides the common paired part. However, we found that the best pairing required a terminal alignment, and ¦Ā-strands tend to pair to make bigger common parts. As a result, 96.97%£æ of ¦Ā-strand pairs have a ratio of 25% of the paired common part to the whole length. Also 94.26% and 95.98%£æ of ¦Ā-strand pairs have a ratio of 40% of the paired common part to the length of the two ¦Ā-strands, respectively. Interstrand register predictions by searching interacting ¦Ā-strands from several alternative offsets should comply with this rule to reduce the computational searching space to improve the performances of algorithms. 1. Introduction The issue of protein structure prediction is still extremely challenging in bioinformatics [1, 2]. Usually, structural information for protein sequences with no detectable homology to a protein of known structure could be obtained by predicting the arrangement of their secondary structural elements [3]. As we know, the two predominant protein secondary structures are ¦Į-helices and ¦Ā-sheets. However, a combination of the early suitable ¦Į-helical model systems and sustained researches have resulted in a detailed understanding of ¦Į-helix, while comparatively little is known about ¦Ā-sheet [4]. Tertiary structures of ¦Ā-sheet-containing proteins are especially difficult to simulate [3, 5]. Unlike ¦Į-helices, ¦Ā-sheets are more complex resulting from a combination of two or more disjoint peptide segments, called ¦Ā-strands. Therefore, the ¦Ā-sheet topology is very useful for elucidating protein folding pathways [6, 7] for predicting tertiary structures [3, 8ØC11], and even for designing new proteins [12ØC14]. As fundamental components, ¦Ā-sheets are plentifully contained in protein domains. In a ¦Ā-sheet, multiple ¦Ā-strands held together linked by hydrogen bonds and can be classified into parallel and antiparallel direction styles. Adjacent
%U http://www.hindawi.com/journals/abi/2013/909436/