Statistical analysis and molecular dynamics simulations of ambivalent α -helices
Department of Chemistry, University of Delhi, Delhi - 110007, India
BMC Bioinformatics 2010, 11:519 doi:10.1186/1471-2105-11-519Published: 18 October 2010
Analysis of known protein structures reveals that identical sequence fragments in proteins can adopt different secondary structure conformations. The extent of this conformational diversity is influenced by various factors like the intrinsic sequence propensity, sequence context and other environmental factors such as pH, site directed mutations or alteration of the binding ligands. Understanding the mechanism by which the environment affects the structural ambivalence of these peptides has potential implications for protein design and reliable local structure prediction algorithms. Identification of the structurally ambivalent sequence fragments and determining the rules which dictate their conformational preferences play an important role in understanding the conformational changes observed in misfolding diseases. However, a systematic classification of their intrinsic sequence patterns or a statistical analysis of their properties and sequence context in relation to the origin of their structural diversity have largely remained unexplored.
In this work, the conformational variability of α-helices is studied by mapping sequences from the non-redundant database to identical sequences across all classes of the SCOP (Structural Classification of Proteins) database. Some helices retain their conformations when mapped in the SCOP database while others exhibit a complete/partial switch to non-helical conformations. The results clearly depict the differences in the propensities of amino acids for the variable and conserved helices. Sequences flanking these ambivalent sequence fragments have anisotropic propensities at the N- and C-termini. This structural variability is depicted by molecular dynamics simulations in explicit solvent, which show that the short conserved helices retain their conformations while their longer counterparts fray into two or more shorter helices. Variable helices in the non-redundant database exhibit a trend of retaining helical conformations while their corresponding non-helical conformations in SCOP database show large deviations from their respective initial structures by adopting partial or full helical conformations. Partially ambivalent helices are also found to retain their respective conformations.
All sequence fragments which show structural diversity in different proteins of the non-redundant database are investigated. The final conformation of these ambivalent sequences are dictated by a fine tuning of their intrinsic sequence propensity and the anisotropic amino acid propensity of the flanking sequences. This analysis may unravel the connection between diverse secondary structures, which conserve the overall structural fold of the protein thus determining its function.