<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1757-5036-2-2</ui>
   <ji>1757-5036</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>An effective all-atom potential for proteins</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Irb&#228;ck</snm>
               <fnm>Anders</fnm>
               <insr iid="I1"/>
               <email>anders@thep.lu.se</email>
            </au>
            <au id="A2">
               <snm>Mitternacht</snm>
               <fnm>Simon</fnm>
               <insr iid="I1"/>
               <email>simon@thep.lu.se</email>
            </au>
            <au id="A3">
               <snm>Mohanty</snm>
               <fnm>Sandipan</fnm>
               <insr iid="I2"/>
               <email>s.mohanty@fz-juelich.de</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Computational Biology &amp; Biological Physics, Department of Theoretical Physics, Lund University, S&#246;lvegatan 14A, SE-223 62 Lund, Sweden</p>
            </ins>
            <ins id="I2">
               <p>J&#252;lich Supercomputing Centre, Institute for Advanced Simulation, Forschungszentrum J&#252;lich, D-52425 J&#252;lich, Germany</p>
            </ins>
         </insg>
         <source>PMC Biophysics</source>
         <issn>1757-5036</issn>
         <pubdate>2009</pubdate>
         <volume>2</volume>
         <issue>1</issue>
         <fpage>2</fpage>
         <url>http://www.physmathcentral.com/1757-5036/2/2</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="doi">10.1186/1757-5036-2-2</pubid>
               <pubid idtype="pmpid">19356242</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>27</day>
               <month>1</month>
               <year>2009</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>08</day>
               <month>4</month>
               <year>2009</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>08</day>
               <month>4</month>
               <year>2009</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2009</year>
         <collab>Irb&#228;ck et al</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <p>We describe and test an implicit solvent all-atom potential for simulations of protein folding and aggregation. The potential is developed through studies of structural and thermodynamic properties of 17 peptides with diverse secondary structure. Results obtained using the final form of the potential are presented for all these peptides. The same model, with unchanged parameters, is furthermore applied to a heterodimeric coiled-coil system, a mixed <it>&#945;</it>/<it>&#946; </it>protein and a three-helix-bundle protein, with very good results. The computational efficiency of the potential makes it possible to investigate the free-energy landscape of these 49&#8211;67-residue systems with high statistical accuracy, using only modest computational resources by today's standards.</p>
            <p><b>PACS Codes</b>: 87.14.E-, 87.15.A-, 87.15.Cc</p>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>1 Introduction</p>
         </st>
         <p>A molecular understanding of living systems requires modeling of the dynamics and interactions of proteins. The relevant dynamics of a protein may amount to small fluctuations about its native structure, or reorientations of its ordered parts relative to each other. In either case, a tiny fraction of the conformational space is explored. For flexible proteins, perhaps with large intrinsically disordered parts <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>, the situation is different. When studying such proteins or conformational conversion processes like folding or amyloid aggregation, the competition between different minima on the free-energy landscape inevitably comes into focus. Studying these systems by computer simulation is a challenge, because proper sampling of all relevant free-energy minima must be ensured. This goal is very hard to achieve if explicit solvent molecules are included in the simulations. The use of coarse-grained models can alleviate this problem, but makes important geometric properties like secondary structure formation more difficult to describe.</p>
         <p>Here we present an implicit solvent all-atom protein model especially aimed at problems requiring exploration of the global free-energy landscape. It is based on a computationally convenient effective potential, with parameters determined through full-scale thermodynamic simulations of a set of experimentally well characterized peptides. Central to the approach is the use of a single set of model parameters, independent of the protein studied. This constraint is a simple but efficient way to avoid unphysical biases, for example, toward either <it>&#945;</it>-helical or <it>&#946;</it>-sheet structure <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>. Imposing this constraint is also a way to enable systematic refinement of the potential.</p>
         <p>An earlier version <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp> of this potential has proven useful, for example, for studies of aggregation <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp> and mechanical unfolding <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>. Also, using a slightly modified form of the potential <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>, the folding mechanisms of a 49-residue protein, Top7-CFr, were investigated <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp>. Here we revise this potential, through studies of an enlarged set of 17 peptides (see Table <tblr tid="T1">1</tblr> and Fig. <figr fid="F1">1</figr>). We show that the model, in its final form, folds these different sequences to structures similar to their experimental structures, using a single set of potential parameters. The description of each peptide is kept brief, to be able to discuss all systems and thereby address the issue of transferability in a direct manner. The main purpose of this study is model development rather than detailed characterization of individual systems.</p>
         <tbl id="T1">
            <title>
               <p>Table 1</p>
            </title>
            <caption>
               <p>Amino acid sequences</p>
            </caption>
            <tblbdy cols="3">
               <r>
                  <c ca="left">
                     <p>System</p>
                  </c>
                  <c ca="left">
                     <p>PDB code</p>
                  </c>
                  <c ca="left">
                     <p>Sequence</p>
                  </c>
               </r>
               <r>
                  <c cspan="3">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Trp-cage</p>
                  </c>
                  <c ca="left">
                     <p>
                        <ext-link ext-link-type="pdb" ext-link-id="1L2Y">1L2Y</ext-link>
                     </p>
                  </c>
                  <c ca="left">
                     <p>NLYIQ WLKDG GPSSG RPPPS</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>E6apn1</p>
                  </c>
                  <c ca="left">
                     <p>
                        <ext-link ext-link-type="pdb" ext-link-id="1RIJ">1RIJ</ext-link>
                     </p>
                  </c>
                  <c ca="left">
                     <p>Ac-ALQEL LGQWL KDGGP SSGRP PPS-NH<sub>2</sub></p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>C</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>Ac-KETAA AKFER AHA-NH<sub>2</sub></p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>EK</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>Ac-YAEAA KAAEA AKAF-NH<sub>2</sub></p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>F<sub>s</sub></p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>Suc-AAAAA AAARA AAARA AAARA A-NH<sub>2</sub></p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>GCN4tp</p>
                  </c>
                  <c ca="left">
                     <p>
                        <ext-link ext-link-type="pdb" ext-link-id="2OVN">2OVN</ext-link>
                     </p>
                  </c>
                  <c ca="left">
                     <p>NYHLE NEVAR LKKLV GE</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>HPLC-6</p>
                  </c>
                  <c ca="left">
                     <p>
                        <ext-link ext-link-type="pdb" ext-link-id="1WFA">1WFA</ext-link>
                     </p>
                  </c>
                  <c ca="left">
                     <p>DTASD AAAAA ALTAA NAKAA AELTA ANAAA AAAAT AR-NH<sub>2</sub></p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Chignolin</p>
                  </c>
                  <c ca="left">
                     <p>
                        <ext-link ext-link-type="pdb" ext-link-id="1UAO">1UAO</ext-link>
                     </p>
                  </c>
                  <c ca="left">
                     <p>GYDPE TGTWG</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>MBH12</p>
                  </c>
                  <c ca="left">
                     <p>
                        <ext-link ext-link-type="pdb" ext-link-id="1J4M">1J4M</ext-link>
                     </p>
                  </c>
                  <c ca="left">
                     <p>RGKWT YNGIT YEGR</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>GB1p</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>GEWTY DDATK TFTVT E</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>GB1m2</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>GEWTY NPATG KFTVT E</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>GB1m3</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>KKWTY NPATG KFTVQ E</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>trpzip1</p>
                  </c>
                  <c ca="left">
                     <p>
                        <ext-link ext-link-type="pdb" ext-link-id="1LE0">1LE0</ext-link>
                     </p>
                  </c>
                  <c ca="left">
                     <p>SWTWE GNKWT WK-NH<sub>2</sub></p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>trpzip2</p>
                  </c>
                  <c ca="left">
                     <p>
                        <ext-link ext-link-type="pdb" ext-link-id="1LE1">1LE1</ext-link>
                     </p>
                  </c>
                  <c ca="left">
                     <p>SWTWE NGKWT WK-NH<sub>2</sub></p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>betanova</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>RGWSV QNGKY TNNGK TTEGR</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>LLM</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>RGWSL QNGKY TLNGK TMEGR</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>beta3s</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>TWIQN GSTKW YQNGS TKIYT</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>AB zipper</p>
                  </c>
                  <c ca="left">
                     <p>
                        <ext-link ext-link-type="pdb" ext-link-id="1U2U">1U2U</ext-link>
                     </p>
                  </c>
                  <c ca="left">
                     <p>Ac-EVAQL EKEVA QLEAE NYQLE QEVAQ LEHEG-NH<sub>2</sub></p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>Ac-EVQAL KKRVQ ALKAR NYALK QKVQA LRHKG-NH<sub>2</sub></p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Top7-CFR</p>
                  </c>
                  <c ca="left">
                     <p>
                        <ext-link ext-link-type="pdb" ext-link-id="2GJH">2GJH</ext-link>
                     </p>
                  </c>
                  <c ca="left">
                     <p>ERVRI SITAR TKKEA EKFAA ILIKV FAELG YNDIN VTWDG DTVTV EGQL</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>GS-<it>&#945;</it><sub>3 </sub>W</p>
                  </c>
                  <c ca="left">
                     <p>
                        <ext-link ext-link-type="pdb" ext-link-id="1LQ7">1LQ7</ext-link>
                     </p>
                  </c>
                  <c ca="left">
                     <p>GSRVK ALEEK VKALE EKVKA LGGGG RIEEL KKKWE ELKKK IEELG GGGEV KKVEE EVKKL EEEIK KL</p>
                  </c>
               </r>
            </tblbdy>
            <tblfn>
               <p>Suc stands for succinylic acid.</p>
            </tblfn>
         </tbl>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Schematic illustration of native geometries studied</p>
            </caption>
            <text>
               <p><b>Schematic illustration of native geometries studied</b>. (a) the Trp-cage, (b) an <it>&#945;</it>-helix, (c) a <it>&#946;</it>-hairpin, (d) a three-stranded <it>&#946;</it>-sheet, (e) an <it>&#945;</it>-helix dimer (1U2U), (f) a three-helix bundle (1LQ7), and (g) a mixed <it>&#945;</it>/<it>&#946; </it>protein (2GJH).</p>
            </text>
            <graphic file="1757-5036-2-2-1"/>
         </fig>
         <p>Whether or not this potential, calibrated using data on peptides with typically ~20 residues, will be useful for larger systems is not obvious. Therefore, we also apply our potential, with unchanged parameters, to three larger systems with different geometries. These systems are the mixed <it>&#945;</it>/<it>&#946; </it>protein Top7-CFr, a three-helix-bundle protein with 67 residues, and a heterodimeric leucine zipper composed of two 30-residue chains.</p>
         <p>Protein folding simulations are by necessity based on potentials whose terms are interdependent and dependent on the choice of geometric representation. Therefore, we choose to calibrate our potential directly against folding properties of whole chains. To make this feasible, we deliberately omit many details included in force fields like Amber, CHARMM and OPLS (for a review, see <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>). With this approach, we might lose details of a given free-energy minimum, but, by construction, we optimize the balance between competing minima.</p>
         <p>Two potentials somewhat similar in form to ours are the <it>&#956;</it>-potential of the Shakhnovich group <abbrgrp><abbr bid="B16">16</abbr></abbrgrp> and the PFF potential of the Wenzel group <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. These groups also consider properties of entire chains for calibration, but use folded PDB structures or sets of decoys rather than full-scale thermodynamic simulations. Our admittedly time-consuming procedure implies that our model is trained on completely general structures, which might be an advantage when studying the dynamics of folding. Another potential with similarities to ours is that developed by the Dokholyan group for discrete molecular dynamics simulations <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>.</p>
      </sec>
      <sec>
         <st>
            <p>2 Methods</p>
         </st>
         <p>Our model belongs to the class of implicit solvent all-atom models with torsional degrees of freedom. All geometrical parameters, like bond lengths and bond angles, are as described earlier <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>.</p>
         <p>The interaction potential is composed of four major terms:</p>
         <p>
            <display-formula id="M1">
               <graphic file="1757-5036-2-2-i1.gif"/>
            </display-formula>
         </p>
         <p>The first term, <it>E</it><sub>loc</sub>, contains local interactions between atoms separated by only a few covalent bonds. The other three terms are non-local in character: <it>E</it><sub>ev </sub>represents excluded-volume effects, <it>E</it><sub>hb </sub>is a hydrogen-bond potential, and <it>E</it><sub>sc </sub>contains residue-specific interactions between pairs of sidechains. Next we describe the precise form of these four terms. Energy parameters are given in a unit called eu. The factor for conversion from eu to kcal/mol will be determined in the next section, by calibration against the experimental melting temperature for one of the peptides studied, the Trp-cage.</p>
         <sec>
            <st>
               <p>2.1 Local potential</p>
            </st>
            <p>The local potential <inline-formula><graphic file="1757-5036-2-2-i2.gif"/></inline-formula> can be divided into two backbone terms, <inline-formula><graphic file="1757-5036-2-2-i3.gif"/></inline-formula> and <inline-formula><graphic file="1757-5036-2-2-i4.gif"/></inline-formula>, and one sidechain term, <inline-formula><graphic file="1757-5036-2-2-i5.gif"/></inline-formula>. In describing the potential, the concept of a peptide unit is useful. A peptide unit consists of the backbone C<it>'</it>O group of one residue and the backbone NH group of the next residue.</p>
            <p indent="3">&#8226; The potential <inline-formula><graphic file="1757-5036-2-2-i3.gif"/></inline-formula> represents interactions between partial charges of neighboring peptide units along the chain. It is given by</p>
            <p>
               <display-formula id="M2">
                  <graphic file="1757-5036-2-2-i6.gif"/>
               </display-formula>
            </p>
            <p indent="3">where the outer sum runs over all pairs of nearest-neighbor peptide units and each of the two inner sums runs over atoms in one peptide unit (if the N side of the peptide unit is proline the sum runs over only C<it>' </it>and O). The partial charge <it>q</it><sub><it>i </it></sub>is taken as &#177; 0.42 for C<it>' </it>and O atoms and &#177; 0.20 for H and N atoms. The parameter <inline-formula><graphic file="1757-5036-2-2-i7.gif"/></inline-formula> is set to 6 eu, corresponding to a dielectric constant of <it>&#1013;</it><sub>r </sub>&#8776; 41. Two peptide units that are not nearest neighbors along the chain interact through hydrogen bonding (see below) rather than through the potential <inline-formula><graphic file="1757-5036-2-2-i3.gif"/></inline-formula>.</p>
            <p indent="3">&#8226; The term <inline-formula><graphic file="1757-5036-2-2-i4.gif"/></inline-formula> provides an additional OO and HH repulsion for neighboring peptide units, unless the residue flanked by the two peptide units is a glycine. This repulsion is added to make doubling of hydrogen bonds less likely. Glycine has markedly different backbone energetics compared to other residues. The lack of C<sub><it>&#946; </it></sub>atom makes glycine more flexible. However, the observed distribution of Ramachandran <it>&#981;</it>, <it>&#968; </it>angles for glycine in PDB structures <abbrgrp><abbr bid="B19">19</abbr></abbrgrp> is not as broad as simple steric considerations would suggest. <inline-formula><graphic file="1757-5036-2-2-i4.gif"/></inline-formula> provides an energy penalty for glycine <it>&#968; </it>values around &#177; 120<sup>&#176;</sup>, which are sterically allowed but relatively rare in PDB structures.</p>
            <p indent="3">The full expression for <inline-formula><graphic file="1757-5036-2-2-i4.gif"/></inline-formula> is</p>
            <p>
               <display-formula id="M3">
                  <graphic file="1757-5036-2-2-i8.gif"/>
               </display-formula>
            </p>
            <p indent="3">where <inline-formula><graphic file="1757-5036-2-2-i9.gif"/></inline-formula> = 1.2 eu, <inline-formula><graphic file="1757-5036-2-2-i10.gif"/></inline-formula> = -0.15 eu, <it>I </it>is a residue index, and</p>
            <p>
               <display-formula id="M4">
                  <graphic file="1757-5036-2-2-i11.gif"/>
               </display-formula>
            </p>
            <p>
               <display-formula id="M5">
                  <graphic file="1757-5036-2-2-i12.gif"/>
               </display-formula>
            </p>
            <p>
               <display-formula id="M6">
                  <graphic file="1757-5036-2-2-i13.gif"/>
               </display-formula>
            </p>
            <p indent="3">The function <it>f</it>(<it>u</it><sub><it>I</it></sub>) is positive if the H<sub><it>I </it></sub>H<sub><it>I</it>+1 </sub>distance, <it>d</it>(H<sub><it>I</it></sub>, H<sub><it>I</it>+1</sub>), is smaller than both of the H<sub><it>I </it></sub>N<sub><it>I</it>+1 </sub>and N<sub><it>I </it></sub>H<sub><it>I</it>+1 </sub>distances, and zero otherwise. This term thus provides an energy penalty when H<sub><it>I </it></sub>and H<sub><it>I</it>+1 </sub>are exposed to each other (it is omitted if residue <it>I </it>or <it>I </it>+ 1 is a proline). Similarly, <it>f</it>(<it>v</it><sub><it>I</it></sub>) is positive when O<sub><it>I </it></sub>and O<sub><it>I</it>+1 </sub>are exposed to each other.</p>
            <p indent="3">&#8226; <inline-formula><graphic file="1757-5036-2-2-i5.gif"/></inline-formula> is an explicit torsion angle potential for sidechain angles, <it>&#967;</it><sub><it>i</it></sub>. Many sidechain angles display distributions resembling what one would expect based on simple steric considerations. The use of the torsion potential is particularly relevant for <it>&#967;</it><sub>2 </sub>in asparagine and aspartic acid and <it>&#967;</it><sub>3 </sub>in glutamine and glutamic acid. The torsion potential is defined as</p>
            <p>
               <display-formula id="M7">
                  <graphic file="1757-5036-2-2-i14.gif"/>
               </display-formula>
            </p>
            <p indent="3">where <inline-formula><graphic file="1757-5036-2-2-i15.gif"/></inline-formula> and <it>n</it><sub><it>i </it></sub>are constants. Each sidechain angle <it>&#967;</it><sub><it>i </it></sub>belongs to one of four classes associated with different values of <inline-formula><graphic file="1757-5036-2-2-i15.gif"/></inline-formula> and <it>n</it><sub><it>i </it></sub>(see Table <tblr tid="T2">2</tblr>).</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Classification of sidechain angles, <it>&#967;</it><sub><it>i</it></sub></p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="left">
                        <p>Residue</p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>&#967;</it>
                           <sub>1</sub>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>&#967;</it>
                           <sub>2</sub>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>&#967;</it>
                           <sub>3</sub>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>&#967;</it>
                           <sub>4</sub>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Ser, Cys, Thr, Val</p>
                     </c>
                     <c ca="center">
                        <p>I</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Ile, Leu</p>
                     </c>
                     <c ca="center">
                        <p>I</p>
                     </c>
                     <c ca="center">
                        <p>I</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Asp, Asn</p>
                     </c>
                     <c ca="center">
                        <p>I</p>
                     </c>
                     <c ca="center">
                        <p>IV</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>His, Phe, Tyr, Trp</p>
                     </c>
                     <c ca="center">
                        <p>I</p>
                     </c>
                     <c ca="center">
                        <p>III</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Met</p>
                     </c>
                     <c ca="center">
                        <p>I</p>
                     </c>
                     <c ca="center">
                        <p>I</p>
                     </c>
                     <c ca="center">
                        <p>II</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Glu, Gln</p>
                     </c>
                     <c ca="center">
                        <p>I</p>
                     </c>
                     <c ca="center">
                        <p>I</p>
                     </c>
                     <c ca="center">
                        <p>IV</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Lys</p>
                     </c>
                     <c ca="center">
                        <p>I</p>
                     </c>
                     <c ca="center">
                        <p>I</p>
                     </c>
                     <c ca="center">
                        <p>I</p>
                     </c>
                     <c ca="center">
                        <p>I</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Arg</p>
                     </c>
                     <c ca="center">
                        <p>I</p>
                     </c>
                     <c ca="center">
                        <p>I</p>
                     </c>
                     <c ca="center">
                        <p>I</p>
                     </c>
                     <c ca="center">
                        <p>III</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>The parameters of the torsion angle potential <inline-formula><graphic file="1757-5036-2-2-i5.gif"/></inline-formula> are (<inline-formula><graphic file="1757-5036-2-2-i15.gif"/></inline-formula>, <it>n</it><sub><it>i</it></sub>) = (0.6 eu, 3) for class I, (<inline-formula><graphic file="1757-5036-2-2-i15.gif"/></inline-formula>, <it>n</it><sub><it>i</it></sub>) = (0.3 eu, 3) for class II, (<inline-formula><graphic file="1757-5036-2-2-i15.gif"/></inline-formula>, <it>n</it><sub><it>i</it></sub>) = (0.4 eu, 2) for class III, and (<inline-formula><graphic file="1757-5036-2-2-i15.gif"/></inline-formula>, <it>n</it><sub><it>i</it></sub>) = (-0.4 eu, 2) for class IV.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>2.2 Excluded volume</p>
            </st>
            <p>Excluded-volume effects are modeled using the potential</p>
            <p>
               <display-formula id="M8">
                  <graphic file="1757-5036-2-2-i16.gif"/>
               </display-formula>
            </p>
            <p>where the summation is over all pairs of atoms with a non-constant separation, <it>&#954;</it><sub>ev </sub>= 0.10 eu, and <it>&#963;</it><sub><it>i </it></sub>= 1.77, 1.75, 1.53, 1.42 and 1.00 &#197; for S, C, N, O and H atoms, respectively. The parameter <it>&#955;</it><sub><it>ij </it></sub>is unity for pairs connected by three covalent bonds and <it>&#955;</it><sub><it>ij </it></sub>= 0.75 for all other pairs. To speed up the calculations, <it>E</it><sub>ev </sub>is evaluated using a cutoff of 4.3 <it>&#955;</it><sub><it>ij </it></sub>&#197;.</p>
         </sec>
         <sec>
            <st>
               <p>2.3 Hydrogen bonding</p>
            </st>
            <p>Our potential contains an explicit hydrogen-bond term, <it>E</it><sub>hb</sub>. All hydrogen bonds in the model are between NH and CO groups. They connect either two backbone groups or a charged sidechain (aspartic acid, glutamic acid, lysine, arginine) with a backbone group. Two neighboring peptide units, which interact through the local potential (see above), are not allowed to hydrogen bond with each other.</p>
            <p>The form of the hydrogen-bond potential is</p>
            <p>
               <display-formula id="M9">
                  <graphic file="1757-5036-2-2-i17.gif"/>
               </display-formula>
            </p>
            <p>where <inline-formula><graphic file="1757-5036-2-2-i18.gif"/></inline-formula> = 3.0 eu and <inline-formula><graphic file="1757-5036-2-2-i19.gif"/></inline-formula> = 2.3 eu set the strengths of backbone-backbone and sidechain-backbone bonds, respectively, <it>r</it><sub><it>ij </it></sub>is the HO distance, <it>&#945;</it><sub><it>ij </it></sub>is the NHO angle, and <it>&#946;</it><sub><it>ij </it></sub>is the HOC angle. The functions <it>u</it>(<it>r</it>) and <it>v</it>(<it>&#945;</it>, <it>&#946;</it>) are given by</p>
            <p>
               <display-formula id="M10">
                  <graphic file="1757-5036-2-2-i20.gif"/>
               </display-formula>
            </p>
            <p>
               <display-formula id="M11">
                  <graphic file="1757-5036-2-2-i21.gif"/>
               </display-formula>
            </p>
            <p>where <it>&#963;</it><sub>hb </sub>= 2.0 &#197;. A 4.5 &#197; cutoff is used for <it>u</it>(<it>r</it>).</p>
         </sec>
         <sec>
            <st>
               <p>2.4 Sidechain potential</p>
            </st>
            <p>Our sidechain potential is composed of two terms, <it>E</it><sub>sc </sub>= <it>E</it><sub>hp </sub>+ <it>E</it><sub>ch</sub>. The <it>E</it><sub>ch </sub>term represents interactions among sidechain charges. The first and more important term, <it>E</it><sub>hp</sub>, is meant to capture the effects of all other relevant interactions, especially effective hydrophobic attraction. For convenience, <it>E</it><sub>hp </sub>and <it>E</it><sub>ch </sub>have a similar form,</p>
            <p>
               <display-formula id="M12">
                  <graphic file="1757-5036-2-2-i22.gif"/>
               </display-formula>
            </p>
            <p>Here the sums run over residue pairs <it>IJ</it>, <inline-formula><graphic file="1757-5036-2-2-i23.gif"/></inline-formula> and <inline-formula><graphic file="1757-5036-2-2-i24.gif"/></inline-formula> are contact measures that take values between 0 and 1, and <inline-formula><graphic file="1757-5036-2-2-i25.gif"/></inline-formula> and <inline-formula><graphic file="1757-5036-2-2-i26.gif"/></inline-formula> are energy parameters.</p>
            <p>It is assumed that ten of the twenty natural amino acids contribute to <it>E</it><sub>hp</sub>, see Table <tblr tid="T3">3</tblr>. Included among these ten are lysine and arginine, which are charged but have large hydrophobic parts. To reduce the number of parameters, the hydrophobic contact energies are taken to be additive, <inline-formula><graphic file="1757-5036-2-2-i25.gif"/></inline-formula> = <it>m</it><sub><it>I </it></sub>+ <it>m</it><sub><it>J </it></sub>. It is known that the statistically derived Miyazawa-Jernigan contact matrix <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> can be approximately decomposed this way <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. The <it>m</it><sub><it>I </it></sub>parameters can be found in Table <tblr tid="T3">3</tblr>. <inline-formula><graphic file="1757-5036-2-2-i25.gif"/></inline-formula> is set to 0 if residues <it>I </it>and <it>J </it>are nearest neighbors along the chain, and is reduced by a factor 2 for next-nearest neighbors.</p>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>The parameter <it>m</it><sub><it>I </it></sub>of the hydrophobicity potential <it>E</it><sub>hp</sub></p>
               </caption>
               <tblbdy cols="2">
                  <r>
                     <c ca="left">
                        <p>Residue</p>
                     </c>
                     <c ca="center">
                        <p><it>m</it><sub><it>I </it></sub>(eu)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="2">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Arg</p>
                     </c>
                     <c ca="center">
                        <p>0.3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Met, Lys</p>
                     </c>
                     <c ca="center">
                        <p>0.4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Val</p>
                     </c>
                     <c ca="center">
                        <p>0.6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Ile, Leu, Pro</p>
                     </c>
                     <c ca="center">
                        <p>0.8</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Tyr</p>
                     </c>
                     <c ca="center">
                        <p>1.1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Phe, Trp</p>
                     </c>
                     <c ca="center">
                        <p>1.6</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>The residues taken as charged are aspartic acid, glutamic acid, lysine and arginine. The charge-charge contact energy is &#8211; <inline-formula><graphic file="1757-5036-2-2-i26.gif"/></inline-formula> = 1.5<it>s</it><sub><it>I </it></sub><it>s</it><sub><it>J </it></sub>eu, where <it>s</it><sub><it>I </it></sub>and <it>s</it><sub><it>J </it></sub>are the signs of the charges (&#177; 1).</p>
            <p>The contact measure <inline-formula><graphic file="1757-5036-2-2-i23.gif"/></inline-formula> is calculated using a predetermined set of atoms for each amino acid, denoted by <inline-formula><graphic file="1757-5036-2-2-i27.gif"/></inline-formula> (see Table <tblr tid="T4">4</tblr>). Let <it>n</it><sub><it>I </it></sub>be the number of atoms in <inline-formula><graphic file="1757-5036-2-2-i27.gif"/></inline-formula> and let</p>
            <tbl id="T4">
               <title>
                  <p>Table 4</p>
               </title>
               <caption>
                  <p>Atoms used in the calculation of the contact measure <inline-formula><graphic file="1757-5036-2-2-i23.gif"/></inline-formula></p>
               </caption>
               <tblbdy cols="2">
                  <r>
                     <c ca="left">
                        <p>Residue</p>
                     </c>
                     <c ca="left">
                        <p>Set of atoms (<it>A</it><sub><it>I</it></sub>)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="2">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Pro</p>
                     </c>
                     <c ca="left">
                        <p>C<sub><it>&#946;</it></sub>, C<sub><it>&#947;</it></sub>, C<sub><it>&#948;</it></sub></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Tyr</p>
                     </c>
                     <c ca="left">
                        <p>C<sub><it>&#947;</it></sub>, C<sub><it>&#948;</it>1</sub>, C<sub><it>&#948;</it>2</sub>, C<sub><it>&#1013;</it>1</sub>, C<sub><it>&#1013;</it>2</sub>, C<sub><it>&#950;</it></sub></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Val</p>
                     </c>
                     <c ca="left">
                        <p>C<sub><it>&#946;</it></sub>, C<sub><it>&#947;</it>1</sub>, C<sub><it>&#947;</it>2</sub></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Ile</p>
                     </c>
                     <c ca="left">
                        <p>C<sub><it>&#946;</it></sub>, C<sub><it>&#947;</it>1</sub>, C<sub><it>&#947;</it>2</sub>, C<sub><it>&#948;</it></sub></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Leu</p>
                     </c>
                     <c ca="left">
                        <p>C<sub><it>&#946;</it></sub>, C<sub><it>&#947;</it></sub>, C<sub><it>&#948;</it>1</sub>, C<sub><it>&#948;</it>2</sub></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Met</p>
                     </c>
                     <c ca="left">
                        <p>C<sub><it>&#946;</it></sub>, C<sub><it>&#947;</it></sub>, S<sub><it>&#948;</it></sub>, C<sub><it>&#1013;</it></sub></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Phe</p>
                     </c>
                     <c ca="left">
                        <p>C<sub><it>&#947;</it></sub>, C<sub><it>&#948;</it>1</sub>, C<sub><it>&#948;</it>2</sub>, C<sub><it>&#1013;</it>1</sub>, C<sub><it>&#1013;</it>2</sub>, C<sub><it>&#950;</it></sub></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Trp</p>
                     </c>
                     <c ca="left">
                        <p>C<sub><it>&#947;</it></sub>, C<sub><it>&#948;</it>1</sub>, C<sub><it>&#948;</it>2</sub>, C<sub><it>&#1013;</it>3</sub>, C<sub><it>&#950;</it>3</sub>, C<sub><it>&#951;</it>2</sub></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Arg</p>
                     </c>
                     <c ca="left">
                        <p>C<sub><it>&#946;</it></sub>, C<sub><it>&#947;</it></sub></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Lys</p>
                     </c>
                     <c ca="left">
                        <p>C<sub><it>&#946;</it></sub>, C<sub><it>&#947;</it></sub>, C<sub><it>&#948;</it></sub></p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>
               <display-formula id="M13">
                  <graphic file="1757-5036-2-2-i28.gif"/>
               </display-formula>
            </p>
            <p>where <it>g</it>(<it>x</it>) is unity for <it>x </it>&lt; (3.7 &#197;)<sup>2</sup>, vanishes for <it>x </it>> (4.5 &#197;)<sup>2</sup>, and varies linearly for intermediate <it>x</it>. The contact measure can then be written as</p>
            <p>
               <display-formula id="M14">
                  <graphic file="1757-5036-2-2-i29.gif"/>
               </display-formula>
            </p>
            <p>where <it>&#947;</it><sub><it>IJ </it></sub>is either 1 or 0.75. For <it>&#947;</it><sub><it>IJ </it></sub>= 1, <inline-formula><graphic file="1757-5036-2-2-i23.gif"/></inline-formula> is, roughly speaking, the fraction of atoms in <inline-formula><graphic file="1757-5036-2-2-i27.gif"/></inline-formula> and <inline-formula><graphic file="1757-5036-2-2-i30.gif"/></inline-formula> that are in contact with some atom from the other of the two sets. A reduction to <it>&#947;</it><sub><it>IJ </it></sub>= 0.75 makes it easier to achieve a full contact (<inline-formula><graphic file="1757-5036-2-2-i23.gif"/></inline-formula> = 1). The value <it>&#947;</it><sub><it>IJ </it></sub>= 0.75 is used for interactions within the group proline, phenylalanine, tyrosine and tryptophan, to make face-to-face stacking of these sidechains less likely. It is also used within the group isoleucine, leucine and valine, because a full contact is otherwise hard to achieve for these pairs. In all other cases, <it>&#947;</it><sub><it>IJ </it></sub>is unity.</p>
            <p>The definition of <inline-formula><graphic file="1757-5036-2-2-i24.gif"/></inline-formula> is similar. The <it>&#947;</it><sub><it>IJ </it></sub>parameter is unity for charge-charge interactions, and the sets of atoms used, <inline-formula><graphic file="1757-5036-2-2-i31.gif"/></inline-formula>, can be found in Table <tblr tid="T5">5</tblr>.</p>
            <tbl id="T5">
               <title>
                  <p>Table 5</p>
               </title>
               <caption>
                  <p>Atoms used in the calculation of the contact measure <inline-formula><graphic file="1757-5036-2-2-i24.gif"/></inline-formula></p>
               </caption>
               <tblbdy cols="2">
                  <r>
                     <c ca="left">
                        <p>Residue</p>
                     </c>
                     <c ca="left">
                        <p>Set of atoms (<it>A</it><sub><it>I</it></sub>)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="2">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Arg</p>
                     </c>
                     <c ca="left">
                        <p>N<sub><it>&#1013;</it></sub>, C<sub><it>&#950;</it></sub>, N<sub><it>&#951;</it>1</sub>, N<sub><it>&#951;</it>2</sub></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Lys</p>
                     </c>
                     <c ca="left">
                        <p><sup>1</sup>H<sub><it>&#950;</it></sub>, <sup>2</sup>H<sub><it>&#950;</it></sub>, <sup>3</sup>H<sub><it>&#950;</it></sub></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Asp</p>
                     </c>
                     <c ca="left">
                        <p>O<sub><it>&#948;</it>1</sub>, O<sub><it>&#948;</it>2</sub></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Glu</p>
                     </c>
                     <c ca="left">
                        <p>O<sub><it>&#1013;</it>1</sub>, O<sub><it>&#1013;</it>2</sub></p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>2.5 Chain ends</p>
            </st>
            <p>Some of the sequences we study have extra groups attached at one or both ends of the chain. The groups occurring are N-terminal acetyl and succinylic acid, and C-terminal NH<sub>2</sub>. When such a unit is present, the model assumes polar NH and CO groups beyond the last C<sub><it>&#945; </it></sub>atom to hydrogen bond like backbone NH/CO groups but with the strength reduced by a factor 2 (multiplicatively). The charged group of succinylic acid interacts like a charged sidechain.</p>
            <p>In the absence of end groups, the model assumes the N and C termini to be positively and negatively charged, respectively, and to interact like charged sidechains.</p>
         </sec>
         <sec>
            <st>
               <p>2.6 Monte Carlo details</p>
            </st>
            <p>We investigate the folding thermodynamics of this model by Monte Carlo (MC) methods. The simulations are done using either simulated tempering (ST) <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp> or parallel tempering/replica exchange (PT) <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr></abbrgrp>, both with temperature as a dynamical variable. For small systems we use ST, with seven geometrically distributed temperatures in the range 279 K&#8211;367 K. For each system, ten independent ST runs are performed. For our largest systems we use PT with a set of sixteen temperatures, spanning the same interval. Using fourfold multiplexing <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>, one run comprising 64 parallel trajectories is performed for each system. The PT temperature distribution is determined by an optimization procedure <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. The length of our different simulations can be found in Table <tblr tid="T6">6</tblr>.</p>
            <tbl id="T6">
               <title>
                  <p>Table 6</p>
               </title>
               <caption>
                  <p>Algorithm used and total number of elementary MC steps for all systems studied</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c ca="left">
                        <p>System</p>
                     </c>
                     <c ca="left">
                        <p>Method</p>
                     </c>
                     <c ca="left">
                        <p>MC steps</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Trp-cage, E6apn1</p>
                     </c>
                     <c ca="left">
                        <p>ST</p>
                     </c>
                     <c ca="left">
                        <p>10 &#215; 1.0 &#215; 10<sup>9</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>C, EK, F<sub>s</sub>, GCN4tp</p>
                     </c>
                     <c ca="left">
                        <p>ST</p>
                     </c>
                     <c ca="left">
                        <p>10 &#215; 1.0 &#215; 10<sup>9</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>HPLC-6</p>
                     </c>
                     <c ca="left">
                        <p>ST</p>
                     </c>
                     <c ca="left">
                        <p>10 &#215; 3.0 &#215; 10<sup>9</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Chignolin</p>
                     </c>
                     <c ca="left">
                        <p>ST</p>
                     </c>
                     <c ca="left">
                        <p>10 &#215; 0.5 &#215; 10<sup>9</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>MBH12</p>
                     </c>
                     <c ca="left">
                        <p>ST</p>
                     </c>
                     <c ca="left">
                        <p>10 &#215; 1.0 &#215; 10<sup>9</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GB1p</p>
                     </c>
                     <c ca="left">
                        <p>ST</p>
                     </c>
                     <c ca="left">
                        <p>10 &#215; 2.0 &#215; 10<sup>9</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GB1m2, GB1m3</p>
                     </c>
                     <c ca="left">
                        <p>ST</p>
                     </c>
                     <c ca="left">
                        <p>10 &#215; 1.0 &#215; 10<sup>9</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Trpzip1, trpzip2</p>
                     </c>
                     <c ca="left">
                        <p>ST</p>
                     </c>
                     <c ca="left">
                        <p>10 &#215; 1.0 &#215; 10<sup>9</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>betanova, LLM</p>
                     </c>
                     <c ca="left">
                        <p>ST</p>
                     </c>
                     <c ca="left">
                        <p>10 &#215; 1.0 &#215; 10<sup>9</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>beta3s</p>
                     </c>
                     <c ca="left">
                        <p>ST</p>
                     </c>
                     <c ca="left">
                        <p>10 &#215; 2.0 &#215; 10<sup>9</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>AB zipper</p>
                     </c>
                     <c ca="left">
                        <p>PT</p>
                     </c>
                     <c ca="left">
                        <p>64 &#215; 3.0 &#215; 10<sup>9</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Top7-CFR</p>
                     </c>
                     <c ca="left">
                        <p>PT</p>
                     </c>
                     <c ca="left">
                        <p>64 &#215; 2.4 &#215; 10<sup>9</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GS-<it>&#945;</it><sub>3 </sub>W</p>
                     </c>
                     <c ca="left">
                        <p>PT</p>
                     </c>
                     <c ca="left">
                        <p>64 &#215; 3.5 &#215; 10<sup>9</sup></p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>Three different conformational updates are used in the simulations: single variable updates of sidechain and backbone angles, respectively, and Biased Gaussian Steps (BGS) <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. The BGS move is semi-local and updates up to eight consecutive backbone degrees of freedom in a manner that keeps the ends of the segment approximately fixed. The ratio of sidechain to backbone updates is the same at all temperatures, whereas the relative frequency of the two backbone updates depends on the temperature. At high temperatures the single variable update is the only backbone update used, and at low temperatures only BGS is used. At intermediate temperatures both updates are used.</p>
            <p>The AB zipper, a two-chain system, is studied using a periodic box of size (158 &#197;)<sup>3</sup>. In addition to the conformational updates described above, the simulations of this system used rigid body translations and rotations of individual chains.</p>
            <p>Our simulations are performed using the open source C++-package PROFASI <abbrgrp><abbr bid="B28">28</abbr></abbrgrp><url>http://cbbp.thep.lu.se/activities/profasi/</url>. Future public releases of PROFASI will include an implementation of the force field described here. While this force field has been implemented in PROFASI in an optimized manner, this optimization does not involve a parallel evaluation of the potential on many processors. Therefore, in our simulations the number of processors used is the same as the number of MC trajectories generated. For a typical small peptide, a trajectory of the length as given in Table <tblr tid="T6">6</tblr> takes ~18 hours to generate on an AMD Opteron processor with ~2.0 GHz clock rate. For the largest system studied, GS-<it>&#945;</it><sub>3 </sub>W, the simulations, with a proportionately larger number of MC updates, take ~10 days to complete.</p>
         </sec>
         <sec>
            <st>
               <p>2.7 Analysis</p>
            </st>
            <p>In our simulations, we monitor a variety of different properties. Three important observables are as follows.</p>
            <p indent="1">1. <it>&#945;</it>-helix content, <it>h</it>. A residue is defined as helical if its Ramachandran angle pair is in the region -90&#176; &lt;<it>&#981; </it>&lt; -30&#176;, -77&#176; &lt;<it>&#968; </it>&lt; -17&#176;. Following <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>, a stretch of <it>n </it>> 2 helical residues is said to form a helical segment of length <it>n </it>- 2. For an end residue that is not followed by an extra end group, the (<it>&#981;</it>, <it>&#968;</it>) pair is poorly defined. Thus, for a chain with <it>N </it>residues, the maximum length of a helical segment is <it>N </it>- 4, <it>N </it>- 3 or <it>N </it>- 2, depending on whether there are zero, one or two end groups. The <it>&#945;</it>-helix content <it>h </it>is defined as the total length of all helical segments divided by this maximum length.</p>
            <p indent="1">2. Root-mean-square deviation from a folded reference structure, bRMSD/RMSD/pRMSD. bRMSD is calculated over backbone atoms, whereas RMSD is calculated over all heavy atoms. All residues except the two end residues are included in the calculation, unless otherwise stated. For the case of the dimeric AB zipper, the periodic box used for the simulations has to be taken into account. The two chains in the simulation might superficially appear to be far away when they are in fact close, because of periodicity. For this case we evaluate backbone RMSD over atoms taken from both chains in the dimer, and minimize this value with respect to periodic translations. We denote this as pRMSD.</p>
            <p indent="1">3. Nativeness measure based on hydrogen bonds, <it>q</it><sub>hb</sub>. This observable has the value 1 if at most two native backbone-backbone hydrogen bonds are missing, and is 0 otherwise. A hydrogen bond is considered formed if its energy is less than -1.03 eu.</p>
            <p>In many cases, it turns out that the temperature dependence of our results can be approximately described in terms of the simple two-state model</p>
            <p>
               <display-formula id="M15">
                  <graphic file="1757-5036-2-2-i32.gif"/>
               </display-formula>
            </p>
            <p>where <it>X</it>(<it>T</it>) is the quantity studied, <it>X</it><sub>1 </sub>and <it>X</it><sub>2 </sub>are the values of <it>X </it>in the two states, and <it>K</it>(<it>T</it>) is the effective equilibrium constant (<it>R </it>is the gas constant). In this first-order form, <it>K</it>(<it>T</it>) contains two parameters: the melting temperature <it>T</it><sub>m </sub>and the energy difference &#916;<it>E</it>. The parameters <it>T</it><sub>m</sub>, &#916;<it>E</it>, <it>X</it><sub>1 </sub>and <it>X</it><sub>2 </sub>are determined by fitting to data.</p>
            <p>Thermal averages and their statistical errors are calculated by using the jackknife method <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>, after discarding the first 20% of each MC trajectory for thermalization.</p>
            <p>Figures of 3D structures were prepared using PyMOL <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>3 Results</p>
         </st>
         <p>We study a total of 20 peptide/protein systems, listed in Table <tblr tid="T1">1</tblr> (amino acid sequences can be found in this table). Among these, there are 17 smaller systems with 10&#8211;37 residues and 3 larger ones with &#8805; 49 residues. Many of the smaller systems have been simulated by other groups, in some cases with explicit water (for a review, see <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>). Two of the three larger systems, as far as we know, have not been studied using other force fields. A study of the 67-residue three-helix-bundle protein GS-<it>&#945;</it><sub>3 </sub>W using the ECEPP/3 force field was recently reported <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. The simulations presented here use the same geometric representation and find about a hundred times the number of independent folding events, while consuming much smaller computing resources.</p>
         <sec>
            <st>
               <p>3.1 Trp-cage and E6apn1</p>
            </st>
            <p>The Trp-cage is a designed 20-residue miniprotein with a compact helical structure <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. Its NMR-derived native structure (see Fig. <figr fid="F1">1</figr>) contains an <it>&#945;</it>-helix and a single turn of 3<sub>10</sub>-helix <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. The E6apn1 peptide was designed using the Trp-cage motif as a scaffold, to inhibit the E6 protein of papillomavirus <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. E6apn1 is three residues larger than the Trp-cage but has a similar structure, except that the <it>&#945;</it>-helix is slightly longer <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>.</p>
            <p>As indicated earlier, we use melting data for the Trp-cage to set the energy scale of the model. For this peptide, several experiments found a similar melting temperature, <it>T</it><sub>m </sub>~315 K <abbrgrp><abbr bid="B34">34</abbr><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr></abbrgrp>. In our model, the heat capacity of the Trp-cage displays a maximum at <it>RT </it>= 0.4722 &#177; 0.0008 eu. Our energy unit eu is converted to kcal/mol by setting this temperature equal to the experimental melting temperature (315 K). Having done that, there is no free parameter left in the model. Other systems are thus studied without tuning any model parameter. For E6apn1, the experimental melting temperature is <it>T</it><sub>m </sub>~305 K <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>.</p>
            <p>Fig. <figr fid="F2">2a</figr> shows the helix content <it>h </it>against temperature for the Trp-cage and E6apn1, as obtained from our simulations. In both cases, the <it>T </it>dependence is well described by the simple two-state model of Eq. 15. The fitted melting temperatures are <it>T</it><sub>m </sub>= 309.6 &#177; 0.7 K and <it>T</it><sub>m </sub>= 304.0 &#177; 0.5 K for the Trp-cage and E6apn1, respectively. This <it>T</it><sub>m </sub>value for the Trp-cage is slightly lower than that we obtain from heat capacity data, 315 K. A fit to our data for the hydrophobicity energy <it>E</it><sub>hp </sub>(not shown) gives instead a slightly larger <it>T</it><sub>m</sub>, 321.1 &#177; 0.8 K. This probe dependence of <it>T</it><sub>m </sub>implies an uncertainty in the determination of the energy scale. By using the Trp-cage, this uncertainty is kept small (~2%). For many other peptides, the spread in <it>T</it><sub>m </sub>is much larger (see below).</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>The Trp-cage and E6apn1</p>
               </caption>
               <text>
                  <p><b>The Trp-cage and E6apn1</b>. (a) Helix content <it>h </it>against temperature. The lines are two-state fits (<it>T</it><sub>m </sub>= 309.6 &#177; 0.7 K and &#916;<it>E </it>= 11.3 &#177; 0.3 kcal/mol for the Trp-cage; <it>T</it><sub>m </sub>= 304.0 &#177; 0.5 K and &#916;<it>E </it>= 14.2 &#177; 0.3 kcal/mol for E6apn1). (b) Free energy <it>F </it>calculated as a function of bRMSD at two different temperatures, 279 K (solid lines) and 306 K (dashed lines). The double lines indicate the statistical errors.</p>
               </text>
               <graphic file="1757-5036-2-2-2"/>
            </fig>
            <p>Fig. <figr fid="F2">2b</figr> shows the free energy calculated as a function of bRMSD for the Trp-cage and E6apn1 at two different temperatures. The first temperature, 279 K, is well below <it>T</it><sub>m</sub>. Here native-like conformations dominate and the global free-energy minima are at 2.4 &#197; and 2.0 &#197; for the Trp-cage and E6apn1, respectively. At the second temperature, 306 K, the minima are shifted to higher bRMSD. Note that these free-energy profiles, taken near <it>T</it><sub>m</sub>, show no sign of a double-well structure. Hence, these peptides do not show a genuine two-state behavior in our simulations, even though the melting curves (Fig. <figr fid="F2">2a</figr>) are well described by a two-state model, as are many experimentally observed melting curves.</p>
         </sec>
         <sec>
            <st>
               <p>3.2 The <it>&#945;</it>-helices C, EK, F<sub>s</sub>, GCN4tp and HPLC-6</p>
            </st>
            <p>Our next five sequences form <it>&#945;</it>-helices. Among these, there are large differences in helix stability, according to CD studies. The least stable are the C <abbrgrp><abbr bid="B38">38</abbr></abbrgrp> and EK <abbrgrp><abbr bid="B39">39</abbr></abbrgrp> peptides, which are only partially stable at <it>T </it>~273 K. The original C peptide is a 13-residue fragment of ribonuclease A, but the C peptide here is an analogue with two alanine substitutions and a slightly increased helix stability <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. The EK peptide is a designed alanine-based peptide with 14 residues.</p>
            <p>Our third <it>&#945;</it>-helix peptide is the 21-residue F<sub>s </sub><abbrgrp><abbr bid="B41">41</abbr></abbrgrp>, which is also alanine-based. F<sub>s </sub>is more stable than C and EK <abbrgrp><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr></abbrgrp>, with estimated <it>T</it><sub>m </sub>values of 308 K <abbrgrp><abbr bid="B42">42</abbr></abbrgrp> and 303 K <abbrgrp><abbr bid="B43">43</abbr></abbrgrp> from CD studies and 334 K from an IR study <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. Even more stable is HPLC-6, a winter flounder antifreeze peptide with 37 residues. CD data suggest that the helix content of HPLC-6 remains non-negligible, ~0.10, at temperatures as high as ~343 K <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>. Our fifth helix-forming sequence, which we call GCN4tp, has 17 residues and is taken from a study of GCN4 coiled-coil formation <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>. Its melting behavior has not been studied, as far as we know, but its structure was characterized by NMR <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>.</p>
            <p>These five peptides are indeed <it>&#945;</it>-helical in our model. At 279 K, the calculated helix content <it>h </it>is 0.28 for the C peptide, 0.47 for the EK peptide, and > 0.60 for the other three peptides. Fig. <figr fid="F3">3</figr> shows the temperature dependence of <it>h</it>. By fitting Eq. 15 to the data for the three stable sequences, we find melting temperatures of 298.9 &#177; 0.1 K, 309.2 &#177; 0.3 K and 323.3 &#177; 1.2 K for GNC4tp, F<sub>s </sub>and HPLC-6, respectively.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>The C, EK, F<sub>s</sub>, GCN4tp and HPLC-6 peptides</p>
               </caption>
               <text>
                  <p><b>The C, EK, F<sub>s</sub>, GCN4tp and HPLC-6 peptides</b>. Helix content <it>h </it>against temperature. The lines are two-state fits (<it>T</it><sub>m </sub>= 276.3 &#177; 2.4 K and &#916;<it>E </it>= 11.7 &#177; 0.4 kcal/mol for C; <it>T</it><sub>m </sub>= 293.9 &#177; 0.4 K and &#916;<it>E </it>= 12.6 &#177; 0.2 kcal/mol for EK; <it>T</it><sub>m </sub>= 309.2 &#177; 0.3 K and &#916;<it>E </it>= 18.7 &#177; 0.4 kcal/mol for F<sub>s</sub>; <it>T</it><sub>m </sub>= 298.9 &#177; 0.1 K and &#916;<it>E </it>= 14.1 &#177; 0.1 kcal/mol for GCN4tp; <it>T</it><sub>m </sub>= 323.3 &#177; 1.2 K and &#916;<it>E </it>= 23.6 &#177; 2.2 kcal/mol for HPLC-6).</p>
               </text>
               <graphic file="1757-5036-2-2-3"/>
            </fig>
            <p>For the four peptides whose melting behavior has been studied experimentally, these results are in good agreement with experimental data. In particular, we find that HPLC-6 indeed is more stable than F<sub>s </sub>in the model, which in turn is more stable than both C and EK. The model thus captures the stability order among these peptides.</p>
         </sec>
         <sec>
            <st>
               <p>3.3 The <it>&#946;</it>-hairpins chignolin and MBH12</p>
            </st>
            <p>We now turn to <it>&#946;</it>-sheet peptides and begin with the <it>&#946;</it>-hairpins chignolin <abbrgrp><abbr bid="B47">47</abbr></abbrgrp> and MBH12 <abbrgrp><abbr bid="B48">48</abbr></abbrgrp> with 10 and 14 residues, respectively. Both are designed and have been characterized by NMR. For chignolin, <it>T</it><sub>m </sub>values in the range 311&#8211;315 K were reported <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>, based on CD and NMR. We are not aware of any melting data for MBH12.</p>
            <p>Fig. <figr fid="F4">4</figr> shows the temperature dependence of the hydrophobicity energy <it>E</it><sub>hp </sub>and the nativeness parameter <it>q</it><sub>hb </sub>for these peptides. By fitting to <it>E</it><sub>hp </sub>data, we obtain <it>T</it><sub>m </sub>= 311.0 &#177; 0.5 K and <it>T</it><sub>m </sub>= 315.4 &#177; 1.3 K for chignolin and MBH12, respectively. Using <it>q</it><sub>hb </sub>data instead, we find <it>T</it><sub>m </sub>= 305.4 &#177; 0.5 K for chignolin and <it>T</it><sub>m </sub>= 309.2 &#177; 0.7 K for MBH12. These <it>T</it><sub>m </sub>values show a significant but relatively weak probe dependence. The values for chignolin can be compared with experimental data, and the agreement is good.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Chignolin and MBH12</p>
               </caption>
               <text>
                  <p><b>Chignolin and MBH12</b>. (a) Hydrophobicity energy <it>E</it><sub>hp </sub>against temperature. The lines are two-state fits (<it>T</it><sub>m </sub>= 311.0 &#177; 0.5 K and &#916;<it>E </it>= 9.6 &#177; 0.2 kcal/mol for chignolin; <it>T</it><sub>m </sub>= 315.4 &#177; 1.3 K and &#916;<it>E </it>= 9.9 &#177; 0.9 kcal/mol for MBH12). (b) Nativeness <it>q</it><sub>hb </sub>against temperature. The lines are two-state fits (<it>T</it><sub>m </sub>= 305.4 &#177; 0.5 K and &#916;<it>E </it>= 10.4 &#177; 0.1 kcal/mol for chignolin; <it>T</it><sub>m </sub>= 309.2 &#177; 0.7 K and &#916;<it>E </it>= 13.5 &#177; 0.2 kcal/mol for MBH12).</p>
               </text>
               <graphic file="1757-5036-2-2-4"/>
            </fig>
            <p>Because these peptides have only four native hydrogen bonds each, one may question our definition of <it>q</it><sub>hb </sub>(see Methods), which takes a conformation as native-like (<it>q</it><sub>hb </sub>= 1) even if two hydrogen bonds are missing. Therefore, we repeated the analysis using the stricter criterion that native-like conformations (<it>q</it><sub>hb </sub>= 1) may lack at most one hydrogen bond. The resulting decrease in native population, as measured by the average <it>q</it><sub>hb</sub>, was ~0.1 or smaller at all temperatures. Even with this stricter definition, we find native populations well above 0.5 at low temperatures for both peptides.</p>
         </sec>
         <sec>
            <st>
               <p>3.4 The <it>&#946;</it>-hairpins GB1p, GB1m2 and GB1m3</p>
            </st>
            <p>GB1p is the second <it>&#946;</it>-hairpin of the B1 domain of protein G (residues 41&#8211;56). Its folded population has been estimated by CD/NMR to be 0.42 at 278 K <abbrgrp><abbr bid="B49">49</abbr></abbrgrp> and ~0.30 at 298 K <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>, whereas a Trp fluorescence study found a <it>T</it><sub>m </sub>of 297 K <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>, corresponding to a somewhat higher folded population. GB1m2 and GB1m3 are two mutants of GB1p with significantly enhanced stability <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. At 298 K, the folded population was found to be 0.74 &#177; 0.05 for GB1m2 and 0.86 &#177; 0.03 for GB1m3, based on CD and NMR measurements <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. It was further estimated that <it>T</it><sub>m </sub>= 320 &#177; 2 K for GB1m2 and <it>T</it><sub>m </sub>= 333 &#177; 2 K for GB1m3 <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>.</p>
            <p>All these three peptides are believed to adopt a structure similar to that GB1p has as part of the protein G B1 domain (PDB code <ext-link ext-link-type="pdb" ext-link-id="1GB1">1GB1</ext-link>). This part of the full protein contains seven backbone-backbone hydrogen bonds. These hydrogen bonds are the ones we consider when evaluating <it>q</it><sub>hb </sub>for these peptides.</p>
            <p>Fig. <figr fid="F5">5</figr> shows the observables <it>E</it><sub>hp </sub>and <it>q</it><sub>hb </sub>against temperature for these peptides. Fits to the data give <it>E</it><sub>hp</sub>-based <it>T</it><sub>m </sub>values of 301.7 &#177; 3.3 K, 324.4 &#177; 1.1 K and 331.4 &#177; 0.7 K for GB1p, GB1m2 and GB1m3, respectively, and <it>q</it><sub>hb</sub>-based <it>T</it><sub>m </sub>values of 307.5 &#177; 0.5 K and 313.9 &#177; 1.4 K for GB1m2 and GB1m3, respectively. The <it>q</it><sub>hb </sub>data do not permit a reliable fit for the less stable GB1p. At 298 K, we find <it>q</it><sub>hb</sub>-based folded populations of 0.20, 0.64 and 0.74 for GB1p, GB1m2 and GB1m3, respectively, which can be compared with the above-mentioned experimental results (0.30, 0.74 and 0.86).</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>GB1p, GB1m2 and GB1m3</p>
               </caption>
               <text>
                  <p><b>GB1p, GB1m2 and GB1m3</b>. (a) Hydrophobicity energy <it>E</it><sub>hp </sub>against temperature. The lines are two-state fits (<it>T</it><sub>m </sub>= 301.7 &#177; 3.3 K and &#916;<it>E </it>= 11.3 &#177; 1.1 kcal/mol for GB1p; <it>T</it><sub>m </sub>= 324.4 &#177; 1.4 K and &#916;<it>E </it>= 13.2 &#177; 1.0 kcal/mol for GB1m2; <it>T</it><sub>m </sub>= 331.4 &#177; 0.7 K and &#916;<it>E </it>= 14.8 &#177; 0.5 kcal/mol for GB1m3). (b) Nativeness <it>q</it><sub>hb </sub>against temperature. The lines are two-state fits (<it>T</it><sub>m </sub>= 307.5 &#177; 0.5 K and &#916;<it>E </it>= 20.7 &#177; 0.5 kcal/mol for GB1m2; <it>T</it><sub>m </sub>= 313.9 &#177; 1.4 K and &#916;<it>E </it>= 21.4 &#177; 1.1 kcal/mol for GB1m3).</p>
               </text>
               <graphic file="1757-5036-2-2-5"/>
            </fig>
            <p>These results show that, in the model, the apparent folded populations of these peptides depend quite strongly on the observable studied. Our <it>E</it><sub>hp</sub>-based results agree quite well with experimental data, especially for GB1m2 and GB1m3, whereas our <it>q</it><sub>hb </sub>results consistently give lower folded populations for all peptides. The stability order is the same independent of which of the two observables we study, namely GB1p &lt; GB1m2 &lt; GB1m3, which is the experimentally observed order.</p>
            <p>The stability difference between GB1m2 and GB1m3 is mainly due to charge-charge interactions. In our previous model <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, these interactions were ignored, and both peptides had similar stabilities. The present model splits this degeneracy. Moreover, the magnitude of the splitting, which sensitively depends on the strength of the charge-charge interactions, is consistent with experimental data.</p>
         </sec>
         <sec>
            <st>
               <p>3.5 The <it>&#946;</it>-hairpins trpzip1 and trpzip2</p>
            </st>
            <p>The 12-residue trpzip1 and trpzip2 are designed <it>&#946;</it>-hairpins, each containing two tryptophans per <it>&#946;</it>-strand <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>. The only difference between the two sequences is a transposition of an aspargine and a glycine in the hairpin turn. CD measurements suggest that trpzip1 and trpzip2 are remarkably stable for their size, with <it>T</it><sub>m </sub>values of 323 K and 345 K, respectively <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>. A complementary trpzip2 study, using both experimental and computational methods, found <it>T</it><sub>m </sub>values to be strongly probe-dependent <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>.</p>
            <p>Fig. <figr fid="F6">6</figr> shows our melting curves for these peptides, based on the observables <it>E</it><sub>hp </sub>and <it>q</it><sub>hb</sub>. The <it>E</it><sub>hp</sub>-based <it>T</it><sub>m </sub>values are 319.7 &#177; 0.2 K and 327.1 &#177; 0.8 K for trpzip1 and trpzip2, respectively. Using <it>q</it><sub>hb </sub>data instead, we find <it>T</it><sub>m </sub>= 303.2 &#177; 1.1 K for trpzip1 and <it>T</it><sub>m </sub>= 305.0 &#177; 1.1 K for trpzip2.</p>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>Trpzip1 and trpzip2</p>
               </caption>
               <text>
                  <p><b>Trpzip1 and trpzip2</b>. (a) Hydrophobicity energy <it>E</it><sub>hp </sub>against temperature. The lines are two-state fits (<it>T</it><sub>m </sub>= 319.7 &#177; 0.2 K and &#916;<it>E </it>= 7.9 &#177; 0.1 kcal/mol for trpzip1; <it>T</it><sub>m </sub>= 327.1 &#177; 0.8 K and &#916;<it>E </it>= 8.3 &#177; 0.4 kcal/mol for trpzip2). (b) Nativeness <it>q</it><sub>hb </sub>against temperature. The lines are two-state fits (<it>T</it><sub>m </sub>= 303.2 &#177; 1.8 K and &#916;<it>E </it>= 14.1 &#177; 0.5 kcal/mol for trpzip1; <it>T</it><sub>m </sub>= 305.0 &#177; 1.1 K and &#916;<it>E </it>= 12.6 &#177; 0.3 kcal/mol for trpzip2).</p>
               </text>
               <graphic file="1757-5036-2-2-6"/>
            </fig>
            <p>Like for the other <it>&#946;</it>-hairpins discussed earlier, our <it>q</it><sub>hb</sub>-based folded populations are low compared to estimates based on CD data, whereas those based on <it>E</it><sub>hp </sub>are much closer to experimental data. For trpzip2, the agreement is not perfect but acceptable, given that <it>T</it><sub>m </sub>has been found to be strongly probe-dependent for this peptide <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>3.6 Three-stranded <it>&#946;</it>-sheets: betanova, LLM and beta3s</p>
            </st>
            <p>Betanova <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>, the betanova triple mutant LLM <abbrgrp><abbr bid="B55">55</abbr></abbrgrp> and beta3s <abbrgrp><abbr bid="B56">56</abbr></abbrgrp> are designed 20-residue peptides forming three-stranded <it>&#946;</it>-sheets. All the three peptides are marginally stable. NMR studies suggest that the folded population at 283 K is 0.09 for betanova <abbrgrp><abbr bid="B55">55</abbr></abbrgrp>, 0.36 for LLM <abbrgrp><abbr bid="B55">55</abbr></abbrgrp>, and 0.13&#8211;0.31 for beta3s <abbrgrp><abbr bid="B56">56</abbr></abbrgrp>.</p>
            <p>Fig. <figr fid="F7">7</figr> shows our <it>E</it><sub>hp </sub>and <it>q</it><sub>hb </sub>data for these peptides. From the <it>q</it><sub>hb </sub>data, <it>T</it><sub>m </sub>values cannot be extracted, because the stability of the peptides is too low. At 283 K, the <it>q</it><sub>hb</sub>-based folded populations are 0.08, 0.47, 0.28 for betanova, LLM and beta3s, respectively, in good agreement with the experimental results. Fits to <it>E</it><sub>hp </sub>data can be performed. The obtained <it>T</it><sub>m </sub>values are 318.8 &#177; 2.5 K, 305.6 &#177; 1.7 K and 295.7 &#177; 3.1 K for betanova, LLM and beta3s, respectively.</p>
            <fig id="F7">
               <title>
                  <p>Figure 7</p>
               </title>
               <caption>
                  <p>Betanova, LLM and beta3s</p>
               </caption>
               <text>
                  <p><b>Betanova, LLM and beta3s</b>. (a) Hydrophobicity energy <it>E</it><sub>hp </sub>against temperature. The lines are two-state fits (<it>T</it><sub>m </sub>= 318.8 &#177; 2.5 K and &#916;<it>E </it>= 13.3 &#177; 2.1 kcal/mol for betanova; <it>T</it><sub>m </sub>= 305.6 &#177; 1.7 K and &#916;<it>E </it>= 13.4 &#177; 1.0 kcal/mol for LLM; <it>T</it><sub>m </sub>= 295.7 &#177; 3.1 K and &#916;<it>E </it>= 9.7 &#177; 0.5 kcal/mol for beta3s). (b) Nativeness <it>q</it><sub>hb </sub>against temperature. Two-state fits were not possible.</p>
               </text>
               <graphic file="1757-5036-2-2-7"/>
            </fig>
            <p>These <it>E</it><sub>hp</sub>-based <it>T</it><sub>m </sub>values are high compared to the experimentally determined folded populations, especially for betanova. Note that betanova has a very low hydrophobicity. The correlation between <it>E</it><sub>hp </sub>and folding status is therefore likely to be weak for this peptide.</p>
            <p>In contrast to the <it>E</it><sub>hp</sub>-based folded populations, those based on <it>q</it><sub>hb </sub>agree quite well with experimental data. In this respect, the situation is the opposite to what we found for the <it>&#946;</it>-hairpins studied above. A possible reason for this difference is discussed below.</p>
         </sec>
         <sec>
            <st>
               <p>3.7 AB zipper</p>
            </st>
            <p>The AB zipper is a designed heterodimeric leucine zipper, composed of an acidic A chain and a basic B chain, each with 30 residues <abbrgrp><abbr bid="B57">57</abbr></abbrgrp>. The dimer structure has been characterized by NMR, and a melting temperature of ~340 K was estimated by CD measurements (at neutral pH) <abbrgrp><abbr bid="B57">57</abbr></abbrgrp>.</p>
            <p>The lowest energy state seen in our simulations is a conformation in which pRMSD calculated over backbone atoms of all residues in both chains is ~2.7 &#197;. In this structure, the bRMSD (all residues) of the individual chains A and B to their counterparts in the PDB structure are ~2.5 &#197; and ~2.4 &#197;, respectively. Unlike for the other systems described in this article, the boundary conditions have a non-trivial role for this dimeric system. A proper discussion of periodicity, concentration and temperature dependence of this system is beyond the scope of this article. In Fig. <figr fid="F8">8a</figr>, we show the energy landscape, i.e., the mean energy as a function of two order parameters for this system. The X-axis shows the measure pRMSD described earlier. The Y-axis represents the sum of the backbone RMSD of the individual chains. pRMSD can be very large even if the sum of bRMSDs is small: the two chains can be folded without making the proper interchain contacts. Indeed, the figure shows that the major energy gradients are along the Y-axis, showing that it is energetically favorable for both chains to fold to their respective helical states. The correct dimeric native state is energetically more favorable by ~20 kcal/mol compared to two folded helices without proper interchain contacts. This is seen more clearly in Fig. <figr fid="F8">8b</figr>, where we plot the average energy as a function of pRMSD for states with two folded chains. We also simulated the two chains A and B of the dimer in isolation. Both chains folded to their native helical conformations. The melting temperatures estimated based on helix content for chains A and B are 314 K and 313 K, respectively. As indicated above, for the dimer, thermodynamic parameters like <it>T</it><sub>m </sub>cannot be directly estimated from the present simulations.</p>
            <fig id="F8">
               <title>
                  <p>Figure 8</p>
               </title>
               <caption>
                  <p>The heterodimeric AB zipper</p>
               </caption>
               <text>
                  <p><b>The heterodimeric AB zipper</b>. (a) Mean energy as a function of pRMSD over both chains and the sum of individual bRMSDs. The direction of the energy gradients implies that a system with two folded monomers is energetically favorable compared to unfolded monomers. The proper dimeric form is the area closest to the origin, and has a lower energy. (b) Mean energy of all states in which both chains have bRMSD &lt; 5 &#197;, shown as a function of the dimer RMSD measure pRMSD.</p>
               </text>
               <graphic file="1757-5036-2-2-8"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>3.8 Top7-CFr</p>
            </st>
            <p>Top7-CFr, the C-terminal fragment of the designed 93-residue <it>&#945;</it>/<it>&#946;</it>-protein Top7 <abbrgrp><abbr bid="B58">58</abbr></abbrgrp>, is the most complex of all molecules studied here. It has both <it>&#945;</it>-helix and <it>&#946;</it>-strand secondary structure elements, and highly non-local hydrogen bonds between the N- and C-terminal strands. CFr is known to form extremely stable homodimers, which retain their secondary structure till very high temperatures like 371 K and high concentrations of denaturants <abbrgrp><abbr bid="B59">59</abbr></abbrgrp>.</p>
            <p>In <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp>, an earlier version of our model was used to study the folding of CFr. The simulations pointed to an unexpected folding mechanism. The N-terminal strand initially folds as a non-native continuation of the adjoining <it>&#945;</it>-helix. After the other secondary structure elements form and diffuse to an approximately correct tertiary organization, the non-native extension of the helix unfolds and frees the N-terminal residues. These residues then attach to an existing <it>&#946;</it>-hairpin to complete the three-stranded <it>&#946;</it>-sheet of the native structure. Premature fastening of the chain ends in <it>&#946;</it>-sheet contacts puts the molecule in a deep local energy minimum, in which the folding and proper arrangement of the other secondary structure elements is hampered by large steric barriers. The above "caching" mechanism, spontaneously emerging in the simulations, accelerates folding by helping the molecule avoid such local minima.</p>
            <p>The folding properties of CFr, including the above mentioned caching mechanism, are preserved under the current modifications of the interaction potential. The centre of the native free-energy minimum shifts from bRMSD (all residues) of 1.7 &#197; as reported in <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> to about 2.2 &#197;. This state remains the minimum energy state, although the new energy function changes the energy ordering of the other low energy states. The runs made for this study (see Table <tblr tid="T6">6</tblr>) found 22 independent folding events. The free-energy landscape observed in the simulations is rather complex with a plethora of deep local minima sharing one or more secondary structure elements with the native structure. They differ in the registry and ordering of strands and the length of the helix. Longer runs are required for the MC simulations to correctly weight these different minima. Temperature dependence of the properties of CFr can therefore not be reliably obtained from these runs.</p>
            <p>We note that the simulations ran on twice as many processors but were only about one sixth the length of those used for <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>, in which 15 independent folding events were found. The improved efficiency is partly due to the changes in the energy function presented here, and partly due to the optimization of the parallel tempering described in <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>3.9 GS-<it>&#945;</it><sub>3 </sub>W</p>
            </st>
            <p>GS-<it>&#945;</it><sub>3 </sub>W is a designed three-helix-bundle protein with 67 residues <abbrgrp><abbr bid="B60">60</abbr></abbrgrp>, whose structure was characterized by NMR <abbrgrp><abbr bid="B61">61</abbr></abbrgrp>. The stability was estimated to be 4.6 kcal/mol in aqueous solution at 298 K, based on CD data <abbrgrp><abbr bid="B60">60</abbr></abbrgrp>.</p>
            <p>It turns out that this protein is very easy to fold with our model. Our results are based on extensive sampling of the conformation space with 64 &#215; 3.5 &#215; 10<sup>9 </sup>Monte Carlo updates, resulting in about 800 independent folding events to the native state. For this estimate, structures with bRMSD (all residues) under 5 &#197; were taken to be in the native minimum (see Fig. <figr fid="F9">9</figr> for justification). Two visits to the native state were considered statistically independent (i) if they occurred in independent Markov chains, or (ii) if the two visits to the native state were separated by at least one visit to the highest temperature in the simulation. For the entire run, we spent about 10 days of computing time on 64 AMD Opteron processors running at 2.0 GHz.</p>
            <fig id="F9">
               <title>
                  <p>Figure 9</p>
               </title>
               <caption>
                  <p>The three-helix-bundle protein GS-<it>&#945;</it><sub>3 </sub>W</p>
               </caption>
               <text>
                  <p><b>The three-helix-bundle protein GS-<it>&#945;</it><sub>3 </sub>W</b>. (a) Variation of histogram of bRMSD with temperature. At high temperatures, there is a broad distribution of bRMSD with values > 10 &#197;. At lower temperatures there are three clearly separated clusters. Representative structures from these clusters are also shown (color) aligned with the native structure (gray). (b) Temperature dependence of specific heat, <it>C</it><sub><it>v</it></sub>, and the ratio <it>h</it><sub><it>r </it></sub>of the observed helix content and the helix content of the native structure.</p>
               </text>
               <graphic file="1757-5036-2-2-9"/>
            </fig>
            <p>In Fig. <figr fid="F9">9a</figr>, we show how the probabilities for structures with different bRMSD vary with temperature in the simulations. Clearly, the protein makes a transition from a rather continuous distribution of bRMSD at high temperatures to a distribution dominated by three well separated clusters. Analysis of the structures at the lower temperatures shows that all three free-energy minima consist almost exclusively of structures with all three helices of GS-<it>&#945;</it><sub>3 </sub>W formed. The plot of the ratio of the observed helix content and the helix content of the native state, shown in Fig. <figr fid="F9">9b</figr>, further supports this idea. The average value of this ratio approaches 1 as the temperature decreases below 300 K. The specific heat curve, also shown in Fig. <figr fid="F9">9b</figr>, indicates that the formation of these structures correlates with the steepest change in energy.</p>
            <p>The cluster with a center at bRMSD ~3 &#197; dominates at the lowest temperatures. The structures contributing to the cluster with ~8&#8211;9 &#197; bRMSD superficially look like well folded three-helix bundles. But as illustrated in the figure, the arrangement of the helices is topologically distinct from the native arrangement. The cluster seen at larger bRMSD values is broader and consists of a host of structures in which two of the helices make a helical hairpin, but the third helix is not bound to it. The unbound helix could be at either side of the chain.</p>
            <p>According to our model therefore, the population at the lowest temperatures consists of ~80% genuinely native structures, ~10% three-helix bundles with wrong topology, and ~10% other structures with as much helix content as the native state. In order to experimentally determine the true folded population of the protein, the experimental probe must be able to distinguish the native fold from the other helix rich structures described here.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>4 Discussion</p>
         </st>
         <p>The model presented here is intrinsically fast compared to many other all-atom models, because all interactions are short range. By exploiting this property and using efficient MC techniques, it is possible to achieve a high sampling efficiency. We could, for example, generate more than 800 independent folding events for the 67-residue GS-<it>&#945;</it><sub>3 </sub>W. The speed of the simulations thus permits statistically accurate studies of the global free-energy landscape of peptides and small proteins.</p>
         <p>In developing this potential, a set of 17 peptides with 10&#8211;37 residues was studied. The peptides were added to this set one at a time. To fold a new sequence sometimes required fine-tuning of the potential, sometimes not. A change was accepted only after testing the new potential on all previous sequences in the set. In its final form, the model folds all 17 sequences to structures similar to their experimental structures, for one and the same choice of potential parameters.</p>
         <p>Also important is the stability of the peptides. A small polypeptide chain is unlikely to be a clear two-state folder, and therefore its apparent folded population will generally depend on the observable studied. For <it>&#946;</it>-sheet peptides, we used the hydrophobicity energy <it>E</it><sub>hp </sub>and the hydrogen bond-based nativeness measure <it>q</it><sub>hb </sub>to monitor the melting behavior. The extracted <it>T</it><sub>m </sub>values indeed showed a clear probe dependence; the <it>E</it><sub>hp</sub>-based value was always larger than that based on <it>q</it><sub>hb</sub>. For the <it>&#946;</it>-hairpins studied, we found a good overall agreement between our <it>E</it><sub>hp</sub>-based results and experimental data. For the three-stranded <it>&#946;</it>-sheets, instead, the <it>q</it><sub>hb </sub>results agreed best with experimental data. The reason for this difference is unclear. One contributing factor could be that interactions between aromatic residues play a more important role for the <it>&#946;</it>-hairpins studied here than for the three-stranded <it>&#946;</it>-sheets. These interactions may influence spectroscopic signals and are part of <it>E</it><sub>hp</sub>. Probe-dependent <it>T</it><sub>m </sub>values have also been obtained experimentally, for example, for trpzip2 <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>.</p>
         <p>The probe dependence makes the comparison with experimental data less straightforward. Nevertheless, the results presented clearly show that the model captures many experimentally observed stability differences. In particular, among related peptides, the calculated order of increasing thermal stability generally agrees with the experimental order, independent of which of our observables we use.</p>
         <p>It is encouraging that the model is able to fold these 17 sequences. However, there is no existing model that will fold all peptides, and our model is no exception. Two sequences that we unsuccessfully tried to fold are the <it>&#946;</it>-hairpins trpzip4 and U<sub>16</sub>, both with 16 residues. Trpzip4 is a triple mutant of GB1p with four tryptophans <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>. For trpzip4, our minimum energy state actually corresponded to the NMR-derived native state <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>, but the population of this state remained low at the lowest temperature studied (~14% at 279 K, as opposed to an estimated <it>T</it><sub>m </sub>of 343 K in experiments <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>). U<sub>16 </sub>is derived from the N-terminal <it>&#946;</it>-hairpin of ubiquitin <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>. It has a shortened turn and has been found to form a <it>&#946;</it>-hairpin with non-native registry <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>. In our simulations, this state was only weakly populated (~8% at 279 K, as opposed to an estimated ~80% at 288 K <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>). Instead, the main free-energy minima corresponded to the two <it>&#946;</it>-hairpin states with the registry of native ubiquitin, one with native hydrogen bonds and the other with the complementary set of hydrogen bonds.</p>
         <p>Our calibration of the potential relies on experimental data with non-negligible uncertainties, on a limited number of peptides. It is not evident that this potential will be useful for larger polypeptide chains. Therefore, as a proof-of-principle test, we also studied three larger systems, with very good results. Our simulations showed that, without having to adjust any parameter, the model folds these sequences to structures consistent with experimental data. Having verified this, it would be interesting to use the model to investigate the mechanisms by which these systems self-assemble, but such an analysis is beyond the scope of this article. The main purpose of our present study of these systems was to demonstrate the viability of our calibration approach.</p>
         <p>The potential can be further constrained by confronting it with more accurate experimental data and data on new sequences. The challenge in this process is to ensure backward compatibility &#8211; new constraints should be met without sacrificing properties already achieved.</p>
      </sec>
      <sec>
         <st>
            <p>5 Conclusion</p>
         </st>
         <p>We have described and tested an implicit solvent all-atom model for protein simulations. The model is computationally fast and yet able to capture structural and thermodynamic properties of a diverse set of sequences. Its computational efficiency greatly facilitates the study of folding and aggregation problems that require exploration of the full free-energy landscape. A program package, called PROFASI <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>, for single- and multi-chain simulations with this model is freely available to academic users.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank Stefan Wallin for suggestions on the manuscript. This work was in part supported by the Swedish Research Council. The simulations of the larger systems were performed at the John von Neumann Institute for Computing (NIC), Research Centre J&#252;lich, Germany.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <aug>
               <au>
                  <snm>Uversky</snm>
                  <fnm>VN</fnm>
               </au>
            </aug>
            <source>Protein Sci</source>
            <pubdate>2002</pubdate>
            <volume>11</volume>
            <fpage>739</fpage>
            <lpage>756</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2373528</pubid>
                  <pubid idtype="pmpid" link="fulltext">11910019</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <aug>
               <au>
                  <snm>Dyson</snm>
                  <fnm>HJ</fnm>
               </au>
               <au>
                  <snm>Wright</snm>
                  <fnm>PE</fnm>
               </au>
            </aug>
            <source>Nat Rev Mol Cell Biol</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>197</fpage>
            <lpage>208</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15738986</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <aug>
               <au>
                  <snm>Yoda</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Sugita</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Okamoto</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Chem Phys</source>
            <pubdate>2004</pubdate>
            <volume>307</volume>
            <fpage>269</fpage>
            <lpage>283</lpage>
         </bibl>
         <bibl id="B4">
            <aug>
               <au>
                  <snm>Shell</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Ritterson</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Dill</snm>
                  <fnm>KA</fnm>
               </au>
            </aug>
            <source>J Phys Chem</source>
            <pubdate>2008</pubdate>
            <volume>B 112</volume>
            <fpage>6878</fpage>
            <lpage>6886</lpage>
         </bibl>
         <bibl id="B5">
            <aug>
               <au>
                  <snm>Irb&#228;ck</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Samuelsson</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Sjunnesson</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Wallin</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Biophys J</source>
            <pubdate>2003</pubdate>
            <volume>85</volume>
            <fpage>1466</fpage>
            <lpage>1473</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1303323</pubid>
                  <pubid idtype="pmpid" link="fulltext">12944264</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <aug>
               <au>
                  <snm>Irb&#228;ck</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Mohanty</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Biophys J</source>
            <pubdate>2005</pubdate>
            <volume>88</volume>
            <fpage>1560</fpage>
            <lpage>1569</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1305213</pubid>
                  <pubid idtype="pmpid" link="fulltext">15613623</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <aug>
               <au>
                  <snm>Cheon</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Chang</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Mohanty</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Luheshi</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Dobson</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Vendruscolo</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Favrin</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>PLoS Comput Biol</source>
            <pubdate>2007</pubdate>
            <volume>3</volume>
            <fpage>e173</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">17941703</pubid>
                  <pubid idtype="pmcid">1976335</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <aug>
               <au>
                  <snm>Irb&#228;ck</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Mitternacht</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Proteins</source>
            <pubdate>2008</pubdate>
            <volume>71</volume>
            <fpage>207</fpage>
            <lpage>214</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17932914</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Mohanty</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Irb&#228;ck</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Huo</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>PLoS Comput Biol</source>
            <pubdate>2008</pubdate>
            <volume>4</volume>
            <fpage>e1000238</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2583953</pubid>
                  <pubid idtype="pmpid" link="fulltext">19057640</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <aug>
               <au>
                  <snm>Irb&#228;ck</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Mitternacht</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Mohanty</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <fpage>13427</fpage>
            <lpage>13432</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1224613</pubid>
                  <pubid idtype="pmpid" link="fulltext">16174739</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <aug>
               <au>
                  <snm>Mitternacht</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Luccioli</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Torcini</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Imparato</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Irb&#228;ck</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Biophys J</source>
            <pubdate>2009</pubdate>
            <volume>96</volume>
            <fpage>429</fpage>
            <lpage>441</lpage>
            <xrefbib>
               <pubid idtype="pmpid">19167294</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <aug>
               <au>
                  <snm>Mohanty</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hansmann</snm>
                  <fnm>UHE</fnm>
               </au>
            </aug>
            <source>Biophys J</source>
            <pubdate>2006</pubdate>
            <volume>91</volume>
            <fpage>3573</fpage>
            <lpage>3578</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1630465</pubid>
                  <pubid idtype="pmpid" link="fulltext">16950845</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <aug>
               <au>
                  <snm>Mohanty</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Meinke</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Zimmermann</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Hansmann</snm>
                  <fnm>UHE</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2008</pubdate>
            <volume>105</volume>
            <fpage>8004</fpage>
            <lpage>8007</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">18408166</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <aug>
               <au>
                  <snm>Mohanty</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hansmann</snm>
                  <fnm>UHE</fnm>
               </au>
            </aug>
            <source>J Phys Chem</source>
            <pubdate>2008</pubdate>
            <volume>B 112</volume>
            <fpage>15134</fpage>
            <lpage>15139</lpage>
         </bibl>
         <bibl id="B15">
            <aug>
               <au>
                  <snm>Ponder</snm>
                  <fnm>JW</fnm>
               </au>
               <au>
                  <snm>Case</snm>
                  <fnm>DA</fnm>
               </au>
            </aug>
            <source>Adv Protein Chem</source>
            <pubdate>2003</pubdate>
            <volume>66</volume>
            <fpage>27</fpage>
            <lpage>85</lpage>
            <xrefbib>
               <pubid idtype="pmpid">14631816</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <aug>
               <au>
                  <snm>Hubner</snm>
                  <fnm>IA</fnm>
               </au>
               <au>
                  <snm>Deeds</snm>
                  <fnm>EJ</fnm>
               </au>
               <au>
                  <snm>Shakhnovich</snm>
                  <fnm>EI</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <fpage>18914</fpage>
            <lpage>18919</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1323145</pubid>
                  <pubid idtype="pmpid" link="fulltext">16365306</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <aug>
               <au>
                  <snm>Herges</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Wenzel</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Phys Rev Lett</source>
            <pubdate>2005</pubdate>
            <volume>94</volume>
            <fpage>018101</fpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15698135</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <aug>
               <au>
                  <snm>Ding</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Tsao</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Nie</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Dokholyan</snm>
                  <fnm>NV</fnm>
               </au>
            </aug>
            <source>Structure</source>
            <pubdate>2008</pubdate>
            <volume>16</volume>
            <fpage>1010</fpage>
            <lpage>1018</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">18611374</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <aug>
               <au>
                  <snm>Hovm&#246;ller</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Ohlsson</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Acta Cryst</source>
            <pubdate>2002</pubdate>
            <volume>D 58</volume>
            <fpage>768</fpage>
            <lpage>776</lpage>
         </bibl>
         <bibl id="B20">
            <aug>
               <au>
                  <snm>Miyazawa</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Jernigan</snm>
                  <fnm>RL</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1996</pubdate>
            <volume>256</volume>
            <fpage>623</fpage>
            <lpage>644</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8604144</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Tang</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Wingreen</snm>
                  <fnm>NS</fnm>
               </au>
            </aug>
            <source>Phys Rev Lett</source>
            <pubdate>1997</pubdate>
            <volume>79</volume>
            <fpage>765</fpage>
            <lpage>768</lpage>
         </bibl>
         <bibl id="B22">
            <aug>
               <au>
                  <snm>Lyubartsev</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Martsinovski</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Shevkunov</snm>
                  <fnm>SV</fnm>
               </au>
               <au>
                  <snm>Vorontsov-Velyaminov</snm>
                  <fnm>PN</fnm>
               </au>
            </aug>
            <source>J Chem Phys</source>
            <pubdate>1992</pubdate>
            <volume>96</volume>
            <fpage>1776</fpage>
            <lpage>1783</lpage>
         </bibl>
         <bibl id="B23">
            <aug>
               <au>
                  <snm>Marinari</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Parisi</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Europhys Lett</source>
            <pubdate>1992</pubdate>
            <volume>19</volume>
            <fpage>451</fpage>
            <lpage>458</lpage>
         </bibl>
         <bibl id="B24">
            <aug>
               <au>
                  <snm>Swendsen</snm>
                  <fnm>RH</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>JS</fnm>
               </au>
            </aug>
            <source>Phys Rev Lett</source>
            <pubdate>1986</pubdate>
            <volume>57</volume>
            <fpage>2607</fpage>
            <lpage>2609</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10033814</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <aug>
               <au>
                  <snm>Hukushima</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Nemoto</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>J Phys Soc (Jap)</source>
            <pubdate>1996</pubdate>
            <volume>65</volume>
            <fpage>1604</fpage>
            <lpage>1608</lpage>
         </bibl>
         <bibl id="B26">
            <aug>
               <au>
                  <snm>Meinke</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Mohanty</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nadler</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Manuscript in preparation</source>
            <pubdate>2009</pubdate>
         </bibl>
         <bibl id="B27">
            <aug>
               <au>
                  <snm>Favrin</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Irb&#228;ck</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sjunnesson</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>J Chem Phys</source>
            <pubdate>2001</pubdate>
            <volume>114</volume>
            <fpage>8154</fpage>
            <lpage>8158</lpage>
         </bibl>
         <bibl id="B28">
            <aug>
               <au>
                  <snm>Irb&#228;ck</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Mohanty</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>J Comput Chem</source>
            <pubdate>2006</pubdate>
            <volume>27</volume>
            <fpage>1548</fpage>
            <lpage>1555</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16847934</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <aug>
               <au>
                  <snm>Garc&#237;a</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Sanbonmatsu</snm>
                  <fnm>KY</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2002</pubdate>
            <volume>99</volume>
            <fpage>2782</fpage>
            <lpage>2787</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">122425</pubid>
                  <pubid idtype="pmpid" link="fulltext">11867710</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <aug>
               <au>
                  <snm>Miller</snm>
                  <fnm>RG</fnm>
               </au>
            </aug>
            <source>Biometrika</source>
            <pubdate>1974</pubdate>
            <volume>61</volume>
            <fpage>1</fpage>
            <lpage>15</lpage>
         </bibl>
         <bibl id="B31">
            <aug>
               <au>
                  <snm>DeLano</snm>
                  <fnm>WL</fnm>
               </au>
            </aug>
            <publisher>San Carlos, CA: DeLano Scientific</publisher>
            <pubdate>2002</pubdate>
         </bibl>
         <bibl id="B32">
            <aug>
               <au>
                  <snm>Gnanakaran</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nymeyer</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Portman</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Sanbonmatsu</snm>
                  <fnm>KY</fnm>
               </au>
               <au>
                  <snm>Garc&#237;a</snm>
                  <fnm>AE</fnm>
               </au>
            </aug>
            <source>Curr Opin Struct Biol</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <fpage>168</fpage>
            <lpage>174</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12727509</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <aug>
               <au>
                  <snm>Meinke</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hansmann</snm>
                  <fnm>UHE</fnm>
               </au>
            </aug>
            <pubdate>2009</pubdate>
            <inpress/>
         </bibl>
         <bibl id="B34">
            <aug>
               <au>
                  <snm>Neidigh</snm>
                  <fnm>JW</fnm>
               </au>
               <au>
                  <snm>Fesinmeyer</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Andersen</snm>
                  <fnm>NH</fnm>
               </au>
            </aug>
            <source>Nat Struct Biol</source>
            <pubdate>2002</pubdate>
            <volume>9</volume>
            <fpage>425</fpage>
            <lpage>430</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11979279</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <aug>
               <au>
                  <snm>Liu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Androphy</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Baleja</snm>
                  <fnm>JD</fnm>
               </au>
            </aug>
            <source>Biochemistry</source>
            <pubdate>2004</pubdate>
            <volume>43</volume>
            <fpage>7421</fpage>
            <lpage>7431</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15182185</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <aug>
               <au>
                  <snm>Qiu</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Pabit</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Roitberg</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Hagen</snm>
                  <fnm>SJ</fnm>
               </au>
            </aug>
            <source>J Am Chem Soc</source>
            <pubdate>2002</pubdate>
            <volume>124</volume>
            <fpage>12952</fpage>
            <lpage>12953</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12405814</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <aug>
               <au>
                  <snm>Streicher</snm>
                  <fnm>WW</fnm>
               </au>
               <au>
                  <snm>Makhatadze</snm>
                  <fnm>GI</fnm>
               </au>
            </aug>
            <source>Biochemistry</source>
            <pubdate>2007</pubdate>
            <volume>46</volume>
            <fpage>2876</fpage>
            <lpage>2880</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17295518</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <aug>
               <au>
                  <snm>Bierzynski</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>PS</fnm>
               </au>
               <au>
                  <snm>Baldwin</snm>
                  <fnm>RL</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1982</pubdate>
            <volume>79</volume>
            <fpage>2470</fpage>
            <lpage>2474</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">346220</pubid>
                  <pubid idtype="pmpid">6283528</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <aug>
               <au>
                  <snm>Scholtz</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Barrick</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>York</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Stewart</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Baldwin</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1995</pubdate>
            <volume>92</volume>
            <fpage>185</fpage>
            <lpage>189</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">42842</pubid>
                  <pubid idtype="pmpid">7816813</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <aug>
               <au>
                  <snm>Shoemaker</snm>
                  <fnm>KR</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>PS</fnm>
               </au>
               <au>
                  <snm>York</snm>
                  <fnm>EJ</fnm>
               </au>
               <au>
                  <snm>Stewart</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Baldwin</snm>
                  <fnm>RL</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1987</pubdate>
            <volume>326</volume>
            <fpage>563</fpage>
            <lpage>567</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">3561498</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <aug>
               <au>
                  <snm>Lockhart</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>PS</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1992</pubdate>
            <volume>257</volume>
            <fpage>947</fpage>
            <lpage>951</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">1502559</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <aug>
               <au>
                  <snm>Lockhart</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>PS</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1993</pubdate>
            <volume>260</volume>
            <fpage>198</fpage>
            <lpage>202</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8469972</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <aug>
               <au>
                  <snm>Thompson</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>Eaton</snm>
                  <fnm>WA</fnm>
               </au>
               <au>
                  <snm>Hofrichter</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Biochemistry</source>
            <pubdate>1997</pubdate>
            <volume>36</volume>
            <fpage>9200</fpage>
            <lpage>9210</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9230053</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <aug>
               <au>
                  <snm>Williams</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Causgrove</snm>
                  <fnm>TP</fnm>
               </au>
               <au>
                  <snm>Gilmanshin</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Fang</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Callender</snm>
                  <fnm>RH</fnm>
               </au>
               <au>
                  <snm>Woodruff</snm>
                  <fnm>WH</fnm>
               </au>
               <au>
                  <snm>Dyer</snm>
                  <fnm>RB</fnm>
               </au>
            </aug>
            <source>Biochemistry</source>
            <pubdate>1996</pubdate>
            <volume>35</volume>
            <fpage>691</fpage>
            <lpage>697</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8547249</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <aug>
               <au>
                  <snm>Chakrabartty</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ananthanarayanan</snm>
                  <fnm>VS</fnm>
               </au>
               <au>
                  <snm>Hew</snm>
                  <fnm>CL</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>1989</pubdate>
            <volume>264</volume>
            <fpage>11307</fpage>
            <lpage>11312</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">2738067</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <aug>
               <au>
                  <snm>Steinmetz</snm>
                  <fnm>MO</fnm>
               </au>
               <au>
                  <snm>Jelesarov</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Matousek</snm>
                  <fnm>WM</fnm>
               </au>
               <au>
                  <snm>Honnappa</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Jahnke</snm>
                  <fnm>WA</fnm>
               </au>
               <au>
                  <snm>Missimer</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Frank</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Alexandrescu</snm>
                  <fnm>AT</fnm>
               </au>
               <au>
                  <snm>Kammerer</snm>
                  <fnm>RA</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2007</pubdate>
            <volume>104</volume>
            <fpage>7062</fpage>
            <lpage>7067</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1855353</pubid>
                  <pubid idtype="pmpid" link="fulltext">17438295</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <aug>
               <au>
                  <snm>Honda</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Yamasaki</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Sawada</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Morii</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Structure</source>
            <pubdate>2004</pubdate>
            <volume>12</volume>
            <fpage>1507</fpage>
            <lpage>1518</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15296744</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <aug>
               <au>
                  <snm>Pastor</snm>
                  <fnm>MT</fnm>
               </au>
               <au>
                  <snm>L&#243;pez de la Paz</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lacroix</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Serrano</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>P&#233;rez-Pay&#225;</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2002</pubdate>
            <volume>99</volume>
            <fpage>614</fpage>
            <lpage>619</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">117354</pubid>
                  <pubid idtype="pmpid" link="fulltext">11782528</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <aug>
               <au>
                  <snm>Blanco</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Rivas</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Serrano</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Nat Struct Biol</source>
            <pubdate>1994</pubdate>
            <volume>1</volume>
            <fpage>584</fpage>
            <lpage>590</lpage>
            <xrefbib>
               <pubid idtype="pmpid">7634098</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <aug>
               <au>
                  <snm>Fesinmeyer</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Hudson</snm>
                  <fnm>FM</fnm>
               </au>
               <au>
                  <snm>Andersen</snm>
                  <fnm>NH</fnm>
               </au>
            </aug>
            <source>J Am Chem Soc</source>
            <pubdate>2004</pubdate>
            <volume>126</volume>
            <fpage>7238</fpage>
            <lpage>7243</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15186161</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <aug>
               <au>
                  <snm>Mu&#241;oz</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Thompson</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>Hofrichter</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Eaton</snm>
                  <fnm>WA</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1997</pubdate>
            <volume>390</volume>
            <fpage>196</fpage>
            <lpage>199</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9367160</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <aug>
               <au>
                  <snm>Cochran</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Skelton</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Starovasnik</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2001</pubdate>
            <volume>98</volume>
            <fpage>5578</fpage>
            <lpage>5583</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">33255</pubid>
                  <pubid idtype="pmpid" link="fulltext">11331745</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>WY</fnm>
               </au>
               <au>
                  <snm>Pitera</snm>
                  <fnm>JW</fnm>
               </au>
               <au>
                  <snm>Swope</snm>
                  <fnm>WC</fnm>
               </au>
               <au>
                  <snm>Gruebele</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2004</pubdate>
            <volume>336</volume>
            <fpage>241</fpage>
            <lpage>251</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14741219</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <aug>
               <au>
                  <snm>Kortemme</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Ram&#237;rez-Alvarado</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Serrano</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1998</pubdate>
            <volume>281</volume>
            <fpage>253</fpage>
            <lpage>256</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9657719</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <aug>
               <au>
                  <snm>L&#243;pez de la Paz</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lacroix</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Ram&#237;rez-Alvarado</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Serrano</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2001</pubdate>
            <volume>312</volume>
            <fpage>229</fpage>
            <lpage>246</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11545599</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <aug>
               <au>
                  <snm>de Alba</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Santorio</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Rico</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Jimenez</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>Protein Sci</source>
            <pubdate>1999</pubdate>
            <volume>8</volume>
            <fpage>854</fpage>
            <lpage>865</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2144301</pubid>
                  <pubid idtype="pmpid" link="fulltext">10211831</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <aug>
               <au>
                  <snm>Marti</snm>
                  <fnm>DN</fnm>
               </au>
               <au>
                  <snm>Bosshard</snm>
                  <fnm>HR</fnm>
               </au>
            </aug>
            <source>Biochemistry</source>
            <pubdate>2004</pubdate>
            <volume>43</volume>
            <fpage>12436</fpage>
            <lpage>12447</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15449933</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <aug>
               <au>
                  <snm>Kuhlman</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Dantas</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Ireton</snm>
                  <fnm>GC</fnm>
               </au>
               <au>
                  <snm>Varani</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Stoddard</snm>
                  <fnm>BL</fnm>
               </au>
               <au>
                  <snm>Baker</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2003</pubdate>
            <volume>302</volume>
            <fpage>1364</fpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14631033</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <aug>
               <au>
                  <snm>Dantas</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Watters</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Lunde</snm>
                  <fnm>BM</fnm>
               </au>
               <au>
                  <snm>Eletr</snm>
                  <fnm>ZM</fnm>
               </au>
               <au>
                  <snm>Isern</snm>
                  <fnm>NG</fnm>
               </au>
               <au>
                  <snm>Roseman</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Lipfert</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Doniach</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Tompa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kuhlman</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Stoddard</snm>
                  <fnm>BL</fnm>
               </au>
               <au>
                  <snm>Varani</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Baker</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2006</pubdate>
            <volume>362</volume>
            <fpage>1004</fpage>
            <lpage>1024</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16949611</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B60">
            <aug>
               <au>
                  <snm>Johansson</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Gibney</snm>
                  <fnm>BR</fnm>
               </au>
               <au>
                  <snm>Skalicky</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Wand</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Dutton</snm>
                  <fnm>PL</fnm>
               </au>
            </aug>
            <source>J Am Chem Soc</source>
            <pubdate>1998</pubdate>
            <volume>120</volume>
            <fpage>3881</fpage>
            <lpage>3886</lpage>
         </bibl>
         <bibl id="B61">
            <aug>
               <au>
                  <snm>Dai</snm>
                  <fnm>QH</fnm>
               </au>
               <au>
                  <snm>Thomas</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Fuentes</snm>
                  <fnm>EJ</fnm>
               </au>
               <au>
                  <snm>Blomberg</snm>
                  <fnm>MRA</fnm>
               </au>
               <au>
                  <snm>Dutton</snm>
                  <fnm>PL</fnm>
               </au>
               <au>
                  <snm>Wand</snm>
                  <fnm>AJ</fnm>
               </au>
            </aug>
            <source>J Am Chem Soc</source>
            <pubdate>2002</pubdate>
            <volume>124</volume>
            <fpage>10952</fpage>
            <lpage>10953</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12224922</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B62">
            <aug>
               <au>
                  <snm>Jourdan</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Griffiths-Jones</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Searle</snm>
                  <fnm>MS</fnm>
               </au>
            </aug>
            <source>Eur Biophys J</source>
            <pubdate>2000</pubdate>
            <volume>267</volume>
            <fpage>3539</fpage>
            <lpage>3548</lpage>
         </bibl>
      </refgrp>
   </bm>
</art>

