<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1472-6807-8-23</ui>
   <ji>1472-6807</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Exploring allosteric coupling in the &#945;-subunit of Heterotrimeric G proteins using evolutionary and ensemble-based approaches</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Sayar</snm>
               <fnm>Kemal</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>kemalsayar@yahoo.com</email>
            </au>
            <au id="A2">
               <snm>U&#287;ur</snm>
               <fnm>&#214;zlem</fnm>
               <insr iid="I1"/>
               <email>mugur@dialup.ankara.edu.tr</email>
            </au>
            <au id="A3">
               <snm>Liu</snm>
               <fnm>Tong</fnm>
               <insr iid="I3"/>
               <email>toliu@ucsd.edu</email>
            </au>
            <au id="A4">
               <snm>Hilser</snm>
               <mi>J</mi>
               <fnm>Vincent</fnm>
               <insr iid="I3"/>
               <email>vjhilser@utmb.edu</email>
            </au>
            <au id="A5" ca="yes">
               <snm>Onaran</snm>
               <fnm>Ongun</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>onaran@dialup.ankara.edu.tr</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Ankara University Faculty of Medicine, Department of Pharmacology and Clinical Pharmacology, S&#305;hhiye 06100, Ankara, Turkey</p>
            </ins>
            <ins id="I2">
               <p>Ankara University Faculty of Medicine, and Molecular Biology and Technology Research and Development Unit, S&#305;hhiye 06100, Ankara, Turkey</p>
            </ins>
            <ins id="I3">
               <p>Department of Biochemistry and Molecular Biology, and Sealy Center for Structural Biology and Molecular Biophysics, University of Texas Medical Branch, Galveston, TX, 77555-1068 USA</p>
            </ins>
         </insg>
         <source>BMC Structural Biology</source>
         <issn>1472-6807</issn>
         <pubdate>2008</pubdate>
         <volume>8</volume>
         <issue>1</issue>
         <fpage>23</fpage>
         <url>http://www.biomedcentral.com/1472-6807/8/23</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18454845</pubid>
               <pubid idtype="doi">10.1186/1472-6807-8-23</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>04</day>
               <month>12</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>02</day>
               <month>5</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>02</day>
               <month>5</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Sayar et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Allosteric coupling, which can be defined as propagation of a perturbation at one region of the protein molecule (such as ligand binding) to distant sites in the same molecule, constitutes the most general mechanism of regulation of protein function. However, unlike molecular details of ligand binding, structural elements involved in allosteric effects are difficult to diagnose. Here, we identified allosteric linkages in the &#945;-subunits of heterotrimeric G proteins, which were evolved to transmit membrane receptor signals by allosteric mechanisms, by using two different approaches that utilize fundamentally different and independent information.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We analyzed: 1) correlated mutations in the family of G protein &#945;-subunits, and 2) cooperativity of the native state ensemble of the G&#945;i1 or transducin. The combination of these approaches not only recovered already-known details such as the switch regions that change conformation upon nucleotide exchange, and those regions that are involved in receptor, effector or G&#946;&#947; interactions (indicating that the predictions of the analyses can be viewed with a measure of confidence), but also predicted new sites that are potentially involved in allosteric communication in the G&#945; protein. A summary of the new sites found in the present analysis, which were not apparent in crystallographic data, is given along with known functional and structural information. Implications of the results are discussed.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>A set of residues and/or structural elements that are potentially involved in allosteric communication in G&#945; is presented. This information can be used as a guide to structural, spectroscopic, mutational, and theoretical studies on the allosteric network in G&#945; proteins, which will provide a better understanding of G protein-mediated signal transduction.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>G proteins and G protein-coupled receptors (GPCR) constitute a large family of signaling proteins that transmit extracellular signals to the intracellular milieu where the signals are integrated and transformed to a variety of biological responses. The receptor is activated by the binding of agonists, which are neurotransmitters, hormones, autacoids, odorants, taste or drug molecules present in the extracellular environment. The receptor then activates its cognate heterotrimeric G protein, which in turn transmits the signals to intracellular effectors, such as second-messenger generating enzymes or ion channels. The GDP-bound G&#945; subunit complexed with tightly bound G&#946;&#947; (i.e. the heterotrimer) is generally considered as the inactive state of the G protein. The agonist-activated receptor catalyses the release of bound-GDP from G&#945; and leads to the binding of GTP to the nucleotide binding site of G&#945; (nucleotide exchange). The conformational change in G&#945; that occurs upon nucleotide exchange results in dissociation of GTP-G&#945; from the G&#946;&#947; subunit. Dissociated subunits interact with downstream effectors and modulate their activity. Hydrolysis of bound-GTP to GDP by the intrinsic GTPase of G&#945; and subsequent association of GDP-G&#945; with G&#946;&#947; restore the inactive heterotrimer (see <abbrgrp><abbr bid="B1">1</abbr></abbrgrp> for review). Accordingly, the strength of the signal transmitted to the cellular effectors is determined by the detailed kinetics and energetics of this activation-inactivation cycle. Besides the receptor, signaling proteins such as RGS (regulator of G protein signaling) <abbrgrp><abbr bid="B2">2</abbr></abbrgrp> and AGS (activator of G protein signaling) <abbrgrp><abbr bid="B3">3</abbr></abbrgrp> proteins are also capable of regulating or modulating G protein activation in a nontrivial manner by interfering with the GTPase activity and/or nucleotide exchange rates of G&#945;, or by modifying binding of the partner proteins, namely G&#945;, G&#946;&#947; and receptor, to each other.</p>
         <p>High resolution crystal structures of different G&#945; subunits <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp> have revealed that G&#945; consists of two main folding domains: A Ras-like GTPase domain, which is conserved in the superfamily of GTPases, with extended amino and carboxyl terminal helices, and a helical domain folded into an orthogonal bundle of six &#945; helices which is unique to heterotrimeric G proteins. A deep guanine nucleotide binding pocket resides between the two domains that are connected with two linkers. Three flexible regions in the GTPase domain that change conformation upon nucleotide exchange have been identified and designated as switch I, II, and III (see Figure <figr fid="F1">1</figr> for a summary). Most of the binding sites for signaling partners of G&#945; have been found or anticipated to be on the GTPase domain, although some of them, such as the one for RGS14 <abbrgrp><abbr bid="B9">9</abbr></abbrgrp> or RGS9 <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>, extend to the helical domain as well (see Figure <figr fid="F1">1</figr>). The interdomain interface is thought to be involved in guanine nucleotide exchange and heterotrimer dissociation (i.e. G protein activation) <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>.</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Main structural elements of G&#945; subunit and its interaction sites with its partners</p>
            </caption>
            <text>
               <p><b>Main structural elements of G&#945; subunit and its interaction sites with its partners</b>. Ribbon representation of transducin (G&#945;t) is shown in the centre as a prototypical example of G&#945; subunits. Ras-like GTPase and helical domains are shown as pale yellow and white, respectively. Three switch regions, indicated as S-I to S-III are colored green. Bound-nucleotide (GTP&#947;S) is shown as cyan sticks in the centre of the molecule. The nomenclature for secondary structure is also indicated on the picture. The long N-terminal helix (26 residues) is missing in the structure. Pictures on the sides are surface representation of the molecule with the same orientation as the ribbon representation. Contact sites with G&#946;&#947; [15], RGS9 and phospodiesterase (PDE&#947;) [10], and sites involved in receptor recognition and interaction [1, 41, 42] are shown with different colors as indicated in the picture. All the pictures were rendered by using PyMol (DeLano Scientific LLC, South San Francisco, California, USA) with the deposited coordinates in Protein Data Bank (PDB code <ext-link ext-link-type="pdb" ext-link-id="1TAG">1TAG</ext-link>-chainA).</p>
            </text>
            <graphic file="1472-6807-8-23-1"/>
         </fig>
         <p>From a molecular point of view, regulation of G protein-mediated signal transduction relies on a complex interplay among different functional and structural domains of G&#945;; a perturbation at one functional domain modifies the structural, energetic and eventually functional properties of the others. Although some of these interactions are mediated by direct contacts, such as the one that takes place between RGS or AGS and nucleotide binding sites <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>, most are mediated allosterically. High resolution crystal structures of G&#945;i <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> and G&#945;t <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>, as well as the heterotrimers <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr></abbrgrp>, in their GDP- or GTP&#947;S- (a nonhydrolysable analogue of GTP) bound forms have provided remarkable insight into the understanding of guanine nucleotide-dependent conformational changes in G&#945; and the mechanism of GTPase. However, the realization that conformational fluctuations may play an important role in mediating allosteric coupling <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp> casts doubt on whether a complete picture of allosteric communication can emerge from static endpoint pictures provided by crystallographic studies. Indeed, despite the wealth of available data provided by biochemical, mutational, and structural studies, many questions regarding the structural elements involved in allosteric communication remain to be answered. This is due to the fact that systematic screening of allosteric linkage, for example by using double-mutant-cycle strategies <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>, is generally an intractable (if not impossible) experimental task, even for relatively small proteins.</p>
         <p>In order to overcome these difficulties, at least in part, and to provide a structural map of a possible network of allosteric linkage in G&#945;, we adopted a strategy that utilizes two different computational approaches, which are based on independent and diverse assumptions and principles. We evaluated; 1) evolutionary data to draw information about statistical coupling between residues, and 2) a statistical thermodynamic model (i.e. COREX algorithm) of the native state ensemble of G&#945; to identify linkages in local folding free energies in G&#945;i and G&#945;t. We used currently available experimental information and the convergence of the results of the two independent analyses as a basis of judgment for the present results.</p>
         <p>We show that the two approaches together reveal already known facts, such as the linkage between guanine nucleotide, receptor and effector binding sites, indicating that the predictions of the analyses can be viewed with a measure of confidence. Therefore, we present a set of residues and/or structural elements that may have potential functional importance for the allosteric communication in G&#945;, and that can be used as a guide to structural, spectroscopic, mutational, and theoretical studies on the allosteric network in G&#945; proteins, as well as to the interpretation of a vast amount of available experimental data.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Dependence of amino acid distributions in multiple sequence alignment</p>
            </st>
            <p>Correlated mutation analyses in multiple sequence alignments have long been considered as useful tools to identify sites involved in protein-protein contacts <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>, or to obtain distance information for structural prediction and fold recognition <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr></abbrgrp>. However, Horowitz et al. <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> have observed that coordinated changes in amino acids at two different positions (i.e. compensatory mutations) in an evolutionarily sampled sequence could predict non-additivity in the corresponding double mutant cycle, which is indicative for allosteric coupling. Lockless and Ranganathan <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> later expanded on this idea and suggested that covariance information in a multiple sequence alignment (MSA) could also be used to identify long-range energetic couplings (i.e. allosteric linkage) in proteins. The reasoning was as follows: if proteins have evolved in such a way that a perturbation at one site can be allosterically transmitted to a distant site, then an evolutionary perturbation imparted by a random mutation at one of these sites should affect the evolution of the other. This should result in a dependence of amino acid distributions at the relevant sites, which could be uncovered by covariance analyses in a well sampled MSA. Although this is equivalent to saying that distant sites in a protein may have been co-evolved under common functional or structural constraints, the subtle interpretation that such associations might indicate allosteric linkage between the sites has been proved to be useful <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. However, the free energy-like measure that has originally been suggested by Lockless and Ranganathan for this statistical coupling has potentially serious disadvantages: It is based on a virtual perturbation experiment that yields an asymmetric measure of coupling, and it is extremely biased by the conservation of the sites. The bias is due to the properties of binomial probabilities that play a central role in the measure (see supplemental information in the Additional file <supplr sid="S1">1</supplr> for detailed discussion). In general, the bias is so extensive that the measure gauges conservation rather than statistical coupling. Other disadvantages have also been discussed by others <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. In the present analysis, we therefore used a modified measure of dependence as explained below.</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p>supporting information. A simulation test for the comparison of the measure of statistical coupling in MSA that was used in the present study with the one that has been originally suggested by Lockless and Ranganathan (ref 23) is provided as supporting information.</p>
               </text>
               <file name="1472-6807-8-23-S1.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>We assessed the dependence of amino acid distributions by means of &#967;<sup>2 </sup>value associated with relevant contingency tables. The tables were constructed as usual. The magnitude of &#967;<sup>2 </sup>gauges the degree of departure from independence assumption. However, the absolute magnitude of &#967;<sup>2 </sup>depends on the actual amino acid content of the sites, which can obviously be different for every pair of sites in a protein sequence, and therefore requires normalization in order to be comparable across different pairs of sites. We used the following normalization:</p>
            <p>
               <display-formula>
                  <m:math name="1472-6807-8-23-i1" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mtable>
                              <m:mtr>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mtable>
                                          <m:mtr>
                                             <m:mtd>
                                                <m:mrow/>
                                             </m:mtd>
                                          </m:mtr>
                                          <m:mtr>
                                             <m:mtd>
                                                <m:mi>N</m:mi>
                                             </m:mtd>
                                          </m:mtr>
                                       </m:mtable>
                                       <m:msubsup>
                                          <m:mi>&#967;</m:mi>
                                          <m:mrow>
                                             <m:mi>i</m:mi>
                                             <m:mi>j</m:mi>
                                          </m:mrow>
                                          <m:mn>2</m:mn>
                                       </m:msubsup>
                                       <m:mo>=</m:mo>
                                       <m:mn>2</m:mn>
                                       <m:mfrac>
                                          <m:mrow>
                                             <m:msubsup>
                                                <m:mi>&#967;</m:mi>
                                                <m:mrow>
                                                   <m:mi>i</m:mi>
                                                   <m:mi>j</m:mi>
                                                </m:mrow>
                                                <m:mn>2</m:mn>
                                             </m:msubsup>
                                          </m:mrow>
                                          <m:mrow>
                                             <m:mrow>
                                                <m:mo>(</m:mo>
                                                <m:mrow>
                                                   <m:msubsup>
                                                      <m:mi>&#967;</m:mi>
                                                      <m:mrow>
                                                         <m:mi>i</m:mi>
                                                         <m:mi>i</m:mi>
                                                      </m:mrow>
                                                      <m:mn>2</m:mn>
                                                   </m:msubsup>
                                                   <m:mo>+</m:mo>
                                                   <m:msubsup>
                                                      <m:mi>&#967;</m:mi>
                                                      <m:mrow>
                                                         <m:mi>j</m:mi>
                                                         <m:mi>j</m:mi>
                                                      </m:mrow>
                                                      <m:mn>2</m:mn>
                                                   </m:msubsup>
                                                </m:mrow>
                                                <m:mo>)</m:mo>
                                             </m:mrow>
                                          </m:mrow>
                                       </m:mfrac>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mrow>
                                          <m:mo>{</m:mo>
                                          <m:mrow>
                                             <m:mtable columnalign="left">
                                                <m:mtr columnalign="left">
                                                   <m:mtd columnalign="left">
                                                      <m:mrow>
                                                         <m:mi>i</m:mi>
                                                         <m:mo>=</m:mo>
                                                         <m:mi>j</m:mi>
                                                         <m:mo>&#8658;</m:mo>
                                                      </m:mrow>
                                                   </m:mtd>
                                                   <m:mtd columnalign="left">
                                                      <m:mrow>
                                                         <m:mtable>
                                                            <m:mtr>
                                                               <m:mtd>
                                                                  <m:mrow/>
                                                               </m:mtd>
                                                            </m:mtr>
                                                            <m:mtr>
                                                               <m:mtd>
                                                                  <m:mi>N</m:mi>
                                                               </m:mtd>
                                                            </m:mtr>
                                                         </m:mtable>
                                                         <m:msubsup>
                                                            <m:mi>&#967;</m:mi>
                                                            <m:mrow>
                                                               <m:mi>i</m:mi>
                                                               <m:mi>j</m:mi>
                                                            </m:mrow>
                                                            <m:mn>2</m:mn>
                                                         </m:msubsup>
                                                         <m:mo>=</m:mo>
                                                         <m:mn>1</m:mn>
                                                      </m:mrow>
                                                   </m:mtd>
                                                </m:mtr>
                                                <m:mtr columnalign="left">
                                                   <m:mtd columnalign="left">
                                                      <m:mrow>
                                                         <m:mi>i</m:mi>
                                                         <m:mo>&#8800;</m:mo>
                                                         <m:mi>j</m:mi>
                                                         <m:mo>&#8658;</m:mo>
                                                      </m:mrow>
                                                   </m:mtd>
                                                   <m:mtd columnalign="left">
                                                      <m:mrow>
                                                         <m:mn>0</m:mn>
                                                         <m:mo>&#8804;</m:mo>
                                                         <m:mtable>
                                                            <m:mtr>
                                                               <m:mtd>
                                                                  <m:mrow/>
                                                               </m:mtd>
                                                            </m:mtr>
                                                            <m:mtr>
                                                               <m:mtd>
                                                                  <m:mi>N</m:mi>
                                                               </m:mtd>
                                                            </m:mtr>
                                                         </m:mtable>
                                                         <m:msubsup>
                                                            <m:mi>&#967;</m:mi>
                                                            <m:mrow>
                                                               <m:mi>i</m:mi>
                                                               <m:mi>j</m:mi>
                                                            </m:mrow>
                                                            <m:mn>2</m:mn>
                                                         </m:msubsup>
                                                         <m:mo>&lt;</m:mo>
                                                         <m:mn>1</m:mn>
                                                      </m:mrow>
                                                   </m:mtd>
                                                </m:mtr>
                                             </m:mtable>
                                          </m:mrow>
                                       </m:mrow>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                           </m:mtable>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaqbaeqabeGaaaqaauaabeqaceaaaeaaaeaacqWGobGtaaGaeq4Xdm2aa0baaSqaaiabdMgaPjabdQgaQbqaaiabikdaYaaakiabg2da9iabikdaYKqbaoaalaaabaGaeq4Xdm2aa0baaeaacqWGPbqAcqWGQbGAaeaacqaIYaGmaaaabaWaaeWaaeaacqaHhpWydaqhaaqaaiabdMgaPjabdMgaPbqaaiabikdaYaaacqGHRaWkcqaHhpWydaqhaaqaaiabdQgaQjabdQgaQbqaaiabikdaYaaaaiaawIcacaGLPaaaaaaakeaadaGabaqaauaabaqaciaaaeaacqWGPbqAcqGH9aqpcqWGQbGAcqGHshI3aeaafaqabeGabaaabaaabaGaemOta4eaaiabeE8aJnaaDaaaleaacqWGPbqAcqWGQbGAaeaacqaIYaGmaaGccqGH9aqpcqaIXaqmaeaacqWGPbqAcqGHGjsUcqWGQbGAcqGHshI3aeaacqaIWaamcqGHKjYOfaqabeGabaaabaaabaGaemOta4eaaiabeE8aJnaaDaaaleaacqWGPbqAcqWGQbGAaeaacqaIYaGmaaGccqGH8aapcqaIXaqmaaaacaGL7baaaaaaaa@6ACF@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>The normalizing factor is the average "self-coupling" of each site, which gives the upper limit of &#967;<sup>2 </sup>for that site. For all possible pair of sites in the MSA, <inline-formula><m:math name="1472-6807-8-23-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mtable><m:mtr><m:mtd><m:mrow/></m:mtd></m:mtr><m:mtr><m:mtd><m:mi>N</m:mi></m:mtd></m:mtr></m:mtable><m:msubsup><m:mi>&#967;</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaqbaeqabiqaaaqaaaqaaiabd6eaobaacqaHhpWydaqhaaWcbaGaemyAaKMaemOAaOgabaGaeGOmaidaaaaa@3295@</m:annotation></m:semantics></m:math></inline-formula> forms a correlation-like symmetric matrix with diagonal elements equal to 1. As a further refinement we made the following correction: we calculated average <inline-formula><m:math name="1472-6807-8-23-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mtable><m:mtr><m:mtd><m:mrow/></m:mtd></m:mtr><m:mtr><m:mtd><m:mi>N</m:mi></m:mtd></m:mtr></m:mtable><m:msubsup><m:mi>&#967;</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaqbaeqabiqaaaqaaaqaaiabd6eaobaacqaHhpWydaqhaaWcbaGaemyAaKMaemOAaOgabaGaeGOmaidaaaaa@3295@</m:annotation></m:semantics></m:math></inline-formula> for each pair of sites by randomly permuting the amino acids at each site in the pair (this procedure affects only the joint distribution without changing the marginals). We considered this value as the "background" <inline-formula><m:math name="1472-6807-8-23-i2" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mtable><m:mtr><m:mtd><m:mrow/></m:mtd></m:mtr><m:mtr><m:mtd><m:mi>N</m:mi></m:mtd></m:mtr></m:mtable><m:msubsup><m:mi>&#967;</m:mi><m:mrow><m:mi>i</m:mi><m:mi>j</m:mi></m:mrow><m:mn>2</m:mn></m:msubsup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaqbaeqabiqaaaqaaaqaaiabd6eaobaacqaHhpWydaqhaaWcbaGaemyAaKMaemOAaOgabaGaeGOmaidaaaaa@3295@</m:annotation></m:semantics></m:math></inline-formula> for the sites <it>i </it>and <it>j</it>. For all possible pairs in the sequence, these values form a matrix of the same size as <inline-formula><m:math name="1472-6807-8-23-i3" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mtable><m:mtr><m:mtd><m:mrow/></m:mtd></m:mtr><m:mtr><m:mtd><m:mi>N</m:mi></m:mtd></m:mtr></m:mtable><m:msup><m:mi>&#967;</m:mi><m:mn>2</m:mn></m:msup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaqbaeqabiqaaaqaaaqaaiabd6eaobaacqaHhpWydaahaaWcbeqaaiabikdaYaaaaaa@2FDD@</m:annotation></m:semantics></m:math></inline-formula>. We used the difference between <inline-formula><m:math name="1472-6807-8-23-i3" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mtable><m:mtr><m:mtd><m:mrow/></m:mtd></m:mtr><m:mtr><m:mtd><m:mi>N</m:mi></m:mtd></m:mtr></m:mtable><m:msup><m:mi>&#967;</m:mi><m:mn>2</m:mn></m:msup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaqbaeqabiqaaaqaaaqaaiabd6eaobaacqaHhpWydaahaaWcbeqaaiabikdaYaaaaaa@2FDD@</m:annotation></m:semantics></m:math></inline-formula> and the background matrix as a final measure of dependence (or statistical coupling), which can be interpreted as the extra <inline-formula><m:math name="1472-6807-8-23-i3" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mtable><m:mtr><m:mtd><m:mrow/></m:mtd></m:mtr><m:mtr><m:mtd><m:mi>N</m:mi></m:mtd></m:mtr></m:mtable><m:msup><m:mi>&#967;</m:mi><m:mn>2</m:mn></m:msup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaqbaeqabiqaaaqaaaqaaiabd6eaobaacqaHhpWydaahaaWcbeqaaiabikdaYaaaaaa@2FDD@</m:annotation></m:semantics></m:math></inline-formula> due to the actual arrangement of amino acids in the MSA columns. Note that &#967;<sup>2 </sup>in this procedure was not used in its usual statistical sense, but simply as a measure of normalized "distance" between two joint distributions. Obviously, physicochemical properties of amino acids were not considered in this analysis. A comparable strategy has been undertaken by Kass and Horovitz <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. Finally, possible contributions of evolutionary noise to the covariance information <abbrgrp><abbr bid="B27">27</abbr></abbrgrp> were disregarded in the present analysis, since the results of COREX analysis (as explained below) were used as a source of independent information to judge the relevence of covariance data. The entire procedure is available upon request as a Matlab script.</p>
            <p>For the present analysis, we used a non-redundant multiple sequence alignment consisting of 233 samples of &#945;-subunits of heterotrimeric G protein sequences, obtained from the pfam database (please see Availability &amp; requirements section below), from which redundancies were eliminated manually (the entire data set is available upon request). We excluded those sites for which the available number of amino acid samples was less than 90% of the size of the MSA. Highly conserved sites were also disregarded in the analysis since the covariance with or between such sites is statistically ambiguous as they possess little or no variance.</p>
            <p>In order to identify the network of alike sites with respect to their coupling strength and pattern, we did a two-way hierarchical cluster analysis with the corrected <inline-formula><m:math name="1472-6807-8-23-i3" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mtable><m:mtr><m:mtd><m:mrow/></m:mtd></m:mtr><m:mtr><m:mtd><m:mi>N</m:mi></m:mtd></m:mtr></m:mtable><m:msup><m:mi>&#967;</m:mi><m:mn>2</m:mn></m:msup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaqbaeqabiqaaaqaaaqaaiabd6eaobaacqaHhpWydaahaaWcbeqaaiabikdaYaaaaaa@2FDD@</m:annotation></m:semantics></m:math></inline-formula> matrix after eliminating the sites that did not fulfill the sampling criterion mentioned above. We used complete linkage function and city-block distance metrics (sum of absolute differences) as the simplest and most natural choice <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. We tested the efficiency of the entire procedure by using simulated multiple sequence alignments, in which different types of couplings with different strengths were imposed. Simulated patterns were perfectly recovered by the above procedure, whereas the original measure of Lockless and Ranganathan failed [see Additional file <supplr sid="S1">1</supplr>].</p>
         </sec>
         <sec>
            <st>
               <p>Simulation of statistical ensemble</p>
            </st>
            <p>In thermodynamic equilibrium, the folded state of a protein comprises an ensemble of energetically close conformational states <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. Each state in this ensemble emerges with a certain probability depending on its free energy. Equivalently, this ensemble can be viewed as the "state repertoire" of a single molecule that is continuously explored by the molecule with different mean dwell-times, the latter being eventually related to the state probabilities when the molecule is observed in the time scale of thermodynamic averaging. In any case, the native state of a protein for a given set of conditions can be specified by a probability distribution defined over this conformational space <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. It can be theoretically shown that modification of this probability distribution by different perturbations made at distant sites on a protein results in energetic coupling of the perturbations when they are made simultaneously (e.g. coupling between two ligand binding processes at two distant binding sites) <abbrgrp><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr></abbrgrp>, which constitutes the microscopic basis of two equivalent linkage theories that have been formulated differently by Wyman <abbrgrp><abbr bid="B33">33</abbr></abbrgrp> and Weber <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. Thus, allosteric linkage between distant sites on a protein molecule can be analyzed by using such probability distributions <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B35">35</abbr></abbrgrp>, provided that the equilibrium probabilities associated with specific conformers in the native ensemble are available.</p>
            <p>In a series of studies, Hilser, Freire and their co-workers have developed an effective strategy to model the native ensemble of folded states of proteins <abbrgrp><abbr bid="B30">30</abbr><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr></abbrgrp>. The procedure, known as the COREX algorithm, can be summarized as follows. The native ensemble of the protein is modeled as a collection of partially folded states of the protein: Each residue is considered as either folded (native-like as in the high-resolution structure) or unfolded (devoid of structure), and folding blocks of consecutive (usually 6 to 12) residues, in which residues are collectively folded or unfolded, are defined. The native ensemble is obtained by combinatorial folding and unfolding of these blocks. Each microstate in the ensemble thus generated is then assigned a Gibbs free energy by means of an empirically parameterized energy function <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. This function, which is based on solvent accessible surface area and conformational entropy, has been previously calibrated and validated for globular non-membrane proteins <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. Free energy distribution over the states (&#916;<it>G</it><sub><it>i</it></sub>) gives the probability distribution of interest:</p>
            <p>
               <display-formula>
                  <m:math name="1472-6807-8-23-i4" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:mtable>
                              <m:mtr>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>K</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:mo>=</m:mo>
                                       <m:msup>
                                          <m:mi>e</m:mi>
                                          <m:mrow>
                                             <m:mo>&#8722;</m:mo>
                                             <m:mi>&#916;</m:mi>
                                             <m:msub>
                                                <m:mi>G</m:mi>
                                                <m:mi>i</m:mi>
                                             </m:msub>
                                             <m:mo>/</m:mo>
                                             <m:mi>R</m:mi>
                                             <m:mi>T</m:mi>
                                          </m:mrow>
                                       </m:msup>
                                    </m:mrow>
                                 </m:mtd>
                                 <m:mtd>
                                    <m:mrow>
                                       <m:mtable>
                                          <m:mtr>
                                             <m:mtd>
                                                <m:mrow>
                                                   <m:mi>Q</m:mi>
                                                   <m:mo>=</m:mo>
                                                   <m:mstyle displaystyle="true">
                                                      <m:munderover>
                                                         <m:mo>&#8721;</m:mo>
                                                         <m:mrow>
                                                            <m:mi>i</m:mi>
                                                            <m:mo>=</m:mo>
                                                            <m:mn>1</m:mn>
                                                         </m:mrow>
                                                         <m:mi>n</m:mi>
                                                      </m:munderover>
                                                      <m:mrow>
                                                         <m:msub>
                                                            <m:mi>K</m:mi>
                                                            <m:mi>i</m:mi>
                                                         </m:msub>
                                                      </m:mrow>
                                                   </m:mstyle>
                                                </m:mrow>
                                             </m:mtd>
                                             <m:mtd>
                                                <m:mrow>
                                                   <m:msub>
                                                      <m:mi>p</m:mi>
                                                      <m:mi>i</m:mi>
                                                   </m:msub>
                                                   <m:mo>=</m:mo>
                                                   <m:mfrac>
                                                      <m:mrow>
                                                         <m:msub>
                                                            <m:mi>K</m:mi>
                                                            <m:mi>i</m:mi>
                                                         </m:msub>
                                                      </m:mrow>
                                                      <m:mi>Q</m:mi>
                                                   </m:mfrac>
                                                   <m:mo>.</m:mo>
                                                </m:mrow>
                                             </m:mtd>
                                          </m:mtr>
                                       </m:mtable>
                                    </m:mrow>
                                 </m:mtd>
                              </m:mtr>
                           </m:mtable>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaqbaeqabeGaaaqaaiabdUealnaaBaaaleaacqWGPbqAaeqaaOGaeyypa0Jaemyzau2aaWbaaSqabeaacqGHsislcqqHuoarcqWGhbWrdaWgaaadbaGaemyAaKgabeaaliabc+caViabdkfasjabdsfaubaaaOqaauaabeqabiaaaeaacqWGrbqucqGH9aqpdaaeWbqaaiabdUealnaaBaaaleaacqWGPbqAaeqaaaqaaiabdMgaPjabg2da9iabigdaXaqaaiabd6gaUbqdcqGHris5aaGcbaGaemiCaa3aaSbaaSqaaiabdMgaPbqabaGccqGH9aqpjuaGdaWcaaqaaiabdUealnaaBaaabaGaemyAaKgabeaaaeaacqWGrbquaaGccqGGUaGlaaaaaaaa@4EDE@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p><it>K</it><sub><it>i </it></sub>is the Boltzman weight of state <it>i</it>, <it>Q </it>is the partition function of the ensemble, and <it>p</it><sub><it>i </it></sub>is the probability of state <it>i</it>. The summation runs over the entire ensemble.</p>
            <p>The probability distribution <it>p </it>can be used to estimate a descriptor of the residue-specific equilibrium; the residue-specific stability constant (<it>k</it><sub><it>f</it></sub>). This quantity is the ratio of total probability of the states where residue <it>j </it>is in folded conformation to that of the states where residue <it>j </it>is in unfolded conformation:</p>
            <p>
               <display-formula>
                  <m:math name="1472-6807-8-23-i5" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msub>
                              <m:mi>k</m:mi>
                              <m:mrow>
                                 <m:mi>f</m:mi>
                                 <m:mo>,</m:mo>
                                 <m:mi>j</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mstyle displaystyle="true">
                                    <m:munder>
                                       <m:mo>&#8721;</m:mo>
                                       <m:mrow>
                                          <m:mi>i</m:mi>
                                          <m:mo>=</m:mo>
                                          <m:mtext>index&#160;of&#160;states&#160;where&#160;j&#160;is&#160;folded</m:mtext>
                                       </m:mrow>
                                    </m:munder>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>p</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                    </m:mrow>
                                 </m:mstyle>
                              </m:mrow>
                              <m:mrow>
                                 <m:mstyle displaystyle="true">
                                    <m:munder>
                                       <m:mo>&#8721;</m:mo>
                                       <m:mrow>
                                          <m:mi>i</m:mi>
                                          <m:mo>=</m:mo>
                                          <m:mtext>index&#160;of&#160;states&#160;where&#160;j&#160;is&#160;unfolded</m:mtext>
                                       </m:mrow>
                                    </m:munder>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>p</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                    </m:mrow>
                                 </m:mstyle>
                              </m:mrow>
                           </m:mfrac>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaem4AaS2aaSbaaSqaaiabdAgaMjabcYcaSiabdQgaQbqabaGccqGH9aqpjuaGdaWcaaqaamaaqafabaGaemiCaa3aaSbaaeaacqWGPbqAaeqaaaqaaiabdMgaPjabg2da9iabbMgaPjabb6gaUjabbsgaKjabbwgaLjabbIha4jabbccaGiabb+gaVjabbAgaMjabbccaGiabbohaZjabbsha0jabbggaHjabbsha0jabbwgaLjabbohaZjabbccaGiabbEha3jabbIgaOjabbwgaLjabbkhaYjabbwgaLjabbccaGiabbQgaQjabbccaGiabbMgaPjabbohaZjabbccaGiabbAgaMjabb+gaVjabbYgaSjabbsgaKjabbwgaLjabbsgaKbqabiabggHiLdaabaWaaabuaeaacqWGWbaCdaWgaaqaaiabdMgaPbqabaaabaGaemyAaKMaeyypa0JaeeyAaKMaeeOBa4MaeeizaqMaeeyzauMaeeiEaGNaeeiiaaIaee4Ba8MaeeOzayMaeeiiaaIaee4CamNaeeiDaqNaeeyyaeMaeeiDaqNaeeyzauMaee4CamNaeeiiaaIaee4DaCNaeeiAaGMaeeyzauMaeeOCaiNaeeyzauMaeeiiaaIaeeOAaOMaeeiiaaIaeeyAaKMaee4CamNaeeiiaaIaeeyDauNaeeOBa4MaeeOzayMaee4Ba8MaeeiBaWMaeeizaqMaeeyzauMaeeizaqgabeGaeyyeIuoaaaaaaa@9772@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p>It follows that the local folding free energy of residue <it>j </it>is -<it>RT</it>ln(<it>k</it><sub><it>f</it>,<it>j</it></sub>). Although this quantity is designated as "local", it actually depends on all the other residues through the probability distribution, which is determined by the properties of all microstates in the ensemble to which every part of the protein contributes. Hence, in the context of a given ensemble, <it>k</it><sub><it>f </it></sub>is local in the sense that it is a true physical descriptor of the equilibrium state of a single residue. However, this does not mean that it is independent of the state of other residues in the protein. It has been shown by extensive analysis of various proteins that these residue-specific constants reproduce a number of experimental observables, suggesting that the actual ensemble is well-represented by the COREX ensemble <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>.</p>
            <p>Another interesting piece of information that can be drawn from the COREX analysis is the degree of dependence of local folding free energies among different sites of the protein, which can be interpreted as an additional sign of allosteric linkage within the molecule <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. This information can be obtained by evaluating folding correlations between different sites in a subset of the ensemble which contains the most probable states (e.g. with 0.99 probability):</p>
            <p>
               <display-formula>
                  <m:math name="1472-6807-8-23-i6" xmlns:m="http://www.w3.org/1998/Math/MathML">
                     <m:semantics>
                        <m:mrow>
                           <m:msub>
                              <m:mi>r</m:mi>
                              <m:mrow>
                                 <m:mi>i</m:mi>
                                 <m:mi>j</m:mi>
                              </m:mrow>
                           </m:msub>
                           <m:mo>=</m:mo>
                           <m:mfrac>
                              <m:mrow>
                                 <m:mi>cov</m:mi>
                                 <m:mo>&#8289;</m:mo>
                                 <m:mrow>
                                    <m:mo>(</m:mo>
                                    <m:mrow>
                                       <m:msub>
                                          <m:mi>F</m:mi>
                                          <m:mi>i</m:mi>
                                       </m:msub>
                                       <m:mo>,</m:mo>
                                       <m:msub>
                                          <m:mi>F</m:mi>
                                          <m:mi>j</m:mi>
                                       </m:msub>
                                    </m:mrow>
                                    <m:mo>)</m:mo>
                                 </m:mrow>
                              </m:mrow>
                              <m:mrow>
                                 <m:msqrt>
                                    <m:mrow>
                                       <m:mi>var</m:mi>
                                       <m:mo>&#8289;</m:mo>
                                       <m:mrow>
                                          <m:mo>(</m:mo>
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>F</m:mi>
                                                <m:mi>i</m:mi>
                                             </m:msub>
                                          </m:mrow>
                                          <m:mo>)</m:mo>
                                       </m:mrow>
                                       <m:mi>var</m:mi>
                                       <m:mo>&#8289;</m:mo>
                                       <m:mrow>
                                          <m:mo>(</m:mo>
                                          <m:mrow>
                                             <m:msub>
                                                <m:mi>F</m:mi>
                                                <m:mi>j</m:mi>
                                             </m:msub>
                                          </m:mrow>
                                          <m:mo>)</m:mo>
                                       </m:mrow>
                                    </m:mrow>
                                 </m:msqrt>
                              </m:mrow>
                           </m:mfrac>
                           <m:mo>.</m:mo>
                        </m:mrow>
                        <m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xI8qiVKYPFjYdHaVhbbf9v8qqaqFr0xc9vqFj0dXdbba91qpepeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaGaemOCai3aaSbaaSqaaiabdMgaPjabdQgaQbqabaGccqGH9aqpjuaGdaWcaaqaaiGbcogaJjabc+gaVjabcAha2naabmaabaGaemOray0aaSbaaeaacqWGPbqAaeqaaiabcYcaSiabdAeagnaaBaaabaGaemOAaOgabeaaaiaawIcacaGLPaaaaeaadaGcaaqaaiGbcAha2jabcggaHjabckhaYnaabmaabaGaemOray0aaSbaaeaacqWGPbqAaeqaaaGaayjkaiaawMcaaiGbcAha2jabcggaHjabckhaYnaabmaabaGaemOray0aaSbaaeaacqWGQbGAaeqaaaGaayjkaiaawMcaaaqabaaaaOGaeiOla4caaa@4F65@</m:annotation>
                     </m:semantics>
                  </m:math>
               </display-formula>
            </p>
            <p><it>F</it><sub><it>i </it></sub>and <it>F</it><sub><it>j </it></sub>indicate the folding state (i.e. <it>F </it>= 0 or 1) of residue <it>i </it>and <it>j</it>, respectively, across the most-probable states. An alternative strategy to reveal such linkages is to calculate the propagation of thermodynamic mutations made at a specific residue to all other residues <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. This is achieved by artificially increasing (~1 kcal/mol) the free energy of all states in which the residue of interest is folded in the representative ensemble, and then recalculating residue-specific stability constants. Such a virtual perturbation results in re-sorting of the probabilities associated with the states of the ensemble, and thus changes the value of residue-specific stability constants. Changes in the stability constants indicate how the perturbation made in a single residue propagates throughout the molecule. In principle, this information is similar to what the simple correlation analysis provides. Nevertheless, we performed both analyses, but provided only the results of the former as they indeed gave similar results.</p>
            <p>In the present analysis we used high resolution structures of G<sub>&#945;i1 </sub>and G<sub>&#945;t</sub>, which have been determined in their GDP- or GTP&#947;S-bound forms (PDB ID: <ext-link ext-link-type="pdb" ext-link-id="1BOF">1BOF</ext-link>, <ext-link ext-link-type="pdb" ext-link-id="1GIA">1GIA</ext-link> for G<sub>&#945;i1 </sub>and <ext-link ext-link-type="pdb" ext-link-id="1TAG">1TAG</ext-link>(A and B) for G<sub>&#945;t</sub>). We used the web server at UTMB (please see Availability &amp; requirements section below) for Monte-Carlo sampling of COREX states (with their calculated free energies) of G<sub>&#945;t</sub>. We performed high state density (i.e., exhaustive enumeration) COREX calculations only for G<sub>&#945;i1</sub>. In both cases a window size of 8 residues per folding blocks was used. In high-resolution correlation calculations we evaluated the top 5000 states of G<sub>&#945;i1</sub>.</p>
         </sec>
         <sec>
            <st>
               <p>Test statistics</p>
            </st>
            <p>In order to test whether an overlap between the two subsets of amino acid sites selected from the entire sequence was a result of pure coincidence, we used the following statistics (for the context of the problem, see results). The statistics is based on the probability distribution defined over the number of elements <it>x </it>in the intersection of two subsets selected randomly from a parent set. The null hypothesis is that the two subsets (whose sizes are given) are selected randomly. Given the size <b>N </b>of the parent set and the sizes of selected subsets <b>K </b>and <b>M</b>, the size of intersection <it>x </it>uniquely defines the partition as <b>N </b>= <it>n+k+m+x</it>, where <it>k </it>= <b>K</b>-<it>x, m </it>= <b>M</b>-<it>x, n </it>= <b>N-(</b><it>k</it>+<it>m</it>+<it>x</it>). The number of possible such partitions for a particular <it>x </it>is <it>q</it><sub><it>x </it></sub>= <b>N</b>!/(<it>n</it>!<it>k</it>!<it>m</it>!<it>x</it>!), and for all allowed values of <it>x</it>, from 0 to min(<b>K</b>,<b>M</b>), provided that <b>K</b>+<b>M </b>&#8804; <b>N</b>, the total number of possible partitions is <b>&#937; </b>= <b>&#931;</b><it>q</it><sub><it>x</it></sub>, {<it>x </it>= 0...min(<b>K</b>,<b>M</b>)}. Given the null hypothesis, all events in <b>&#937; </b>are equally likely. Hence, the probability of selecting <b>K </b>and <b>M </b>randomly with a particular number of overlaps <it>x </it>is <it>p</it><sub><it>x </it></sub>= <it>q</it><sub><it>x</it></sub>/<b>&#937;</b>. The statistics for the actually observed number of overlaps <it>x</it><sub>0 </sub>then reads as <it>p</it>{<it>x </it>&#8805; <it>x</it><sub>0</sub>} = <b>&#931;</b><it>p</it><sub><it>x</it></sub>, {<it>x </it>= <it>x</it><sub>0</sub>...min(<b>K</b>,<b>M</b>)}.</p>
            <p>We used <it>p</it>{<it>x </it>&#8805; <it>x</it><sub>0</sub>} &lt; 0.05 as a one-tail rejection criterion for the null hypothesis that the observed overlap is coincidental (or equivalently, the observed overlap is not significantly different from the expected overlap under the null hypothesis). When the alternative hypothesis is the hypothesis of interest, the left-hand tail of the same distribution can obviously be used to test whether the observed lack of overlap is coincidental or not.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <p>In this section we briefly summarize the findings. The results of the statistical coupling and corex analyses are given separately, and then a combined picture is provided at the end. The entire matrix of statistical couplings for further analysis is available [see Additional file <supplr sid="S2">2</supplr>].</p>
         <suppl id="S2">
            <title>
               <p>Additional file 2</p>
            </title>
            <text>
               <p>Statistical coupling matrix. Entire statistical coupling matrix for the G protein family is given as supporting data (as an MSExcel file) for further analysis.</p>
            </text>
            <file name="1472-6807-8-23-S2.xls">
               <p>Click here for file</p>
            </file>
         </suppl>
         <sec>
            <st>
               <p>Statistical couplings in G protein family</p>
            </st>
            <p>Statistical coupling analysis in MSA provides a symmetrical matrix (361 &#215; 361) of coupling values for all possible pairs of individual sites in G protein as described in the method section (actual number of couplings is 361 &#215; 360/2 due to the symmetry of the matrix). However, in order to acquire a general picture of statistical coupling, we first evaluated the couplings between secondary structure blocks by averaging the coupling of individual sites in a given secondary structure element. Overall, 35 such elements as helices, sheets, loops, linkers etc. are identifiable in the molecule (13 &#945;-helices, 6 &#946;-sheets and 16 connecting loops and linkers). Hence, this procedure yielded a 35 &#215; 35 symmetric matrix, entries of which represent average pairwise coupling between the secondary structure blocks (net number of couplings is 35 &#215; 34/2). A hierarchical cluster analysis of the coupling matrix identified four clusters of elements with similar patterns and extents of couplings. The coupling matrix, rearranged according to these clusters, is shown in Figure <figr fid="F2">2A</figr>. Clusters #2 and #3 in Figure <figr fid="F2">2A</figr> consist of mainly uncoupled elements, although a mutually coupled spot is apparent in cluster #2. This spot, which is relatively isolated from the rest of the molecule in terms of statistical coupling, mainly consists of very conserved sites, whose functional importance is relatively well known (i.e. &#945;2-helix, &#946;4-sheet, and &#946;4-&#945;3 linker sequence, which correspond to switches II and III, respectively). Therefore, we did not further discuss these regions in the context of the present analysis. On the other hand, clusters #1 and #4 are formed by relatively less conserved sites. The elements in #4 are strongly, and those in #1 are moderately intra-coupled. The two clusters are also coupled to each other (see Figure <figr fid="F2">2A</figr>). Secondary structure elements that form these two clusters are shown on the structure of G&#945;i in Figure <figr fid="F2">2A</figr>. They cover mostly those domains that are known or predicted to be involved in the interaction of G&#945; with its partners, namely G&#946;&#947;, nucleotide, receptor, effector, RGS and AGS proteins (switches II and III are not present in this picture for reasons explained in the Methods section). Note that the sites in cluster 1 and 4 are located in the opposite faces of the molecule, including a part of the nucleotide binding cleft (&#946;6-&#945;5, &#946;5-&#945;G). Therefore, the observation that the elements of cluster 1 are mutually coupled to those of cluster 4 implies a statistical coupling between the two faces of the molecule. Incidentally, a mechanical coupling between the two faces of the protein has also been revealed by molecular dynamics simulations <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. These results imply and confirm that, on average, all the functional domains of G&#945; are statistically (or allosterically) intercoupled. Although this may seem to be trivial at first sight, a global intercoupling of these domains implies a more complex and flexible picture of allosteric regulation than a sequential interaction scheme would suggest. For example, statistical coupling between receptor, effector and G&#946;&#947; binding sites raises the possibility of direct allosteric interactions between these sites which may by-pass the nucleotide exchange.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Color coded statistical coupling matrices and mapping of selected sites on the structure of G&#945;i</p>
               </caption>
               <text>
                  <p><b>Color coded statistical coupling matrices and mapping of selected sites on the structure of G&#945;i</b>. (A) Pair wise couplings of 35 secondary structure elements. Pixels indicate the values of statistical couplings (<inline-formula><m:math name="1472-6807-8-23-i3" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mtable><m:mtr><m:mtd><m:mrow/></m:mtd></m:mtr><m:mtr><m:mtd><m:mi>N</m:mi></m:mtd></m:mtr></m:mtable><m:msup><m:mi>&#967;</m:mi><m:mn>2</m:mn></m:msup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaqbaeqabiqaaaqaaaqaaiabd6eaobaacqaHhpWydaahaaWcbeqaaiabikdaYaaaaaa@2FDD@</m:annotation></m:semantics></m:math></inline-formula>) on a rainbow color scale (red: full coupling, dark blue: no coupling). The secondary structure elements are sorted according to the cluster analysis and cluster numbers are indicated on the sides of the matrix. (B) The elements in clusters 1 and 4 are shown on the structure of G&#945;i (colors of the the numbers match the colors of the clusters in panel A). These clusters include following elements: N-terminal helix, &#946;1, &#946;2, &#946;3, &#945;1, linker I, &#945;A, &#945;A-&#945;B, &#945;D-&#945;E, &#945;F, linker II (switch I), &#945;3, &#945;3-&#946;5 (in #1), &#946;2-&#946;3 hairpin tip, &#946;5-&#945;G, &#945;4, &#946;6, &#946;6-&#945;5, &#945;5 (in #4). (C) The original coupling matrix (361 &#215; 361) is sorted according to the cluster analysis. Pixels indicate the values of statistical couplings (<inline-formula><m:math name="1472-6807-8-23-i3" xmlns:m="http://www.w3.org/1998/Math/MathML"><m:semantics><m:mrow><m:mtable><m:mtr><m:mtd><m:mrow/></m:mtd></m:mtr><m:mtr><m:mtd><m:mi>N</m:mi></m:mtd></m:mtr></m:mtable><m:msup><m:mi>&#967;</m:mi><m:mn>2</m:mn></m:msup></m:mrow><m:annotation encoding="MathType-MTEF">
 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacPC6xNi=xH8viVGI8Gi=hEeeu0xXdbba9frFj0xb9qqpG0dXdb9aspeI8k8fiI+fsY=rqGqVepae9pg0db9vqaiVgFr0xfr=xfr=xc9adbaqaaeGaciGaaiaabeqaaeqabiWaaaGcbaqbaeqabiqaaaqaaaqaaiabd6eaobaacqaHhpWydaahaaWcbeqaaiabikdaYaaaaaa@2FDD@</m:annotation></m:semantics></m:math></inline-formula>) as in panel A. (D) Members of the two relevant clusters, indicated as c1 and c2 in panel C, are shown on the structure of G&#945;i as orange strips and spheres corresponding to their &#945;-carbons. These sites correspond to (with SwissProt numbering of G&#945;i1): V34, V72, S75, I78, S80, F95, N149, T177, K180, V185, H188, F191, K192, L194, H195, S206, C214, A235, M240, H244, L248, S252, C254, N256, T260, K271, T284, Y296, E298, A300, A301, Q304, Q306, L310, N311, K312, R313, E318, I319, T321, T327, F334, D337, A338, T340. Coordinates of GTP&#947;S-bound G&#945;i (PDB ID: <ext-link ext-link-type="pdb" ext-link-id="1GIA">1GIA</ext-link>) is used to render 3D-structures. RasTop program v2.0.3 (Philippe Valadon, 2003) is used for 3D visualization.</p>
               </text>
               <graphic file="1472-6807-8-23-2"/>
            </fig>
            <p>It is clear that secondary structure elements do not necessarily coincide with energetic or functional units in proteins <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. Therefore, we further refined the picture given in Figure <figr fid="F2">2A</figr> by applying a cluster analysis to the coupling matrix of individual sites. The analysis identified 12 clusters of alike coupling patterns. Figure <figr fid="F2">2B</figr> shows the rearrangement of the original matrix according to these clusters. Out of twelve, two clusters designated as c1 and c2 in Figure <figr fid="F2">2C</figr> are of particular interest. These two clusters together consist of 50 moderately or poorly conserved sites (average conservation in the MSA is 40%, ranging from 19% to 86%). Moreover, these sites are slightly, albeit significantly, coupled to a considerable portion of the molecule. The sites that form cluster c1 and c2 are shown on the structure of G&#945;i in Figure <figr fid="F2">2B</figr>. The pattern of coupling between secondary structural elements shown in Figure <figr fid="F2">2A</figr> is well represented by the distribution of the sites that belong to c1 and c2 (compare Figure <figr fid="F2">2A</figr> and Figure <figr fid="F2">2C</figr>). Two other highly coupled clusters in the coupling matrix are marked with yellow arrows in Figure <figr fid="F2">2C</figr>. Sites in these clusters mainly correspond to the highly inter-coupled spot in cluster #2 shown in Figure <figr fid="F2">2A</figr> (i.e. mainly those sites that form switches II and III).</p>
            <p>In summary, cluster analysis of the coupling matrix revealed a set of individual sites that were all strongly coupled to each other and moderately coupled to the rest of the molecule in the statistical sense. The overall coupling between the two domains shown in Figure <figr fid="F2">2A</figr> is well represented by these sites. Therefore, we considered these sites as the core of allosteric interactions in G&#945; in the statistical sense. This core, which constitutes only about 15% of the entire molecule, is designated for brevity as the "allosteric core cluster".</p>
         </sec>
         <sec>
            <st>
               <p>Analysis of statistical thermodynamic ensemble of G&#945;i and G&#945;t</p>
            </st>
            <p>We evaluated equilibrium ensembles of G&#945;i and G&#945;t in their GDP- or GTP&#947;S-bound forms that were modeled as partially folded states by COREX algorithm. Ensemble average of fractional folding of G&#945;i or G&#945;t showed that ~4% of the residues were unfolded in equilibrium. However, GDP-bound forms of both G&#945;i and G&#945;t were slightly less folded compared to their GTP&#947;S-bound forms (average fractional unfolding was 5% vs. 3% in Gi, 4% vs. 3% in Gt for GDP- and GTP&#947;S-bound forms, respectively). This was reflected by increased average conformational entropy and solvent accessible (polar and apolar) surface area in the GDP-bound form of the proteins, indicating that the GDP-bound form of G&#945; is slightly more "flexible" than the GTP&#947;S-bound form in the thermodynamic sense.</p>
            <p>Evaluation of folding states of individual residues in the ensemble (residue-specific stability constants) showed that the local folding free energy was not distributed uniformly over the residues of G&#945;i or G&#945;t (Figure <figr fid="F3">3</figr>). The pattern of local free energy distribution was roughly similar in G&#945;i and G&#945;t, as one might expect from the structural similarity of the two proteins. In Figure <figr fid="F3">3</figr>, it is worth noting that some of the low stability regions coincide with switches I, II and III (indicated in the picture), and with the &#945;B-&#945;C loop, which has also been designated as a switch region in G&#945;i (switch-IV) that assumed nucleotide-dependent conformations.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Distribution of local folding free energies over the residues of G&#945;i or G&#945;t</p>
               </caption>
               <text>
                  <p><b>Distribution of local folding free energies over the residues of G&#945;i or G&#945;t</b>. Free energies are given as ln(k<sub>f</sub>) in the GDP or GTP&#947;S bound forms of the proteins as indicated. Secondary structure elements are indicated as strips next to the x-axis (black for helices, grey for &#946;-sheets) and named according to the convention given in figure 1. Known switch regions are also shown as grey strips numbered form I to IV on top of each picture. The two sequences are aligned and Swiss Prot residue numbering for G&#945;i is used in both cases.</p>
               </text>
               <graphic file="1472-6807-8-23-3"/>
            </fig>
            <p>It is evident from Figure <figr fid="F3">3</figr> that the local stability of many residues changes upon "nucleotide exchange", both in G&#945;i and G&#945;t. The patterns of these nucleotide-dependent changes (&#916;&#916;G) in G&#945;i and G&#945;t are comparable. The present analysis can predict nucleotide-dependent changes in the local stability of the regions that have already been identified as switch regions in crystallographic studies (see Figure <figr fid="F3">3</figr> for G&#945;t). Additional nucleotide-dependent changes that were not apparent in the crystal structure were also provided by the present analysis. The distribution of these nucleotide-dependent changes over the entire protein can be interpreted as the allosteric propagation of a free energy perturbation induced by nucleotide exchange at the guanine nucleotide binding site of G&#945; throughout the protein. Figure <figr fid="F4">4</figr> shows the map of this propagation on the structure of G&#945;i: Perturbation upon nucleotide exchange propagates nonuniformly to distant sites in the protein, and it covers a considerable part of the molecule (especially the GTPase domain). A qualitatively similar pattern of propagation was also observable in G&#945;t (not shown). In addition to the switch regions, the following regions are also affected by nucleotide exchange (Figure <figr fid="F4">4</figr>): N-terminal of &#945;G, &#945;G-&#945;4 loop, almost the entire sequence of &#945;4-&#946;6-&#945;5, &#946;2-&#946;3 hairpin, and obviously a large part of the nucleotide binding site. Among these regions, &#945;G, &#945;G-&#945;4 loop, &#945;4, &#945;4-&#946;6 loop and &#945;5 have all been suggested to be involved in the formation of the binding interface between G&#945; and receptor <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr></abbrgrp>.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Changes in local folding free energy upon "nucleotide exchange" on G&#945;i</p>
               </caption>
               <text>
                  <p><b>Changes in local folding free energy upon "nucleotide exchange" on G&#945;i</b>. Changes in free energies upon nucleotide exchange in G&#945;i are calculated from data given in figure 3 and mapped onto the structure of G&#945;<sub>i1</sub>-GTP&#947;S (PDB ID: <ext-link ext-link-type="pdb" ext-link-id="1GIA">1GIA</ext-link>). Changes are shown on the color-coded trace-tube representation of the molecule (blue: no change &#8211; red: maximum change ~6 kcal/mol). Trace-tube radius changes proportionally to the magnitude of free energy change to help visual perception. RasTop program v2.0.3 (Philippe Valadon, 2003) is used for 3D visualization.</p>
               </text>
               <graphic file="1472-6807-8-23-4"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Relationship between evolutionary and thermodynamic coupling</p>
            </st>
            <p>We evaluated the relationship between the residues designated by the MSA analysis as "allosteric core cluster" and the residues identified by the Corex analysis to be involved in the propagation of nucleotide-induced thermodynamic effect. We used an arbitrary cut-off of 1.5 kcal/mol (comparable with an H-bond) for the local folding free energy difference (&#916;&#916;G) between GDP- and GTP&#947;S-bound G&#945;i to construct the set of residues that were affected by nucleotide exchange. The overlap between the two sets was remarkable (ca. 65% of the allosteric core intersected with the selected set of residues for which &#916;&#916;G>1.5 kcal/mol). However, the set of residues that changed local stability (above the cut-off) upon nucleotide exchange was quite large (about 1/3 of the entire molecule), which raised the possibility of a coincidental overlap between the two sets. We tested this possibility by using the test statistics described in the method section, which yielded a low probability of coincidence (p = 0.0086 for N = 322, K = 144, M = 45 and <it>x </it>= 28). Therefore, we interpret the observed overlap as significant, and suggest that the residues in the "allosteric core cluster" are very likely to be mediating allosteric propagation of energetic perturbations in G&#945;.</p>
            <p>Additionally, we searched for folding correlations as another potential indicator of allosteric coupling in the high resolution statistical ensemble generated by the COREX algorithm for G&#945;i. For this analysis, 5000 partially folded states of GDP-G&#945;i or GTP&#947;S-G&#945;i that possessed the highest probability of occurrence in the ensemble, and which accounted for a cumulative probability of greater than 99% were used. In this analysis nucleotide-dependent couplings in the molecule were disregarded, as the propagation of nucleotide-induced perturbation was already considered in the above analysis. In order to exclude the effect of nucleotide, only the correlations that are common to GDP- or GTP&#947;S-bound forms of G&#945;i were selected. Also excluded were high correlations between those residues that have a high level of stability in the representative ensemble. These correlations are statistically ambiguous for reasons similar to the correlations of highly conserved (invariant) residues in the statistical coupling analysis of MSA (as mentioned above).</p>
            <p>In order to filter out the correlations that were not common in GDP-G&#945;i and GTP&#947;S-G&#945;i, we constructed a correlation matrix whose entries were obtained through element-by-element multiplication of the two correlation matrices calculated for GDP-G&#945;i or GTP&#947;S-G&#945;i (Figure <figr fid="F5">5</figr>). In this correlation matrix, an unambiguous correlation group that corresponds to residues forming &#945;3-&#946;5 and &#945;4-&#946;6 loops was identifiable (indicated with arrows in Figure <figr fid="F5">5</figr>). The same two loops were also represented in the "allosteric core cluster" by 8 residues, and the test statistics yielded a <it>p </it>value equal to 0.0018 (for N = 322; total, K = 45; allosteric core, M = 19; correlation group, <it>x </it>= 8; overlap) indicating that the intersection between the correlation group and the "allosteric core cluster" could not be explained by pure coincidence. This again confirms the convergence of the two independent analyses, as far as the allosteric coupling is concerned. Nucleotide-independent coupling between &#945;3-&#946;5 and &#945;4-&#946;6 loops may indicate a nucleotide-independent allosteric interaction between receptor and effector on G&#945;, as the two loops have been implicated to be involved in effector <abbrgrp><abbr bid="B10">10</abbr></abbrgrp> and receptor <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr></abbrgrp> interactions respectively. Such a nucleotide-independent coupling between effector and receptor sites on G&#945; may explain the experimental observation that &#946;2-adrenoceptors are able to activate adenylate cyclase through Gs independently from nucleotide-exchange <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>. Using the statistical ensembles of GDP-G&#945;i and GTP&#947;S-G&#945;i, we also performed a mutual perturbation response analysis based on thermodynamic "mutation" in the ensemble <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. This analysis also identified the same coupling group as the one we found in the correlation analysis above (data not shown).</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Common folding correlations in GDP- or GTP&#947;S-bound G&#945;i</p>
               </caption>
               <text>
                  <p><b>Common folding correlations in GDP- or GTP&#947;S-bound G&#945;i</b>. Correlation matrix is calculated from top 5000 partially folded COREX states of G&#945;i-GDP and G&#945;i-GTP&#947;S and is shown on a rainbow color scale (blue to red correspond to 0 to 1). The matrix is constructed as explained in the text and the sites are sorted from N to C terminal (left to right, top to bottom). Only the lower triangle of the symmetric matrix is shown. Yellow arrows indicate regions of very high local stability. An unambiguous correlation group is indicated with white arrows and mapped on the structure of Gi&#945;1 (inset). RasTop program v2.0.3 (Philippe Valadon, 2003) is used for 3D visualization.</p>
               </text>
               <graphic file="1472-6807-8-23-5"/>
            </fig>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>In the present study, we analyzed 1) statistical couplings in the MSA of heterotrimeric G protein family, and 2) statistical ensemble properties of G&#945;i and G&#945;t by using COREX algorithm to assess allosterically linked sites in this signal transducer family, which was not apparent in crystallographic studies. We showed that the residues captured by the two methods overlapped significantly. Considering that the information processed by the two methods was essentially different and independent, convergent results of the two analyses can be interpreted as they pointing (to a certain extent) to a common aspect of the protein sites (i.e. their involvement in allosteric communication in the protein). This is particularly important for the interpretation of statistical coupling in MSA, which is less obvious than the information provided by COREX analysis, as the covariance in MSA does not necessarily indicate allosteric coupling <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. In this sense, the COREX algorithm seems to be a very useful tool to complement and confirm the covariance information used for this particular purpose.</p>
         <p>The results of the entire analysis, along with information available from previous analyses concerning G protein structure and function, are summarized in Figure <figr fid="F6">6</figr> using the aligned sequences of prototypical G proteins. It is evident from Figure <figr fid="F6">6</figr> that the residues that change stability upon nucleotide exchange (green), those that are included in the "allosteric core" (gray) and those that are located in the known functional domains of G&#945; (represented by different colors and symbols) show a considerable co-aggregation along the G&#945; sequence. Highly conserved sites (especially those that are involved in nucleotide binding and those that form switch II) do not appear in the allosteric core due to the nature of the statistical coupling analysis as discussed above. This does not mean that these sites are not involved in allosteric coupling. Conserved sites are excluded simply due to the fact that statistical coupling analysis was not appropriate to diagnose the involvement of conserved sites in allosteric linkage. Nevertheless, it is evident from COREX analysis that most sites in the switch regions (including switch-II) are thermodynamically sensitive to the identity of bound nucleotide, which implies a thermodynamic coupling between the nucleotide binding site and the switch regions (see Figures <figr fid="F3">3</figr> and <figr fid="F6">6</figr>). Together with the observation that the folding correlation between the &#945;3-&#946;5 and &#945;4-&#946;6 loops revealed by COREX analysis was also represented significantly in the allosteric core (not indicated in Figure <figr fid="F6">6</figr>), we interpret the general picture in Figure <figr fid="F6">6</figr> as evidence for the existence of an allosteric network that connects all (known) functional domains of G&#945; to each other in all possible ways (as the residues in the allosteric core are all mutually coupled). Sites in the allosteric core that have not been associated so far with an obvious function (such as those in the helical domain or in the &#946;2-&#946;3 hairpin tip), may then be considered as potential candidates for the mediation of intramolecular allosteric communication in the G protein family.</p>
         <fig id="F6">
            <title>
               <p>Figure 6</p>
            </title>
            <caption>
               <p>Sites in the allosteric core cluster and residues that change stability upon nucleotide exchange</p>
            </caption>
            <text>
               <p><b>Sites in the allosteric core cluster and residues that change stability upon nucleotide exchange</b>. Sites that are found in MSA and COREX analyses, along with the sites involved in known functions, are shown together on the aligned sequences of indicated G&#945; proteins. Coding is shown on the picture.</p>
            </text>
            <graphic file="1472-6807-8-23-6"/>
         </fig>
         <p>Two interesting observations made in the present analyses may warrant further discussion. The first one is that residues that are thermodynamically affected by nucleotide exchange and those that are represented in the allosteric core overlap much better in the GTPase domain than in the helical domain (see Figure <figr fid="F6">6</figr>). In the helical domain, COREX analysis revealed a nucleotide-induced local thermodynamic effect on the residues of switch IV and the &#945;E-&#945;F sequence, whereas the statistical coupling analysis hit residues along the &#945;A helix. A trivial explanation for the observed mismatch can be that residues in the &#945;A helix have nothing to do with allosteric linkage, as statistical coupling analysis always has the potential of such bias when it is not supported by independent information. However, it is also possible that sites identified by COREX analysis as undergoing nucleotide-dependent changes in the helical domain (i.e. switch IV and the &#945;E-&#945;F sequence) are not linked to other functional domains in the GTPase region as they are not covered by the allosteric core where all residues are mutually coupled. Only by experiments can these two possibilities be distinguished. The second observation is that there is a nucleotide-independent linkage between the &#945;3-&#946;5 and &#945;4-&#946;6 loops, which was revealed by both COREX and statistical coupling analyses. This suggests a direct (nucleotide-independent) coupling between effector and receptor on G&#945;. This linkage may result in a nucleotide exchange-independent modulation of effector-G&#945; interaction by the receptor. As mentioned above, experiments have shown that this may indeed be the case for &#946;2-adrenoceptor-induced adenylyl cyclase activation through G&#945;s <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>.</p>
         <p>At this point, it may be worth mentioning the nature of allosteric linkage that was revealed by the present analyses. For practical purposes, the process of allosteric interactions can be seen from two different points of view: 1) The mechanistic approach, which sees the process as the propagation of a perturbation in one site to another via a series of structural distortions that extend from one site to the other, and 2) the thermodynamical or statistical mechanical approach, which considers the process as arising from the perturbation of the state distribution by ligand binding (or other effects such as mutations) in the equilibrium conformational ensemble of the protein. The latter view (to which the COREX approach belongs) can eventually represent the process only as an energetic linkage between different domains of the protein without referring to any specific mechanism whatsoever. The information provided by statistical coupling analysis in MSA also belongs to that second category. From this point of view, the effects of perturbations in one part of a protein on all other parts are due to a redistribution of the conformational ensemble <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B35">35</abbr></abbrgrp>. Whether a particular substructure is stabilized by the perturbation in a particular conformational state will depend on how those ensemble-averaged properties change as a result of the redistribution. Regions (or more specifically, particular conformations of a region) that are positively coupled will be stabilized as a result of the perturbation, and regions that are negatively coupled will be destabilized <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. In consequence, the pattern of allosteric coupling in the protein is determined by the energetic hierarchy of states (i.e. which states are the most stable and what structural and functional attributes those states possess). This may also have implications for the interpretation of mutational screening data. For example, any mutation that causes a change in the energetic hierarchy of states in the ensemble can change the allosteric coupling between two sites, even if the network of structural elements that physically connect the two sites is unaffected, which complicates the interpretation of the effect of mutation from a mechanistic point of view. Hence, the ensemble view does not imply any mechanical coupling between energetically (or statistically) linked sites. The information summarized in Figure <figr fid="F6">6</figr> should also be understood in this context.</p>
         <p>Another practical implication of the present analysis concerns the functional (or structural) importance that has generally been attributed to evolutionarily conserved sites in proteins. Unconserved sites have attracted less attention than the conserved for obvious reasons. However, the local conservation of the sites in the allosteric core indicated as gray-shaded residues in Figure <figr fid="F6">6</figr> was quite weak: Average frequency of most abundant amino acids at these sites was 40%, or the average entropy of their conservations was 0.33 on a normalized scale, where 0 and 1 correspond to none and full conservation, respectively. It follows that unconserved sites may play particularly important roles in the mediation of allosteric coupling. This assertion finds a firm basis when we consider the following observation made in the present analysis: We found that conservation of sites in the G protein family, on average, scales proportionally with the thermodynamic stability of these sites within G&#945;i (Figure <figr fid="F7">7</figr>). In other words, there is (on average) a positive association between evolutionary and thermodynamic stability in G proteins. On the other hand, it has also been demonstrated that residues in unstable regions are important for mediating allosteric coupling <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B35">35</abbr></abbrgrp>. Thus, it can be inferred that locally unconserved sites are indeed important for allosteric effects. This may explain why some allosterically important sites elude attention when we consider the fact that the literature of screening by point mutations has been mostly concentrated on conserved sites. However, this should not be understood as an implication that the conserved sites are generally unimportant in allosteric coupling.</p>
         <fig id="F7">
            <title>
               <p>Figure 7</p>
            </title>
            <caption>
               <p>Association between thermodynamic and evolutionary stability</p>
            </caption>
            <text>
               <p><b>Association between thermodynamic and evolutionary stability</b>. Residues are classified into four (equally-spaced) categories according to their conservation entropy in G&#945;-MSA (1 to 4 correspond to weakest to strongest conservation), and plotted against average residue-specific stability constants in the categories. Stability constants are calculated in GDP- or GTP&#947;S-bound forms of G&#945;i. Bars are standard deviation of means. There are approximately equal numbers of sites in each category (n &#8776; 80). Note that in average, there is a remarkable positive association between conservation of the residues and their local thermodynamic stabilities (local stabilities of the GDP-bound form are slightly but systematically higher than those of the GTP&#947;S-bound form).</p>
            </text>
            <graphic file="1472-6807-8-23-7"/>
         </fig>
         <p>Besides the above discussion, we believe that the observed association between evolutionary and thermodynamic stabilities is interesting in itself from a theoretical point of view, as there is no obvious reason for such an association: The former entity has to do with the variability of amino acid identity at a site in the G protein family, whereas the latter is associated with the folding stability of that site in a single member of the family (i.e. G&#945;i). Although a detailed discussion of the issue is beyond the scope of the present work, we may be permitted to speculate that such an association is expected (on average) in the following sense: If the protein has such a structural, energetic and functional design that a thermodynamic flexibility (rather than a particular fold) is required at a site (which is observed as a low stability of the site), then evolution may permit any amino acid at that site (which results in an unconserved site), as the folding state of that site is determined by the entire design of the protein, but not by the actual identity of the amino acid that occupies that site. The thermodynamic instability of a site may well be a part of the functional design of the protein, as discussed above. Among many other possible mechanisms, this one alone can lead to a tendency of association between evolutionary and thermodynamic stabilities of the sites. It would be interesting to investigate whether such an association exists in other protein families.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>We propose that the sites indicated in Figure <figr fid="F6">6</figr> (i.e. those that are linked to nucleotide binding sites and those that are statistically coupled to each other) constitute an allosteric network in the energetic sense within the G protein. Recent progress in G protein expression, purification and spectroscopy <abbrgrp><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr><abbr bid="B47">47</abbr></abbrgrp>, together with molecular biological techniques, provide a potential tool to bring the information summarized in Figure <figr fid="F6">6</figr> to experimental test, which may eventually contribute to a more comprehensive understanding of the functional design of G proteins.</p>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p>GPCR: G protein coupled receptors; RGS: regulators of G protein signaling; AGS: activator of G protein signaling; MSA: multiple sequence alignment.</p>
      </sec>
      <sec>
         <st>
            <p>Availability &amp; requirements</p>
         </st>
         <p>Pfam database: <url>http://pfam.sanger.ac.uk</url></p>
         <p>BEST @ UTMD: <url>http://best.utmb.edu/BEST/</url></p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>KS performed statistical analyses and participated in the interpretation of data, TL and VJH performed high resolution COREX calculations for Gi&#945;, VJH helped in drafting the manuscript, &#214;U and OO designed the study, participated in all statistical analyses, interpreted the data, and wrote the paper.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>The authors would like to thank to Tommaso Costa (ISS, Rome, Italy) and Francesca Fanelli (University of Modena, Italy) for their insightful comments and criticism, and Frederic Stark for editing the manuscript in terms of English. This work was supported, in part, by research grants from the Turkish Scientific and Technical Research Council 104s472 and 107s086.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Structural basis of function in heterotrimeric G proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Oldham</snm>
                  <fnm>WM</fnm>
               </au>
               <au>
                  <snm>Hamm</snm>
                  <fnm>HE</fnm>
               </au>
            </aug>
            <source>Q Rev Biophys</source>
            <pubdate>2006</pubdate>
            <volume>39</volume>
            <fpage>117</fpage>
            <lpage>166</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1017/S0033583506004306</pubid>
                  <pubid idtype="pmpid" link="fulltext">16923326</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Cellular Regulation of RGS proteins: Modulators and integrators of G protein signaling</p>
            </title>
            <aug>
               <au>
                  <snm>Hollinger</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hepler</snm>
                  <fnm>JR</fnm>
               </au>
            </aug>
            <source>Pharmacol Rev</source>
            <pubdate>2002</pubdate>
            <volume>54</volume>
            <fpage>527</fpage>
            <lpage>559</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1124/pr.54.3.527</pubid>
                  <pubid idtype="pmpid" link="fulltext">12223533</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>AGS proteins, GPR motifs and the signals processed</p>
            </title>
            <aug>
               <au>
                  <snm>Lanier</snm>
                  <fnm>SM</fnm>
               </au>
            </aug>
            <source>Biology of the Cell</source>
            <pubdate>2004</pubdate>
            <volume>96</volume>
            <fpage>369</fpage>
            <lpage>372</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.biolcel.2004.03.012</pubid>
                  <pubid idtype="pmpid" link="fulltext">15207906</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>The 2.2 A crystal structure of transducin-alpha complexed with GTP&#947;S</p>
            </title>
            <aug>
               <au>
                  <snm>Noel</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Hamm</snm>
                  <fnm>HE</fnm>
               </au>
               <au>
                  <snm>Sigler</snm>
                  <fnm>PB</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1993</pubdate>
            <volume>366</volume>
            <fpage>654</fpage>
            <lpage>663</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/366654a0</pubid>
                  <pubid idtype="pmpid" link="fulltext">8259210</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Structures of active conformations of Gi&#945;1 and the mechanism of GTP hydrolysis</p>
            </title>
            <aug>
               <au>
                  <snm>Coleman</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Berghuis</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Linder</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Gilman</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Sprang</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1994</pubdate>
            <volume>269</volume>
            <fpage>1405</fpage>
            <lpage>1412</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1126/science.8073283</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Crystal structure of adenylyl cyclase activator Gs&#945;</p>
            </title>
            <aug>
               <au>
                  <snm>Sunahara</snm>
                  <fnm>RK</fnm>
               </au>
               <au>
                  <snm>Tesmer</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Gilman</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Sprang</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1997</pubdate>
            <volume>278</volume>
            <fpage>1943</fpage>
            <lpage>1947</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.278.5345.1943</pubid>
                  <pubid idtype="pmpid" link="fulltext">9395396</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>A new approach to producing functional G&#945; subunits yields the activated and deactivated structures of G&#945; (12/13) proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Kreutz</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Yau</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Nance</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Tanabe</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Tesmer</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Kozasa</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Biochemistry</source>
            <pubdate>2006</pubdate>
            <volume>45</volume>
            <fpage>167</fpage>
            <lpage>174</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1021/bi051729t</pubid>
                  <pubid idtype="pmpid" link="fulltext">16388592</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Structure of G&#945;q-p63RhoGEF-RhoA complex reveals a pathway for the activation of RhoA by GPCRs</p>
            </title>
            <aug>
               <au>
                  <snm>Lutz</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Shankaranarayanan</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Coco</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ridilla</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Nance</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Vettel</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Baltus</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Evelyn</snm>
                  <fnm>CR</fnm>
               </au>
               <au>
                  <snm>Neubig</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Wieland</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Tesmer</snm>
                  <fnm>JJ</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2007</pubdate>
            <volume>318</volume>
            <fpage>1923</fpage>
            <lpage>1927</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1147554</pubid>
                  <pubid idtype="pmpid" link="fulltext">18096806</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Structural determinants for GoLoco induced inhibition of nucleotide release by G&#945; subunits</p>
            </title>
            <aug>
               <au>
                  <snm>Kimple</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Kimple</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Betts</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Sondek</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Siderovski</snm>
                  <fnm>DP</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>416</volume>
            <fpage>878</fpage>
            <lpage>881</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/416878a</pubid>
                  <pubid idtype="pmpid" link="fulltext">11976690</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Structural determinants for regulation of phosphodiesterase by a G protein at 2.0 &#197;</p>
            </title>
            <aug>
               <au>
                  <snm>Slep</snm>
                  <fnm>KC</fnm>
               </au>
               <au>
                  <snm>Kercher</snm>
                  <fnm>NA</fnm>
               </au>
               <au>
                  <snm>He</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Cowan</snm>
                  <fnm>CW</fnm>
               </au>
               <au>
                  <snm>Wensel</snm>
                  <fnm>TG</fnm>
               </au>
               <au>
                  <snm>Sigler</snm>
                  <fnm>PB</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2001</pubdate>
            <volume>409</volume>
            <fpage>1071</fpage>
            <lpage>1077</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35059138</pubid>
                  <pubid idtype="pmpid" link="fulltext">11234020</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Mutations at the domain interface of Gs&#945; impair receptor-mediated activation by altering receptor and guanine nucleotide binding</p>
            </title>
            <aug>
               <au>
                  <snm>Grishina</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Berlot</snm>
                  <fnm>CH</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>1998</pubdate>
            <volume>273</volume>
            <fpage>15053</fpage>
            <lpage>15060</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.273.24.15053</pubid>
                  <pubid idtype="pmpid" link="fulltext">9614114</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Crystalisation and preliminary crystalographic studies of G<sub>i&#945;1 </sub>and muatnts of G<sub>i&#945;1 </sub>in the GTP and GDP bound states</p>
            </title>
            <aug>
               <au>
                  <snm>Coleman</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Mixon</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Linder</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Berghuis</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Gilman</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Sprang</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1994</pubdate>
            <volume>238</volume>
            <fpage>630</fpage>
            <lpage>634</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.1994.1320</pubid>
                  <pubid idtype="pmpid" link="fulltext">8176751</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Structural determinants for activation of the alpha-subunit of a heterotrimeric G protein</p>
            </title>
            <aug>
               <au>
                  <snm>Lambright</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Noel</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Hamm</snm>
                  <fnm>HE</fnm>
               </au>
               <au>
                  <snm>Sigler</snm>
                  <fnm>PB</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1994</pubdate>
            <volume>369</volume>
            <fpage>621</fpage>
            <lpage>628</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/369621a0</pubid>
                  <pubid idtype="pmpid" link="fulltext">8208289</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>The structure of G protein heterotrimer G<sub>i&#945;1 </sub>&#946;<sub>1&#947;2</sub></p>
            </title>
            <aug>
               <au>
                  <snm>Wall</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Coleman</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Iniguez-Llhui</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Posner</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Gilman</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Sprang</snm>
                  <fnm>SE</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>1995</pubdate>
            <volume>83</volume>
            <fpage>1047</fpage>
            <lpage>1058</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0092-8674(95)90220-1</pubid>
                  <pubid idtype="pmpid" link="fulltext">8521505</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>The 2.0 &#197; crystal sturucture of a heterotrimeric G protein</p>
            </title>
            <aug>
               <au>
                  <snm>Lambright</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Sondek</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bohm</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Skiba</snm>
                  <fnm>NP</fnm>
               </au>
               <au>
                  <snm>Hamm</snm>
                  <fnm>HE</fnm>
               </au>
               <au>
                  <snm>Sigler</snm>
                  <fnm>PB</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1996</pubdate>
            <volume>379</volume>
            <fpage>311</fpage>
            <lpage>319</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/379311a0</pubid>
                  <pubid idtype="pmpid" link="fulltext">8552184</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Allosterism without conformational change</p>
            </title>
            <aug>
               <au>
                  <snm>Cooper</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Dryden</snm>
                  <fnm>DTF</fnm>
               </au>
            </aug>
            <source>Eur Biophys J</source>
            <pubdate>1982</pubdate>
            <volume>11</volume>
            <fpage>103</fpage>
            <lpage>109</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1007/BF00276625</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Binding sites in E. Coli dihydrofolate reductase communicate by modulating the conformational ensemble</p>
            </title>
            <aug>
               <au>
                  <snm>Pan</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Hilser</snm>
                  <fnm>VJ</fnm>
               </au>
            </aug>
            <source>Proc Nat Acad Sci USA</source>
            <pubdate>2000</pubdate>
            <volume>97</volume>
            <fpage>12020</fpage>
            <lpage>12025</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">17287</pubid>
                  <pubid idtype="pmpid" link="fulltext">11035796</pubid>
                  <pubid idtype="doi">10.1073/pnas.220240297</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Strategy for analysing the co-operativity of intramolecular interactions in peptides and proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Horovitz</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Fersht</snm>
                  <fnm>AR</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1990</pubdate>
            <volume>214</volume>
            <fpage>613</fpage>
            <lpage>617</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0022-2836(90)90275-Q</pubid>
                  <pubid idtype="pmpid" link="fulltext">2388258</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Correlated mutations and residue contacts in proteins</p>
            </title>
            <aug>
               <au>
                  <snm>G&#246;bel</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Sander</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Schneider</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Valenzia</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Proteins</source>
            <pubdate>1994</pubdate>
            <volume>18</volume>
            <fpage>309</fpage>
            <lpage>317</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/prot.340180402</pubid>
                  <pubid idtype="pmpid">8208723</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Can three-dimensional contacts in protein structures be predicted by analysis of correlated mutations?</p>
            </title>
            <aug>
               <au>
                  <snm>Shindyalov</snm>
                  <fnm>IN</fnm>
               </au>
               <au>
                  <snm>Kolchanov</snm>
                  <fnm>NA</fnm>
               </au>
               <au>
                  <snm>Sander</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Protein Eng</source>
            <pubdate>1994</pubdate>
            <volume>7</volume>
            <fpage>349</fpage>
            <lpage>358</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/protein/7.3.349</pubid>
                  <pubid idtype="pmpid" link="fulltext">8177884</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Effective use of sequence correlation and conservation in fold recognition</p>
            </title>
            <aug>
               <au>
                  <snm>Olmea</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Rost</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Valenzia</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1999</pubdate>
            <volume>293</volume>
            <fpage>1221</fpage>
            <lpage>1239</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.1999.3208</pubid>
                  <pubid idtype="pmpid" link="fulltext">10547297</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Prediction of an inter-residue interaction in the chaperonin GroEL from multiple sequence alignment is confirmed by double-mutant cycle analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Horovitz</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bochkareva</snm>
                  <fnm>ES</fnm>
               </au>
               <au>
                  <snm>Yifrach</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Girshovich</snm>
                  <fnm>AS</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1994</pubdate>
            <volume>238</volume>
            <fpage>133</fpage>
            <lpage>138</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.1994.1275</pubid>
                  <pubid idtype="pmpid" link="fulltext">7908986</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Evolutionarily conserved pathways of energetic connectivity in protein families</p>
            </title>
            <aug>
               <au>
                  <snm>Lockless</snm>
                  <fnm>SW</fnm>
               </au>
               <au>
                  <snm>Ranganathan</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1999</pubdate>
            <volume>286</volume>
            <fpage>295</fpage>
            <lpage>298</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.286.5438.295</pubid>
                  <pubid idtype="pmpid" link="fulltext">10514373</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Natural-like function in WW domains</p>
            </title>
            <aug>
               <au>
                  <snm>Russ</snm>
                  <fnm>WP</fnm>
               </au>
               <au>
                  <snm>Lowery</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Mishra</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Yaffe</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Ranganathan</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2005</pubdate>
            <volume>437</volume>
            <fpage>579</fpage>
            <lpage>583</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature03990</pubid>
                  <pubid idtype="pmpid" link="fulltext">16177795</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>A perturbation-based method for calculating explicit likelihood of evolutionary co-variance in multiple sequence alignments</p>
            </title>
            <aug>
               <au>
                  <snm>Dekker</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Fodor</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Aldrich</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>Yellen</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <fpage>1565</fpage>
            <lpage>1572</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bth128</pubid>
                  <pubid idtype="pmpid" link="fulltext">14962924</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Mapping pathways of allosteric communication in GroEL by analysis of correlated mutations</p>
            </title>
            <aug>
               <au>
                  <snm>Kass</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Horovitz</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Proteins</source>
            <pubdate>2002</pubdate>
            <volume>48</volume>
            <fpage>611</fpage>
            <lpage>617</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/prot.10180</pubid>
                  <pubid idtype="pmpid" link="fulltext">12211028</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Detection and reduction of evolutionary noise in correlated mutation analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Noivirt1</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Eisenstein</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Horovitz</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Protein Engineering, Design &amp; Selection</source>
            <pubdate>2005</pubdate>
            <volume>18</volume>
            <fpage>247</fpage>
            <lpage>253</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1093/protein/gzi029</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>A perturbation-based method for calculating explicit likelihood of evolutionary co-variance in multiple sequence alignments</p>
            </title>
            <aug>
               <au>
                  <snm>S&#252;el</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Lockless</snm>
                  <fnm>SW</fnm>
               </au>
               <au>
                  <snm>Wall</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Ranganathan</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Nat Struct Biol</source>
            <pubdate>2003</pubdate>
            <volume>10</volume>
            <fpage>59</fpage>
            <lpage>69</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nsb881</pubid>
                  <pubid idtype="pmpid" link="fulltext">12483203</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Conformational substates in proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Frauenfelder</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Parak</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>RD</fnm>
               </au>
            </aug>
            <source>Annu Rev Biophys Biophys Chem</source>
            <pubdate>1988</pubdate>
            <volume>17</volume>
            <fpage>451</fpage>
            <lpage>479</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.bb.17.060188.002315</pubid>
                  <pubid idtype="pmpid">3293595</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Str