<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1472-6807-8-19</ui>
   <ji>1472-6807</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Charge environments around phosphorylation sites in proteins</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Kitchen</snm>
               <fnm>James</fnm>
               <insr iid="I1"/>
               <email>j.kitchen@student.manchester.ac.uk</email>
            </au>
            <au id="A2">
               <snm>Saunders</snm>
               <mi>E</mi>
               <fnm>Rebecca</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>becky.saunders@cancer.org.uk</email>
            </au>
            <au id="A3" ca="yes">
               <snm>Warwicker</snm>
               <fnm>Jim</fnm>
               <insr iid="I1"/>
               <email>jim.warwicker@manchester.ac.uk</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Faculty of Life Sciences, University of Manchester, Michael Smith Building, Oxford Road, Manchester M13 9PT, UK</p>
            </ins>
            <ins id="I2">
               <p>High Throughput Screening, Cancer Research UK, 44 Lincoln's Inn Fields, London, WC2A 3PX, UK</p>
            </ins>
         </insg>
         <source>BMC Structural Biology</source>
         <issn>1472-6807</issn>
         <pubdate>2008</pubdate>
         <volume>8</volume>
         <issue>1</issue>
         <fpage>19</fpage>
         <url>http://www.biomedcentral.com/1472-6807/8/19</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18366741</pubid>
               <pubid idtype="doi">10.1186/1472-6807-8-19</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>26</day>
               <month>11</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>25</day>
               <month>3</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>25</day>
               <month>3</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Kitchen et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Phosphorylation is a central feature in many biological processes. Structural analyses have identified the importance of charge-charge interactions, for example mediating phosphorylation-driven allosteric change and protein binding to phosphopeptides. Here, we examine computationally the prevalence of charge stabilisation around phosphorylated sites in the structural database, through comparison with locations that are not phosphorylated in the same structures.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>A significant fraction of phosphorylated sites appear to be electrostatically stabilised, largely through interaction with sidechains. Some examples of stabilisation across a subunit interface are evident from calculations with biological units. When considering the immediately surrounding environment, in many cases favourable interactions are only apparent after conformational change that accompanies phosphorylation. A simple calculation of potential interactions at longer-range, applied to non-phosphorylated structures, recovers the separation exhibited by phosphorylated structures. In a study of sites in the Phospho.ELM dataset, for which structural annotation is provided by non-phosphorylated proteins, there is little separation of the known phospho-acceptor sites relative to background, even using the wider interaction radius. However, there are differences in the distributions of patch polarity for acceptor and background sites in the Phospho.ELM dataset.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>In this study, an easy to implement procedure is developed that could contribute to the identification of phospho-acceptor sites associated with charge-charge interactions and conformational change. Since the method gives information about potential anchoring interactions subsequent to phosphorylation, it could be combined with simulations that probe conformational change. Our analysis of the Phospho.ELM dataset also shows evidence for mediation of phosphorylation effects through (i) conformational change associated with making a solvent inaccessible phospho-acceptor site accessible, and (ii) modulation of protein-protein interactions.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>The phosphoproteome is a window into key biological processes, such as gene expression and cell growth, that are mediated by kinase and phosphatase activities <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. There is widespread interest in mediators of phosphorylation as therapeutic targets <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. Studies of specific phosphorylation pathways are being supplemented with measurements from the developing field of phosphoproteomics <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>, indicating that around one third of proteins may be phosphorylated in a given cellular snapshot <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. A number of general databases are being developed to collate these data, such as Phospho.ELM (originally PhosphoBase) <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, the Phosphorylation Site Database <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>, and PhosphoSite <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. Some resources are linked to certain streams of data, such as plant phosphorylation (PlantsP <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>) and specific phosphoproteome experiments (Phosida <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>). Other databases, such as dbPTM <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>, collect phosphorylations along with other post-translational modifications.</p>
         <p>Phosphorylation site databases serve as a reference for construction of methods aimed at sequence-based prediction of phosphorylation sites. A range of machine-learning techniques have been used, including neural networks (NetPhos <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>), hidden Markov models (KinasePhos 1.0 <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr></abbrgrp>), support vector machines (PredPhospho and KinasePhos 2.0 <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp>), Bayesian decision theory (PPSP <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>), and other forms of sequence profile analysis (Scansite 2.0, Phosite, GPS <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp>). Such predictors perform particularly well when kinase specificities are taken into account <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>, or when species-dependence is included <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. Experimental studies of phosphorylation in yeast are consistent with strong preferences of kinases for specific substrates, but also indicate that predictions based on phosphorylation site patterns can lead to substantial over-prediction, and the conclusion that many sequences may not be accessible and that other regions may be involved in substrate recognition <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. The current set of phosphorylation site databases and prediction servers has been recently reviewed <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>.</p>
         <p>Whilst it is clear that amino acid sequence and kinase-specific features play key roles in many phosphorylations, a number of studies have also looked at 3D structural encoding of features around phosphorylation sites. Local structure around phosphorylated and non-phosphorylated sites was used, in the form of contact maps, to train a neural network <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. An algorithm to predict local structural segments from sequence has been used to enhance prediction accuracy for phosphorylation sites <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. A database of 3D structures of protein phosphorylation sites has been developed, including structural annotation of the Phospho.ELM database (Phospho3D <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>). It has been noticed that the amino acid sequence properties around phosphorylation sites are similar to those of intrinsically disordered protein regions, and this information used to construct a prediction algorithm, DISPHOS <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. Further work has demonstrated that intrinsic disorder is a characteristic of partners that bind 14-3-3 proteins <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>, and that it may be a general feature of linear motifs that determine protein-protein interactions <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. A survey of proteins for which structures are known in the phosphorylated and non-phosphorylated forms found that the effect of phosphorylation is mediated in many of these cases by conformational change, and by alteration of interfacial properties in some others <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>.</p>
         <p>Charge interactions have been investigated around subsets of phosphorylation sites, in phosphopeptide binding systems <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>, at interfaces mediated by phosphorylated sites in general <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>, and around activation loops containing phosphorylation sites <abbrgrp><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr></abbrgrp>. In all of these cases, favourable charge-charge interactions were identified at the phosphorylated sites. For the phosphopeptide binding systems this is the major feature in a prediction tool <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>, whilst complementary charge interactions drive the successful modelling of changes in activation loop conformation <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>.</p>
         <p>Model peptide studies have shown that phosphorylation stabilises &#945;-helix formation when located at the amino terminus <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>, consistent with calculations and the location of phosphate or sulphate ions in crystal structures <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. Other work has highlighted the strength of non-covalent interactions that can be mediated by phosphorylated groups, salt-bridging in a model peptide <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>, or mediating protein-protein interactions <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. Calculations have been used to estimate the relative strengths of hydrogen bonds that involve phosphorylated amino acid sidechains <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. We have previously used Finite Difference Poisson-Boltzmann (FDPB) calculations of charge interactions in studies of peptide phosphorylation <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. The current study uses phosphorylated proteins in the structural database to address the question of whether phosphorylation sites can be distinguished in terms of charge interactions, relative to their non-phosphorylated counterparts within the same proteins. We find a subset of stabilised phosphorylated sites, many of which are within kinases, and for which the favourable interaction is derived largely from interactions with basic sidechains. When studied in the context protein structures that are not phosphorylated at these sites, an indication of favourable interactions (and conformational change) can be recovered by analysing the charge environment beyond the nearest neighbours. In an extension to phosphorylation sites from the Phospho.ELM database <abbrgrp><abbr bid="B6">6</abbr></abbrgrp> that can be structurally annotated, we find that these do not have as large a signal indicative of conformational change as those from the structural database. On the other hand, the Phospho.ELM sites are more buried on average than the set derived from phosphorylated protein structures. There is also an interesting difference in the distributions of patch polarity around Phospho.ELM sites and background sites.</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <sec>
            <st>
               <p>Phosphorylated proteins in the structural database</p>
            </st>
            <p>Figure <figr fid="F1">1</figr> shows the scheme for extracting proteins phosphorylated on Serine (Sep), Threonine (Tpo) or Tyrosine (Ptr) sidechains, from the Protein Data Bank (PDB <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>). Phosphorylated sets were made by reduction according to sequence identity at 100%, 90% and 25% levels, using the PISCES server <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>. In order to compare phosphorylated and non-phosphorylated sites, without model-building torsional angles for phosphate groups on the non-phosphorylated sites, phosphate groups were removed from the phosphorylated sites. Calculations of charge interactions were then made between a notional -2e charge on the OG, OG1 and OH atoms (Ser, Thr and Tyr), and the surrounding environment. Interactions were therefore equivalent in terms of acceptor site geometry for phosphorylated and non-phosphorylated sidechains.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Scheme for collecting proteins from the PDB, that are phosphorylated on serine, threonine or tyrosine sidechains, using phosphorylated residue identifiers, Sep, Tpo or Ptr, respectively</p>
               </caption>
               <text>
                  <p>
                     <b>Scheme for collecting proteins from the PDB, that are phosphorylated on serine, threonine or tyrosine sidechains, using phosphorylated residue identifiers, Sep, Tpo or Ptr, respectively.</b>
                  </p>
               </text>
               <graphic file="1472-6807-8-19-1"/>
            </fig>
            <p>Figure <figr fid="F2">2</figr> shows results for the 100% and 25% sequence identity-culled sets. Separation between the distributions is evident for both sets. Figure <figr fid="F3">3</figr> shows, for the non-redundant set at 25% sequence identity, the relative importance of interactions from surrounding mainchain and sidechain components. The major determinant of stabilisation is (basic) sidechain interaction with phosphorylated sites. These basic sidechains often, but not exclusively, occur within the local sequence neighbourhood of the phosphorylated site, as is the case for substrates of basophilic protein kinases. Favourable interaction with sidechains contrasts to the relatively well-known occurrences of sulphate ion stabilisation at an &#945;-helix terminus in protein crystal structures <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>, and with the role of P-loop mainchain interaction with the phosphates of ATP and GTP <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>. Mainchain interactions may be underestimated relative to sidechain interactions, since our model omits specific hydrogen-bonds associated with phosphate oxygens, concentrating on net charge.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Charge interactions of (Ser, Thr, Tyr) phospho-acceptor sites</p>
               </caption>
               <text>
                  <p><b>Charge interactions of (Ser, Thr, Tyr) phospho-acceptor sites.</b> Distributions are compared for sites that are phosphorylated in the PDB (phos) and those that are not phosphorylated (non-phos) in the same coordinate files. In these, and succeeding phos/non-phos plots, probability (p) values of the phos and non-phos distributions arising from the same underlying population were estimated with the Mann-Whitney U test. (a) 100%, no removal of sequences. (b) Sequences culled at 25% sequence identity.</p>
               </text>
               <graphic file="1472-6807-8-19-2"/>
            </fig>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Charge interactions for (Ser, Thr, Tyr) acceptor sides divided according to interactions with surrounding sidechain (a) or mainchain (b) groups</p>
               </caption>
               <text>
                  <p>
                     <b>Charge interactions for (Ser, Thr, Tyr) acceptor sides divided according to interactions with surrounding sidechain (a) or mainchain (b) groups.</b>
                  </p>
               </text>
               <graphic file="1472-6807-8-19-3"/>
            </fig>
            <p>We examined the effect of replacing single protein chains with biological units (i.e. physiologically relevant oligomers, Figure <figr fid="F4">4a</figr>). The results are similar to those for the single chains (Figure <figr fid="F3">3</figr>), but with greater separation of the phosphorylated and non-phosphorylated sets, indicative of some phosphate-mediated stabilisation between protein chains. Figure <figr fid="F4">4b</figr> shows an example, where the binding of phosphorylated serotonin N-acetyltransferase and isoform &#950; of 14-3-3, 1ib1 <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>, is analagous to the phosphopeptide binding complexes, in which charge interactions were observed to be a key feature of binding <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. The majority of these systems were excluded through the use of a minimum polypeptide length of 40 amino acids. Other examples of stabilisation between protein chains also relate to segments that would be unstructured in the absence of phosphorylation: &#946;-catenin repeat interactions with both E-cadherin, 1i7w <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>, and phosphorylated APC, 1v18 <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>, and the forkhead-associated domain of Ki67 with a phosphorylated fragment of human nucleolar protein, 2aff <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Charge stabilisation of phosphorylated sites in biological units</p>
               </caption>
               <text>
                  <p><b>Charge stabilisation of phosphorylated sites in biological units.</b> (a) Phosphorylated and background sites in the dataset culled at 25% sequence identity. (b) Phosphorylation site stabilisation at a protein-protein interface. An extended segment of phosphorylated serotonin N-acetyltransferase (AANAT) (mainchain, cyan and TPO 31, pink) binds to 14-3-3 (orange mainchain), with surrounding residues colour-coded by type (blue basic, red acidic, yellow polar uncharged, grey non-polar).</p>
               </text>
               <graphic file="1472-6807-8-19-4"/>
            </fig>
            <p>Many of the structures studied show little stabilisation, or show destabilisation, in our estimate of charge-charge interactions. In a few instances, small overall interaction results from near cancelling of stabilising and destabilising contributions, as at the phosphorylated active site of phosphoglucomutase, 1vkl <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>, with opposing contributions from a metal ion and its ligands. In a larger number of cases, the surrounding charge interactions are generally small. Some of these at least may be indicative of phosphorylation mediating differences in the formation of higher order structures or complexes that are not included in the current analysis. Examples are the surface properties and fibre formation in pilin, 2pil <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>, and the phosphorylation of nucleotide-binding domain 1 and potential interactions with other components of the cystic fibrosis transmembrane conductance regulator, 1r0z <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. As more structures become available for complexes, and using comparative models built on protein-protein interaction templates <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>, a more thorough study of the extent to which phosphorylation mediates interfacial properties will be possible.</p>
            <p>For the PDB set, non-redundant at 25% sequence identity, and using a threshold interaction of -6 kJ/mole, the current method gives (sensitivity, specificity) of (0.48, 0.90), compared with (0.52, 0.67) for NetPhos <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> and (0.65, 0.79) for DISPHOS <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>, using the default thresholds.</p>
         </sec>
         <sec>
            <st>
               <p>Phosphorylation sites in Phospho.ELM</p>
            </st>
            <p>A scheme was employed to structurally annotate phosphorylated sequences contained in the Phospho.ELM dataset (see Methods). Figure <figr fid="F5">5a</figr> shows that little separation is apparent between the phosphorylated sites and other Ser, Thr, Tyr locations in the same set. Proteins identified in this structural annotation are generally non-phosphorylated. Considering possible structural adaptation to phosphorylation, the most simple to model would be movement of sidechains on a fixed backbone framework. Thus, in the absence of the negative charge of a phosphorylated group binding to a basic patch, an alternative acidic sidechain could occupy a similar location. In order to test whether this simple picture would aid distinction of phospho-acceptor sites, we calculated interactions with positively-charged residues only. Again no separation is apparent (not shown). In terms of (sensitivity, specificity), the current method, using the -6 kJ/mole threshold, gives (0.15, 0.92), compared with (0.66, 0.68) for NetPhos <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> and (0.64, 0.77) for DISPHOS <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. These results emphasise how poorly the present charge-based algorithm performs for the Phospho.ELM set, in contrast to the PDB set, leading us to consider other features that could mediate the effects of phosphorylation.</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Calculations with structurally annotated phosphorylation sites from the Phospho.ELM dataset</p>
               </caption>
               <text>
                  <p><b>Calculations with structurally annotated phosphorylation sites from the Phospho.</b>ELM dataset. (a) Summed charge interactions. (b) Solvent accessibility for phospho-acceptor atoms in the 25% sequence identity culled PDB set, in the Phospho.ELM set, and in a control set (other Ser, Thr, Tyr in the 25% PDB set).</p>
               </text>
               <graphic file="1472-6807-8-19-5"/>
            </fig>
            <p>Solvent accessibility of phosphorylation sites could be an important factor, although conformational change will again play a role. Figure <figr fid="F5">5b</figr> shows that phosphorylated sites in the PDB set that is non-redundant at 25% sequence identity, are on average more accessible than the control set. Interestingly, phosphorylated sites from the structurally annotated Phospho.ELM set appear to be more buried, on average, than do those from the 25% PDB set. Both sets of calculations displayed in Figure <figr fid="F5">5b</figr> were made with single protein chains, rather than biological units, and should therefore be comparable.</p>
            <p>We examined some of the most buried sites in the structurally annotated Phospho.ELM dataset. In order to probe the burial in more detail, a burial depth was calculated from exterior solvent (see the Methods section), for the least solvent accessible sites (0 &#8211; 5 &#197;<sup>2 </sup>ASA, Figure <figr fid="F6">6</figr>). This is a grid-based method, with a 2 &#197; burial simply due to van der Waals radius, and about +/- 0.5 &#197; due to the grid spacing. We find that many sites lie at a burial depth of around 2 &#197;, effectively at the exterior surface. Even at larger burial depth, we find sites with some solvent accessibility, i.e. solvent pockets within a protein. Of particular interest are the sites that are most buried, according to our algorithm, with least accessibility. Amongst such sites, at zero calculated solvent accessibility, we see known examples of phosphorylation control of function, e.g. Y15 in cyclin-dependent kinase, 1unl, shown in Figure <figr fid="F6">6</figr>, <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>; Y19 in the adipocyte lipid-binding protein, 1a18, <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>; and S579 in human plasmin, 1bml, <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>. In each of these cases the structure from our annotation procedure is inconsistent with phosphorylation (zero solvent accessibility and substantial burial), indicative of conformational change. Although the effects of phosphorylation in these Phospho.ELM entries are generally well-characterised, this work demonstrates the possibility for prediction of conformational change in less well-studied examples, such as those emerging from mass spectrometry data.</p>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>Burial depth (from exterior solvent surface) of Phospho.ELM sites that have low solvent accessibility</p>
               </caption>
               <text>
                  <p><b>Burial depth (from exterior solvent surface) of Phospho.ELM sites that have low solvent accessibility.</b> Scatter plot of burial depth (see text) against accessibility for all sites with ASA &lt; 5 &#197;<sup>2</sup>, with a surface depiction of the environment around a buried tyrosine phospho-acceptor site in PDB file 1unl.</p>
               </text>
               <graphic file="1472-6807-8-19-6"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Phosphorylated and non-phosphorylated structures</p>
            </st>
            <p>Figure <figr fid="F7">7</figr> presents a comparison of sites from the PDB set, for which resolved structures exist in both phosphorylated and non-phosphorylated states. It is evident that whilst there is separation of the phosphorylated sites (from other Ser, Thr, Tyr), this is not the case for the equivalent non-phosphorylated sites. Thus conformational change between ordered segments leads to charge stabilisation in these systems. This result is consistent with our study of the Phospho.ELM dataset, where known phospho-acceptor sites do not exhibit charge stabilisation when studied in the context of non-phosphorylated structures.</p>
            <fig id="F7">
               <title>
                  <p>Figure 7</p>
               </title>
               <caption>
                  <p>Comparison of structure-based calculations for proteins with phosphorylated and non-phosphorylated structures</p>
               </caption>
               <text>
                  <p><b>Comparison of structure-based calculations for proteins with phosphorylated and non-phosphorylated structures.</b> (a) Charge interactions of the phos and non-phos sites in coordinate files that are phosphorylated. (b) The equivalent sites to panel (a), in coordinate files that are not phosphorylated.</p>
               </text>
               <graphic file="1472-6807-8-19-7"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Potential peaks around phospho-acceptor sites</p>
            </st>
            <p>Since mainchain conformational change plays a role in some phosphorylations, we adapted previous work that studied electrostatic potential values on a solvent accessible shell <abbrgrp><abbr bid="B55">55</abbr></abbrgrp>. In this case we are looking for the most favourable interacting site (highest positive potential) within a given radius of each site (centred on Ser OG, Thr OG1 or Tyr OH). Figure <figr fid="F8">8</figr> shows this analysis for the non-phosphorylated coordinate sets of the previous section, at three radii. At 30 &#197; radius, the interaction values are generally favourable and there is little separation between the real and background sets, since the large search radius links all Ser, Thr, Tyr sites to favourable patches for charge interaction. Of more interest are the smaller radii. Some separation is apparent at 5 &#197; radius, which increases at 10 &#197; radius, and is largely associated with the proteins (including kinases) that underpin separation in earlier Figures (e.g. Figure <figr fid="F7">7a</figr>). The procedure of electrostatic peak finding within a sphere around a phospho-acceptor site, but for non-phosphorylated molecules, largely recovers the result of calculations with phosphorylated coordinate sets. This occurs without knowledge of the loops that undergo conformational change. The method could therefore aid identification of such regions, and allied with simulation methods <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>, provide structural models for conformational change that is coupled to phosphorylation.</p>
            <fig id="F8">
               <title>
                  <p>Figure 8</p>
               </title>
               <caption>
                  <p>Searching for favourable interactions around phospho-acceptor sites in proteins that are not phosphorylated</p>
               </caption>
               <text>
                  <p><b>Searching for favourable interactions around phospho-acceptor sites in proteins that are not phosphorylated.</b> Calculations were made with the non-phosphorylated set of proteins, from the phosphorylated/non-phosphorylated structure pairs, many of which undergo conformational change upon phosphorylation. A search was made, within a given radius, for the most positive potential peak around each site (real/phos or background/non-phos). Interaction energy is given for a single unit negative charge in the positive potential field. (a) Radius = 5 &#197;. (b) Radius = 10 &#197;. (c) Radius = 30 &#197;.</p>
               </text>
               <graphic file="1472-6807-8-19-8"/>
            </fig>
            <p>The same, patch-based, calculation was made for the 101 members of the Phospho.ELM set that could be structurally annotated (Figure <figr fid="F9">9</figr>). Separation of the Ser, Thr, Tyr phos from non-phos sets is small at both the radii shown.</p>
            <fig id="F9">
               <title>
                  <p>Figure 9</p>
               </title>
               <caption>
                  <p>Favourable interactions around Phospho.ELM sites compared with background sites</p>
               </caption>
               <text>
                  <p><b>Favourable interactions around Phospho.ELM sites compared with background sites.</b> Structural annotation was made with non-phosphorylated coordinate files. (a) Maximal charge interactions around structurally annotated Ser, Thr and Tyr sites at 5 &#197; radius. (b) Maximal interactions around Ser, Thr, Tyr sites at 10 &#197; radius.</p>
               </text>
               <graphic file="1472-6807-8-19-9"/>
            </fig>
            <p>The patch search method appears able to identify some regions that are susceptible to phosphorylation/charge-driven conformational change, and will be studied in more detail in future work, both with conformationally characterised systems and with high-throughput proteomics data. However, there also many proteins for which charge-charge effects are not clearly apparent in transducing phosphorylation activity, particularly in the Phospho.ELM set. In some cases, it is possible that low solvent accessibility is important (Figure <figr fid="F5">5b</figr>), and in others complexation may play a role. In a preliminary analysis of this last factor, we calculated the non-polar percentage of ASA in patches around phos and non-phos sites for the structurally annotated Phospho.ELM dataset (Figure <figr fid="F10">10</figr>). The distributions are different, with a bimodal shape for the phos sites and more unimodal for the non-phos subset. It is not clear at this stage what underpins this difference, but one possibility is that the effects of phosphorylation for many of these proteins could be mediated via interfacial regions in protein-protein interactions.</p>
            <fig id="F10">
               <title>
                  <p>Figure 10</p>
               </title>
               <caption>
                  <p>Polarity of patches around phos and non-phos sites in the structurally annotated Phospho.ELM dataset</p>
               </caption>
               <text>
                  <p><b>Polarity of patches around phos and non-phos sites in the structurally annotated Phospho.</b>ELM dataset. Distributions are plotted for surface patches consisting of residues of ASA > 5 &#197;<sup>2</sup>, within 10 &#197; of a central Ser, Thr, Tyr.</p>
               </text>
               <graphic file="1472-6807-8-19-10"/>
            </fig>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>We have studied the charge environments around phospho-acceptor sites in phosphorylated and non-phosphorylated protein structures, and extended the study to structurally annotated sites in Phospho.ELM. Our results show that only a subset of phosphorylated structures (around a third in our analysis of the PDB) appear to be making use of particularly favourable charge interactions at the site of phosphorylation, when considered in the context of biological unit. Several of these examples are protein kinases with phosphorylated activation loops. When non-phosphorylated structures are studied, much of the signal of excess favourable charge interactions relative to a control set is lost, consistent with a degree of conformational change (e.g. the activation loop of protein kinases). We introduce a simple algorithm, using the charge/electrostatic potential neighbourhood, to recover the signal for non-phosphorylated structures. This method has the potential to aid studies of conformational change associated with phosphorylation, since it predicts the target area for phosphate binding. The result is consistent with molecular simulations in which charge interaction was seen to be the key factor in establishing an anchor point for phosphorylated residues <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>.</p>
         <p>For a set of phosphorylated sites structurally annotated from Phospho.ELM, we find little signal for charge stabilisation around phospho-acceptor sites, even when the neighbourhood algorithm is applied. This result suggests that any phosphorylation-induced conformational change is relatively complex or that the effects of phosphorylation are mediated in other ways, for many of the Phospho.ELM proteins. We find indirect evidence in both of these directions. The Phospho.ELM set has a tendency towards buried sites (in non-phosphorylated forms), possibly indicative of substantial conformational change. A preliminary analysis of patch surface polarity in the Phospho.ELM set shows a different distribution for phos sites relative to non-phos sites, suggesting that complexation could play a role i.e. where phosphorylation alters the propensity for forming an interface.</p>
         <p>Further work will study phosphorylation sites in the context of protein-protein interaction databases (e.g. PIBASE, <abbrgrp><abbr bid="B56">56</abbr></abbrgrp>). There are also issues of detailed modelling, such as inclusion of phosphate group torsion angles, emphasising phosphate interactions with mainchain groups, and adding Asp and His sidechains as acceptors. Our longer term aims are to probe molecular mechanisms by which phosphorylation mediates biological effects, and improve predictive algorithms that are based on sequence properties, using for example kinase specificity or disorder <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B27">27</abbr></abbrgrp>. The issue of disorder is of particular interest since our structural annotation of Phospho.ELM loses many phosphorylated sites as unstructured regions within known folds. It will be of interest to extend our patch algorithm to such examples, given that we also find examples in the structural database where sites that are ordered when phosphorylated, are unstructured when non-phosphorylated.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>PDB datasets</p>
            </st>
            <p>Structures containing as ligand type Sep, Tpo or Ptr representing phosphoserine, phosphothreonine and phosphotyrosine respectively, were identified in the PDB <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. Single chains were selected to remove redundant copies of phosphorylated sites within a coordinate file. Redundancy between PDB sets was investigated using the PISCES server <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>, selecting crystal structures with a minimum polypeptide length of 40 amino acids, and resolution better than 3.0 &#197;. We carried through sets of phosphorylation sites at 100%, 90% and 25% non-redundancy, based on sequence identity. These sets represent the complete phosphorylation data (100%), removal of near-identical copies (90%), and removal of copies that are clearly homologous (25%). This scheme is shown in Figure <figr fid="F1">1</figr>, with the numbers of phosphorylated sites retrieved. In addition to the phosphorylated sites, other Ser, Thr and Tyr residues in the same proteins constituted a non-phosphorylated control set, which was submitted to the same calculations. Generally for any set of PDB files containing phosphorylated sites, the control set was derived from that set of proteins, although it is possible that some of these other sites could be phosphorylated under different conditions. For calculations with biological units, the relevant coordinate files were obtained from the PDB.</p>
            <p>An additional dataset was formed by searching for non-phosphorylated structures that are related PDB entries to the phosphorylated structures, with the caveat that the site of phosphorylation should be ordered in both cases. This dataset was derived from the 90% sequence identity culled set of phosphorylated coordinate files.</p>
         </sec>
         <sec>
            <st>
               <p>Phospho.ELM dataset</p>
            </st>
            <p>Phosphorylation sites from the Phospho.ELM database were also studied, using version 5.0, released May 2006 <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. The approximately 7000 instances of phosphorylated serine, threonine and tyrosine were reduced to just 118 PDB files, with our structural annotation procedure.</p>
         </sec>
         <sec>
            <st>
               <p>Structural annotation of sequences</p>
            </st>
            <p>The UNIPROT protein sequence database was used to underpin the structural annotation process <abbrgrp><abbr bid="B57">57</abbr></abbrgrp>. Phosphoprotein accession numbers from Phospho.ELM were matched to UNIPROT, and then to 3D structures where available. Phosphopeptide sequences extracted from Phospho.ELM were matched to the amino acid sequence present in the coordinate section of the PDB file for verification. Where a phosphopeptide sequence matches to more than one PDB file, the first match was used.</p>
         </sec>
         <sec>
            <st>
               <p>Charge calculations</p>
            </st>
            <p>Charge interactions are compared at phosphorylated and non-phosphorylated Ser, Thr, Tyr sites. This is achieved in a model where a notional -2e charge is placed on the OG atom of Ser, OG1 of Thr and OH of Tyr. For this purpose, the phosphate group is removed from any phosphorylated residue, allowing direct comparison with non-phosphorylated residues. There are number of approximations in this process, notably the lack of phosphate group modelling and the -2e charge. The pKas of phosphorylated sidechains for the -1e to -2e transition suggest that the -2e state will predominate at neutral pH <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. Nevertheless, our focus is on comparison of phosphorylated and non-phosphorylated residues, rather than absolute values. The same rationale underlies the omission of a phosphate group model. To maintain direct comparisons, without removing phosphates, these groups would need to be modelled on non-phosphorylated residues, and results averaged over (or selected from) torsional variation.</p>
            <p>The Debye-H&#252;ckel (DH) model for charge-charge interactions is used, with a relative dielectric (78.4) corresponding to water, and an ionic strength of 0.15 Molar. This model does not account for charge interactions with the less polarisable protein, but it generally works well for charge-charge interactions at a protein surface <abbrgrp><abbr bid="B58">58</abbr></abbrgrp>, which are the current target. We compared the DH model with Finite Difference Poisson-Boltzmann calculations of charge-charge interactions. The relative distributions of charge interaction energies for phosphorylated and non-phosporylated sites were similar to those for the DH model (not shown).</p>
            <p>Distributions are displayed as histograms of interaction energy, summed over all phosphorylated (phos) or non-phosphorylated (non-phos) residues in the coordinate files of a dataset. Percentages are plotted, facilitating comparison of the unequal-sized datasets. The Mann-Whitney U test is applied to the series of interaction values for each phos and non-phos set. The probability of occurrence of these series, if there is no difference in the underlying distributions, is given for each phos/non-phos plot. A significant difference is generally inferred at the 5% (p &lt; 0.05) level.</p>
         </sec>
         <sec>
            <st>
               <p>Solvent accessibility and burial depth</p>
            </st>
            <p>Solvent accessibility was calculated for phospho-acceptor atoms, OG (Ser), OG1 (Thr), OH (Tyr), using an in-house program, and the data were binned for histogram display. Burial depth from the exterior solvent surface was calculated with a grid-based algorithm. Molecule points are assigned on the grid for regions that are not solvent accessible. Then starting from outside of the molecule, a grid of accessibility to the exterior solvent is constructed. The burial distance (to the exterior surface) is calculated between each atom and the nearest exterior accessible grid point. A value of around 2 &#197; represents exterior surface located, taking into account the united atom van der Waals radius set.</p>
         </sec>
         <sec>
            <st>
               <p>Polarity of surface patches in the Phospho.ELM dataset</p>
            </st>
            <p>The surface polarity around Ser, Thr and Tyr residues was calculated for amino acids within a 10 &#197; radius (based on C<sub>&#945; </sub>atoms), and with ASA > 5 &#197;<sup>2</sup>. Distributions of non-polar percentage (i.e. 100 &#215; non-polar area/total area) were compared between phos and non-phos sites.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Availability and requirements</p>
         </st>
         <p>Project name: Phospatch</p>
         <p>Project home page: <url>http://www.bioinf.manchester.ac.uk/phospatch/</url></p>
         <p>The software used in this study is also available for download from: <url>http://personalpages.manchester.ac.uk/staff/j.warwicker/resources.html</url></p>
         <p>Operating system(s): Linux</p>
         <p>Programming language: Perl, Fortran</p>
         <p>License: GNU GPL</p>
         <p>No additional restrictions for non-academic users.</p>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p>DH, Debye-H&#369;ckel; FDPB, Finite Difference Poisson-Boltzmann; PDB, Protein Data Bank; ASA, Accessible Surface Area; ATP, Adenosine 5'-triphosphate; GTP, Guanosine 5'-triphosphate; 3D, 3-Dimensional.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>RES and JW carried out an initial study. JK and JW planned and performed subsequent work, and wrote the manuscript. All authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>This work was funded by a UK BBSRC studentship to JK. We thank Mr Stephen Hall and Dr Nick Gresham for technical support.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Protein post-translational modification: the chemistry of proteome diversifications</p>
            </title>
            <aug>
               <au>
                  <snm>Walsh</snm>
                  <fnm>CT</fnm>
               </au>
               <au>
                  <snm>Garneau-Tsodikova</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gatto</snm>
                  <fnm>GJ</fnm>
                  <suf>Jr</suf>
               </au>
            </aug>
            <source>Angew. Chem Int Ed Engl</source>
            <pubdate>2005</pubdate>
            <volume>44</volume>
            <fpage>7342</fpage>
            <lpage>7372</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/anie.200501023</pubid>
                  <pubid idtype="pmpid" link="fulltext">16267872</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Emerging applications for phospho-proteomics in cancer molecular therapeutics</p>
            </title>
            <aug>
               <au>
                  <snm>Moran</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Tong</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Ewing</snm>
                  <fnm>RM</fnm>
               </au>
            </aug>
            <source>Biochim Biophys Acta</source>
            <pubdate>2006</pubdate>
            <volume>1766</volume>
            <fpage>230</fpage>
            <lpage>241</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16889898</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Exploring the phosphoproteome with mass spectrometry</p>
            </title>
            <aug>
               <au>
                  <snm>Peters</snm>
                  <fnm>EC</fnm>
               </au>
               <au>
                  <snm>Brock</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ficarro</snm>
                  <fnm>SB</fnm>
               </au>
            </aug>
            <source>Mini Rev Med Chem</source>
            <pubdate>2004</pubdate>
            <volume>4</volume>
            <fpage>313</fpage>
            <lpage>324</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.2174/1389557043487330</pubid>
                  <pubid idtype="pmpid" link="fulltext">15032677</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Charging it up: global analysis of protein phosphorylation</p>
            </title>
            <aug>
               <au>
                  <snm>Ptacek</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Snyder</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2006</pubdate>
            <volume>22</volume>
            <fpage>545</fpage>
            <lpage>554</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tig.2006.08.005</pubid>
                  <pubid idtype="pmpid" link="fulltext">16908088</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Analysis of protein phosphorylation using mass spectrometry: deciphering the phosphoproteome</p>
            </title>
            <aug>
               <au>
                  <snm>Mann</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ong</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Gronborg</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Steen</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Jensen</snm>
                  <fnm>ON</fnm>
               </au>
               <au>
                  <snm>Pandey</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Trends Biotechnol</source>
            <pubdate>2002</pubdate>
            <volume>20</volume>
            <fpage>261</fpage>
            <lpage>268</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0167-7799(02)01944-3</pubid>
                  <pubid idtype="pmpid" link="fulltext">12007495</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Phospho.ELM: A database of experimentally verified phosphorylation sites in eukaryotic proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Diella</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Cameron</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gemund</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Linding</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Via</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kuster</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Sicheritz-Ponten</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Blom</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Gibson</snm>
                  <fnm>TJ</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>79</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">449700</pubid>
                  <pubid idtype="pmpid" link="fulltext">15212693</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-5-79</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>The Phosphorylation Site Database: A guide to the serine-, threonine-, and/or tyrosine-phosphorylated proteins in prokaryotic organisms</p>
            </title>
            <aug>
               <au>
                  <snm>Wurgler-Murphy</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>King</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Kennelly</snm>
                  <fnm>PJ</fnm>
               </au>
            </aug>
            <source>Proteomics</source>
            <pubdate>2004</pubdate>
            <volume>4</volume>
            <fpage>1562</fpage>
            <lpage>1570</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/pmic.200300711</pubid>
                  <pubid idtype="pmpid" link="fulltext">15174126</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>PhosphoSite: A bioinformatics resource dedicated to physiological protein phosphorylation</p>
            </title>
            <aug>
               <au>
                  <snm>Hornbeck</snm>
                  <fnm>PV</fnm>
               </au>
               <au>
                  <snm>Chabra</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Kornhauser</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Skrzypek</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Proteomics</source>
            <pubdate>2004</pubdate>
            <volume>4</volume>
            <fpage>1551</fpage>
            <lpage>1561</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/pmic.200300772</pubid>
                  <pubid idtype="pmpid" link="fulltext">15174125</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Phosphoproteomics of the <it>Arabidopsis </it>plasma membrane and a new phosphorylation site database</p>
            </title>
            <aug>
               <au>
                  <snm>N&#252;hse</snm>
                  <fnm>TS</fnm>
               </au>
               <au>
                  <snm>Stensballe</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Jensen</snm>
                  <fnm>ON</fnm>
               </au>
               <au>
                  <snm>Peck</snm>
                  <fnm>SC</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>2004</pubdate>
            <volume>16</volume>
            <fpage>2394</fpage>
            <lpage>2405</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">520941</pubid>
                  <pubid idtype="pmpid" link="fulltext">15308754</pubid>
                  <pubid idtype="doi">10.1105/tpc.104.023150</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Global, <it>in vivo</it>, and site-specific phosphorylation dynamics in signaling networks</p>
            </title>
            <aug>
               <au>
                  <snm>Olsen</snm>
                  <fnm>JV</fnm>
               </au>
               <au>
                  <snm>Blagoev</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Gnad</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Macek</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Kumar</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Mortensen</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Mann</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2006</pubdate>
            <volume>127</volume>
            <fpage>635</fpage>
            <lpage>648</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cell.2006.09.026</pubid>
                  <pubid idtype="pmpid" link="fulltext">17081983</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>dbPTM: an information repository of protein post-translational modification</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>TY</fnm>
               </au>
               <au>
                  <snm>Huang</snm>
                  <fnm>HD</fnm>
               </au>
               <au>
                  <snm>Hung</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Huang</snm>
                  <fnm>HY</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>YS</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>TH</fnm>
               </au>
            </aug>
            <source>Nucl Acids Res</source>
            <pubdate>2006</pubdate>
            <volume>34</volume>
            <fpage>D622</fpage>
            <lpage>D627</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1347446</pubid>
                  <pubid idtype="pmpid" link="fulltext">16381945</pubid>
                  <pubid idtype="doi">10.1093/nar/gkj083</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>NetphosYeast: prediction of protein phosphorylation sites in yeast</p>
            </title>
            <aug>
               <au>
                  <snm>Ingrell</snm>
                  <fnm>CR</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Jensen</snm>
                  <fnm>ON</fnm>
               </au>
               <au>
                  <snm>Blom</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2007</pubdate>
            <volume>23</volume>
            <fpage>895</fpage>
            <lpage>897</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btm020</pubid>
                  <pubid idtype="pmpid" link="fulltext">17282998</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Sequence and structure-based prediction of eukaryotic protein phosphorylation sites</p>
            </title>
            <aug>
               <au>
                  <snm>Blom</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Gammeltoft</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Brunak</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1999</pubdate>
            <volume>294</volume>
            <fpage>1351</fpage>
            <lpage>1362</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.1999.3310</pubid>
                  <pubid idtype="pmpid" link="fulltext">10600390</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>KinasePhos: a web tool for identifying protein kinase-specific phosphorylation sites</p>
            </title>
            <aug>
               <au>
                  <snm>Huang</snm>
                  <fnm>HD</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>TY</fnm>
               </au>
               <au>
                  <snm>Tzeng</snm>
                  <fnm>SW</fnm>
               </au>
               <au>
                  <snm>Horng</snm>
                  <fnm>JT</fnm>
               </au>
            </aug>
            <source>Nucl Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <fpage>W226</fpage>
            <lpage>W229</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1160232</pubid>
                  <pubid idtype="pmpid" link="fulltext">15980458</pubid>
                  <pubid idtype="doi">10.1093/nar/gki471</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Predicting the phosphorylation sites using hidden Markov models and machine learning methods</p>
            </title>
            <aug>
               <au>
                  <snm>Senawongse</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Dalby</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>ZR</fnm>
               </au>
            </aug>
            <source>J Chem Inf Model</source>
            <pubdate>2005</pubdate>
            <volume>45</volume>
            <fpage>1147</fpage>
            <lpage>1152</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1021/ci050047+</pubid>
                  <pubid idtype="pmpid" link="fulltext">16045309</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Prediction of phosphorylation sites using SVMs</p>
            </title>
            <aug>
               <au>
                  <snm>Kim</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Oh</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Kimm</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Koh</snm>
                  <fnm>I</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <fpage>3179</fpage>
            <lpage>3184</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bth382</pubid>
                  <pubid idtype="pmpid" link="fulltext">15231530</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Molecular modeling of phosphorylation sites in proteins using a database of structure segments</p>
            </title>
            <aug>
               <au>
                  <snm>Plewczynski</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Jaroszewski</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Godzik</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kloczkowski</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Rychlewski</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>J Mol Model</source>
            <pubdate>2005</pubdate>
            <volume>11</volume>
            <fpage>431</fpage>
            <lpage>438</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s00894-005-0235-z</pubid>
                  <pubid idtype="pmpid">16094535</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>KinasePhos 2.0: a web server for identifying protein kinase-specific phosphorylation sites based on sequences and coupling patterns</p>
            </title>
            <aug>
               <au>
                  <snm>Wong</snm>
                  <fnm>YH</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>TY</fnm>
               </au>
               <au>
                  <snm>Liang</snm>
                  <fnm>HK</fnm>
               </au>
               <au>
                  <snm>Huang</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>TY</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>YH</fnm>
               </au>
               <au>
                  <snm>Chu</snm>
                  <fnm>CH</fnm>
               </au>
               <au>
                  <snm>Huang</snm>
                  <fnm>HD</fnm>
               </au>
               <au>
                  <snm>Ko</snm>
                  <fnm>MT</fnm>
               </au>
               <au>
                  <snm>Hwang</snm>
                  <fnm>JK</fnm>
               </au>
            </aug>
            <source>Nucl Acids Res</source>
            <pubdate>2007</pubdate>
            <volume>35</volume>
            <fpage>W588</fpage>
            <lpage>W594</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1933228</pubid>
                  <pubid idtype="pmpid" link="fulltext">17517770</pubid>
                  <pubid idtype="doi">10.1093/nar/gkm322</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>PPSP: prediction of PK-specific phosphorylation sites with Bayesian decision theory</p>
            </title>
            <aug>
               <au>
                  <snm>Xue</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Feng</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Yao</snm>
                  <fnm>X</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>163</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1435943</pubid>
                  <pubid idtype="pmpid" link="fulltext">16549034</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-7-163</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Scansite 2.0: proteome-wide prediction of cell signaling interactions using short sequence motifs</p>
            </title>
            <aug>
               <au>
                  <snm>Obenauer</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Cantley</snm>
                  <fnm>LC</fnm>
               </au>
               <au>
                  <snm>Yaffe</snm>
                  <fnm>MB</fnm>
               </au>
            </aug>
            <source>Nucl Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <fpage>3635</fpage>
            <lpage>3641</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">168990</pubid>
                  <pubid idtype="pmpid" link="fulltext">12824383</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg584</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Highly specific prediction of phosphorylation sites in proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Koenig</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Grabe</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <fpage>3620</fpage>
            <lpage>3627</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bth455</pubid>
                  <pubid idtype="pmpid" link="fulltext">15297298</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>GPS: a novel group-based phosphorylation predicting and scoring method</p>
            </title>
            <aug>
               <au>
                  <snm>Zhou</snm>
                  <fnm>FF</fnm>
               </au>
               <au>
                  <snm>Xue</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>GL</fnm>
               </au>
               <au>
                  <snm>Yao</snm>
                  <fnm>X</fnm>
               </au>
            </aug>
            <source>Biochem Biophys Res Comm</source>
            <pubdate>2004</pubdate>
            <volume>325</volume>
            <fpage>1443</fpage>
            <lpage>1448</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.bbrc.2004.11.001</pubid>
                  <pubid idtype="pmpid" link="fulltext">15555589</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Global analysis of phosphorylation in yeast</p>
            </title>
            <aug>
               <au>
                  <snm>Ptacek</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Devgan</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Michaud</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Fasolo</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Guo</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Jona</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Breitkreutz</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sopko</snm>
                  <fnm>R</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2005</pubdate>
            <volume>438</volume>
            <fpage>679</fpage>
            <lpage>684</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature04187</pubid>
                  <pubid idtype="pmpid" link="fulltext">16319894</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Phosphoproteomics toolbox: Computational biology, protein chemistry and mass spectrometry</p>
            </title>
            <aug>
               <au>
                  <snm>Hjerrild</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Gammeltoft</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>FEBS Letters</source>
            <pubdate>2006</pubdate>
            <volume>580</volume>
            <fpage>4764</fpage>
            <lpage>4770</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.febslet.2006.07.068</pubid>
                  <pubid idtype="pmpid" link="fulltext">16914146</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>A support vector machine approach to the identification of phosphorylation sites</p>
            </title>
            <aug>
               <au>
                  <snm>Plewczynski</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Tkacz</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Godzik</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Rychiewski</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Cell Mol Biol Lett</source>
            <pubdate>2005</pubdate>
            <volume>10</volume>
            <fpage>73</fpage>
            <lpage>89</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15809681</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Phospho3D: a database of three-dimensional structures of protein phosphorylation sites</p>
            </title>
            <aug>
               <au>
                  <snm>Zanzoni</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ausiello</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Via</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Gherardini</snm>
                  <fnm>PF</fnm>
               </au>
               <au>
                  <snm>Helmer-Citterich</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nucl Acids Res</source>
            <pubdate>2007</pubdate>
            <volume>35</volume>
            <fpage>D229</fpage>
            <lpage>D231</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1669737</pubid>
                  <pubid idtype="pmpid" link="fulltext">17142231</pubid>
                  <pubid idtype="doi">10.1093/nar/gkl922</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>The importance of intrinsic disorder for protein phosphorylation</p>
            </title>
            <aug>
               <au>
                  <snm>Iakoucheva</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Radivojac</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>O'Connor</snm>
                  <fnm>TR</fnm>
               </au>
               <au>
                  <snm>Sikes</snm>
                  <fnm>JG</fnm>
               </au>
               <au>
                  <snm>Obradovic</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Dunker</snm>
                  <fnm>AK</fnm>
               </au>
            </aug>
            <source>Nucl Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <fpage>1037</fpage>
            <lpage>1049</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">373391</pubid>
                  <pubid idtype="pmpid" link="fulltext">14960716</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh253</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Intrinsic disorder is a key characteristic in partners that bind 14-3-3 proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Bustos</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Iglesias</snm>
                  <fnm>AA</fnm>
               </au>
            </aug>
            <source>Proteins</source>
            <pubdate>2006</pubdate>
            <volume>63</volume>
            <fpage>35</fpage>
            <lpage>42</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/prot.20888</pubid>
                  <pubid idtype="pmpid" link="fulltext">16444738</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Local structural disorder imparts plasticity on linear motifs</p>
            </title>
            <aug>
               <au>
                  <snm>Fuxreiter</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Tompa</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Simon</snm>
                  <fnm>I</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2007</pubdate>
            <volume>23</volume>
            <fpage>950</fpage>
            <lpage>956</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btm035</pubid>
                  <pubid idtype="pmpid" link="fulltext">17387114</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Structural basis for control by phosphorylation</p>
            </title>
            <aug>
               <au>
                  <snm>Johnson</snm>
                  <fnm>LN</fnm>
               </au>
               <au>
                  <snm>Lewis</snm>
                  <fnm>RJ</fnm>
               </au>
            </aug>
            <source>Chem Rev</source>
            <pubdate>2001</pubdate>
            <volume>101</volume>
            <fpage>2209</fpage>
            <lpage>2242</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1021/cr000225s</pubid>
                  <pubid idtype="pmpid" link="fulltext">11749371</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>A computational method for the analysis and prediction of protein:phosphopeptide-binding sites</p>
            </title>
            <aug>
               <au>
                  <snm>Joughin</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Tidor</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Yaffe</snm>
                  <fnm>MB</fnm>
               </au>
            </aug>
            <source>Protein Science</source>
            <pubdate>2005</pubdate>
            <volume>14</volume>
            <fpage>131</fpage>
            <lpage>139</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1110/ps.04964705</pubid>
                  <pubid idtype="pmpid" link="fulltext">15576564</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Lessons from Nature: On the molecular recognition elements of the phosphoprotein binding-domains</p>
            </title>
            <aug>
               <au>
                  <snm>Roque</snm>
                  <fnm>ACA</fnm>
               </au>
               <au>
                  <snm>Lowe</snm>
                  <fnm>CR</fnm>
               </au>
            </aug>
            <source>Biotechnol Bioeng</source>
            <pubdate>2005</pubdate>
            <volume>91</volume>
            <fpage>546</fpage>
            <lpage>555</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/bit.20561</pubid>
                  <pubid idtype="pmpid" link="fulltext">15959902</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Structural modes of stabilization of permissive phosphorylation sites in protein kinases: Distinct strategies in Ser/Thr and Tyr kinases</p>
            </title>
            <aug>
               <au>
                  <snm>Krupa</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Preethi</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Srinivasan</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2004</pubdate>
            <volume>339</volume>
            <fpage>1025</fpage>
            <lpage>1039</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.jmb.2004.04.043</pubid>
                  <pubid idtype="pmpid" link="fulltext">15178245</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Conformational changes in protein loops and helices induced by post-translational phosphorylation</p>
            </title>
            <aug>
               <au>
                  <snm>Groban</snm>
                  <fnm>ES</fnm>
               </au>
               <au>
                  <snm>Narayanan</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Jacobson</snm>
                  <fnm>MP</fnm>
               </au>
            </aug>
            <source>PloS Comput Biol</source>
            <pubdate>2006</pubdate>
            <volume>2</volume>
            <fpage>e32</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1440919</pubid>
                  <pubid idtype="pmpid" link="fulltext">16628247</pubid>
                  <pubid idtype="doi">10.1371/journal.pcbi.0020032</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Effect of phosphorylation on &#945;-helix stability as a function of position</p>
            </title>
            <aug>
               <au>
                  <snm>Andrew</snm>
                  <fnm>CD</fnm>
               </au>
               <au>
                  <snm>Warwicker</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>GR</fnm>
               </au>
               <au>
                  <snm>Doig</snm>
                  <fnm>AJ</fnm>
               </au>
            </aug>
            <source>Biochemistry</source>
            <pubdate>2002</pubdate>
            <volume>41</volume>
            <fpage>1897</fpage>
            <lpage>1905</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1021/bi0113216</pubid>
                  <pubid idtype="pmpid" link="fulltext">11827536</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Phosphorylation stabilizes the N-termini of &#945;-helices</p>
            </title>
            <aug>
               <au>
                  <snm>Smart</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>McCammon</snm>
                  <fnm>JA</fnm>
               </au>
            </aug>
            <source>Biopolymers</source>
            <pubdate>1999</pubdate>
            <volume>49</volume>
            <fpage>225</fpage>
            <lpage>233</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/(SICI)1097-0282(199903)49:3&lt;225::AID-BIP4>3.0.CO;2-B</pubid>
                  <pubid idtype="pmpid" link="fulltext">9990840</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>A phosphoserine-lysine salt bridge within an &#945;-helical peptide, the strongest &#945;-helix side-chain interaction measured to date</p>
            </title>
            <aug>
               <au>
                  <snm>Errington</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Doig</snm>
                  <fnm>AJ</fnm>
               </au>
            </aug>
            <source>Biochemistry</source>
            <pubdate>2005</pubdate>
            <volume>44</volume>
            <fpage>7553</fpage>
            <lpage>7558</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1021/bi050297j</pubid>
                  <pubid idtype="pmpid" link="fulltext">15895998</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Amazing stability of the arginine-phosphate electrostatic interaction</p>
            </title>
            <aug>
               <au>
                  <snm>Woods</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ferre</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>J Proteome Research</source>
            <pubdate>2005</pubdate>
            <volume>4</volume>
            <fpage>1397</fpage>
            <lpage>1402</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1021/pr050077s</pubid>
                  <pubid idtype="pmpid" link="fulltext">16083292</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Strengths of hydrogen bonds involving phosphorylated amino acid sidechains</p>
            </title>
            <aug>
               <au>
                  <snm>Mandell</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Chorny</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Groban</snm>
                  <fnm>ES</fnm>
               </au>
               <au>
                  <snm>Wong</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Levine</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Rapp</snm>
                  <fnm>CS</fnm>
               </au>
               <au>
                  <snm>Jacobson</snm>
                  <fnm>MP</fnm>
               </au>
            </aug>
            <source>J Am Chem Soc</source>
            <pubdate>2007</pubdate>
            <volume>129</volume>
            <fpage>820</fpage>
            <lpage>827</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1021/ja063019w</pubid>
                  <pubid idtype="pmpid" link="fulltext">17243818</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>The Protein Data Bank</p>
            </title>
            <aug>
               <au>
                  <snm>Berman</snm>
                  <fnm>HM</fnm>
               </au>
               <au>
                  <snm>Westbrook</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Feng</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Gilliland</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Bhat</snm>
                  <fnm>TN</fnm>
               </au>
               <au>
                  <snm>Weissig</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Shindyalov</snm>
                  <fnm>IN</fnm>
               </au>
               <au>
                  <snm>Bourne</snm>
                  <fnm>PE</fnm>
               </au>
            </aug>
            <source>Nucl Acids Res</source>
            <pubdate>2000</pubdate>
            <volume>28</volume>
            <fpage>235</fpage>
            <lpage>242</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">102472</pubid>
                  <pubid idtype="pmpid" link="fulltext">10592235</pubid>
                  <pubid idtype="doi">10.1093/nar/28.1.235</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>PISCES: a protein sequence culling server</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Dunbrack</snm>
                  <fnm>R</fnm>
                  <suf>Jr</suf>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>19</volume>
            <fpage>1589</fpage>
            <lpage>1591</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btg224</pubid>
                  <pubid idtype="pmpid" link="fulltext">12912846</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Dipoles localized at helix termini of proteins stabilize charges</p>
            </title>
            <aug>
               <au>
                  <snm>Aqvist</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Luecke</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Quiocho</snm>
                  <fnm>FA</fnm>
               </au>
               <au>
                  <snm>Warshel</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1991</pubdate>
            <volume>88</volume>
            <fpage>2026</fpage>
            <lpage>2030</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">51159</pubid>
                  <pubid idtype="pmpid" link="fulltext">2000410</pubid>
                  <pubid idtype="doi">10.1073/pnas.88.5.2026</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>The P-loop &#8211; a common motif in ATP- and GTP-binding proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Saraste</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sibbald</snm>
                  <fnm>PR</fnm>
               </au>
               <au>
                  <snm>Wittinghofer</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Trends Biochem Sci</source>
            <pubdate>1990</pubdate>
            <volume>15</volume>
            <fpage>430</fpage>
            <lpage>434</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0968-0004(90)90281-F</pubid>
                  <pubid idtype="pmpid">2126155</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Crystal structure of the 14-3-3&#950;: serotonin N-acetyltransferase complex. A role for scaffolding in enzyme regulation</p>
            </title>
            <aug>
               <au>
                  <snm>Obsil</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Ghirlando</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Klein</snm>
                  <fnm>DC</fnm>
               </au>
               <au>
                  <snm>Ganguly</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Dyda</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2001</pubdate>
            <volume>105</volume>
            <fpage>257</fpage>
            <lpage>267</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0092-8674(01)00316-6</pubid>
                  <pubid idtype="pmpid" link="fulltext">11336675</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>The structure of the &#946;-catenin/E-cadherin complex and the molecular basis of diverse ligand recognition by &#946;-catenin</p>
            </title>
            <aug>
               <au>
                  <snm>Huber</snm>
                  <fnm>AH</fnm>
               </au>
               <au>
                  <snm>Weis</snm>
                  <fnm>WI</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2001</pubdate>
            <volume>105</volume>
            <fpage>391</fpage>
            <lpage>402</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0092-8674(01)00330-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">11348595</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>Mechanism of phosphorylation-dependent binding of APC to &#946;-catenin and its role in &#946;-catenin degradation</p>
            </title>
            <aug>
               <au>
                  <snm>Ha</snm>
                  <fnm>NC</fnm>
               </au>
               <au>
                  <snm>Tonozuka</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Stamos</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Choi</snm>
                  <fnm>HJ</fnm>
               </au>
               <au>
                  <snm>Weis</snm>
                  <fnm>WI</fnm>
               </au>
            </aug>
            <source>Mol Cell</source>
            <pubdate>2004</pubdate>
            <volume>15</volume>
            <fpage>511</fpage>
            <lpage>521</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.molcel.2004.08.010</pubid>
                  <pubid idtype="pmpid" link="fulltext">15327768</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>Sequential phosphorylation and multisite interactions characterize specific target recognition by the FHA domain of Ki67</p>
            </title>
            <aug>
               <au>
                  <snm>Byeon</snm>
                  <fnm>IJ</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Song</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Gronenborn</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Tsai</snm>
                  <fnm>MD</fnm>
               </au>
            </aug>
            <source>Nat Struct Mol Biol</source>
            <pubdate>2005</pubdate>
            <volume>12</volume>
            <fpage>987</fpage>
            <lpage>993</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16244663</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>Structural changes at the metal binding site during the phosphoglucomutase reaction</p>
            </title>
            <aug>
               <au>
                  <snm>Ray</snm>
                  <fnm>WJ</fnm>
               </au>
               <au>
                  <snm>Post</snm>
                  <fnm>CB</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Rhyu</snm>
                  <fnm>GI</fnm>
               </au>
            </aug>
            <source>Biochemistry</source>
            <pubdate>1993</pubdate>
            <volume>32</volume>
            <fpage>48</fpage>
            <lpage>57</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1021/bi00052a008</pubid>
                  <pubid idtype="pmpid">8418859</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>Crystallographic structure reveals phosphorylated pilin from <it>Neisseria</it>: phosphoserine sites modify type IV pilus surface chemistry and fibre morphology</p>
            </title>
            <aug>
               <au>
                  <snm>Forest</snm>
                  <fnm>KT</fnm>
               </au>
               <au>
                  <snm>Dunham</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Koomey</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Tainer</snm>
                  <fnm>JA</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>1999</pubdate>
            <volume>31</volume>
            <fpage>743</fpage>
            <lpage>752</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-2958.1999.01184.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">10048019</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>Structure of nucleotide-binding domain 1 of the cystic fibrosis transmembrane conductance regulator</p>
            </title>
            <aug>
               <au>
                  <snm>Lewis</snm>
                  <fnm>HA</fnm>
               </au>
               <au>
                  <snm>Buchanan</snm>
                  <fnm>SG</fnm>
               </au>
               <au>
                  <snm>Burley</snm>
                  <fnm>SK</fnm>
               </au>
               <au>
                  <snm>Conners</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Dickey</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Dorwart</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Fowler</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Gao</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Guggino</snm>
                  <fnm>WB</fnm>
               </au>
               <au>
                  <snm>Hendrickson</snm>
                  <fnm>WA</fnm>
               </au>
               <etal/>
            </aug>
            <source>EMBO J</source>
            <pubdate>2004</pubdate>
            <volume>23</volume>
            <fpage>282</fpage>
            <lpage>293</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1271750</pubid>
                  <pubid idtype="pmpid" link="fulltext">14685259</pubid>
                  <pubid idtype="doi">10.1038/sj.emboj.7600040</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>Structural systems biology: modelling protein interactions</p>
            </title>
            <aug>
               <au>
                  <snm>Aloy</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Russell</snm>
                  <fnm>RB</fnm>
               </au>
            </aug>
            <source>Nat Rev Mol Cell Biol</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>188</fpage>
            <lpage>197</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrm1859</pubid>
                  <pubid idtype="pmpid" link="fulltext">16496021</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>Regulation of the human WEE1Hu CDK tyrosine 15-kinase during the cell cycle</p>
            </title>
            <aug>
               <au>
                  <snm>Watanabe</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Broome</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hunter</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>EMBO J</source>
            <pubdate>1995</pubdate>
            <volume>14</volume>
            <fpage>1878</fpage>
            <lpage>1891</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">398287</pubid>
                  <pubid idtype="pmpid">7743995</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>Structural and functional characterization of the phosphorylated adipocyte lipid-binding protein (pp15)</p>
            </title>
            <aug>
               <au>
                  <snm>Buelt</snm>
                  <fnm>MK</fnm>
               </au>
               <au>
                  <snm>Xu</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Banaszak</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Bernlohr</snm>
                  <fnm>DA</fnm>
               </au>
            </aug>
            <source>Biochemistry</source>
            <pubdate>1992</pubdate>
            <volume>31</volume>
            <fpage>3493</fpage>
            <lpage>3499</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1021/bi00128a025</pubid>
                  <pubid idtype="pmpid">1372828</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>Serine-578 is a major phosphorylation locus in human plasma plasminogen</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Prorok</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bretthauer</snm>
                  <fnm>RK</fnm>
               </au>
               <au>
                  <snm>Castellino</snm>
                  <fnm>FJ</fnm>
               </au>
            </aug>
            <source>Biochemistry</source>
            <pubdate>1997</pubdate>
            <volume>36</volume>
            <fpage>8100</fpage>
            <lpage>8106</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1021/bi970328d</pubid>
                  <pubid idtype="pmpid" link="fulltext">9201958</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>Enzyme/non-enzyme discrimination and prediction of enzyme active site location using charge-based methods</p>
            </title>
            <aug>
               <au>
                  <snm>Bate</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Warwicker</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2004</pubdate>
            <volume>340</volume>
            <fpage>263</fpage>
            <lpage>276</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.jmb.2004.04.070</pubid>
                  <pubid idtype="pmpid" link="fulltext">15201051</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <title>
               <p>PIBASE: a comprehensive database of structurally defined protein interfaces</p>
            </title>
            <aug>
               <au>
                  <snm>Davis</snm>
                  <fnm>FP</fnm>
               </au>
               <au>
                  <snm>Sali</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <fpage>1901</fpage>
            <lpage>1907</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bti277</pubid>
                  <pubid idtype="pmpid" link="fulltext">15657096</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <title>
               <p>Protein sequence databases</p>
            </title>
            <aug>
               <au>
                  <snm>Apweiler</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Bairoch</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>CH</fnm>
               </au>
            </aug>
            <source>Curr Opin Chem Biol</source>
            <pubdate>2004</pubdate>
            <volume>8</volume>
            <fpage>76</fpage>
            <lpage>80</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cbpa.2003.12.004</pubid>
                  <pubid idtype="pmpid" link="fulltext">15036160</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>Improved pKa calculations through flexibility-based sampling of a water-dominated interaction scheme</p>
            </title>
            <aug>
               <au>
                  <snm>Warwicker</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Protein Sci</source>
            <pubdate>2004</pubdate>
            <volume>13</volume>
            <fpage>2793</fpage>
            <lpage>2805</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1110/ps.04785604</pubid>
                  <pubid idtype="pmpid" link="fulltext">15388865</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
