<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2008-9-10-r144</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>Comparative phosphoproteomics reveals evolutionary and functional conservation of phosphorylation across eukaryotes</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Boekhorst</snm>
               <fnm>Jos</fnm>
               <insr iid="I1"/>
               <email>J.Boekhorst@uu.nl</email>
            </au>
            <au id="A2">
               <snm>van Breukelen</snm>
               <fnm>Bas</fnm>
               <insr iid="I2"/>
               <email>b.vanbreukelen@uu.nl</email>
            </au>
            <au id="A3">
               <snm>Heck</snm>
               <mi>JR</mi>
               <fnm>Albert</fnm>
               <insr iid="I2"/>
               <email>a.j.r.heck@uu.nl</email>
            </au>
            <au id="A4">
               <snm>Snel</snm>
               <fnm>Berend</fnm>
               <insr iid="I1"/>
               <insr iid="I3"/>
               <email>b.snel@uu.nl</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Bioinformatics, Department of Biology, Faculty of Science, Utrecht University, Padualaan, 3584 CH, The Netherlands</p>
            </ins>
            <ins id="I2">
               <p>Biomolecular Mass Spectrometry and Proteomics Group, Bijvoet Center for Biomolecular Research and Utrecht Institute for Pharmaceutical Sciences, Utrecht University, Sorbonnelaan, 3584 CA Utrecht, The Netherlands</p>
            </ins>
            <ins id="I3">
               <p>Academic Biomedical Centre, Utrecht University, Yalelaan, 3584 CL Utrecht, The Netherlands</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2008</pubdate>
         <volume>9</volume>
         <issue>10</issue>
         <fpage>R144</fpage>
         <url>http://genomebiology.com/2008/9/10/R144</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18828897</pubid>
               <pubid idtype="doi">10.1186/gb-2008-9-10-r144</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>08</day>
               <month>7</month>
               <year>2008</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>03</day>
               <month>9</month>
               <year>2008</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>01</day>
               <month>10</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>01</day>
               <month>10</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Boekhorst et al.; licensee BioMed Central Ltd.</collab>
         <note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <shorttitle>
         <p>Phosphorylation in eukaryote evolution</p>
      </shorttitle>
      <shortabs>
         <p>A comparison of phosphoproteomics datasets of six eukaryotes shows significant overlap between phosphoproteomes.</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Reversible phosphorylation of proteins is involved in a wide range of processes, ranging from signaling cascades to regulation of protein complex assembly. Little is known about the structure and evolution of phosphorylation networks. Recent high-throughput phosphoproteomics studies have resulted in the rapid accumulation of phosphopeptide datasets for many model organisms. Here, we exploit these novel data for the comparative analysis of phosphorylation events between different species of eukaryotes.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Comparison of phosphoproteomics datasets of six eukaryotes yields an overlap ranging from approximately 700 sites for human and mouse (two large datasets of closely related species) to a single site for fish and yeast (distantly related as well as two of the smallest datasets). Some conserved events appear surprisingly old; those shared by plant and animals suggest conservation over the time scale of a billion years. In spite of the hypothesized incomprehensive nature of phosphoproteomics datasets and differences in experimental procedures, we show that the overlap between phosphoproteomes is greater than expected by chance and indicates increased functional relevance. Despite the dynamic nature of the evolution of phosphorylation, the relative overlap between the different datasets is identical to the phylogeny of the species studied.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>This analysis provides a framework for the generation of biological insights by comparative analysis of high-throughput phosphoproteomics datasets. We expect the rapidly growing body of data from high-throughput mass spectrometry analysis to make comparative phosphoproteomics a powerful tool for elucidating the evolutionary and functional dynamics of reversible phosphorylation.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010001">Biochemistry and structural biology</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010008">Evolution</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Post-translational modifications play important roles in a wide range of cellar functions. Reversible phosphorylation has been studied extensively and is known to influence protein function by changing protein-protein binding properties, activity, stability, and spatial organization <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Phosphorylation plays a key role in signal transduction cascades <abbrgrp><abbr bid="B2">2</abbr></abbrgrp> and allows the fine tuning of protein complex assembly <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. It is estimated that about one-third of all proteins in eukaryotic cells are phosphorylated at any given time <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>.</p>
         <p>Recent developments in high-throughput phosphoproteomics studies have resulted in the availability of phosphopeptide datasets for many model organisms. As a result, tools for the comparison of phosphoproteomes are emerging <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. Although these high-throughput datasets do not capture all phosphorylated peptides of a species under a given condition, large advances in enrichment strategies and mass spectrometry techniques have been made in the past few years, and studies comparing partial phosphoproteomes are emerging <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. Even though both the incomprehensive nature of the data as well as differences in experimental procedures complicate comparative analysis, we can now start to exploit these data. Comparative analysis of phosphoproteomics data could increase our understanding of phosphorylation and the evolution of the phosphorylation network as a systems level property.</p>
         <p>Not only do comparative analyses aid in elucidating the evolution of phosphorylation, but they also are a powerful tool with which to improve function prediction from sometimes noisy high-throughput datasets. For example, the use of conserved gene order has been shown to be a much stronger signal for protein function prediction than the order of genes in a single genome <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp>. Similarly, the conservation of co-expression has been shown to aid function prediction from microarray data <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>.</p>
         <p>In this study we perform comparative analysis of phosphorylation events in eukaryotes. Our aim is to determine whether the quality of the data is sufficient to detect functionally significant overlap between high-throughput phosphoproteomics datasets, and to identify an evolutionarily significant pattern in this overlap. To address these questions, we compare recent high-throughput phosphoproteomics datasets of human, mouse, zebra fish, fruit fly, yeast, and plant. We determine the overlap between these datasets and show that this overlap is statistically, functionally, and evolutionarily relevant.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Measuring the overlap in phosphoproteomes</p>
            </st>
            <p>We analyzed the overlap between high-throughput phosphoproteomics datasets from six species of eukaryotes. These datasets were created by different laboratories, using different experimental procedures (Table <tblr tid="T1">1</tblr>). In order to amend these datasets for comparative analysis, we imposed a relatively strict set of cutoffs on phosphopeptide calls in order to improve the uniformity and reduce noise caused by differences in scoring methods and thresholds (more details are provided in the Materials and methods section, below). The sizes of these individual datasets range from 724 to 3,296 (Table <tblr tid="T1">1</tblr>).</p>
            <tbl id="T1" hint_layout="single">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Phosphoproteomics datasets</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="left">
                        <p>Species</p>
                     </c>
                     <c ca="left">
                        <p>Reference</p>
                     </c>
                     <c ca="left">
                        <p>Proteins (<it>n</it>)<sup>a</sup></p>
                     </c>
                     <c ca="left">
                        <p>Sites (<it>n</it>)<sup>a</sup></p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Human</p>
                     </c>
                     <c ca="left">
                        <p>
                           <abbrgrp>
                              <abbr bid="B34">34</abbr>
                           </abbrgrp>
                        </p>
                     </c>
                     <c ca="left">
                        <p>1,419</p>
                     </c>
                     <c ca="left">
                        <p>3,296</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Mouse</p>
                     </c>
                     <c ca="left">
                        <p>
                           <abbrgrp>
                              <abbr bid="B23">23</abbr>
                           </abbrgrp>
                        </p>
                     </c>
                     <c ca="left">
                        <p>1,605</p>
                     </c>
                     <c ca="left">
                        <p>3,142</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Fly</p>
                     </c>
                     <c ca="left">
                        <p>
                           <abbrgrp>
                              <abbr bid="B14">14</abbr>
                           </abbrgrp>
                        </p>
                     </c>
                     <c ca="left">
                        <p>991</p>
                     </c>
                     <c ca="left">
                        <p>2,080</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Yeast</p>
                     </c>
                     <c ca="left">
                        <p>
                           <abbrgrp>
                              <abbr bid="B35">35</abbr>
                           </abbrgrp>
                        </p>
                     </c>
                     <c ca="left">
                        <p>481</p>
                     </c>
                     <c ca="left">
                        <p>850</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Plant</p>
                     </c>
                     <c ca="left">
                        <p>
                           <abbrgrp>
                              <abbr bid="B22">22</abbr>
                           </abbrgrp>
                        </p>
                     </c>
                     <c ca="left">
                        <p>470</p>
                     </c>
                     <c ca="left">
                        <p>724</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Zebrafish</p>
                     </c>
                     <c ca="left">
                        <p>
                           <abbrgrp>
                              <abbr bid="B11">11</abbr>
                           </abbrgrp>
                        </p>
                     </c>
                     <c ca="left">
                        <p>668</p>
                     </c>
                     <c ca="left">
                        <p>759</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><sup>a</sup>Can be less than the number mentioned in the original papers, because we imposed a relatively strict set of cutoffs on phosphopeptide calls to improve the uniformity and reduce noise.</p>
               </tblfn>
            </tbl>
            <p>We identified homologous sequences by an all-against-all Smith-Waterman search of all full-length proteins for which one or more phosphopeptides were present in the datasets. Phosphosites are considered homologous when a phosphosite in the query is aligned with the same type of phosphosite in the target sequence (workflow illustrated in Figure <figr fid="F1">1</figr>). For each dataset (the query) we counted the number of phosphorylation sites in the query datasets with at least one homolog in each of the target datasets (Table <tblr tid="T2">2</tblr>). The overlap between the datasets ranges from approximately 700 sites for human and mouse (two large datasets from closely related species) to a single site for fish and yeast (both distantly relate as well as two of the smallest datasets). Despite the virtually nonexistent overlap between fish and yeast, larger datasets of distantly related species exhibit considerable conservation; for example, mouse and plant share 27 phosphosites. We detect an overlap that is substantially larger than the overlap reported in specific phosphoproteomics experiments; the analysis conducted by Lemeer and coworkers <abbrgrp><abbr bid="B11">11</abbr></abbrgrp> resulted in 50 phosphosites in zebrafish that had already been reported in human or mouse, whereas we find an overlap of more than 150.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Workflow for determining conservation between two phosphoproteomics datasets</p>
               </caption>
               <text>
                  <p>Workflow for determining conservation between two phosphoproteomics datasets. Black letters are amino acid residues, and a white p in a red circle indicates a phosphogroup. A more detailed description of this procedure can be found in the Materials and methods section.</p>
               </text>
               <graphic file="gb-2008-9-10-r144-1"/>
            </fig>
            <tbl id="T2" hint_layout="double">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Number of query phosphorylation sites with at least one conserved site in the target species</p>
               </caption>
               <tblbdy cols="7">
                  <r>
                     <c ca="left">
                        <p>Query\target</p>
                     </c>
                     <c ca="center">
                        <p>Plant</p>
                     </c>
                     <c ca="center">
                        <p>Fly</p>
                     </c>
                     <c ca="center">
                        <p>Human</p>
                     </c>
                     <c ca="center">
                        <p>Mouse</p>
                     </c>
                     <c ca="center">
                        <p>Yeast</p>
                     </c>
                     <c ca="center">
                        <p>Fish</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Plant</p>
                     </c>
                     <c ca="center">
                        <p>&#215;</p>
                     </c>
                     <c ca="center">
                        <p>9 (3.4)</p>
                     </c>
                     <c ca="center">
                        <p>13 (6.1)</p>
                     </c>
                     <c ca="center">
                        <p>27 (9.6)</p>
                     </c>
                     <c ca="center">
                        <p>3 (3.1)</p>
                     </c>
                     <c ca="center">
                        <p>4 (1.8)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Fly</p>
                     </c>
                     <c ca="center">
                        <p>9 (3.1)</p>
                     </c>
                     <c ca="center">
                        <p>&#215;</p>
                     </c>
                     <c ca="center">
                        <p>85 (32.0)</p>
                     </c>
                     <c ca="center">
                        <p>72 (28.0)</p>
                     </c>
                     <c ca="center">
                        <p>4 (3.2)</p>
                     </c>
                     <c ca="center">
                        <p>35 (6.5)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Human</p>
                     </c>
                     <c ca="center">
                        <p>13 (5.6)</p>
                     </c>
                     <c ca="center">
                        <p>88 (33.7)</p>
                     </c>
                     <c ca="center">
                        <p>&#215;</p>
                     </c>
                     <c ca="center">
                        <p>700 (155.5)</p>
                     </c>
                     <c ca="center">
                        <p>8 (6.3)</p>
                     </c>
                     <c ca="center">
                        <p>157 (27.6)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Mouse</p>
                     </c>
                     <c ca="center">
                        <p>27 (9.3)</p>
                     </c>
                     <c ca="center">
                        <p>79 (28.8)</p>
                     </c>
                     <c ca="center">
                        <p>706 (151.5)</p>
                     </c>
                     <c ca="center">
                        <p>&#215;</p>
                     </c>
                     <c ca="center">
                        <p>13 (6.7)</p>
                     </c>
                     <c ca="center">
                        <p>151 (19.7)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Yeast</p>
                     </c>
                     <c ca="center">
                        <p>2 (2.8)</p>
                     </c>
                     <c ca="center">
                        <p>4 (3.1)</p>
                     </c>
                     <c ca="center">
                        <p>6 (5.9)</p>
                     </c>
                     <c ca="center">
                        <p>11 (6.4)</p>
                     </c>
                     <c ca="center">
                        <p>&#215;</p>
                     </c>
                     <c ca="center">
                        <p>1 (1.6)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Fish</p>
                     </c>
                     <c ca="center">
                        <p>3 (1.5)</p>
                     </c>
                     <c ca="center">
                        <p>38 (6.5)</p>
                     </c>
                     <c ca="center">
                        <p>149 (26.0)</p>
                     </c>
                     <c ca="center">
                        <p>132 (18.9)</p>
                     </c>
                     <c ca="center">
                        <p>1 (1.6)</p>
                     </c>
                     <c ca="center">
                        <p>&#215;</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>The number in parenthesis is average number of conserved sites of 1,000 randomization trials in which the position of phosphorylation sites were shuffled. Please note that the overlap is not symmetric, because a site in a query dataset can have multiple homologs in a target dataset.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>The overlap between phosphoproteomics sets is significant</p>
            </st>
            <p>In both a scenario in which the rate of evolution of reversible phosphorylation is so high that the species are too diverged to detect real homologous phosphosites, and when species completely re-wire their phosphoproteome after speciation, chance alone would result in a certain amount of overlap. We thus randomized for every protein in the datasets the positions of the phosphorylated residues across 1,000 trials and computed the average overlap. Note that this is a conservative null model, because it assumes that different species phosphorylate the same protein, whereas cases have been described in which different species use phosphorylation of different proteins for the regulation of the assembly of homologous protein complexes <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. The observed overlap is larger than the average random overlap for almost all species comparisons (Table <tblr tid="T2">2</tblr>), strongly suggesting that the observed overlap is the result of significant evolutionary conservation.</p>
            <p>Given the difficulty of formulating a null model for the significance of conservation between two species, we next considered the conservation of phosphorylation events over three or more species; if evolution plays no role in the overlap between datasets, then the chance of a specific site being conserved in one species will be independent of the presence or absence of that same site in other species. We thus compare the number of sites with homologs in two or more species with the number of sites that we would expect if we assume the chances of being conserved in different species to be independent (Table <tblr tid="T3">3</tblr>). For all datasets we observe that the number of sites observed in three, four, or five different species exceeds the number of sites expected assuming independence. Although we do not observe any phosphosites with homologs in all six species, we do observe a number of phosphorylation sites in <it>Arabidopsis thaliana </it>with homologs in one or more of the other datasets. These sites predate the evolutionary split between plants and ophistokonts, making them more than a billion years old <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>.</p>
            <tbl id="T3" hint_layout="double">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Number of sites found in three or more different species</p>
               </caption>
               <tblbdy cols="7">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="2" ca="center">
                        <p>Three different species<sup>a</sup></p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>Four different species</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>Five different species</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Observed</p>
                     </c>
                     <c ca="center">
                        <p>Expected<sup>b</sup></p>
                     </c>
                     <c ca="center">
                        <p>Observed</p>
                     </c>
                     <c ca="center">
                        <p>Expected</p>
                     </c>
                     <c ca="center">
                        <p>Observed</p>
                     </c>
                     <c ca="center">
                        <p>Expected</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Plant</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                     <c ca="center">
                        <p>1.55</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>0.02</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>0.00</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Fly</p>
                     </c>
                     <c ca="center">
                        <p>33</p>
                     </c>
                     <c ca="center">
                        <p>7.55</p>
                     </c>
                     <c ca="center">
                        <p>13</p>
                     </c>
                     <c ca="center">
                        <p>0.18</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>0.00</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Human</p>
                     </c>
                     <c ca="center">
                        <p>103</p>
                     </c>
                     <c ca="center">
                        <p>59.45</p>
                     </c>
                     <c ca="center">
                        <p>17</p>
                     </c>
                     <c ca="center">
                        <p>1.31</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>0.01</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Mouse</p>
                     </c>
                     <c ca="center">
                        <p>106</p>
                     </c>
                     <c ca="center">
                        <p>63.59</p>
                     </c>
                     <c ca="center">
                        <p>23</p>
                     </c>
                     <c ca="center">
                        <p>1.80</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>0.02</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Yeast</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>0.27</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0.00</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0.00</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Fish</p>
                     </c>
                     <c ca="center">
                        <p>72</p>
                     </c>
                     <c ca="center">
                        <p>40.15</p>
                     </c>
                     <c ca="center">
                        <p>12</p>
                     </c>
                     <c ca="center">
                        <p>1.86</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>0.01</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><sup>a</sup>Total number of species in which a phosphosite was present, including the query organism. We did not identify any sites with homologs in all six datasets. <sup>b</sup>The number of expected sites assuming independent chances of conservation (the chance of a specific site being conserved in one species is independent of the presence or absence of that same site in other species).</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Relative overlap between phosphoproteomics data sets contains a strong evolutionary signal</p>
            </st>
            <p>Two independent tests suggest that the phosphorylation overlap is quantitatively significant. As a next step we tested for qualitative relevance by searching for a possible evolutionary pattern in the conservation of phosphoproteomics datasets. Specifically, we wondered whether a purported dynamic system level property such as the phosphorylation repertoire reflects the species phylogeny. However, interpreting the relative differences in overlap is far from trivial, because a myriad of both biological and technical factors, ranging from the sensitivity of the mass spectrometry analysis to experimental conditions under which phosphoproteomes were sampled, convolute a potential signal.</p>
            <p>In order to extract this potential signal, we determined the relative number of conserved phosphorylation sites by comparing the overlap with the number of sites that can potentially be conserved, given the proteins in the specific datasets: the relative overlap. This relative overlap can be obtained in a relatively straightforward manner by dividing the number of conserved phosphorylation events of the query and target datasets by the number of sites in the query dataset with one or more homologous positions in full-length proteins of the target dataset. We subsequently clustered the six species on the basis of their relative by the neighbor joining algorithm using 1 - (relative overlap) as the distance measure (Figure <figr fid="F2">2</figr> and Additional data file 1). The topology of the unrooted tree that is the result of the neighbor-joining is identical to the topology of the tree of life for this small sample of six species.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Phosphorylation follows phylogeny</p>
               </caption>
               <text>
                  <p>Phosphorylation follows phylogeny. The distance measure used in the construction of this neighbor-joining tree is (1 - relative overlap; described in detail in the main text). If the tree is rooted at the branch marked with the x, the topology of this tree is identical to the topology of the tree of life of these six species. The tree was generated with Quicktree <abbrgrp><abbr bid="B32">32</abbr></abbrgrp> and visualized using Treeview <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>.</p>
               </text>
               <graphic file="gb-2008-9-10-r144-2"/>
            </fig>
            <p>Variations in experimental conditions and protocols potentially obscure the evolutionary signal in the overlap between datasets. If this evolutionary signal is relatively strong, then the relative overlap between datasets from a single species should be greater than the relative overlap between datasets from different species. We determined the relative overlap between an additional dataset from fly <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> and the other six datasets (Figure <figr fid="F3">3a</figr>). This additional dataset contains many more phosphosites than the fly dataset that is already part of our analysis (the additional dataset contains 10,293 sites, as compared with the 2,080 of the original dataset), and the two datasets were constructed by different laboratories using different techniques <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp>. Nevertheless, the relative overlap between both fly datasets is more than twice that with any of the other datasets (Figure <figr fid="F3">3a</figr>), and an extended neighbor-joining tree groups these two datasets together (Figure <figr fid="F3">3b</figr>). The relative overlap between the datasets is thus not only higher than expected by random chance; the relative overlap also follows phylogeny and thus contains a qualitatively strong and relevant evolutionary signal.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>An additional dataset from fly</p>
               </caption>
               <text>
                  <p>An additional dataset from fly. <b>(a) </b>Overlap between the additional fly dataset <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> and the original six datasets. <b>(b) </b>Neighbor-joining tree of the relative overlap between these seven datasets.</p>
               </text>
               <graphic file="gb-2008-9-10-r144-3"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Low-throughput experiments as a golden standard and conserved phosphosites and protein function</p>
            </st>
            <p>Conservation in sequence and gene order generally has functional meaning <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. Low-throughput experiments are in general considered to be more reliable than high-throughput experiments, because they tend to be more suited to controls and validation. Several databases collect experimental data on reversible phosphorylation, for example Phospho.ELM <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> and Phosida <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. Of all of the phosphosites in the human dataset, 2.5% have also been observed by a low-throughput experiment in the Phospho.ELM database; for the mouse dataset this is 2.0%. In contrast, 4.8% of the conserved sites in human and 4.2% of the conserved sites in mouse have been measured using low-throughput techniques, a significant increase (&#967;<sup>2 </sup>test <it>P </it>&lt; 0.0001). This observation shows that putative phosphorylation events with homologs in other high-throughput experiments are less likely to be false positives. This increase in reliability suggests that the overlap between phosphoproteomics datasets could be used as a tool with which to assess the reliability of putative phosphosites identified in high-throughput experiments, similar to the use of comparative methods for improving reliability of interactomes <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>.</p>
            <p>Because some functional classes of proteins have been shown to be more conserved than others, we wondered whether this also holds for phosphorylation events. We utilized the functional classification provided by the Clusters of Orthologous Groups database <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> to study over-representation of biological processes among proteins with well conserved phosphosites (Figure <figr fid="F4">4a,b</figr>). These data reveal a clear functional trend in conserved phosphorylation sites; compared with sites that are found in only a single species, a relatively high percentage of phosphosites with homologs in two or more species are found in proteins with functions related to information storage and processing. Most striking is the over-representation of proteins that are involved in replication, chromatin structure, and cell cycle related processes, classes that contain functions that could be considered to be most fundamental for the survival of the cell. The presence of highly conserved phosphorylation events in these functional categories suggests that the fine-tuning mechanisms provided by phosphorylation arose early in evolution. Although based on these data we cannot exclude the possibility that this over-representation is influenced by other factors (for example, proteins with functions related to information storage and processing being more likely to have homologs in all six species studied), a link between conservation of phosphorylation events and protein function is in accordance with other observation (for example, protein function and duplication rate <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>).</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Functional classification of conserved phosphosites</p>
               </caption>
               <text>
                  <p>Functional classification of conserved phosphosites. <b>(a) </b>Main classes. The height of the bars represents the percentage of phosphosites with homologs in a specific number of different species (indicated by the color of the bar) belonging to the different classes. The black arrows indicate groups with homologs in a specific number of species that are significantly over-represented (arrows pointing up) or under-represented (arrows pointing down) compared with all phosphorylation events in that functional category. Significance was determined using a Fisher's exact test; scores with a <it>P </it>value below 0.05 after Bonferroni correction were considered significant. <b>(b) </b>Subclasses. The numbers in the cells are the fold increase of the fraction of phosphosites in that subclass relative to the fraction in that subclass of phosphosites without homologs in other species (<sup>2</sup>log [sites in <it>n </it>species] - <sup>2</sup>log [sites in 1 species]). Over-representation is presented in red, and under-representation in blue. Only classes with a total of 80 or more sites and with at least one site found in a total of four species are shown. The black boxes indicate significant under-representation or over-representation (Fisher's exact test, <it>P </it>&lt; 0.05 after Bonferroni correction).</p>
               </text>
               <graphic file="gb-2008-9-10-r144-4"/>
            </fig>
            <p>Phosphorylation events identified in a single high-thoughput experiments are known to cluster outside globular domains, as meausured by PFAM <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. Of the events we analyzed, 15% are found inside a domain predicted using domain predictions from the PFAM database <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. When we only consider conserved phosphorylation events, this shows a slight increase to 17%. The similar percentage shows that the low occurrence of phosphoryalation in known globular domains holds true for evolutionarily conserved events, and hence is not the result of the presence of spurious phosphorylations in unconfirmed high-throughput data.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>Both the incomprehensive nature of high-throughput phosphoproteomics experiments as well as idiosyncrasies of the experimental pipelines used by different laboratories complicate the comparison of high-throughput phosphoproteomics datasets. In addition, the data we are comparing result from experiments designed with different biological questions in mind; the plant experiment, for example, focuses on the phosphorylation of membrane associated proteins from cells grown in culture <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>, whereas the mouse experiment uses protein extract from homogenized liver tissue <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. All of these differences will undoubtedly introduce dissimilarities in the observed phosphoproteomes that do not reflect the evolutionary changes in phosphorylation networks between the different species, making the overlap that we found a minimal estimate. Randomization trials, functional bias in highly conserved phosphorylation events, and the relative differences in overlap between the six high-throughput phosphoproteomics datasets all suggest the overlap between these datasets to be biologically relevant, and we successfully identified the evolutionary signal in this overlap. We find a number of phosphorylation events that are likely to predate the evolutionary split between plants and animals. These sites thus appear to be ancient in origin, which is perhaps surprising, given that phosphorylation is thought to be a subtle regulatory mechanism.</p>
         <p>Our work suggests that our understanding of reversible phosphorylation can be increased by comparing the results of high-throughput phosphoproteomics analysis with those from large-scale <it>in vitro </it>phosphorylation assays (for example <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr></abbrgrp>) or computationally predicted phosphoproteomes. In the current setup (comparing different mass spectrometry based high-throughput phosphoproteomics datasets), experimental idiosyncrasies already loom large over any comparison; hence, we did not include such datasets in this study. However, because we have now shown that the overlap is biologically significant, this restraint can be relaxed; comparative analysis in fact enables the use of the ever-increasing amount of data on phosphorylation obtained by high-throughput mass spectrometry experiments that were not designed specifically for this particular purpose.</p>
         <p>Previous studies have described the conservation across multiple species of amino acid residues that are known to be phosphorylated in a specific organism <abbrgrp><abbr bid="B26">26</abbr></abbrgrp> and have studied the conservation of the phosphorylation events themselves on a small scale (for example <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>). PhosphoBlast <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> provides a powerful tool with which to compare (phosphorylated) peptides, illustrated by the authors by comparing human and mouse phosphopeptide datasets. These studies revealed a relatively high conservation of amino acid residues that are known to be phosphorylated in one or more phosphoproteomics experiments, and identified a substantial overlap between the phosphoproteomes of different species. We extend this observation to larger evolutionary distances and show that the overlap is statistically, functionally, and evolutionarily relevant. These insights can applied, for example, to discriminating between noise and real phosphorylation events in high-throughput mass spectrometry experiments (analogous to the use of conserved gene order in the evaluation of BLAST significance scores <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>).</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>The presence of functionally and evolutionarily significant overlap between high-throughput phosphoproteomics experiments allows the use of comparative phosphoproteomics in the prediction and evaluation of phosphorylation networks, similar to the established use of comparative genomics and transcriptomics in the elucidation of protein functions and biological networks. We expect the rapidly growing amount of data from high-throughput mass spectrometry analysis to make comparative phosphoproteomics a powerful tool in predicting, evaluating, and understanding reversible phosphorylation.</p>
      </sec>
      <sec>
         <st>
            <p>Materials and methods</p>
         </st>
         <sec>
            <st>
               <p>Datasets</p>
            </st>
            <p>Table <tblr tid="T1">1</tblr> lists the datasets compared in this study. Because our comparison of high-throughput datasets is already complicated by many factors, ranging from the incomprehensive nature of the data to differences in experimental procedures, we made an effort to keep putative false-positive phosphorylation sites from further confounding the analysis. We used criteria for filtering the input data that in many cases are more stringent than the criteria used in the original publications. Each dataset was preprocessed by removing all phosphopeptides with ambiguous sites (phosphogroups that could not be attributed to a specific amino acid residue), by removing peptides that could not be retraced unambiguously to one specific protein, and by applying a strict threshold on the peptide identification scores. For the human, fly, Arabidopsis, and zebrafish datasets we used a Mascot peptide score threshold of 35; for the mouse dataset we used an Ascore threshold of 19; and from the yeast dataset we took only phosphorylation sites with e-values of 1 &#215; e<sup>-04 </sup>or lower. For the additional fly dataset we used an dCn threshold of 0.1 and a PeptideProphet threshold of 0.9. Data handling was done with <it>ad hoc </it>Python scripts.</p>
         </sec>
         <sec>
            <st>
               <p>Overlap</p>
            </st>
            <p>Homologous phosphosites were identified by doing an all-against-all similarity search using the Paralign implementation of the Smith-Waterman algorithm <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> of all of the full-length proteins for which one or more phosphopeptides were present in the datasets, followed by the identification of high-scoring segment pairs with an e-value of 1 &#215; e<sup>-10 </sup>or lower in which both the query and the target had the same type of phosphosites at exactly the same position in the alignment (a phosphorylated serine residue should be aligned with a phosphorylated serine residue). Because this procedure does not include any (reciprocal) best hit criteria, all we conclude is that similar sites are homologous; the exact nature of this relationship (orthologous, paralogous) remains unclear. We used a strict e-value threshold of 1 &#215; e<sup>-10 </sup>for the identification of homologous sequences. The use of a more liberal threshold would increase the overlap (we are now probably missing some homologous phosphorylation events because we did not consider the surrounding sequence to be sufficiently conserved) but would also introduce more noise into an already noisy dataset. In addition, a strict cutoff means that we do not erroneously assume convergently evolved small linear motifs to be homologous (motifs involved in recognition of phosphosites by their kinases tend to be extremely short <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>).</p>
         </sec>
         <sec>
            <st>
               <p>Expected overlap between datasets assuming independence</p>
            </st>
            <p>The probability that a phosphorylation event in a query dataset is conserved in a target dataset is given by Equation 1.</p>
            <p>
               <display-formula id="M1">P(q &#8712; O<sub>Q, T</sub>) = N<sub>Q, T</sub>/N<sub>Q</sub></display-formula>
            </p>
            <p>Where Q is the query dataset, q is a phosphorylation event in Q, T is the target dataset, &#8712; means 'element of', &#8713; means 'not an element of', O<sub>Q, T </sub>is the overlap of Q and T (events from Q with a homologous event in T), N<sub>Q, T </sub>is the number of events in O<sub>Q, T</sub>, and N<sub>Q </sub>is the total number of events in Q.</p>
            <p>The probability that q has homologs in x of the target datasets is the sum of all possible combinations of presence and absence in all of the target datasets, given x. As an example, we consider target datasets A, B, and C. The probability <it>P </it>that q has homologs in two out of these three datasets is given by Equation 2.</p>
            <p>
               <display-formula id="M2">P(q|x = 2) = P(q &#8712; O<sub>Q, A </sub>&#8898; q &#8712; O<sub>Q, B </sub>&#8898; q &#8713; O<sub>Q, C</sub>)+ P(q &#8712; O<sub>Q, A </sub>&#8898; q &#8713; O<sub>Q, B </sub>&#8898; q &#8712; O<sub>Q, C</sub>) + P(q &#8713; O<sub>Q, A </sub>&#8898; q &#8712; O<sub>Q, B </sub>&#8898; q &#8712; O<sub>Q, C</sub>)</display-formula>
            </p>
            <p>Where <it>P</it>(q|x = 2) is the probability that q has homologs in two target datasets, and &#8898; is the 'and' operator.</p>
            <p>The expected number of phosphorylation events from a query dataset with homologs in x target datasets is now given by Equation 3.</p>
            <p>
               <display-formula id="M3">E (x = i) = <it>P</it>(q|x = i).N<sub>Q</sub></display-formula>
            </p>
            <p>In which E is the expected value, and i is a number lower than the total number of datasets.</p>
         </sec>
         <sec>
            <st>
               <p>Relative overlap</p>
            </st>
            <p>Relative overlap was calculated by dividing the number of conserved phosphorylation events of the query and target datasets by the number of sites in the query dataset with one or more homologous positions in the target dataset. We identified homologous positions using the results of the all-against-all similarity search described above; a site has a homologous position in a target dataset when the site is part of one or more high-scoring segment pairs in that dataset, irrespective of the specific residue type the site is aligned with.</p>
         </sec>
         <sec>
            <st>
               <p>Domains</p>
            </st>
            <p>We identified known domains in the full-length sequence of all proteins with one or more phosphorylation events. Domains were identified with HMMER <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>, using models provided by version 23 of the PFAM database <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. The location of phosphorylation events relative to these domains was determined using python scripts.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>BS, AH, and JB conceived the study. BS, AH, and BvB participated in its design and coordination, and contributed to manuscript preparation. JB performed the analysis and drafted the manuscript. All authors read and approved the final manuscript.</p>
      </sec>
      <sec>
         <st>
            <p>Additional data files</p>
         </st>
         <p>The following additional data are available with the online version of this paper. Additional data file <supplr sid="S1">1</supplr> provides the number of conserved phosphosites per query phosphosite with one more homologous sites in the target dataset.</p>
         <suppl id="S1">
            <title>
               <p>Additional data file 1</p>
            </title>
            <caption>
               <p>Number of conserved phosphosites per query phosphosite</p>
            </caption>
            <text>
               <p>Provided is the number of conserved phosphosites per query phosphosite with one more homologous sites in the target dataset.</p>
            </text>
            <file name="gb-2008-9-10-r144-S1.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>This work was supported by BioRange project SP 2.3.1 of the Netherlands Bioinformatics Centre (NBIC) and by the Netherlands Proteomics Centre. We thank S Mohammed, M Pinkse, and S Lemeer for their phosphorylation data and valuable comments.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>The regulation of protein function by multisite phosphorylation--a 25 year update.</p>
            </title>
            <aug>
               <au>
                  <snm>Cohen</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Trends Biochem Sci</source>
            <pubdate>2000</pubdate>
            <volume>25</volume>
            <fpage>596</fpage>
            <lpage>601</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0968-0004(00)01712-6</pubid>
                  <pubid idtype="pmpid" link="fulltext">11116185</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Protein-protein interactions define specificity in signal transduction.</p>
            </title>
            <aug>
               <au>
                  <snm>Pawson</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Nash</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Genes Dev</source>
            <pubdate>2000</pubdate>
            <volume>14</volume>
            <fpage>1027</fpage>
            <lpage>1047</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10809663</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Dynamic complex formation during the yeast cell cycle.</p>
            </title>
            <aug>
               <au>
                  <snm>de Lichtenberg</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Jensen</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Brunak</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2005</pubdate>
            <volume>307</volume>
            <fpage>724</fpage>
            <lpage>727</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1105103</pubid>
                  <pubid idtype="pmpid" link="fulltext">15692050</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>PhosphoBlast, a computational tool for comparing phosphoprotein signatures among large datasets.</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Klemke</snm>
                  <fnm>RL</fnm>
               </au>
            </aug>
            <source>Mol Cell Proteomics</source>
            <pubdate>2008</pubdate>
            <volume>7</volume>
            <fpage>145</fpage>
            <lpage>162</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17934212</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Phosphoproteome analysis of fission yeast.</p>
            </title>
            <aug>
               <au>
                  <snm>Wilson-Grady</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Villen</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Gygi</snm>
                  <fnm>SP</fnm>
               </au>
            </aug>
            <source>J Proteome Res</source>
            <pubdate>2008</pubdate>
            <volume>7</volume>
            <fpage>1088</fpage>
            <lpage>1097</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1021/pr7006335</pubid>
                  <pubid idtype="pmpid" link="fulltext">18257517</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Conservation of gene order: a fingerprint of proteins that physically interact.</p>
            </title>
            <aug>
               <au>
                  <snm>Dandekar</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Huynen</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Trends Biochem Sci</source>
            <pubdate>1998</pubdate>
            <volume>23</volume>
            <fpage>324</fpage>
            <lpage>328</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0968-0004(98)01274-2</pubid>
                  <pubid idtype="pmpid" link="fulltext">9787636</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Predicting protein function by genomic context: quantitative evaluation and qualitative inferences.</p>
            </title>
            <aug>
               <au>
                  <snm>Huynen</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Lathe</snm>
                  <fnm>W</fnm>
                  <suf>III</suf>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2000</pubdate>
            <volume>10</volume>
            <fpage>1204</fpage>
            <lpage>1210</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">310926</pubid>
                  <pubid idtype="pmpid" link="fulltext">10958638</pubid>
                  <pubid idtype="doi">10.1101/gr.10.8.1204</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>The use of gene clusters to infer functional coupling.</p>
            </title>
            <aug>
               <au>
                  <snm>Overbeek</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Fonstein</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>D'Souza</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Pusch</snm>
                  <fnm>GD</fnm>
               </au>
               <au>
                  <snm>Maltsev</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1999</pubdate>
            <volume>96</volume>
            <fpage>2896</fpage>
            <lpage>2901</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">15866</pubid>
                  <pubid idtype="pmpid" link="fulltext">10077608</pubid>
                  <pubid idtype="doi">10.1073/pnas.96.6.2896</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Predicting gene function by conserved co-expression.</p>
            </title>
            <aug>
               <au>
                  <snm>van Noort</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Huynen</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2003</pubdate>
            <volume>19</volume>
            <fpage>238</fpage>
            <lpage>242</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-9525(03)00056-8</pubid>
                  <pubid idtype="pmpid" link="fulltext">12711213</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>A gene-coexpression network for global discovery of conserved genetic modules.</p>
            </title>
            <aug>
               <au>
                  <snm>Stuart</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Segal</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Koller</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>SK</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2003</pubdate>
            <volume>302</volume>
            <fpage>249</fpage>
            <lpage>255</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1087447</pubid>
                  <pubid idtype="pmpid" link="fulltext">12934013</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Online automated in vivo zebrafish phosphoproteomics: from large-scale analysis down to a single embryo.</p>
            </title>
            <aug>
               <au>
                  <snm>Lemeer</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Pinkse</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Mohammed</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>van Breukelen</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>den Hertog</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Slijper</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Heck</snm>
                  <fnm>AJ</fnm>
               </au>
            </aug>
            <source>J Proteome Res</source>
            <pubdate>2008</pubdate>
            <volume>7</volume>
            <fpage>1555</fpage>
            <lpage>1564</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1021/pr700667w</pubid>
                  <pubid idtype="pmpid" link="fulltext">18307296</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Bangiomorpha pubescens n. gen., n. sp.: implications for the evolution of sex, multicellularity, and the Mesoproterozoic/Neoproterozoic radiation of eukaryotes.</p>
            </title>
            <aug>
               <au>
                  <snm>Butterfield</snm>
                  <fnm>NJ</fnm>
               </au>
            </aug>
            <source>Paleobiology</source>
            <pubdate>2000</pubdate>
            <volume>26</volume>
            <fpage>386</fpage>
            <lpage>404</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1666/0094-8373(2000)026&lt;0386:BPNGNS>2.0.CO;2</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>PhosphoPep: a phosphoproteome resource for systems biology research in <it>Drosophila </it>Kc167 cells.</p>
            </title>
            <aug>
               <au>
                  <snm>Bodenmiller</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Malmstrom</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Gerrits</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Campbell</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Lam</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Schmidt</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Rinner</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Mueller</snm>
                  <fnm>LN</fnm>
               </au>
               <au>
                  <snm>Shannon</snm>
                  <fnm>PT</fnm>
               </au>
               <au>
                  <snm>Pedrioli</snm>
                  <fnm>PG</fnm>
               </au>
               <au>
                  <snm>Panse</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>HK</fnm>
               </au>
               <au>
                  <snm>Schlapbach</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Aebersold</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Mol Syst Biol</source>
            <pubdate>2007</pubdate>
            <volume>3</volume>
            <fpage>139</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2063582</pubid>
                  <pubid idtype="pmpid" link="fulltext">17940529</pubid>
                  <pubid idtype="doi">10.1038/msb4100182</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Highly robust, automated, and sensitive online TiO2-based phosphoproteomics applied to study endogenous phosphorylation in <it>Drosophila melanogaster</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Pinkse</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Mohammed</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gouw</snm>
                  <fnm>JW</fnm>
               </au>
               <au>
                  <snm>van Breukelen</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Vos</snm>
                  <fnm>HR</fnm>
               </au>
               <au>
                  <snm>Heck</snm>
                  <fnm>AJ</fnm>
               </au>
            </aug>
            <source>J Proteome Res</source>
            <pubdate>2008</pubdate>
            <volume>7</volume>
            <fpage>687</fpage>
            <lpage>697</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1021/pr700605z</pubid>
                  <pubid idtype="pmpid" link="fulltext">18034456</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Phospho.ELM: a database of experimentally verified phosphorylation sites in eukaryotic proteins.</p>
            </title>
            <aug>
               <au>
                  <snm>Diella</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Cameron</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gemund</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Linding</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Via</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kuster</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Sicheritz-Ponten</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Blom</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Gibson</snm>
                  <fnm>TJ</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>79</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">449700</pubid>
                  <pubid idtype="pmpid" link="fulltext">15212693</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-5-79</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>PHOSIDA (phosphorylation site database): management, structural and evolutionary investigation, and prediction of phosphosites.</p>
            </title>
            <aug>
               <au>
                  <snm>Gnad</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Ren</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Cox</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Olsen</snm>
                  <fnm>JV</fnm>
               </au>
               <au>
                  <snm>Macek</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Oroshi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Mann</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>R250</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2258193</pubid>
                  <pubid idtype="pmpid" link="fulltext">18039369</pubid>
                  <pubid idtype="doi">10.1186/gb-2007-8-11-r250</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Conserved patterns of protein interaction in multiple species.</p>
            </title>
            <aug>
               <au>
                  <snm>Sharan</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Suthram</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kelley</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Kuhn</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>McCuine</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Uetz</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Sittler</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Karp</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Ideker</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <fpage>1974</fpage>
            <lpage>1979</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">548573</pubid>
                  <pubid idtype="pmpid" link="fulltext">15687504</pubid>
                  <pubid idtype="doi">10.1073/pnas.0409522102</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>The COG database: an updated version includes eukaryotes.</p>
            </title>
            <aug>
               <au>
                  <snm>Tatusov</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Fedorova</snm>
                  <fnm>ND</fnm>
               </au>
               <au>
                  <snm>Jackson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Jacobs</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Kiryutin</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Krylov</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Mazumder</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Mekhedov</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Nikolskaya</snm>
                  <fnm>AN</fnm>
               </au>
               <au>
                  <snm>Rao</snm>
                  <fnm>BS</fnm>
               </au>
               <au>
                  <snm>Smirnov</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sverdlov</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Vasudevan</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Yin</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Natale</snm>
                  <fnm>DA</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <fpage>41</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">222959</pubid>
                  <pubid idtype="pmpid" link="fulltext">12969510</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-4-41</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Natural history and evolutionary principles of gene duplication in fungi.</p>
            </title>
            <aug>
               <au>
                  <snm>Wapinski</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Pfeffer</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Friedman</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Regev</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2007</pubdate>
            <volume>449</volume>
            <fpage>54</fpage>
            <lpage>61</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature06107</pubid>
                  <pubid idtype="pmpid" link="fulltext">17805289</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Phosphoproteomics of the Arabidopsis plasma membrane and a new phosphorylation site database.</p>
            </title>
            <aug>
               <au>
                  <snm>Nuhse</snm>
                  <fnm>TS</fnm>
               </au>
               <au>
                  <snm>Stensballe</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Jensen</snm>
                  <fnm>ON</fnm>
               </au>
               <au>
                  <snm>Peck</snm>
                  <fnm>SC</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>2004</pubdate>
            <volume>16</volume>
            <fpage>2394</fpage>
            <lpage>2405</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">520941</pubid>
                  <pubid idtype="pmpid" link="fulltext">15308754</pubid>
                  <pubid idtype="doi">10.1105/tpc.104.023150</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>The Pfam protein families database.</p>
            </title>
            <aug>
               <au>
                  <snm>Finn</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Tate</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Mistry</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Coggill</snm>
                  <fnm>PC</fnm>
               </au>
               <au>
                  <snm>Sammut</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Hotz</snm>
                  <fnm>HR</fnm>
               </au>
               <au>
                  <snm>Ceric</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Forslund</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Eddy</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Sonnhammer</snm>
                  <fnm>EL</fnm>
               </au>
               <au>
                  <snm>Bateman</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2008</pubdate>
            <volume>36</volume>
            <fpage>D281</fpage>
            <lpage>D288</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2238907</pubid>
                  <pubid idtype="pmpid" link="fulltext">18039703</pubid>
                  <pubid idtype="doi">10.1093/nar/gkm960</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Quantitative phosphoproteomics of early elicitor signaling in <it>Arabidopsis</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Benschop</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Mohammed</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>O'Flaherty</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Heck</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Slijper</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Menke</snm>
                  <fnm>FL</fnm>
               </au>
            </aug>
            <source>Mol Cell Proteomics</source>
            <pubdate>2007</pubdate>
            <volume>6</volume>
            <fpage>1198</fpage>
            <lpage>1214</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/mcp.M600429-MCP200</pubid>
                  <pubid idtype="pmpid" link="fulltext">17317660</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Large-scale phosphorylation analysis of mouse liver.</p>
            </title>
            <aug>
               <au>
                  <snm>Villen</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Beausoleil</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Gerber</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Gygi</snm>
                  <fnm>SP</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2007</pubdate>
            <volume>104</volume>
            <fpage>1488</fpage>
            <lpage>1493</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1785252</pubid>
                  <pubid idtype="pmpid" link="fulltext">17242355</pubid>
                  <pubid idtype="doi">10.1073/pnas.0609836104</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Evidence for a minimal eukaryotic phosphoproteome?</p>
            </title>
            <aug>
               <au>
                  <snm>Diks</snm>
                  <fnm>SH</fnm>
               </au>
               <au>
                  <snm>Parikh</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Sijde</snm>
                  <mnm>van der</mnm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Joore</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ritsema</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Peppelenbosch</snm>
                  <fnm>MP</fnm>
               </au>
            </aug>
            <source>PLoS ONE</source>
            <pubdate>2007</pubdate>
            <volume>2</volume>
            <fpage>e777</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1945084</pubid>
                  <pubid idtype="pmpid" link="fulltext">17712425</pubid>
                  <pubid idtype="doi">10.1371/journal.pone.0000777</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Global analysis of protein phosphorylation in yeast.</p>
            </title>
            <aug>
               <au>
                  <snm>Ptacek</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Devgan</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Michaud</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Fasolo</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Guo</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Jona</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Breitkreutz</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sopko</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>McCartney</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Schmidt</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Rachidi</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Mah</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Meng</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Stark</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Stern</snm>
                  <fnm>DF</fnm>
               </au>
               <au>
                  <snm>De Virgilio</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Tyers</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Andrews</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Gerstein</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Schweitzer</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Predki</snm>
                  <fnm>PF</fnm>
               </au>
               <au>
                  <snm>Snyder</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2005</pubdate>
            <volume>438</volume>
            <fpage>679</fpage>
            <lpage>684</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature04187</pubid>
                  <pubid idtype="pmpid" link="fulltext">16319894</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Comparative conservation analysis of the human mitotic phosphoproteome.</p>
            </title>
            <aug>
               <au>
                  <snm>Malik</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Nigg</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Korner</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2008</pubdate>
            <volume>24</volume>
            <fpage>1426</fpage>
            <lpage>1432</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btn197</pubid>
                  <pubid idtype="pmpid" link="fulltext">18426804</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Phosphoproteome analysis of E. coli reveals evolutionary conservation of bacterial Ser/Thr/Tyr phosphorylation.</p>
            </title>
            <aug>
               <au>
                  <snm>Macek</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Gnad</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Soufi</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Kumar</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Olsen</snm>
                  <fnm>JV</fnm>
               </au>
               <au>
                  <snm>Mijakovic</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Mann</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Mol Cell Proteomics</source>
            <pubdate>2008</pubdate>
            <volume>7</volume>
            <fpage>299</fpage>
            <lpage>307</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17938405</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Identification of homologs in insignificant blast hits by exploiting extrinsic gene properties.</p>
            </title>
            <aug>
               <au>
                  <snm>Boekhorst</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>356</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2048517</pubid>
                  <pubid idtype="pmpid" link="fulltext">17888146</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-8-356</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Six-fold speed-up of Smith-Waterman sequence database searches using parallel processing on common microprocessors.</p>
            </title>
            <aug>
               <au>
                  <snm>Rognes</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Seeberg</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2000</pubdate>
            <volume>16</volume>
            <fpage>699</fpage>
            <lpage>706</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/16.8.699</pubid>
                  <pubid idtype="pmpid" link="fulltext">11099256</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>A curated compendium of phosphorylation motifs.</p>
            </title>
            <aug>
               <au>
                  <snm>Amanchy</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Periaswamy</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Mathivanan</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Reddy</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Tattikota</snm>
                  <fnm>SG</fnm>
               </au>
               <au>
                  <snm>Pandey</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nat Biotechnol</source>
            <pubdate>2007</pubdate>
            <volume>25</volume>
            <fpage>285</fpage>
            <lpage>286</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nbt0307-285</pubid>
                  <pubid idtype="pmpid" link="fulltext">17344875</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Profile hidden Markov models.</p>
            </title>
            <aug>
               <au>
                  <snm>Eddy</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>1998</pubdate>
            <volume>14</volume>
            <fpage>755</fpage>
            <lpage>763</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/14.9.755</pubid>
                  <pubid idtype="pmpid" link="fulltext">9918945</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>QuickTree: building huge neighbour-joining trees of protein sequences.</p>
            </title>
            <aug>
               <au>
                  <snm>Howe</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Bateman</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Durbin</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2002</pubdate>
            <volume>18</volume>
            <fpage>1546</fpage>
            <lpage>1547</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/18.11.1546</pubid>
                  <pubid idtype="pmpid" link="fulltext">12424131</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>TreeView: an application to display phylogenetic trees on personal computers.</p>
            </title>
            <aug>
               <au>
                  <snm>Page</snm>
                  <fnm>RD</fnm>
               </au>
            </aug>
            <source>Comput Appl Biosci</source>
            <pubdate>1996</pubdate>
            <volume>12</volume>
            <fpage>357</fpage>
            <lpage>358</lpage>
            <xrefbib>
               <pubid idtype="pmpid">8902363</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Global, in vivo, and site-specific phosphorylation dynamics in signaling networks.</p>
            </title>
            <aug>
               <au>
                  <snm>Olsen</snm>
                  <fnm>JV</fnm>
               </au>
               <au>
                  <snm>Blagoev</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Gnad</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Macek</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Kumar</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Mortensen</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Mann</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2006</pubdate>
            <volume>127</volume>
            <fpage>635</fpage>
            <lpage>648</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cell.2006.09.026</pubid>
                  <pubid idtype="pmpid" link="fulltext">17081983</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Analysis of phosphorylation sites on proteins from Saccharomyces cerevisiae by electron transfer dissociation (ETD) mass spectrometry.</p>
            </title>
            <aug>
               <au>
                  <snm>Chi</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Huttenhower</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Geer</snm>
                  <fnm>LY</fnm>
               </au>
               <au>
                  <snm>Coon</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Syka</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Bai</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Shabanowitz</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Burke</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Troyanskaya</snm>
                  <fnm>OG</fnm>
               </au>
               <au>
                  <snm>Hunt</snm>
                  <fnm>DF</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2007</pubdate>
            <volume>104</volume>
            <fpage>2193</fpage>
            <lpage>2198</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1892997</pubid>
                  <pubid idtype="pmpid" link="fulltext">17287358</pubid>
                  <pubid idtype="doi">10.1073/pnas.0607084104</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
