<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2148-10-24</ui>
   <ji>1471-2148</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Signature proteins for the major clades of Cyanobacteria</p>
         </title>
         <aug>
            <au ca="yes" id="A1">
               <snm>Gupta</snm>
               <mi>S</mi>
               <fnm>Radhey</fnm>
               <insr iid="I1"/>
               <email>gupta@mcmaster.ca</email>
            </au>
            <au id="A2">
               <snm>Mathews</snm>
               <mi>W</mi>
               <fnm>Divya</fnm>
               <insr iid="I1"/>
               <email>divyarw@hotmail.com</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Biochemistry and Biomedical Sciences, McMaster University, Hamilton, Ontario, Canada L8N 3Z5</p>
            </ins>
         </insg>
         <source>BMC Evolutionary Biology</source>
         <issn>1471-2148</issn>
         <pubdate>2010</pubdate>
         <volume>10</volume>
         <issue>1</issue>
         <fpage>24</fpage>
         <url>http://www.biomedcentral.com/1471-2148/10/24</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">20100331</pubid>
               <pubid idtype="doi">10.1186/1471-2148-10-24</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>27</day>
               <month>4</month>
               <year>2009</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>25</day>
               <month>1</month>
               <year>2010</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>25</day>
               <month>1</month>
               <year>2010</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2010</year>
         <collab>Gupta and Mathews; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>The phylogeny and taxonomy of cyanobacteria is currently poorly understood due to paucity of reliable markers for identification and circumscription of its major clades.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>A combination of phylogenomic and protein signature based approaches was used to characterize the major clades of cyanobacteria. Phylogenetic trees were constructed for 44 cyanobacteria based on 44 conserved proteins. In parallel, Blastp searches were carried out on each ORF in the genomes of <it>Synechococcus WH8102, Synechocystis PCC6803, Nostoc PCC7120, Synechococcus JA-3-3Ab, Prochlorococcus MIT9215 </it>and <it>Prochlor. marinus subsp. marinus CCMP1375 </it>to identify proteins that are specific for various main clades of cyanobacteria. These studies have identified 39 proteins that are specific for all (or most) cyanobacteria and large numbers of proteins for other cyanobacterial clades. The identified signature proteins include: (i) 14 proteins for a deep branching clade (Clade A) of <it>Gloebacter violaceus </it>and two diazotrophic <it>Synechococcus </it>strains (JA-3-3Ab and JA2-3-B'a); (ii) 5 proteins that are present in all other cyanobacteria except those from Clade A; (iii) 60 proteins that are specific for a clade (Clade C) consisting of various marine unicellular cyanobacteria (viz. <it>Synechococcus </it>and <it>Prochlorococcus</it>); (iv) 14 and 19 signature proteins that are specific for the Clade C <it>Synechococcus </it>and <it>Prochlorococcus </it>strains, respectively; (v) 67 proteins that are specific for the Low B/A ecotype <it>Prochlorococcus </it>strains, containing lower ratio of <it>chl b/a</it><sub>2 </sub>and adapted to growth at high light intensities; (vi) 65 and 8 proteins that are specific for the <it>Nostocales </it>and <it>Chroococcales </it>orders, respectively; and (vii) 22 and 9 proteins that are uniquely shared by various <it>Nostocales </it>and <it>Oscillatoriales </it>orders, or by these two orders and the <it>Chroococcales</it>, respectively. We also describe 3 conserved indels in flavoprotein, heme oxygenase and protochlorophyllide oxidoreductase proteins that are specific for either Clade C cyanobacteria or for various subclades of <it>Prochlorococcus</it>. Many other conserved indels for cyanobacterial clades have been described recently.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusions</p>
               </st>
               <p>These signature proteins and indels provide novel means for circumscription of various cyanobacterial clades in clear molecular terms. Their functional studies should lead to discovery of novel properties that are unique to these groups of cyanobacteria.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification id="refman" subtype="user_supplied_xml" type="bmc"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Cyanobacteria are the sole prokaryotic group that carries out oxygenic photosynthesis. The species from this phylum exhibit enormous diversity in terms of their morphology, physiology and other characteristics (e.g. motility, thermophily, cell division characteristic, nitrogen fixation ability, etc.) <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. The taxonomy and evolutionary relationships among cyanobacteria is presently poorly understood. In the 16S rRNA trees, which provides the current basis for understanding microbial phylogeny, cyanobacteria species/strains form 14 unresolved clusters <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. Although cyanobacteria is a large phylum with >4000 isolates <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>, only a small number of species and higher taxonomic groups within this phylum have been validly described <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>. Except for 16S rRNA, sequence information for cyanobacteria for other genes/proteins sequences until recently was very limited. Hence, the availability of genome sequences has provided new opportunities for understanding cyanobacterial phylogeny and taxonomy. Based upon these sequences, several investigators have assembled phylogenetic trees for cyanobacteria based upon combined sequences for different large sets of proteins. These studies have included analyses of 14 cyanobacteria based upon 34 proteins by Sanchez-Barcaldo et al. <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>, trees for 24 cyanobacteria based upon 583 orthologous proteins by Swingley et al. <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>, and branching patterns of 13 cyanobacteria based upon 682 proteins by Shi and Falkowski <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. Additionally, Zhaxybayeva et al. <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> have examined individual phylogenies of 1128 protein-coding genes from 11 cyanobacterial genomes to identify phylogenetic signal exhibited by the plurality of these proteins and to recognize the incidence of lateral gene transfers. These studies have proven very useful in establishing the existence of certain important clades within the sequenced cyanobacteria and in clarifying their relative branching positions <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp>.</p>
         <p>The studies of the above kind, although very useful, are limited to species whose genomes are sequenced. Further, as indicated by earlier work <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp>, integration of sequence information from any new genome by this approach requires reassembly of the entire phylogenomic tree(s). Based upon the phylogenomic approach it is also difficult to circumscribe various cyanobacterial clades in definitive biochemical or molecular terms, which is important for developing a stable taxonomy <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>. Hence, it is important to identify other reliable molecular markers that are consistent with the results of phylogenomic studies, but which can also be used to circumscribe different phylogenetic clades in more definitive (molecular) terms. One approach that has proven very useful in this regard consists of identifying molecular markers or synapomorphies that are specific for different phylogenetically defined clades. Two different kinds of molecular markers are proving very useful for these studies. The first of these consists of conserved inserts and deletions (indels) in widely distributed proteins that are distinctive characteristics of either a given phylum or its different main subgroups <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr></abbrgrp>. Our recent work has identified >40 conserved indels in important proteins that are exclusively present in either all cyanobacteria or many of its major clades that are observed in phylogenomic trees <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>. The presence of several of these indels in the plants/plastids homologs has also provided evidence for the derivation of plastids from cyanobacterial ancestors <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp>. The second kind of molecular markers consists of whole proteins that are uniquely found in various species from a given phylogenetic clade <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr></abbrgrp>. Martin et al. <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> have earlier reported Blast analysis on 8 cyanobacterial genomes (6 finished and 2 unfinished) to identify 181 proteins that were uniquely found in at least 7 out of 8 of these cyanobacteria. A later study by Mulkidjanian et al. <abbrgrp><abbr bid="B30">30</abbr></abbrgrp> on 15 cyanobacterial genomes identified 50 proteins that were uniquely present in at least 14 out of 15 cyanobacteria and 84 others that were exclusively present in plants/plastids and cyanobacteria.</p>
         <p>These earlier studies primarily looked for proteins that were uniquely found in most cyanobacteria and no work was carried out on identifying proteins that are specific for various main clades of cyanobacteria, observed in phylogenetic trees. In the past 2-3 years, the number of sequenced cyanobacterial genomes has also more than doubled to a total of 36 genomes. Hence, it was of much interest to carry out both phylogenomic as well as gene content analyses on these genomes to identify signature proteins that are distinctive characteristics of either all cyanobacteria or its various main clades in the phylogenomic trees.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Phylogenomic/phylogenetic analyses on Cyanobacteria</p>
            </st>
            <p>Prior to undertaking studies on identifying proteins that are specific for different cyanobacterial clades, it was necessary to determine the branching pattern of sequenced cyanobacteria in phylogenetic trees. Although detailed phylogenetic studies have been previously reported for a limited numbers of cyanobacteria <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp>, sequence information for many other genomes has become available in the past 2-3 years (see Table <tblr tid="T1">1</tblr>). Hence, it was necessary to carry out phylogenetic studies on all of these cyanobacteria to determine their branching pattern. The phylogenetic trees are now commonly constructed based on concatenated sequences for large number of proteins <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B31">31</abbr></abbrgrp>. Their main advantage is that because they are based on large numbers of characters derived from many independent proteins, they are generally considered to provide a better reflection of organismal phylogeny than trees based on any single gene or protein, where the observed relationship could be affected by various factors including lateral gene transfer, differences in evolutionary rates among species, long branch attraction effect, etc. <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. However, it should be recognized that the trees based on concatenated sequences, due to the possibility of their lumping together gene sequences with discordant evolutionary histories, can sometime result in unreliable inferences <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr></abbrgrp>. In the present work, phylogenetic trees were constructed based on a combined sequence alignment for 44 widely distributed proteins (see additional file <supplr sid="S1">1</supplr>) from 44 cyanobacterial species/isolates for which sequence information was available (see Materials and Methods). Most of these proteins carry out important housekeeping functions, and they are universally present in various species <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>, making them a good choice for phylogenetic analysis.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>List of Cyanobacterial Genomes Studied in this work</p>
               </caption>
               <tblbdy cols="6">
                  <r>
                     <c ca="left">
                        <p>
                           <b>Species Name</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Genome size (Mb)</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>GC content %</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Protein Number</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Genome</b>
                        </p>
                        <p>
                           <b>Reference</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Center/Pubmed ID</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Acaryochloris marina MBIC11017</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>8.36</p>
                     </c>
                     <c ca="center">
                        <p>47.0</p>
                     </c>
                     <c ca="center">
                        <p>6254</p>
                     </c>
                     <c ca="left">
                        <p>NC_009925.1</p>
                     </c>
                     <c ca="left">
                        <p>
                           <abbrgrp>
                              <abbr bid="B45">45</abbr>
                           </abbrgrp>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Anabaena variabilis ATCC 29413</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>7.07</p>
                     </c>
                     <c ca="center">
                        <p>41.4</p>
                     </c>
                     <c ca="center">
                        <p>5043</p>
                     </c>
                     <c ca="left">
                        <p>NC_007413.1</p>
                     </c>
                     <c ca="left">
                        <p>DOE JGI</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Gloeobacter violaceus PCC 7421</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>4.66</p>
                     </c>
                     <c ca="center">
                        <p>62</p>
                     </c>
                     <c ca="center">
                        <p>4430</p>
                     </c>
                     <c ca="left">
                        <p>NC_005125.1</p>
                     </c>
                     <c ca="left">
                        <p>
                           <abbrgrp>
                              <abbr bid="B36">36</abbr>
                           </abbrgrp>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Cyanothece sp. ATCC 51142</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>5.43</p>
                     </c>
                     <c ca="center">
                        <p>37.9</p>
                     </c>
                     <c ca="center">
                        <p>4762</p>
                     </c>
                     <c ca="left">
                        <p>NC_010546.1</p>
                     </c>
                     <c ca="left">
                        <p>Washington University</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Cyanothece sp. PCC 8801</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>4.81</p>
                     </c>
                     <c ca="center">
                        <p>39.8</p>
                     </c>
                     <c ca="center">
                        <p>4260</p>
                     </c>
                     <c ca="left">
                        <p>NC_011726.1</p>
                     </c>
                     <c ca="left">
                        <p>DOE JGI</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Nostoc sp. PCC 7120</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>7.21</p>
                     </c>
                     <c ca="center">
                        <p>41.3</p>
                     </c>
                     <c ca="center">
                        <p>5366</p>
                     </c>
                     <c ca="left">
                        <p>NC_003272.1</p>
                     </c>
                     <c ca="left">
                        <p>
                           <abbrgrp>
                              <abbr bid="B48">48</abbr>
                           </abbrgrp>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Microcystis aeruginosa NIES-843</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>5.8</p>
                     </c>
                     <c ca="center">
                        <p>42.3</p>
                     </c>
                     <c ca="center">
                        <p>6312</p>
                     </c>
                     <c ca="left">
                        <p>NC_010296.1</p>
                     </c>
                     <c ca="left">
                        <p>Kazusa</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Nostoc punctiforme PCC73102</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>8.2</p>
                     </c>
                     <c ca="center">
                        <p>41.4</p>
                     </c>
                     <c ca="center">
                        <p>6087</p>
                     </c>
                     <c ca="left">
                        <p>NC_010628.1</p>
                     </c>
                     <c ca="left">
                        <p>DOE JGI</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Prochloro. marinus str. AS9601</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1.7</p>
                     </c>
                     <c ca="center">
                        <p>31.3</p>
                     </c>
                     <c ca="center">
                        <p>1921</p>
                     </c>
                     <c ca="left">
                        <p>NC_008816.1</p>
                     </c>
                     <c ca="left">
                        <p>J. Craig Venter Institute</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Prochloro. marinus str. MIT 9211</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1.7</p>
                     </c>
                     <c ca="center">
                        <p>39.7</p>
                     </c>
                     <c ca="center">
                        <p>1855</p>
                     </c>
                     <c ca="left">
                        <p>NC_009976.1</p>
                     </c>
                     <c ca="left">
                        <p>
                           <abbrgrp>
                              <abbr bid="B40">40</abbr>
                           </abbrgrp>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Prochloro. marinus str. MIT 9215</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1.7</p>
                     </c>
                     <c ca="center">
                        <p>31.1</p>
                     </c>
                     <c ca="center">
                        <p>1983</p>
                     </c>
                     <c ca="left">
                        <p>NC_009840.1</p>
                     </c>
                     <c ca="left">
                        <p>DOE JGI</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Prochloro. marinus str. MIT 9301</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1.6</p>
                     </c>
                     <c ca="center">
                        <p>31.3</p>
                     </c>
                     <c ca="center">
                        <p>1907</p>
                     </c>
                     <c ca="left">
                        <p>NC_009091.1</p>
                     </c>
                     <c ca="left">
                        <p>GBM Foundation</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Prochloro. marinus str. MIT 9303</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>2.7</p>
                     </c>
                     <c ca="center">
                        <p>50</p>
                     </c>
                     <c ca="center">
                        <p>2997</p>
                     </c>
                     <c ca="left">
                        <p>NC_008820.1</p>
                     </c>
                     <c ca="left">
                        <p>J. Craig Venter Institute</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Prochloro. marinus str. MIT 9312</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1.71</p>
                     </c>
                     <c ca="center">
                        <p>31.2</p>
                     </c>
                     <c ca="center">
                        <p>1810</p>
                     </c>
                     <c ca="left">
                        <p>NC_007577.1</p>
                     </c>
                     <c ca="left">
                        <p>DOE JGI.</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Prochloro. marinus str. MIT 9313</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>2.41</p>
                     </c>
                     <c ca="center">
                        <p>50.7</p>
                     </c>
                     <c ca="center">
                        <p>2269</p>
                     </c>
                     <c ca="left">
                        <p>NC_005071.1</p>
                     </c>
                     <c ca="left">
                        <p>
                           <abbrgrp>
                              <abbr bid="B40">40</abbr>
                           </abbrgrp>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Prochloro. marinus str. MIT 9515</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1.7</p>
                     </c>
                     <c ca="center">
                        <p>30.8</p>
                     </c>
                     <c ca="center">
                        <p>1906</p>
                     </c>
                     <c ca="left">
                        <p>NC_008817.1</p>
                     </c>
                     <c ca="left">
                        <p>J. Craig Venter Institute</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Prochloro. marinus str. NATL1A</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1.9</p>
                     </c>
                     <c ca="center">
                        <p>35</p>
                     </c>
                     <c ca="center">
                        <p>2193</p>
                     </c>
                     <c ca="left">
                        <p>NC_008819.1</p>
                     </c>
                     <c ca="left">
                        <p>J. Craig Venter Institute</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Prochloro. marinus str. NATL2A</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1.8</p>
                     </c>
                     <c ca="center">
                        <p>35.1</p>
                     </c>
                     <c ca="center">
                        <p>2163</p>
                     </c>
                     <c ca="left">
                        <p>NC_007335.2</p>
                     </c>
                     <c ca="left">
                        <p>DOE Joint Genome Inst.</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Prochloro. marinus subsp. marinus str. CCMP1375</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1.75</p>
                     </c>
                     <c ca="center">
                        <p>36.4</p>
                     </c>
                     <c ca="center">
                        <p>1883</p>
                     </c>
                     <c ca="left">
                        <p>NC_005042.1</p>
                     </c>
                     <c ca="left">
                        <p>
                           <abbrgrp>
                              <abbr bid="B51">51</abbr>
                           </abbrgrp>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Prochloro. marinus subsp. pastoris str. CCMP1986</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1.7</p>
                     </c>
                     <c ca="center">
                        <p>30.8</p>
                     </c>
                     <c ca="center">
                        <p>1717</p>
                     </c>
                     <c ca="left">
                        <p>NC_005072.1</p>
                     </c>
                     <c ca="left">
                        <p>
                           <abbrgrp>
                              <abbr bid="B40">40</abbr>
                           </abbrgrp>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Synechococcus elongatus PCC 6301</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>2.7</p>
                     </c>
                     <c ca="center">
                        <p>55.5</p>
                     </c>
                     <c ca="center">
                        <p>2527</p>
                     </c>
                     <c ca="left">
                        <p>NC_006576.1</p>
                     </c>
                     <c ca="left">
                        <p>
                           <abbrgrp>
                              <abbr bid="B83">83</abbr>
                           </abbrgrp>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Synechococcus elongatus PCC 7942</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>2.75</p>
                     </c>
                     <c ca="center">
                        <p>55.4</p>
                     </c>
                     <c ca="center">
                        <p>2612</p>
                     </c>
                     <c ca="left">
                        <p>NC_007604.1</p>
                     </c>
                     <c ca="left">
                        <p>DOE JGI</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Synechococcus sp. CC9311</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>2.61</p>
                     </c>
                     <c ca="center">
                        <p>52.4</p>
                     </c>
                     <c ca="center">
                        <p>2892</p>
                     </c>
                     <c ca="left">
                        <p>NC_008319.1</p>
                     </c>
                     <c ca="left">
                        <p>
                           <abbrgrp>
                              <abbr bid="B53">53</abbr>
                           </abbrgrp>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Synechococcus sp. CC9605</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>2.51</p>
                     </c>
                     <c ca="center">
                        <p>59.2</p>
                     </c>
                     <c ca="center">
                        <p>2645</p>
                     </c>
                     <c ca="left">
                        <p>NC_007516.1</p>
                     </c>
                     <c ca="left">
                        <p>
                           <abbrgrp>
                              <abbr bid="B84">84</abbr>
                           </abbrgrp>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Synechococcus sp. CC9902</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>2.23</p>
                     </c>
                     <c ca="center">
                        <p>54.2</p>
                     </c>
                     <c ca="center">
                        <p>2307</p>
                     </c>
                     <c ca="left">
                        <p>NC_007513.1</p>
                     </c>
                     <c ca="left">
                        <p>
                           <abbrgrp>
                              <abbr bid="B84">84</abbr>
                           </abbrgrp>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Synechococcus sp. JA-2-3B'a(2-13)</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>3.05</p>
                     </c>
                     <c ca="center">
                        <p>58.5</p>
                     </c>
                     <c ca="center">
                        <p>2862</p>
                     </c>
                     <c ca="left">
                        <p>NC_007776.1</p>
                     </c>
                     <c ca="left">
                        <p>TIGR</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Synechococcus sp. JA-3-3Ab</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>2.93</p>
                     </c>
                     <c ca="center">
                        <p>60.2</p>
                     </c>
                     <c ca="center">
                        <p>2760</p>
                     </c>
                     <c ca="left">
                        <p>NC_007775.1</p>
                     </c>
                     <c ca="left">
                        <p>TIGR</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Synechococcus sp. RCC307</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>2.2</p>
                     </c>
                     <c ca="center">
                        <p>60.8</p>
                     </c>
                     <c ca="center">
                        <p>2535</p>
                     </c>
                     <c ca="left">
                        <p>NC_009482.1</p>
                     </c>
                     <c ca="left">
                        <p>
                           <abbrgrp>
                              <abbr bid="B84">84</abbr>
                           </abbrgrp>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Synechococcus sp. WH7803</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>2.4</p>
                     </c>
                     <c ca="center">
                        <p>60.2</p>
                     </c>
                     <c ca="center">
                        <p>2533</p>
                     </c>
                     <c ca="left">
                        <p>NC_009481.1</p>
                     </c>
                     <c ca="left">
                        <p>
                           <abbrgrp>
                              <abbr bid="B84">84</abbr>
                           </abbrgrp>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Synechococcus sp. PCC 7002</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>3.4</p>
                     </c>
                     <c ca="center">
                        <p>49.2</p>
                     </c>
                     <c ca="center">
                        <p>2823</p>
                     </c>
                     <c ca="left">
                        <p>NC_010475.1</p>
                     </c>
                     <c ca="left">
                        <p>Penn. State University</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Synechococcus sp. WH8102</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>2.43</p>
                     </c>
                     <c ca="center">
                        <p>59.4</p>
                     </c>
                     <c ca="center">
                        <p>2519</p>
                     </c>
                     <c ca="left">
                        <p>NC_005070.1</p>
                     </c>
                     <c ca="left">
                        <p>
                           <abbrgrp>
                              <abbr bid="B52">52</abbr>
                           </abbrgrp>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Synechocystis sp. PCC 6803</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>3.95</p>
                     </c>
                     <c ca="center">
                        <p>47.4</p>
                     </c>
                     <c ca="center">
                        <p>3172</p>
                     </c>
                     <c ca="left">
                        <p>NC_000911.1</p>
                     </c>
                     <c ca="left">
                        <p>
                           <abbrgrp>
                              <abbr bid="B85">85</abbr>
                           </abbrgrp>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Thermosynechococcus elongatus BP-1</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>2.59</p>
                     </c>
                     <c ca="center">
                        <p>53.9</p>
                     </c>
                     <c ca="center">
                        <p>2476</p>
                     </c>
                     <c ca="left">
                        <p>NC_004113.1</p>
                     </c>
                     <c ca="left">
                        <p>
                           <abbrgrp>
                              <abbr bid="B46">46</abbr>
                           </abbrgrp>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Trichodesmium erythraeum IMS101</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>7.8</p>
                     </c>
                     <c ca="center">
                        <p>34.1</p>
                     </c>
                     <c ca="center">
                        <p>4451</p>
                     </c>
                     <c ca="left">
                        <p>NC_008312.1</p>
                     </c>
                     <c ca="left">
                        <p>DOE Joint Genome Inst.</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Abbreviations: DOE-JGI, Department of Energy Joint Genome Institute; TIGR, The Institute of Genome Research; GBM, Gordon &amp; Betty Moore. The genome of <it>Crocosphaera watsonii </it>WH8501 was not fully sequenced.</p>
               </tblfn>
            </tbl>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p><b>List of proteins used in phylogenetic analyses</b>. The information for various proteins regarding their lengths, accession numbers, Gene bank IDs, locus tag for Nostoc sp. PCC7120 and COG groups is provided.</p>
               </text>
               <file name="1471-2148-10-24-S1.PDF">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>A rooted maximum likelihood (ML) distance tree based on the combined sequences for these proteins is shown in Fig. <figr fid="F1">1</figr> and a neighbour-joining (NJ) tree for the same dataset is provided as additional file <supplr sid="S2">2</supplr>. A number of distinct clades of cyanobacteria were observed in both these trees. Very similar branching patterns and the grouping of cyanobacterial species in various clades have been observed in earlier studies based on other large and independent datasets of protein sequences <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp>, giving confidence in the observed results. One of the observed clades, referred to here as Clade A, consists of <it>Gloebacter violaceus </it>and <it>Synechococcus sps</it>. (JA-3-3Ab and JA2-3-B'a). The ML and NJ tree differ from each other in the branching position of this clade. In the ML tree, the Clade A species/strains formed the deepest branching lineage within cyanobacteria. In contrast, in the NJ tree, the cyanobacteria were divided into two main clades at the deepest level and the Clade A formed the outermost branch of one of these clades, separated from all other species/strains by a long branch (additional file <supplr sid="S2">2</supplr>). However, the branching of Clade A in this position is not reliable, as in our recent studies based on the same dataset of protein sequences but with smaller numbers of cyanobacteria, the clade A species/strains branched in the same position as seen here in the ML tree <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. The deep branching of Clade A species/strains has also been observed in a number of earlier studies based on different datasets of protein sequences <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B6">6</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B23">23</abbr><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr></abbrgrp>. Further strong and independent evidence that the Clade A species/strains constitutes the earliest branching lineage within sequenced cyanobacteria is provided by our recent identification of several conserved indels in broadly distributed proteins (viz. 18 aa insert in DNA polymerase I, 4-5 aa insert in the tryptophan synthase beta chain, 4 aa insert in tryptophanyl-tRNA synthetase and a 2 aa insert in the DNA polymerase III) <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. The indicated conserved inserts in these proteins are commonly shared by all other sequenced cyanobacteria, but they are lacking in Clade A as well as all other phyla of bacteria <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. The species distributions of these conserved indels strongly indicate that these synapomorphies were introduced in a common ancestor of various other cyanobacteria after the branching of Clade A. In a recent proposal for the classification of cyanobacteria, the thylakoids lacking <it>Gloebacterales </it>are placed into a separate subclass (Gloebacterophycidae) <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. It is unclear whether the <it>Synechococcus sps</it>. (JA-3-3Ab and JA2-3-B'a), which group with <it>G. violaceus</it>, also lack thylakoids or not.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>A maximum-likelihood distance tree for sequenced cyanobacteria based on concatenated sequences for 44 conserved proteins</p>
               </caption>
               <text>
                  <p><b>A maximum-likelihood distance tree for sequenced cyanobacteria based on concatenated sequences for 44 conserved proteins</b>. The distance scale (bar = 0.1 substitutions per site) is shown in the top right hand corner. The tree was rooted using <it>B. subtilis </it>and <it>S. aureus </it>sequences. The numbers at the nodes indicate % of puzzling quartets supporting various nodes. The low B/A ecotype clade refers to the <it>Prochlorococcus </it>spp. containing lower ratio of chlorophyll <b>b/a<sub>2 </sub></b>that are adapted to growth at high light intensities.</p>
               </text>
               <graphic file="1471-2148-10-24-1" hint_layout="double"/>
            </fig>
            <suppl id="S2">
               <title>
                  <p>Additional file 2</p>
               </title>
               <text>
                  <p><b>Neighbour-joining tree for the sequenced Cyanobacteria</b>. A neighbour-joining, bootstrapped tree for 44 cyanobacteria based on concatenated sequences for 44 proteins listed in additional file <supplr sid="S1">1</supplr>. The sequences for <it>B. subtilis </it>and <it>S. aureus </it>were used to root this tree.</p>
               </text>
               <file name="1471-2148-10-24-S2.PDF">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>Most other cyanobacteria could be grouped into two main clades in these trees. One of these clades (designated here as Clade B) is comprised of diverse cyanobacteria including <it>Thermosynechococcus, Acaryochloris</it>, as well as other cyanobacterial groups such as <it>Chroococcales </it>(<it>Synechocystis/Crocosphaera/Microcystis/Cyanothece</it>), <it>Nostocales </it>(<it>Nostoc/Nodularia/Anabaena</it>) and <it>Oscillatoriales </it>(<it>Trichodesmium/Lynbya</it>)<abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. Within Clade B, a subclade comprising of the <it>Chroococcales, Nostocales </it>and <it>Oscillatoriales </it>is also observed in both ML and NJ trees (Fig. <figr fid="F1">1</figr> and additional file <supplr sid="S2">2</supplr>). The other main clade (clade C) is composed entirely of different strains/isolates of marine unicellular <it>Prochlorococcus </it>and <it>Synechococcus </it>cyanobacteria. This latter clade has been referred to as the Syn/Pro clade <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> and it corresponds to the subclass Synechococcophycidae in the proposal by Hoffman et al. <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. Within clade C, different <it>Prochlorococcus </it>and <it>Synechococcus </it>strains/isolates were not completely separated from each other. In particular, two of the <it>Prochlorococcus </it>strains, MIT 9303 and MIT 9313, branched within the <it>Synechoccous </it>strains/isolates, in both ML and NJ trees (Fig. <figr fid="F1">1</figr> and additional file <supplr sid="S2">2</supplr>). Similar polyphyletic branching of these strains has been observed in earlier studies <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B23">23</abbr></abbrgrp>. However, in both these trees, one subclade of <it>Prochlorococcus </it>strains, which is referred to as the low B/A ecotype subgroup <abbrgrp><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr></abbrgrp>, was separated from all others <it>Prochlorococcus </it>strains by a long-branch. The branching position of the freshwater unicellular cyanobacterium <it>Synechococcus elongatus </it>(strains PCC 6301 and PCC 7942), although it appeared as a deep branching lineage of Clade C, was uncertain in these trees (discussed later).</p>
         </sec>
         <sec>
            <st>
               <p>Signature proteins for Cyanobacteria and its major subgroups</p>
            </st>
            <p>These phylogenetic trees provide a framework for identifying proteins that are specific for either all cyanobacteria or their different well-resolved clades. Based upon earlier studies, within any given group of bacteria or organisms, signature proteins are present at various phylogenetic depths <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr></abbrgrp>. Hence, to identify proteins that are specific for different main clades of cyanobacteria, Blastp searches were carried out on each ORF in the genomes of the following 6 cyanobacteria: <it>Synechococcus sp. WH8102, Synechocystis sp. PCC6803, Nostoc sp. PCC7120, Synechococcus sp. JA-3-3Ab, Prochlorococcus sp. MIT9215 </it>and <it>Pro. marinus subsp. marinus str. CCMP1375</it>. These cyanobacteria are present at the tips of various clades in phylogenetic trees (Fig. <figr fid="F1">1</figr> and additional file <supplr sid="S2">2</supplr>). Hence, blast searches with the proteins in them should enable us to identify proteins that are specific for various main clades of cyanobacteria at different phylogenetic depths. The results of these studies are summarized below.</p>
         </sec>
         <sec>
            <st>
               <p>Signature proteins that are specific for Cyanobacteria</p>
            </st>
            <p>Blast searches on the above genomes have identified 39 proteins that are specific for cyanobacteria and which are present in virtually all of the sequenced genomes (Table <tblr tid="T2">2a</tblr>). Thirty-three of these proteins are present in all sequenced cyanobacteria (Table <tblr tid="T2">2a</tblr>) whereas the remaining 6 (marked with *) are missing in 1-2 isolated species/strains. The homologs of some of these proteins are also found in a few algae or plants. Because of their specific presence in practically all cyanobacteria, but generally no other bacteria, these proteins could be regarded as the cyanobacterial signature proteins. The number of cyanobacterial signature proteins identified in the present work is much smaller than those reported in earlier studies <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr></abbrgrp>. However, this difference is mainly due to the large increase in the number of sequenced cyanobacterial as well as other genomes in the past few years. In earlier work, we have also described 15 conserved indels in broadly distributed proteins that are distinctive characteristics of all available cyanobacteria and which are not found in any other bacterial groups/phyla <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>.</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Cyanobacterial Signature Proteins</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="center" cspan="4">
                        <p>
                           <b>(a) Protein that are Uniquely found in All (or most) Cyanobacteria</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Protein</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Function (length)</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Protein</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Function (length)</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_439901/slr0613</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (173)</p>
                     </c>
                     <c ca="left">
                        <p>NP_441893/ssl0242</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (78)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_439967/slr1122</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (329)</p>
                     </c>
                     <c ca="left">
                        <p>NP_442014/sll0350*</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (803)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_439995/slr0729<sup>+</sup></p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (101)</p>
                     </c>
                     <c ca="left">
                        <p>NP_442026/slr0376</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (116)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_440139/slr1796</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (201)</p>
                     </c>
                     <c ca="left">
                        <p>NP_442147/sll0208*</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (231)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_440262/ssl1972</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (93)</p>
                     </c>
                     <c ca="left">
                        <p>NP_442176/sll0413*</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (207)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_440437/slr2049<sup>+</sup></p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (192)</p>
                     </c>
                     <c ca="left">
                        <p>NP_442207/ssr0109</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (78)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_440459/slr1915</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (104)</p>
                     </c>
                     <c ca="left">
                        <p>NP_442330/sll0372 <sup>a</sup></p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (196)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_440545/ssr2843<sup>+</sup></p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (87)</p>
                     </c>
                     <c ca="left">
                        <p>NP_442365/ssr0332</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (70)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_440678/slr1900 <sup>a</sup></p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (247)</p>
                     </c>
                     <c ca="left">
                        <p>NP_442366/slr0211</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (403)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_440903/sll1271</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (572)</p>
                     </c>
                     <c ca="left">
                        <p>NP_442402/slr0921</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (128)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_440946/sll0860</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (173)</p>
                     </c>
                     <c ca="left">
                        <p>NP_442464/sll0822<sup>a</sup></p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (129)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_441021/ssr3189</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (55)</p>
                     </c>
                     <c ca="left">
                        <p>NP_442734/slr0042</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (576)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_441047/slr2144*</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (301)</p>
                     </c>
                     <c ca="left">
                        <p>NP_442826/sll1340</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (85)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_441164/ssr2087</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (84)</p>
                     </c>
                     <c ca="left">
                        <p>NP_442884/slr1557</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (369)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_441199/slr1990</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (240)</p>
                     </c>
                     <c ca="left">
                        <p>NP_442932/slr0748<sup>+</sup></p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (230)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_441265/ssl0461*</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (83)</p>
                     </c>
                     <c ca="left">
                        <p>NP_443015/sll1109</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (194)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_441307/sll1979</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (142)</p>
                     </c>
                     <c ca="left">
                        <p>NP_484529/asr0485<sup>+</sup></p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (92)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_441346/ssr2551</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (94)</p>
                     </c>
                     <c ca="left">
                        <p>NP_440513/slr1384</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (391)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_441647/slr1160*</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (204)</p>
                     </c>
                     <c ca="left">
                        <p>NP_0010358/slr1146</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (89)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_441848/sll0359</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (155)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center" cspan="4">
                        <p>
                           <b>(b) Proteins Specific for Various Cyanobacteria Except those from Clade A</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_439997/slr0731</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (402)</p>
                     </c>
                     <c ca="left">
                        <p>NP_441174/slr1260</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (177)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_440149/slr1800</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (355)</p>
                     </c>
                     <c ca="left">
                        <p>NP_441937/slr1949</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (212)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_441115/sll0854</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (308)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center" cspan="4">
                        <p>
                           <b>(c) Proteins Specific for Various Cyanobacteria Except those from Clade C</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_440495/sll0984</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (148)</p>
                     </c>
                     <c ca="left">
                        <p>NP_441597/slr1276</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (275)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_440591/slr2025</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (153)</p>
                     </c>
                     <c ca="left">
                        <p>NP_485360/all1317</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (147)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_440594/sll1915</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (183)</p>
                     </c>
                     <c ca="left">
                        <p>NP_488024/all3984</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (231)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_440896/sll1274</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (171)</p>
                     </c>
                     <c ca="left">
                        <p>NP_488046/all4006</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (127)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_441155/sll1155*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (113)</p>
                     </c>
                     <c ca="left">
                        <p>NP_484683/asl0639</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (73)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_484163/all0119*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (137)</p>
                     </c>
                     <c ca="left">
                        <p>NP_485187/alr1144*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (290)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_484255/all0211*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (126)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>* - missing in 1-2 species</p>
                  <p><sup>a </sup>significant similarity also seen for 1-2 other bacteria</p>
                  <p><sup>+ </sup>also found in some algae and mosses</p>
                  <p>Clade A is comprised of <it>G. violaceus, Synechococcus sp. JA-3-3Ab and Synechococcus sp. JA-2-3B'a</it></p>
                  <p>Clade C is comprised of most of the <it>Synechococcus </it>and all <it>Prochlorococcus sps</it>.</p>
               </tblfn>
            </tbl>
            <p>These analyses have also identified 5 proteins whose homologs are present in all other cyanobacteria, except those from Clade A (Table <tblr tid="T2">2b</tblr>). Based upon solely the genomic distributions of these proteins, it is difficult to interpret whether the genes for these proteins first evolved in a common ancestor of all cyanobacteria followed by their loss in Clade A species/strains, or they originally evolved in a common ancestor of the Clade B and C cyanobacteria after the branching of Clade A. However, based upon the results of phylogenomic analyses, and more importantly the species distribution patterns of several conserved indels in widely distributed proteins that provide evidence that the Clade A is ancestral to other cyanobacteria <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>, the most parsimonious explanation for the observed distribution of these genes is that they first evolved in a common ancestor of the Clade B and C cyanobacteria, as indicated in Fig. <figr fid="F2">2</figr>. Table <tblr tid="T2">2c</tblr> lists 13 other proteins for which high scoring homologs are present in all (or most) cyanobacteria from Clades A and B, but which are lacking in Clade C strains/isolates. Because of the deep branching of Clade A, it is likely that the genes for these proteins also first evolved in a common ancestor of cyanobacteria, followed by their loss in an ancestor of Clade C. The alternate possibility that the Clade A and B cyanobacteria shared a common ancestor exclusive of Clade C is not supported by the species distribution pattern of conserved indels in several proteins, as noted above. Blast searches with proteins in the genome of <it>Synechococcus sp. JA-3-3Ab </it>have also identified 14 proteins that are specific for the Clade A cyanobacteria (additional file <supplr sid="S3">3</supplr>). The Clade A species/strains can also be distinguished from other cyanobacteria based upon a 15 aa conserved insert in the protein synthesis elongation factor-G that is specific for this clade <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>An interpretive cladogram indicating the evolutionary stages where genes for different signature proteins described in this work, which are specific for different groups of cyanobacteria, likely evolved</p>
               </caption>
               <text>
                  <p><b>An interpretive cladogram indicating the evolutionary stages where genes for different signature proteins described in this work, which are specific for different groups of cyanobacteria, likely evolved</b>. Many conserved indels that are specific for the same groups/clades of cyanobacteria, have also been described in recent work <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>.</p>
               </text>
               <graphic file="1471-2148-10-24-2" hint_layout="double"/>
            </fig>
            <suppl id="S3">
               <title>
                  <p>Additional file 3</p>
               </title>
               <text>
                  <p><b>Proteins that are specific for the Clade A of Cyanobacteria.</b>. All of the proteins listed in this Table are specific for Clade A, which consists of <it>G. violaceus </it>and <it>Synechococcus sps. JA-3-3Ab </it>and <it>JA-2-3B'a</it>.</p>
               </text>
               <file name="1471-2148-10-24-S3.PDF">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Signature proteins for the Clade B cyanobacteria</p>
            </st>
            <p>The Clade B comprises the majority of known cyanobacteria except the unicellular marine cyanobacteria (Clade C) and some deep branching cyanobacteria (see Fig. <figr fid="F1">1</figr>). This clade as defined in our work includes all of the species/strains from the orders <it>Chroococcales, Nostocales </it>and <it>Oscillatoriales </it>as well as the deeper branching cyanobacteria, <it>A. marina </it>and <it>Thermosyn. elongatus</it>. Of these latter cyanobacteria, <it>Acaryochloris </it>is unique in containing chlorophyll d as its primary photosynthetic pigment <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>, whereas <it>Thermosynechococcus </it>is a unicellular thermophilic cyanobacterium <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>. Our analyses have identified 38 proteins that are uniquely shared by all or most of the species/strains from this clade. Two of the <it>Synechococcus </it>strains viz. PCC7002 and PCC7335, also consistently appeared in this group and of these <it>Synechococcus </it>PCC7002, for which sequence information was available from various cyanobacteria, branched with the <it>Chroococcales </it>in phylogenetic trees (Fig. <figr fid="F1">1</figr> and additional file <supplr sid="S2">2</supplr>).</p>
            <p>The branching position of <it>Syn. elongatus </it>(strains PCC 6301 and PCC 7942) is not resolved in phylogenetic trees <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B37">37</abbr><abbr bid="B47">47</abbr></abbrgrp>. It generally branches in between the Clades B and C species/strains in phylogenetic trees (Fig. <figr fid="F1">1</figr>, additional file <supplr sid="S2">2</supplr>) <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. Our analyses have identified 22 proteins, which in addition to various Clade B cyanobacteria are also present in <it>Syn. elongatus </it>(Table <tblr tid="T3">3b</tblr>). It is known from earlier work that a number of cyanobacteria contain split DnaE protein due to the presence of intervening inteins <abbrgrp><abbr bid="B46">46</abbr><abbr bid="B48">48</abbr><abbr bid="B49">49</abbr></abbrgrp>. Examination of DnaE gene/protein from various cyanobacteria indicates that the split DnaE proteins are found in all of the Clades B cyanobacteria as well as <it>Syn. elongatus</it>, whereas all other species/strains from clade A and C do not contain split DnaE <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>(Gupta, R. S., results not shown). This rare genetic characteristic together with the various proteins in Table <tblr tid="T3">3b</tblr> suggests that <it>Syn. elongatus </it>and Clade B cyanobacteria probably shared a common ancestor exclusive of other cyanobacteria.</p>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Proteins Specific for Clade B Cyanobacteria</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="center" cspan="4">
                        <p>
                           <b>(a) Protein that are Uniquely found in All (or most) Clade B Cyanobacteria</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Protein</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Function (length)</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Protein</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Function (length)</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_439990/slr0723*<sup>+</sup></p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (363)</p>
                     </c>
                     <c ca="left">
                        <p>NP_484675/all0631*<sup>+</sup></p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (130)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_440199/slr0971</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (451)</p>
                     </c>
                     <c ca="left">
                        <p>NP_484710/all0666*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (348)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_440305/slr0695<sup>a</sup></p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (173)</p>
                     </c>
                     <c ca="left">
                        <p>NP_485162/all1119*<sup>+</sup></p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (255)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_440382/sll1642*<sup>+</sup></p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (163)</p>
                     </c>
                     <c ca="left">
                        <p>NP_485285/alr1242*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (221)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_440557/sll1573*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (104)</p>
                     </c>
                     <c ca="left">
                        <p>NP_485393/alr1350*<sup>+</sup></p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (359)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_440936/slr0888*<sup>+</sup></p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (168)</p>
                     </c>
                     <c ca="left">
                        <p>NP_485508/all1467*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (247)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_441490/sll1247*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (457)</p>
                     </c>
                     <c ca="left">
                        <p>NP_486386/alr2346*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (104)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_441696/slr1686*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (141)</p>
                     </c>
                     <c ca="left">
                        <p>NP_486393/asl2353*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (98)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_441913/sll1858*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (627)</p>
                     </c>
                     <c ca="left">
                        <p>NP_486647/asr2607*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (65)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_442061/slr0779*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (206)</p>
                     </c>
                     <c ca="left">
                        <p>NP_487221/all3181*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (322)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_442144/slr0217<sup>+</sup></p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (140)</p>
                     </c>
                     <c ca="left">
                        <p>NP_487892/all3852</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (281)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_484091/all0047*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (531)</p>
                     </c>
                     <c ca="left">
                        <p>NP_488032/asr3992*</p>
                     </c>
                     <c ca="left">
                        <p>photosystem II reaction center</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_484127/alr0083*<sup>+</sup></p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (137)</p>
                     </c>
                     <c ca="left">
                        <p>NP_488333/alr4293*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (163)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_484326/all0282*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical(162)</p>
                     </c>
                     <c ca="left">
                        <p>NP_488559/all4519*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (104)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_484594/asl0550*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (72)</p>
                     </c>
                     <c ca="left">
                        <p>NP_488570/alr4530<sup>2</sup></p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (388)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_484607/all0563</p>
                     </c>
                     <c ca="left">
                        <p>general secretion pathway protein (207)</p>
                     </c>
                     <c ca="left">
                        <p>NP_488633/all4593<sup>a</sup></p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (434)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_484635/all0591*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (123)</p>
                     </c>
                     <c ca="left">
                        <p>NP_488729/all4689*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (169)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_484674/all0630*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (128)</p>
                     </c>
                     <c ca="left">
                        <p>NP_489127/alr5087*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (124)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center" cspan="4">
                        <p>
                           <b>(b) Proteins Specific for clade B Cyanobacteria and also <it>Synechococcus elongates</it></b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_440371/ssl1918</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (97)</p>
                     </c>
                     <c ca="left">
                        <p>NP_485176/alr1133*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (160)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_440821/slr1218</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (158)</p>
                     </c>
                     <c ca="left">
                        <p>NP_485590/alr1550*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (119)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_441017/sll1757<sup>+</sup></p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (292)</p>
                     </c>
                     <c ca="left">
                        <p>NP_486755/all2715*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (214)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_441155/sll1155</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (67)</p>
                     </c>
                     <c ca="left">
                        <p>NP_486776/all2736*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (186)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_441519/slr1970*<sup>+</sup></p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (173)</p>
                     </c>
                     <c ca="left">
                        <p>NP_487697/asr3657*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (120)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_441527/sll1884<sup>+</sup></p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (374)</p>
                     </c>
                     <c ca="left">
                        <p>NP_488054/asl4014*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (98)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_441857/ssr0657</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (103)</p>
                     </c>
                     <c ca="left">
                        <p>NP_488538/asr4498*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (86)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_442144/slr0217<sup>+</sup></p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical 140)</p>
                     </c>
                     <c ca="left">
                        <p>NP_488628/asr4588*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (68)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_442174/ssl0788<sup>+</sup></p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (97)</p>
                     </c>
                     <c ca="left">
                        <p>NP_488797/all4757*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (116)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_442462/slr0845*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (190)</p>
                     </c>
                     <c ca="left">
                        <p>NP_488854/alr4814*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (162)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_484393/all0349*<sup>+</sup></p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical(138)</p>
                     </c>
                     <c ca="left">
                        <p>NP_489314/all5274*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (247)</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>* - Missing in 1-2 species</p>
                  <p><sup>+ </sup>Also present in <it>Synechococcus sp. PCC 7335</it></p>
                  <p><sup>a </sup>A homolog showing significant similarity is also found in <it>Sorganum cellulosum</it></p>
               </tblfn>
            </tbl>
            <p>Within Clade B, the cyanobacterial species/strains belonging to the orders <it><ul>N</ul>ostocales, <ul>O</ul>scillatoriales and <ul>C</ul>hroococcales </it>form a distinct clade (NOC clade) in phylogenetic trees (Fig. <figr fid="F1">1</figr> and additional file <supplr sid="S2">2</supplr>). This clade has been referred to as the SPM clade in earlier work <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B47">47</abbr></abbrgrp>. We have recently described a number of conserved indels in important proteins (viz. a 19 aa insert in DnaE protein, a 13 aa deletion in GDP-mannose pyrophosphorylase and a 22 aa insert in NAD(P)H-quinone oxidoreductase subunit D) that are distinctive characteristics of this clade of cyanobacteria <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. In the present work, we have identified 9 proteins (Table <tblr tid="T4">4a</tblr>) that are also uniquely present in all of the species/strains from the NOC clade of cyanobacteria. In addition, 33 other proteins listed in the additional file <supplr sid="S4">4</supplr> are also specific for the NOC clade, but they are missing in some species/strains. Within the NOC clade, species/strains belonging to the orders <it>Nostocales </it>and <it>Oscillatoriales </it>exhibit a closer relationship in phylogenetic trees (Fig. <figr fid="F1">1</figr> and additional file <supplr sid="S2">2</supplr>). A 4 aa deletion in the translation initiation factor IF-2 is also uniquely shared by various sequenced cyanobacterial species/strains from these two orders <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. In this study, we have come across 22 proteins that are specifically present in various sequenced species/strains from these two orders of cyanobacteria (Table <tblr tid="T4">4b</tblr>), providing further support that these two groups are more closely related.</p>
            <suppl id="S4">
               <title>
                  <p>Additional file 4</p>
               </title>
               <text>
                  <p><b>Proteins specific for the <it>Nostocales, Oscillatoriales </it>and <it>Chroococcales </it>orders.</b>. As above</p>
               </text>
               <file name="1471-2148-10-24-S4.PDF">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <tbl id="T4">
               <title>
                  <p>Table 4</p>
               </title>
               <caption>
                  <p>Proteins Specific for Different Groups within Clade B Cyanobacteria</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="center" cspan="4">
                        <p>
                           <b>(a) Proteins Specific for Nostocales, Oscillatoriales and Chroococcales (NOC) Orders</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Protein</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Function (length)</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Protein</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Function (length)</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_441847/sll0360<sup>#</sup></p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (277)</p>
                     </c>
                     <c ca="left">
                        <p>NP_486936/asr2896</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (63)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_484828/asr0785</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (60)</p>
                     </c>
                     <c ca="left">
                        <p>NP_488368/asl4328</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (68)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_485335/all1292</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (142)</p>
                     </c>
                     <c ca="left">
                        <p>NP_488902/asl4862</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (77)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_485350/asr1307</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (78)</p>
                     </c>
                     <c ca="left">
                        <p>NP_488971/all4931</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (225)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_485586/alr1546</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (170)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center" cspan="4">
                        <p>
                           <b>(b) Proteins Specific for Nostocales and Oscillatoriales Orders</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_484145/alr0101</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (258)</p>
                     </c>
                     <c ca="left">
                        <p>NP_485811/all1771</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (238)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_484259/all0215</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (212)</p>
                     </c>
                     <c ca="left">
                        <p>NP_486433/alr2393</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (343)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_484503/all0459*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (119)</p>
                     </c>
                     <c ca="left">
                        <p>NP_486508/asr2468*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (76)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_484625/asr0581*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (76)</p>
                     </c>
                     <c ca="left">
                        <p>NP_486828/all2788*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (146)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_484724/asr0680*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (94)</p>
                     </c>
                     <c ca="left">
                        <p>NP_487523/asr3483*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (64)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_484725/alr0681*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (115)</p>
                     </c>
                     <c ca="left">
                        <p>NP_488294/all4254<sup>&#215;</sup></p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (398)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_485091/asr1048*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (65)</p>
                     </c>
                     <c ca="left">
                        <p>NP_488340/all4300*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (227)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_485092/asr1049*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (88)</p>
                     </c>
                     <c ca="left">
                        <p>NP_488754/alr4714</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (232)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_485286/asl1243*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (72)</p>
                     </c>
                     <c ca="left">
                        <p>NP_488903/alr4863</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (999)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_485748/all1708*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (200)</p>
                     </c>
                     <c ca="left">
                        <p>NP_489130/all5090</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (162)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_486432/alr2392*</p>
                     </c>
                     <c ca="left">
                        <p>filament integrity protein (179)</p>
                     </c>
                     <c ca="left">
                        <p>NP_489162/all5122</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (119)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center" cspan="4">
                        <p>
                           <b>(c) Proteins specific for Chroococcales</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>BAA10649/slr0111</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (173)</p>
                     </c>
                     <c ca="left">
                        <p>BAA17589/sll1268</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical(517)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>BAA10763</p>
                     </c>
                     <c ca="left">
                        <p>cytochrome b6-f complex subunit (36)</p>
                     </c>
                     <c ca="left">
                        <p>BAA17704/sll1755</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical(407)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>BAA16770/slr1107</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical(444)</p>
                     </c>
                     <c ca="left">
                        <p>BAA18427/slr0960</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical(146)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>BAA17546/ssr2406</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical(74)</p>
                     </c>
                     <c ca="left">
                        <p>BAA18451/sll1531</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical(608)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center" cspan="4">
                        <p>
                           <b>(d) Proteins specific for Nostocales</b>
                           <sup>+</sup>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_48404/all0002</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (245)</p>
                     </c>
                     <c ca="left">
                        <p>NP_485976/asl1936</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (81)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_484071/asl0027</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (81)</p>
                     </c>
                     <c ca="left">
                        <p>NP_485977/asl1937</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (83)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_484141/asl0097</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (51)</p>
                     </c>
                     <c ca="left">
                        <p>NP_486406/alr2366</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (118)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_484220/asl0176</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (87)</p>
                     </c>
                     <c ca="left">
                        <p>NP_486414/alr2374</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (129)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_484351/all0307</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (114)</p>
                     </c>
                     <c ca="left">
                        <p>NP_486562/alr2522</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (141)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_484421/alr0377</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (153)</p>
                     </c>
                     <c ca="left">
                        <p>NP_486815/alr2775</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (249)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_484504/asr0460</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (81)</p>
                     </c>
                     <c ca="left">
                        <p>NP_487185/all3145</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (122)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_484505/asr0461</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (96)</p>
                     </c>
                     <c ca="left">
                        <p>NP_487215/alr3175</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (264)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_484526/asr0482</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (64)</p>
                     </c>
                     <c ca="left">
                        <p>NP_487290/asr3250</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (69)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_484616/asl0572</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (75)</p>
                     </c>
                     <c ca="left">
                        <p>NP_487319/asr3279</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (64)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_484758/asl0715</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (56)</p>
                     </c>
                     <c ca="left">
                        <p>NP_487408/asr3368</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (75)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_484822/asl0779</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (67)</p>
                     </c>
                     <c ca="left">
                        <p>NP_487429/asr3389</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (75)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_484885/asl0842</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (80)</p>
                     </c>
                     <c ca="left">
                        <p>NP_487760/alr3720</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (129)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_484898/asr0855</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (83)</p>
                     </c>
                     <c ca="left">
                        <p>NP_487950/alr3910</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (252)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_484966/asr0923</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (67)</p>
                     </c>
                     <c ca="left">
                        <p>NP_487957/alr3917</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (447)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_485022/all0979</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (220)</p>
                     </c>
                     <c ca="left">
                        <p>NP_488113/all4073</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (121)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_485048/asr1005</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (80)</p>
                     </c>
                     <c ca="left">
                        <p>NP_488149/all4109</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (235)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_485180/alr1137</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (107)</p>
                     </c>
                     <c ca="left">
                        <p>NP_488157/all4117</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (411)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_485189/alr1146</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (847)</p>
                     </c>
                     <c ca="left">
                        <p>NP_488392/asr4352</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (65)</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><sup># </sup>also found in one of the clade A cyanobacteria</p>
                  <p>* missing in 1-2 species/strains</p>
                  <p><sup>+</sup>Additional proteins that are specific for Nostocales are listed in the Additional file <supplr sid="S5">5</supplr>.</p>
               </tblfn>
            </tbl>
            <p>Within Clade B, the heterocyst-forming cyanobacteria form a monophyletic group (subclass Nostocophycidae) <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B10">10</abbr><abbr bid="B47">47</abbr><abbr bid="B50">50</abbr></abbrgrp>. We recently described two conserved indels (a 4 aa insert in the PetA protein, a precursor of the apocytochrome f, and a 5 aa insert in the ribosomal protein S3) that are specific for these bacteria <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. In the present work, blast searches on the genome of <it>Nostoc sp. PCC7120 </it>have identified 65 proteins that are uniquely shared by all of the sequenced <it>Nostocales </it>species/strains (<it>Nostoc, Anabaena and Nodularia</it>) (Table <tblr tid="T4">4d</tblr> and additional file <supplr sid="S5">5</supplr>). Fifty-eight additional protein listed in the additional file <supplr sid="S5">5</supplr> are also specific for this order, but they are missing in 1-2 species/strains. These proteins provide potential molecular signatures for the <it>Nostocales </it>order (Nostocophycidae subclass).</p>
            <suppl id="S5">
               <title>
                  <p>Additional file 5</p>
               </title>
               <text>
                  <p><b>Proteins specific for the <it>Nostocales </it>order.</b>. As above</p>
               </text>
               <file name="1471-2148-10-24-S5.PDF">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>The cyanobacteria such as <it>Synechocystis, Microcystis</it>, <it>Crocosphaera </it>and <it>Cyanothece</it>, belonging to the order <it>Chroococcales</it>, form another well-defined clade in phylogenetic trees (see Fig. <figr fid="F1">1</figr> and additional file <supplr sid="S2">2</supplr>) <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B37">37</abbr><abbr bid="B47">47</abbr></abbrgrp>. A 1 aa insert in a highly conserved region of the RecA protein is also specific for these cyanobacteria <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. This insert is also present in <it>Synechococcus sp. PCC7002</it>, which branches with this clade in the phylogenetic trees (see Fig. <figr fid="F1">1</figr> and additional file <supplr sid="S2">2</supplr>) <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B47">47</abbr></abbrgrp>. In this work, we have identified 8 proteins that are uniquely present in various sequenced <it>Chroococcales </it>species/strains (Table <tblr tid="T4">4c</tblr>). The evolutionary stages where the genes for these proteins have likely evolved are indicated in the interpretive diagram (Fig. <figr fid="F2">2</figr>).</p>
         </sec>
         <sec>
            <st>
               <p>Signature proteins for the Clade C Cyanobacteria</p>
            </st>
            <p>The Clade C is comprised of different strains/isolates of marine <it>Prochlorococcus </it>and <it>Synechococcus </it><abbrgrp><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr><abbr bid="B51">51</abbr><abbr bid="B52">52</abbr><abbr bid="B53">53</abbr></abbrgrp>. We have recently described a number of conserved indels in widely distributed proteins that are specific for all of the species/strains from Clade C <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. These signatures include a 3 aa insert in the RNA polymerase beta subunit, a 2 aa insert the proteins KsgA, a 6 aa insert in tyrosyl-tRNA synthetase, a 2 aa insert in the tRNA (guanine-N1-)-methyltransferase, a 1 aa insert in the RNA polymerase &#946;' subunit and a 12 aa insert in the DNA polymerase I <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. These signature indels are not found in the Clades A or B cyanobacteria or other phyla of bacteria. Additionally, they are also absent in <it>Syn. elongatus </it>as well as <it>Synechococcus sps</it>. PCC7002 and PCC7335. Another example of a signature insert that is specific for Clade C species/strains is presented in Fig. <figr fid="F3">3</figr>. In this case, a 6 aa insert in a flavoprotein is commonly present in all Clade C species/strains, but absent from all other cyanobacteria as well as other bacteria. This latter observation indicates that this indels is an insert in the Clade C species/strains. Interestingly, this insert and also several of the other Clade C signature indels are also present in <it>Cyanobium </it>sp. PCC7001 (Fig. <figr fid="F3">3</figr>), supporting its placement within the Clade C (Fig. <figr fid="F2">2</figr>) <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B15">15</abbr></abbrgrp>.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Partial sequence alignment of flavoprotein showing a 6 aa conserved insert (boxed) that is specific for the Clade C cyanobacteria</p>
               </caption>
               <text>
                  <p><b>Partial sequence alignment of flavoprotein showing a 6 aa conserved insert (boxed) that is specific for the Clade C cyanobacteria</b>. Dashes (-) in this and all other sequence alignments indicate identity with the amino acid on the top line. The numbers on the top indicate the position of the sequence in the species on the first line. The absence of this insert in all other cyanobacteria and other phyla of bacteria provide evidence that this indel is an insert in the Clade C.</p>
               </text>
               <graphic file="1471-2148-10-24-3" hint_layout="double"/>
            </fig>
            <p>Our blast analyses on proteins from the genomes of <it>Synechococcus sp. WH8102, Prochlorococcus sp. MIT9215 </it>and <it>Pro. marinus subsp. marinus str. CCMP1375 </it>have identified 60 proteins that are uniquely shared by virtually all of the species/strains from Clade C cyanobacteria (Table <tblr tid="T5">5a</tblr>). These signature proteins provide further evidence and molecular markers indicating the distinctness of Clade C. Eight additional proteins in Table <tblr tid="T5">5b</tblr> are also specific for Clade C cyanobacteria, but they are absent in all of the low B/A ecotype <it>Prochlorococcus </it>strains, indicating that the genes for these proteins were lost from a common ancestor of the low B/A clade.</p>
            <tbl id="T5">
               <title>
                  <p>Table 5</p>
               </title>
               <caption>
                  <p>Proteins Specific for the Clade C Cyanobacteria (<it>Synechococcus/Prochlorococcus</it>)</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="left">
                        <p>
                           <b>Protein</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Function (length)</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Protein</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Function (length)</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_874427/Pro0033</p>
                     </c>
                     <c ca="left">
                        <p>predicted membrane protein (87)</p>
                     </c>
                     <c ca="left">
                        <p>YP_001483584</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (114)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_874433/Pro0039</p>
                     </c>
                     <c ca="left">
                        <p>predicted membrane protein (203)</p>
                     </c>
                     <c ca="left">
                        <p>YP_001483784</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (60)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_874460/Pro0066</p>
                     </c>
                     <c ca="left">
                        <p>predicted membrane protein (128)</p>
                     </c>
                     <c ca="left">
                        <p>YP_001483792<sup>+</sup></p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (116)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_874461/Pro0067</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (154)</p>
                     </c>
                     <c ca="left">
                        <p>YP_001483839</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical(75)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_874496/Pro0102</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (121)</p>
                     </c>
                     <c ca="left">
                        <p>YP_001484024</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (67)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_874497/Pro0103</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (76)</p>
                     </c>
                     <c ca="left">
                        <p>YP_001484070</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (96)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_874503/Pro0109</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (127)</p>
                     </c>
                     <c ca="left">
                        <p>YP_001484558</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical(70)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_874769/Pro0375</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (128)</p>
                     </c>
                     <c ca="left">
                        <p>YP_001484735</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical(136)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_874827/Pro0433</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (148)</p>
                     </c>
                     <c ca="left">
                        <p>YP_001484929</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (89)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_874971/Pro0578</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (104)</p>
                     </c>
                     <c ca="left">
                        <p>YP_001484936</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (237)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_875238/Pro0846</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (135)</p>
                     </c>
                     <c ca="left">
                        <p>YP_001485057</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical(88)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_875250/Pro0858</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (116)</p>
                     </c>
                     <c ca="left">
                        <p>YP_001485093</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (172)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_875290/Pro0898</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (75)</p>
                     </c>
                     <c ca="left">
                        <p>YP_001485151<sup>+</sup></p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (139)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_875352/Pro0960</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (76)</p>
                     </c>
                     <c ca="left">
                        <p>NP_875191/Pro0799*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (234)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_875454/Pro1062</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (189)</p>
                     </c>
                     <c ca="left">
                        <p>NP_875240/Pro0848*</p>
                     </c>
                     <c ca="left">
                        <p>membrane protein/(99)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_875462/Pro1070</p>
                     </c>
                     <c ca="left">
                        <p>dihydroneopterin aldolase (127)</p>
                     </c>
                     <c ca="left">
                        <p>NP_875270/Pro0878*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (62)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_875555/Pro1163</p>
                     </c>
                     <c ca="left">
                        <p>predicted protein family PM-1 (67)</p>
                     </c>
                     <c ca="left">
                        <p>YP_001483575*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical(71)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_875594/Pro1202</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (81)</p>
                     </c>
                     <c ca="left">
                        <p>YP_001483809*<sup>+</sup></p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical(116)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_875635/Pro1243</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (193)</p>
                     </c>
                     <c ca="left">
                        <p>YP_001483828*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical(122)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_876135/Pro1744</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (206)</p>
                     </c>
                     <c ca="left">
                        <p>YP_001483924*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical(502)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_876152/Pro1761</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (98)</p>
                     </c>
                     <c ca="left">
                        <p>NP_875468/Pro1076*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (88)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_876219/Pro1828</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (100)</p>
                     </c>
                     <c ca="left">
                        <p>NP_875511/Pro1119*</p>
                     </c>
                     <c ca="left">
                        <p>Predicted protein with signal (144)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YP_001010165</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical(121)</p>
                     </c>
                     <c ca="left">
                        <p>NP_875732/Pro1341*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (88)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YP_001483235</p>
                     </c>
                     <c ca="left">
                        <p>type II secretion system (149)</p>
                     </c>
                     <c ca="left">
                        <p>NP_876151/Pro1760*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (152)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YP_001483304</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (100)</p>
                     </c>
                     <c ca="left">
                        <p>NP_876229/Pro1838*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (171)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YP_001483312</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (87)</p>
                     </c>
                     <c ca="left">
                        <p>YP_001483988*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical(70)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YP_001483445</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical(72)</p>
                     </c>
                     <c ca="left">
                        <p>YP_001484266*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical(195)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YP_001483588</p>
                     </c>
                     <c ca="left">
                        <p>TIR domain-containing protein (82)</p>
                     </c>
                     <c ca="left">
                        <p>YP_001483537*</p>
                     </c>
                     <c ca="left">
                        <p>possible Pollen allergen (139)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YP_001483568</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (102)</p>
                     </c>
                     <c ca="left">
                        <p>YP_001483448</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (42)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YP_001484489</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (85)</p>
                     </c>
                     <c ca="left">
                        <p>YP_001484000</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (80)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center" cspan="4">
                        <p>
                           <b>(b) Proteins Specific for Clade C which are missing in Low B/A ecotype Prochlorococcus strains</b>
                           <sup>#</sup>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_875075/Pro0683*</p>
                     </c>
                     <c ca="left">
                        <p>Predicted protein family PM-3 (178)</p>
                     </c>
                     <c ca="left">
                        <p>NP_875154/Pro0762</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (127)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_874434/Pro0040*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (119)</p>
                     </c>
                     <c ca="left">
                        <p>NP_875509/Pro1117</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (181)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_874631/Pro0237</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (102)</p>
                     </c>
                     <c ca="left">
                        <p>NP_875611/Pro1219*</p>
                     </c>
                     <c ca="left">
                        <p>Predicted protein family PM-3 (195)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_875013/Pro0621*</p>
                     </c>
                     <c ca="left">
                        <p>predicted protein family PM-3 (167)</p>
                     </c>
                     <c ca="left">
                        <p>NP_876129/Pro1738*</p>
                     </c>
                     <c ca="left">
                        <p>Predicted dehydrogenase (273)</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>* - Missing in 1-2 species</p>
                  <p><sup>+</sup>Also present in <it>Synechococcous elongatus</it></p>
                  <p>Several of these proteins are also present in <it>Cyanobium sp. PCC7001 </it>and <it>Paulinella chromatophora</it></p>
                  <p><sup>#</sup>Low B/A ecotype clade is comprised of the following Prochlorococcus strains: Pro. marinus AS9601, Pro. marinus MIT9215, Pro. marinus MIT9301, Pro. marinus MIT9312, Pro. marinus MIT9515, Pro. marinus CCMP1986</p>
               </tblfn>
            </tbl>
            <p>As noted earlier, in phylogenetic trees, the branching position of <it>Syn. elongatus </it>is not resolved. In our analyses, we have come across only 3 proteins (marked with <sup>+ </sup>in Table <tblr tid="T5">5a</tblr>) that are uniquely found in Clade C species/strains as well as <it>Syn. elongatus</it>. This is in contrast to 22 proteins that are uniquely shared by Clade B cyanobacteria and <it>Syn. elongatus </it>(Table <tblr tid="T3">3b</tblr>). These observations in conjunction with the unique presence of split DnaE genes in Clade B cyanobacteria and <it>Syn. elongatus </it>make a strong case that <it>Syn. elongatus </it>is more closely related to the Clade B cyanobacteria than to the Clade C species/strains.</p>
            <p>The two genera, <it>Prochlorococcus </it>and <it>Synechococcus</it>, which make up most of the Clade C cyanobacteria, differ from each other in important respects, particularly with regard to the main pigments in their light harvesting systems <abbrgrp><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr></abbrgrp>. In contrast to various <it>Synechococcus </it>strains/isolates and most other cyanobacteria, which contain chlorophyll <b>a </b>and phycobiliproteins as the major pigments in their photosynthetic systems, all <it>Prochlorococcus </it>strains/isolates utilize divinyl chlorophyll <b>a </b>and both mono and divinyl chlrophyll <b>b </b>as the main pigments in their light-harvesting systems <abbrgrp><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr></abbrgrp>. Further, while <it>Synechococcus </it>isolates are ubiquitous in different aquatic environments including estuarine, coastal and offshore waters <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>, <it>Prochlorococcus </it>strains are mainly found in warm oligotrophic oceanic settings <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. Among the sequenced cyanobacteria, <it>Prochlorococcus </it>strains/isolates have the smallest genomes (see Table <tblr tid="T1">1</tblr>). Although <it>Prochlorococcus </it>are indicated to be polyphyletic in phylogenetic analyses (with strains MIT 9303 and MIT 9313 branching within the <it>Synechococcus </it>strains/isolates; see Fig. <figr fid="F1">1</figr> and additional file <supplr sid="S2">2</supplr>) <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B23">23</abbr><abbr bid="B33">33</abbr></abbrgrp>, our blast searches have identified 19 proteins that are uniquely shared by all or most of the <it>Prochlorococcus </it>strains (Table <tblr tid="T6">6b</tblr>). These results indicate that despite their polyphyletic branching in phylogenetic trees, all <it>Prochlorococcus </it>strains/isolates form a monophyletic clade, which is in accordance with their distinctive photosynthetic pigments composition. In this work, we also describe a 2 aa conserved insert in the protein heme oxygenase that is also exclusively present in various <it>Prochlorococcus </it>strains (Fig. <figr fid="F4">4</figr>). The unique presence of this insert in various <it>Prochlorococcus </it>strains provides further evidence that this group is monophyletic. The enzyme heme oxygenase, which contains this conserved insert, plays an important role in the biosynthesis of photosynthetic pigments phyto-chromobilin and phycobilins <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>. Because <it>Prochlorococcus </it>are unique in terms of their photosynthetic pigment composition, it is of much interest to determine the functional significance of this conserved indel.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Partial sequence alignment of heme oxygenase showing a 2 aa insert (boxed) that is uniquely present in all sequenced <it>Prochlorococcus </it>strains, but not found in any other cyanobacteria</p>
               </caption>
               <text>
                  <p><b>Partial sequence alignment of heme oxygenase showing a 2 aa insert (boxed) that is uniquely present in all sequenced <it>Prochlorococcus </it>strains, but not found in any other cyanobacteria</b>. This insert provides evidence that <it>Prochlorococcus </it>strains are monophyletic and shared a common ancestor.</p>
               </text>
               <graphic file="1471-2148-10-24-4" hint_layout="double"/>
            </fig>
            <tbl id="T6">
               <title>
                  <p>Table 6</p>
               </title>
               <caption>
                  <p>Proteins specific for the Main Groups of Clade C Cyanobacteria</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="center" cspan="4">
                        <p>
                           <b>(a) Proteins Specific for the Clade C cyanobacteria<sup>+ </sup>except <it>Prochlorococcus</it></b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Protein</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Function (length)</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Protein</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Function (length)</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_896793/SYNW0700</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (76)</p>
                     </c>
                     <c ca="left">
                        <p>NP_897761/SYNW1668</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (181)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_896942/SYNW0849*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (120)</p>
                     </c>
                     <c ca="left">
                        <p>NP_898450/SYNW2361*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (129)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_897039/SYNW0946</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (139)</p>
                     </c>
                     <c ca="left">
                        <p>NP_896879/SYNW0786*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (107)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_896623/SYNW0528*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical(94)</p>
                     </c>
                     <c ca="left">
                        <p>NP_896904/SYNW0811*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical (81)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_896827/SYNW0734</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical(152)</p>
                     </c>
                     <c ca="left">
                        <p>NP_897398/SYNW1305*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical(78)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_897338/SYNW1245*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical(95)</p>
                     </c>
                     <c ca="left">
                        <p>NP_897599/SYNW1506</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical(221)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NP_897228/SYNW1135*</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical(139)</p>
                     </c>
                     <c ca="left">
                        <p>NP_897875/SYNW1784</p>
                     </c>
                     <c ca="left">
                        <p>Hypothetical(150)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center" cspan="4">
                        <p>
                           <b>(b) Proteins Specific for <it>Prochlorococcus</it></b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YP_001483307</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (58)</p>
                     </c>
                     <c ca="left">
                        <p>YP_001484319*</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (94)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YP_001483938*</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (109)</p>
                     </c>
                     <c ca="left">
                        <p>YP_001484350</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (104)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YP_001483942</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (75)</p>
                     </c>
                     <c ca="left">
                        <p>YP_001484353*</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (68)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YP_001483946</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (88)</p>
                     </c>
                     <c ca="left">
                        <p>YP_001484529*</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (99)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YP_001483975*</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (99)</p>
                     </c>
                     <c ca="left">
                        <p>YP_001484536*</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (42)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YP_001483996*</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (51)</p>
                     </c>
                     <c ca="left">
                        <p>NP_875788*</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (81)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YP_001484105*</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (64)</p>
                     </c>
                     <c ca="left">
                        <p>YP_001483983*</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (96)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YP_001484131</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (61)</p>
                     </c>
                     <c ca="left">
                        <p>YP_001484474*</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (79)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YP_001484828</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (55)</p>
                     </c>
                     <c ca="left">
                        <p>YP_001484870</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (142)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YP_001483822</p>
                     </c>
                     <c ca="left">
                        <p>hypothetical (44)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>* Missing in 1-2 strains/isolates</p>
                  <p><sup>+ </sup>These proteins are primarily present in various Synechococcus species/strains that are part of Clade C (see Figs. 1 and 2). However, <it>Synechococcus </it>genus is not monophyletic and many <it>Synechococcus </it>strains group with Clade and B (viz. <it>Synechococcus sp. PCC7002</it>, Synechococcus sp. PCC7335, Synechococcus sp. JA-3-3Ab and JA-2-3B'a) and these proteins are absent in those strains. Besides <it>Synechococcus</it>, homologs of many of these proteins are also found in <it>Cyanobium sp. PCC7001 </it>as well as in <it>Paulinella chromatophora</it>, indicating that these species may also belong to the Clade C cyanobacteria.</p>
               </tblfn>
            </tbl>
            <p>If <it>Prochlorococcus </it>strains/isolates form a monophyletic lineage, then one expect that other cyanobacteria that are part of Clade C might also share many unique proteins in common. Indeed, our blast searches have identified 14 proteins that are uniquely present in various other cyanobacteria (mostly <it>Synechococcus </it>strains) that are part of Clade C (Table <tblr tid="T6">6a</tblr>). It should be mentioned that for several of these proteins, blast hits indicating significant similarity are also found for <it>Cyanobium sp. PCC7001 </it>and <it>Paulinellla chromatophora</it>, indicating that these cyanobacteria are also part of the Clade C. The grouping of <it>Cyanobium sp. PCC7001 </it>with Clade C is also supported by the conserved indel in the flavoprotein (see Fig. <figr fid="F3">3</figr>).</p>
            <p>As noted above, in phylogenetic trees based on concatenated protein sequences <it>Prochlorococcus </it>str. MIT9303 and MIT9313 branch within the various <it>Synechococcus </it>strains/isolates (Fig. <figr fid="F1">1</figr> and additional file <supplr sid="S2">2</supplr>). Earlier phylogenetic studies by Rocap et al. <abbrgrp><abbr bid="B41">41</abbr></abbrgrp> based on the 16S-23S rDNA spacer region indicate that these two strains (high B/A clade IV) form the deepest branching isolates of this genus. Further, in contrast to other sequenced <it>Prochlorococcus </it>strains, whose G+C content range from 30-39%, the strains MIT9303 and MIT9313 have much higher G+C content (~50%) (see Table <tblr tid="T1">1</tblr>). Our blast analyses, in addition to identifying many proteins that are unique to various <it>Synechococcus </it>strains/isolates, have also identified 22 proteins that are specifically present in all of the Clade C <it>Synechococcus </it>strains as well as in <it>Prochlorococcus </it>MIT9303 and MIT9313 (additional file <supplr sid="S6">6a</supplr>). At the same time, we have come across 37 proteins that are uniquely found in all other sequenced <it>Prochlorococcus </it>strains, but which are missing in MIT9303 and MIT9313 (additional file <supplr sid="S6">6b</supplr>). In addition, we have also identified a 1 aa deletion in a conserved region of the protein protochlorophyllide oxidoreductase (POR) that is uniquely shared by all other <it>Prochlorococcus </it>strains except MIT9303 and MIT9313 (Fig. <figr fid="F5">5</figr>). The enzyme POR is responsible for catalyzing light driven reduction of protochlorophyllide to chlorophyllide - a key regulatory reaction in the chlorophyll biosynthetic pathway <abbrgrp><abbr bid="B55">55</abbr></abbrgrp>. Hence, it is again of much interest to understand the functional significance of this conserved indel. The rare genetic change leading to this indel likely occurred in a common ancestor of various <it>Prochlorococcus </it>strains after the branching of MIT9303 and MIT9313 (Fig. <figr fid="F2">2</figr>). These observations, in conjunction with the branching pattern of these strains in phylogenetic trees, provide evidence that these two <it>Prochlorococcus </it>strains comprise the deepest branching group (high B/A clade IV) <abbrgrp><abbr bid="B41">41</abbr></abbrgrp> within the <it>Prochlorococcus </it>genus, exhibiting closest relationship to the <it>Synechococcus </it>strains/isolates.</p>
            <suppl id="S6">
               <title>
                  <p>Additional file 6</p>
               </title>
               <text>
                  <p><b>Clade C proteins showing anomalous behavior of <it>Pro. marinus MIT9303 </it>and <it>MIT9313</it>.</b>. This table describes two sets of proteins: (a) Proteins that are specific for Clade C <it>Synechococcus </it>strains/isolates that are also found in <it>Pro. marinus MIT9303 </it>and <it>MIT9313 </it>and (b) Proteins specific for various other Prochlorococcus marinus strains/isolates, but which are missing in <it>Pro. marinus MIT9303 </it>and <it>MIT9313</it>.</p>
               </text>
               <file name="1471-2148-10-24-S6.PDF">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Partial sequence alignment of the protein protochlorophyllide oxidoreductase showing a 1 aa deletion that is commonly shared by all <it>Prochlorococcus </it>strains except MIT 9303 and MIT 9313</p>
               </caption>
               <text>
                  <p><b>Partial sequence alignment of the protein protochlorophyllide oxidoreductase showing a 1 aa deletion that is commonly shared by all <it>Prochlorococcus </it>strains except MIT 9303 and MIT 9313</b>. This indel provides evidence for the deep branching of these <it>Prochlorococcus </it>strains relative to all other strains.</p>
               </text>
               <graphic file="1471-2148-10-24-5" hint_layout="double"/>
            </fig>
            <p>Earlier studies have led to the division of <it>Prochlorococcus </it>strains/isolates into two physiologically distinct groups (high B/A and low B/A ecotypes), based upon the ratios of chlorophyll <b>b </b>and <b>a2 </b>in their light-harvesting systems and their ability to grow at different light intensities <abbrgrp><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr><abbr bid="B56">56</abbr></abbrgrp>. Of these two groups, strains from the high B/A ecotype, which have larger ratio of chlorophyll <b>b/a</b><sub>2 </sub>are able to grow at extremely low irradiance, whereas those from the low- B/A ecotype containing lower ratio of chlorophyll <b>b/a</b><sub>2 </sub>are unable to grow under these conditions. The low- B/A ecotype strains instead are adapted to growth at high light intensities, where the growth of high B/A ecotype strains is inhibited. The strains from these two ecotypes also differ in terms of their sensitivity to copper and their ability to use nitrite or nitrate as nitrogen sources <abbrgrp><abbr bid="B41">41</abbr><abbr bid="B57">57</abbr></abbrgrp>. In phylogenetic trees, the low B/A ecotype <it>Prochlorococcus </it>isolates (viz. MIT9515, CCMP1986, MIT9312, MIT9215, MIT9301 and AS9601) formed a distinct subclade that was well separated from all other Clade C species/strains by a long-branch and 100% bootstrap score (Fig. <figr fid="F1">1</figr> and additional file <supplr sid="S2">2</supplr>)<abbrgrp><abbr bid="B23">23</abbr><abbr bid="B41">41</abbr></abbrgrp>. We have also described two conserved indels (viz. a 5 aa deletion in leucyl-tRNA synthetase and 1 aa insert in the Ffh protein) that are uniquely shared by all of the low B/A ecotype <it>Prochlorococcus </it>strains <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. In the present work, we have identified 67 proteins that are exclusively found in all of the sequenced strains from the low B/A ecotype clade (additional file <supplr sid="S7">7a</supplr>). Seventy-two proteins listed in the additional file <supplr sid="S7">7b</supplr> are also specific for this clade, but they are missing in 1-2 of the strains/isolates. These signature proteins and indels together with the distinct branching of the low B/A strains in phylogenetic trees provide strong evidence that this group of <it>Prochlorococcus </it>strains are phylogenetically, physiologically and molecularly distinct from all other <it>Prochlorococcus </it>strains. Based upon species distribution patterns of various cyanobacteria-specific proteins, evolutionary stages where the genes for these proteins likely evolved are indicated in the interpretive diagram in Fig. <figr fid="F2">2</figr>.</p>
            <suppl id="S7">
               <title>
                  <p>Additional file 7</p>
               </title>
               <text>
                  <p>Proteins specific for the Low B/A ecotype <it>Pro. marinus </it>strains/isolates.</p>
               </text>
               <file name="1471-2148-10-24-S7.PDF">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion and Conclusions</p>
         </st>
         <p>In this work, we have used a combination of phylogenomic and signature proteins based approaches to elucidate the evolutionary relationships among cyanobacteria. Phylogenetic trees were initially constructed for 44 cyanobacteria based on concatenated sequences for 44 widely distributed proteins present in various cyanobacteria. The branching pattern of cyanobacteria in these trees was very similar to that observed in other recent studies based on different large sets of proteins for smaller numbers of cyanobacteria <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp>. In all of these trees a number of distinct clades of cyanobacteria are consistently observed. However, the main focus of the present work was on comparative analyses of cyanobacterial genomes to identify unique sets of genes/proteins that are limited to particular groups of cyanobacteria, corresponding to various phylogenetically identified clades. This work complement our recent studies, where a comparative genomic approach was employed to identify >40 conserved indels in widely distributed proteins that are also specific for the same groups/clades of cyanobacteria <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>.</p>
         <p>Recent analyses of genomic sequences have revealed that whole proteins that are limited to different monophyletic clades are present at different phylogenetic depths <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr><abbr bid="B58">58</abbr><abbr bid="B59">59</abbr></abbrgrp>. Unlike ORFan proteins, which are unique to a given species or a strain and are subject to rapid gene loss <abbrgrp><abbr bid="B44">44</abbr><abbr bid="B60">60</abbr><abbr bid="B61">61</abbr></abbrgrp>, these lineage-specific proteins are retained in a conserved state by all or most species/strains from a given clade, indicating that they are conferring selective advantage to species from these clades <abbrgrp><abbr bid="B28">28</abbr><abbr bid="B58">58</abbr><abbr bid="B62">62</abbr></abbrgrp>. Although the mechanism responsible for the evolution or acquisition of genes for these proteins is unclear <abbrgrp><abbr bid="B28">28</abbr><abbr bid="B61">61</abbr></abbrgrp>, their specific presence in different clades indicates that the genes for these proteins first evolved (or introduced) in a common ancestor of these clades followed by their retention by various descendents of these clades. Because of their clade specificity, these lineage specific-proteins or conserved signature proteins (CSPs) provide valuable molecular markers for these clades <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr><abbr bid="B43">43</abbr><abbr bid="B59">59</abbr></abbrgrp>. Our recent analyses of CSPs from several major groups of bacteria (viz. alpha proteobacteria, epsilon proteobacteria, gamma proteobacteria, chlamydiae, <it>Bacteroidetes-Chlorobi </it>and <it>Actinobacteria</it>) provide evidence that the species distribution of most of these CSPs show high degree of concordance with different clades in the phylogenetic trees <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr><abbr bid="B42">42</abbr><abbr bid="B63">63</abbr><abbr bid="B64">64</abbr></abbrgrp>. This inference is strongly reinforced by the results of present study, where most of the identified CSPs correspond to well-defined clades in the phylogenetic trees.</p>
         <p>It should be mentioned that in our analyses we have not come across significant numbers of CSPs that support alternate groupings i.e. where the proteins are commonly shared by various species/strains from clades that are phylogenetically unrelated (e.g. <it>Nostocales </it>and Clade C, or <it>Oscillatoriales </it>and Clade C). However, one commonly observed pattern is that if two clades are close to each other in phylogenetic trees, but their branching is not clearly resolved (i.e. weakly supported by bootstrap scores), then in addition to observing many proteins that are unique to each of these two clades, several proteins that are commonly shared by them are also observed. This could be due to either that genes for many of these proteins probably evolved in a common ancestor of these clades prior to their becoming phylogenetically distinct or due to lateral gene transfers among closely related taxa <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B65">65</abbr></abbrgrp>. Nevertheless, our results that most of these proteins are distinctive characteristics of phylogenetically well-defined monophyletic clades strongly suggest that their species distribution has not been significantly affected by lateral gene transfers, which is indicated to be very common in cyanobacteria <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B66">66</abbr></abbrgrp>.</p>
         <p>When a protein is confined to only a certain group of species/strains, then based upon this information alone, it is difficult to determine whether the group of species containing this protein form a clade in the phylogenetic sense or not. To properly evaluate the results of such studies, it is necessary to carry out these studies in conjunction with phylogenetic as well as other forms of analyses (e.g. studies based on conserved indels), where it is possible to establish a rooted relationship among different groups or taxa under consideration <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B26">26</abbr><abbr bid="B59">59</abbr></abbrgrp>. Based on these studies, if a given protein is uniquely found in all or most of the species from a well-defined monophyletic clade, and generally no where else, then the simplest and most parsimonious explanation for this is that the gene for this protein first appeared in a common ancestor of this group and then passed on vertically to its various descendants <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B20">20</abbr><abbr bid="B67">67</abbr></abbrgrp>. We have interpreted the results of species distribution of various unique proteins based on this minimal assumption. Based on this interpretation, various identified signature proteins or CSPs could be regarded as molecular synapomorphies that are specific for different clades of cyanobacteria.</p>
         <p>The branching order and interrelationships among cyanobacteria that emerges based upon all of these different approaches is shown in Fig. <figr fid="F2">2</figr>. All of these approaches indicate that a clade consisting of <it>Gloebacter </it>and the <it>Synechococcus </it>strains JA-3-3Ab and JA2-3-B'a (Clade A) forms the deepest branching lineage within cyanobacteria. A large number of sequenced cyanobacteria correspond to marine unicellular <it>Synechococcus </it>and <it>Prochlorococcus </it>strains (Clade C). We have identified numerous proteins and conserved indels that are specific for this clade. Although <it>Synechococcus </it>and <it>Prochlorococcus </it>strains do not form monophyletic clusters in phylogenetic trees, the shared presence of many novel proteins as well as some conserved indels by various <it>Prochlorococcus </it>strains provide evidence that this group is monophyletic. The unique pigments that are found in the light harvesting system of <it>Prochlorococcus </it>also support their distinctness from other cyanobacteria. The monophyletic grouping of marine unicellular <it>Synechococcus </it>strains/isolates based upon these molecular and biochemical characteristics is at variance with their polyphyletic branching in different phylogenetic trees (see Fig. <figr fid="F1">1</figr>, additional file <supplr sid="S2">2</supplr>) <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B11">11</abbr><abbr bid="B23">23</abbr></abbrgrp>. This discordance could be explained by either lateral migration of genes responsible for these characteristics <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B13">13</abbr><abbr bid="B33">33</abbr><abbr bid="B68">68</abbr></abbrgrp>, or due to inability of the phylogenetic trees to resolve the branching order among closely related species/strains. Among the <it>Prochlorococcus </it>strains, our analyses confirm that the strains corresponding to low B/A ecotype are distinct not only in physiological and phylogenetic terms <abbrgrp><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr><abbr bid="B56">56</abbr></abbrgrp>, but that they also share large numbers of proteins that are unique to them. Several conserved indels that are specific for the low B/A ecotype clade have also been identified <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. Recent study by Zhaxybayeva et al. <abbrgrp><abbr bid="B33">33</abbr></abbrgrp> also provides evidence that the high-light adapted low B/A ecotype <it>Prochlorococcus </it>strains form a monophyletic clade, in contrast to the paraphyletic grouping of the low-light adapted (i.e. high B/A ecotype) <it>Prochlorococcus </it>spp. <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. All of these observations make a strong case for the recognition of low B/A ecotype <it>Prochlorococcus </it>strains as a distinct taxonomic entity.</p>
         <p>Within Clade B, many CSPs were identified that are specific for the <it>Nostocales </it>and <it>Chroococcales </it>orders. In addition, several other CSPs are uniquely present in the <it>Nostocales </it>and <it>Oscillatoriales </it>orders, or by the <it>Nostocales</it>, <it>Oscillatoriales </it>and <it>Chroococcales</it>. In recent work, a number of conserved indels that are unique to these orders of cyanobacteria have also been identified <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. Although, the clade comprising of these cyanobacterial orders is not clearly resolved in phylogenetic trees <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B11">11</abbr></abbrgrp>, the shared presence of large numbers of novel CSPs as well as some conserved indels by these cyanobacteria strongly suggests that species/strains from these groups shared a common ancestor exclusive of other cyanobacteria and that this clade represents a deeper branching grouping within cyanobacteria. The results presented here also suggest that <it>Syn. elongatus </it>is more closely related to Clade B in comparison to either clade A or C of cyanobacteria.</p>
         <p>The signature proteins and conserved indels for different cyanobacterial clades that are described in this work and in our recent studies <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> provide novel and powerful means for understanding cyanobacterial phylogeny and taxonomy. Based on these molecular markers, all of the main clades of cyanobacteria can now be identified and circumscribed in molecular terms. These signature proteins and indels should also prove useful for the identification and assignment of cyanobacterial species/strains to specific clades based upon the presence or absence of various signature indels or CSPs. Because many of these CSPs, or proteins containing the conserved indels, are highly conserved, degenerate PCR primers could be readily designed to sequence the corresponding genes/proteins from any given cyanobacteria. The assignment of any species/strains into a given clade by this approach is based upon several independent signatures that provide complementary information. Some of these signatures serve to exclude a given species/strains from particular groups or clades, whereas others point to its inclusion in more and more specific clades. Blast searches with these cyanobacteria-specific CSPs should also prove useful in determining the presence or absence of different groups of cyanobacteria in metagenomic sequences <abbrgrp><abbr bid="B69">69</abbr></abbrgrp></p>
         <p>Most of the cyanobacterial signature proteins identified in this work are of unknown functions. However, the retention of these genes by all cyanobacteria from the indicated clades strongly suggests that these proteins perform important functions in these groups of cyanobacteria <abbrgrp><abbr bid="B70">70</abbr><abbr bid="B71">71</abbr><abbr bid="B72">72</abbr></abbrgrp>. Likewise, our recent work shows that the conserved indels in protein sequences are also essential for the group or clade of species where they are found <abbrgrp><abbr bid="B73">73</abbr></abbrgrp>. Hence, further work on understanding the cellular functions of these cyanobacterial signature proteins and signature indels should be of great interest. These studies should provide valuable insights regarding biochemical and physiological characteristics that are unique to different clades of cyanobacteria <abbrgrp><abbr bid="B64">64</abbr><abbr bid="B74">74</abbr><abbr bid="B75">75</abbr><abbr bid="B76">76</abbr></abbrgrp>.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Phylogenetic/phylogenomic analyses</p>
            </st>
            <p>Phylogenetic analyses were carried out on a set of 44 proteins involved in important housekeeping functions that are present in most organisms (see Additional file <supplr sid="S1">1</supplr>) <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. Blast searches with these proteins revealed that their homologs were present in all 34 sequenced cyanobacterial genomes (listed in Table <tblr tid="T1">1</tblr>), the two outgroup species (<it>Bacillus subtilis </it>and <it>Staphylococcus aureus</it>), as well as 10 other cyanobacteria (viz. <it>Crocosphaera watsonii WH8501, Cyanothece sp. CCY0110, Lyngbya sp. PCC8106, Microcystis aeruginosa PCC7806, Nodularia spumigena CCY9414, Syenchococcus sp. WH5701, Syenchococcus sp. BL107, Syenchococcus sp. RS9917, Syenchococcus sp. RS9916 </it>and <it>Syenchococcus sp. WH7805</it>). Hence, sequence information for all of these cyanobacteria was included in our analyses. The multiple sequence alignments for these proteins were created using the ClustalX 1.83 program <abbrgrp><abbr bid="B77">77</abbr></abbrgrp> and they were concatenated into a single large file. This unedited sequence alignment was imported into the Gblocks 0.91b program to remove poorly aligned regions <abbrgrp><abbr bid="B78">78</abbr></abbrgrp>. This program was used with default settings except that allowed gap position parameter was changed to half. The resulting final alignment of 16834 amino acid sites was used for phylogenetic analyses. A neighbour-joining (NJ) tree based on 1000 bootstrap replicates was constructed by the Kimura model <abbrgrp><abbr bid="B79">79</abbr></abbrgrp> using the TREECON 1.3b program <abbrgrp><abbr bid="B80">80</abbr></abbrgrp>. The maximum-likelihood (ML) analysis was carried out using the WAG+F model with gamma distribution of evolutionary rates with four categories using the TREE-PUZZLE program with 10000 puzzling steps <abbrgrp><abbr bid="B81">81</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Identification of proteins and conserved indels that are specific for Cyanobacteria</p>
            </st>
            <p>The Blastp searches were carried out on each ORF in the genomes of <it>Synechococcus sp. WH8102, Synechocystis sp. PCC6803, Nostoc sp. PCC7120, Synechococcus sp. JA-3-3Ab, Prochlorococcus sp. MIT9215 </it>and <it>Prochlorococcus marinus subsp. marinus str. CCMP1375 </it>to identify proteins that are uniquely present in various clades of cyanobacteria seen in the phylogenetic trees (Fig. <figr fid="F1">1</figr>). The blast searches were performed against all organisms (i.e. non-redundant (nr) database) using the default parameters, without the low complexity filter <abbrgrp><abbr bid="B82">82</abbr></abbrgrp>. The proteins that were of interest were those where either all significant hits were from the indicated groups of cyanobacteria, or which involved a large increase in E values from the last hit belonging to a particular clade to the first hit from any other bacteria/cyanobacteria and the E values for the latter hits were >1e<sup>-04</sup>, indicating weak similarity that could occur by chance. Higher E values are often significant for smaller proteins as the magnitude of the E value depends upon the length of the query sequence <abbrgrp><abbr bid="B82">82</abbr></abbrgrp>. Hence, the lengths of the query proteins and those of various hits were also taken into consideration when analyzing the results of these studies. In most cases, the lengths of various significant hits were very similar to those of the query proteins. Some proteins, which in addition to cyanobacteria were also found in the plants/plastids, or in an isolated species from some other groups (noted appropriately), were also retained. The proteins, which were uniquely found in a given species or strain were not examined in this work. For all cyanobacterial proteins that are specific for various clades or subgroups, their accession numbers, any information regarding cellular functions, and protein lengths, were tabulated and are presented. Identification of new conserved indels that are specific for cyanobacterial clades was carried out as described in our earlier work <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>The initial Blastp searches on various cyanobacterial genomes were carried out by RSG with the computer assistance provided by Venus Wong. DWM analyzed the results of these searches to identify various group-specific proteins. All of these results were checked by RSG. DWM also generated a concatenated alignment of various cyanobacteria. RSG was responsible for carrying out the phylogenetic studies and for identification of conserved indels that are reported here. RSG also directed this study and wrote the manuscript, which was read and approved by all authors.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>This work was supported by a research grant from the Natural Science and Engineering Research Council of Canada. We thank Kenneth Ng and Amy Mok for assistance in carrying out some earlier blast searches on the cyanobacterial genomes.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Generic assignments, strain histories and properties of pure cultures of cyanobacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Rippka</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Deruelles</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Waterbury</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Herdman</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Stanier</snm>
                  <fnm>RY</fnm>
               </au>
            </aug>
            <source>J Gen Microbiol</source>
            <pubdate>1979</pubdate>
            <volume>111</volume>
            <fpage>1</fpage>
            <lpage>61</lpage>
         </bibl>
         <bibl id="B2">
            <title>
               <p>The Phototrophic Prokaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Kondratieva</snm>
                  <fnm>EN</fnm>
               </au>
               <au>
                  <snm>Pfennig</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Truper</snm>
                  <fnm>HG</fnm>
               </au>
            </aug>
            <source>The Prokaryotes</source>
            <publisher>New York: Springer-Verlag</publisher>
            <editor>Balows A, Truper HG, Dworkin M, Harder W, Schleifer KH</editor>
            <pubdate>1992</pubdate>
            <fpage>312</fpage>
            <lpage>330</lpage>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Phylum BX. Cyanobacteria: Oxygenic Photosynthetic Bacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Castenholz</snm>
                  <fnm>RW</fnm>
               </au>
            </aug>
            <source>Bergey's Manual of Systematic Bacteriology</source>
            <publisher>New York: Springer</publisher>
            <editor>Boone DR, Castenholz RW</editor>
            <pubdate>2001</pubdate>
            <fpage>474</fpage>
            <lpage>487</lpage>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Morphological and habitat evolution in the Cyanobacteria using a compartmentalization approach</p>
            </title>
            <aug>
               <au>
                  <snm>Sanchez-Baracaldo</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Hayes</snm>
                  <fnm>PK</fnm>
               </au>
               <au>
                  <snm>Blank</snm>
                  <fnm>CE</fnm>
               </au>
            </aug>
            <source>Geobiology</source>
            <pubdate>2005</pubdate>
            <volume>3</volume>
            <fpage>145</fpage>
            <lpage>165</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1111/j.1472-4669.2005.00050.x</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Morphological and genetic criteria in the taxonomy of Cyanophyta/Cyanobacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Wilmotte</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Golubic</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Arhciv fur Hydrobiologie</source>
            <pubdate>1991</pubdate>
            <volume>64</volume>
            <fpage>1</fpage>
            <lpage>24</lpage>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Phylogenetic Relationships among the Cyanobacteria Based on 16S rRNA Sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Wilmotte</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Herdman</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Bergey's Manual of Systematic Bacteriology</source>
            <publisher>New York: Springer</publisher>
            <editor>Boone DR, Castenholz RW</editor>
            <pubdate>2001</pubdate>
            <fpage>487</fpage>
            <lpage>493</lpage>
         </bibl>
         <bibl id="B7">
            <title>
               <p>The RDP-II (Ribosomal Database Project)</p>
            </title>
            <aug>
               <au>
                  <snm>Maidak</snm>
                  <fnm>BL</fnm>
               </au>
               <au>
                  <snm>Cole</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Lilburn</snm>
                  <fnm>TG</fnm>
               </au>
               <au>
                  <snm>Parker</snm>
                  <fnm>CT</fnm>
                  <suf>Jr</suf>
               </au>
               <au>
                  <snm>Saxman</snm>
                  <fnm>PR</fnm>
               </au>
               <au>
                  <snm>Farris</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Garrity</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Olsen</snm>
                  <fnm>GJ</fnm>
               </au>
               <au>
                  <snm>Schmidt</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Tiedje</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <fpage>173</fpage>
            <lpage>174</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/29.1.173</pubid>
                  <pubid idtype="pmcid">29785</pubid>
                  <pubid idtype="pmpid" link="fulltext">11125082</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>The Revised Road Map to the Manual</p>
            </title>
            <aug>
               <au>
                  <snm>Garrity</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Bell</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Lilburn</snm>
                  <fnm>TG</fnm>
               </au>
            </aug>
            <source>Bergey's Manual of Systematic Bacteriology, Part A, Introductory Essays</source>
            <publisher>New York: Springer</publisher>
            <editor>Brenner DJ, Krieg NR, Staley JT</editor>
            <pubdate>2005</pubdate>
            <volume>2</volume>
            <fpage>159</fpage>
            <lpage>220</lpage>
            <xrefbib>
               <pubid idtype="doi">full_text</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>A proposal for further integration of the cyanobacteria under the Bacteriological Code</p>
            </title>
            <aug>
               <au>
                  <snm>Oren</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Int J Syst Evol Microbiol</source>
            <pubdate>2004</pubdate>
            <volume>54</volume>
            <fpage>1895</fpage>
            <lpage>1902</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1099/ijs.0.03008-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">15388760</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Nomenclature of Cyanophyta/Cyanobacteria: roundtable on the unification of the nomenclature under the Botanical and Bacteriological Codes</p>
            </title>
            <aug>
               <au>
                  <snm>Hoffmann</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Algological Studies</source>
            <pubdate>2005</pubdate>
            <volume>117</volume>
            <fpage>13</fpage>
            <lpage>29</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1127/1864-1318/2005/0117-0013</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Integrating Markov clustering and molecular phylogenetics to reconstruct the cyanobacterial species tree from conserved protein families</p>
            </title>
            <aug>
               <au>
                  <snm>Swingley</snm>
                  <fnm>WD</fnm>
               </au>
               <au>
                  <snm>Blankenship</snm>
                  <fnm>RE</fnm>
               </au>
               <au>
                  <snm>Raymond</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2008</pubdate>
            <volume>25</volume>
            <fpage>643</fpage>
            <lpage>654</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msn034</pubid>
                  <pubid idtype="pmpid" link="fulltext">18296704</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Genome evolution in cyanobacteria: the stable core and the variable shell</p>
            </title>
            <aug>
               <au>
                  <snm>Shi</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Falkowski</snm>
                  <fnm>PG</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2008</pubdate>
            <volume>105</volume>
            <fpage>2510</fpage>
            <lpage>2515</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.0711165105</pubid>
                  <pubid idtype="pmcid">2268167</pubid>
                  <pubid idtype="pmpid" link="fulltext">18268351</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Phylogenetic analyses of cyanobacterial genomes: quantification of horizontal gene transfer events</p>
            </title>
            <aug>
               <au>
                  <snm>Zhaxybayeva</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Gogarten</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Charlebois</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Doolittle</snm>
                  <fnm>WF</fnm>
               </au>
               <au>
                  <snm>Papke</snm>
                  <fnm>RT</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2006</pubdate>
            <volume>16</volume>
            <fpage>1099</fpage>
            <lpage>1108</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gr.5322306</pubid>
                  <pubid idtype="pmcid">1557764</pubid>
                  <pubid idtype="pmpid" link="fulltext">16899658</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Prokaryote taxonomy online: challenges ahead</p>
            </title>
            <aug>
               <au>
                  <snm>Oren</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Stackebrandt</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>419</volume>
            <fpage>15</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/419015c</pubid>
                  <pubid idtype="pmpid" link="fulltext">12214210</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>System of Cyanoprokaryotes (Cyanobacteria) - State in 2004</p>
            </title>
            <aug>
               <au>
                  <snm>Hoffmann</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Komarek</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>kastovsky</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Algological Studies</source>
            <pubdate>2005</pubdate>
            <fpage>95</fpage>
            <lpage>1155</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1127/1864-1318/2005/0117-0095</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Critical Issues in Bacterial Phylogenies</p>
            </title>
            <aug>
               <au>
                  <snm>Gupta</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Griffiths</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Theor Popul Biol</source>
            <pubdate>2002</pubdate>
            <volume>61</volume>
            <fpage>423</fpage>
            <lpage>434</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/tpbi.2002.1589</pubid>
                  <pubid idtype="pmpid" link="fulltext">12167362</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Protein Phylogenies and Signature Sequences: A Reappraisal of Evolutionary Relationships Among Archaebacteria, Eubacteria, and Eukaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Gupta</snm>
                  <fnm>RS</fnm>
               </au>
            </aug>
            <source>Microbiol Mol Biol Rev</source>
            <pubdate>1998</pubdate>
            <volume>62</volume>
            <fpage>1435</fpage>
            <lpage>1491</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">98952</pubid>
                  <pubid idtype="pmpid" link="fulltext">9841678</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>The phylogeny of Proteobacteria: relationships to other eubacterial phyla and eukaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Gupta</snm>
                  <fnm>RS</fnm>
               </au>
            </aug>
            <source>FEMS Microbiol Rev</source>
            <pubdate>2000</pubdate>
            <volume>24</volume>
            <fpage>367</fpage>
            <lpage>402</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1574-6976.2000.tb00547.x</pubid>
                  <pubid idtype="pmpid">10978543</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Phylogenetic analysis of tufA sequences indicates a cyanobacterial origin of all plastids</p>
            </title>
            <aug>
               <au>
                  <snm>Delwiche</snm>
                  <fnm>CF</fnm>
               </au>
               <au>
                  <snm>Kuhsel</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Palmer</snm>
                  <fnm>JD</fnm>
               </au>
            </aug>
            <source>Mol Phylogenet Evol</source>
            <pubdate>1995</pubdate>
            <volume>4</volume>
            <fpage>110</fpage>
            <lpage>128</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/mpev.1995.1012</pubid>
                  <pubid idtype="pmpid" link="fulltext">7663757</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Evidence that eukaryotes and eocyte prokaryotes are immediate relatives</p>
            </title>
            <aug>
               <au>
                  <snm>Rivera</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Lake</snm>
                  <fnm>JA</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1992</pubdate>
            <volume>257</volume>
            <fpage>74</fpage>
            <lpage>76</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1621096</pubid>
                  <pubid idtype="pmpid" link="fulltext">1621096</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Phylogeny and shared conserved inserts in proteins provide evidence that Verrucomicrobia are the closest known free-living relatives of chlamydiae</p>
            </title>
            <aug>
               <au>
                  <snm>Griffiths</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Gupta</snm>
                  <fnm>RS</fnm>
               </au>
            </aug>
            <source>Microbiology</source>
            <pubdate>2007</pubdate>
            <volume>153</volume>
            <fpage>2648</fpage>
            <lpage>2654</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1099/mic.0.2007/009118-0</pubid>
                  <pubid idtype="pmpid">17660429</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Molecular signatures in protein sequences that are characteristic of Cyanobacteria and plastid homologues</p>
            </title>
            <aug>
               <au>
                  <snm>Gupta</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Pereira</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Chandrasekera</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Johari</snm>
                  <fnm>V</fnm>
               </au>
            </aug>
            <source>Int J Syst Evol Microbiol</source>
            <pubdate>2003</pubdate>
            <volume>53</volume>
            <fpage>1833</fpage>
            <lpage>1842</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1099/ijs.0.02720-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">14657112</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Protein signatures (molecular synapomorphies) that are distinctive characteristics of the major cyanobacterial clades</p>
            </title>
            <aug>
               <au>
                  <snm>Gupta</snm>
                  <fnm>RS</fnm>
               </au>
            </aug>
            <source>Int J Syst Evol Microbiol</source>
            <pubdate>2009</pubdate>
            <volume>59</volume>
            <fpage>2510</fpage>
            <lpage>2526</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1099/ijs.0.005678-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">19622649</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>The origin and evolution of plastids and their genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Palmer</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Delwiche</snm>
                  <fnm>CF</fnm>
               </au>
            </aug>
            <source>Molecular Systematics of Plants II DNA Sequencing</source>
            <publisher>Norwell, MA, USA. Kluwer Academic Publishers</publisher>
            <editor>Sotis DE, Soltis PE, Doyle JJ</editor>
            <pubdate>1998</pubdate>
            <fpage>375</fpage>
            <lpage>409</lpage>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Signature proteins that are distinctive characteristics of Actinobacteria and their subgroups</p>
            </title>
            <aug>
               <au>
                  <snm>Gao</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Parmanathan</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Gupta</snm>
                  <fnm>RS</fnm>
               </au>
            </aug>
            <source>Antonie van Leeuwenhoek</source>
            <pubdate>2006</pubdate>
            <volume>90</volume>
            <fpage>69</fpage>
            <lpage>91</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s10482-006-9061-2</pubid>
                  <pubid idtype="pmpid" link="fulltext">16670965</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Phylogeny and molecular signatures (conserved proteins and indels) that are specific for the Bacteroidetes and Chlorobi species</p>
            </title>
            <aug>
               <au>
                  <snm>Gupta</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Lorenzini</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>BMC Evol Biol</source>
            <pubdate>2007</pubdate>
            <volume>7</volume>
            <fpage>71</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/1471-2148-7-71</pubid>
                  <pubid idtype="pmcid">1887533</pubid>
                  <pubid idtype="pmpid" link="fulltext">17488508</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Phylogenomics and signature proteins for the alpha Proteobacteria and its main groups</p>
            </title>
            <aug>
               <au>
                  <snm>Gupta</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Mok</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>BMC Microbiol</source>
            <pubdate>2007</pubdate>
            <volume>7</volume>
            <fpage>106</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/1471-2180-7-106</pubid>
                  <pubid idtype="pmcid">2241609</pubid>
                  <pubid idtype="pmpid" link="fulltext">18045498</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Signature genes as a phylogenomic tool</p>
            </title>
            <aug>
               <au>
                  <snm>Dutilh</snm>
                  <fnm>BE</fnm>
               </au>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Ettema</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Huynen</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2008</pubdate>
            <volume>25</volume>
            <fpage>1659</fpage>
            <lpage>1667</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msn115</pubid>
                  <pubid idtype="pmcid">2464742</pubid>
                  <pubid idtype="pmpid" link="fulltext">18492663</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Cyanobacterial signatures genes</p>
            </title>
            <aug>
               <au>
                  <snm>Martin</snm>
                  <fnm>KA</fnm>
               </au>
               <au>
                  <snm>Siefert</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Yerrapragada</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>McNeill</snm>
                  <fnm>TZ</fnm>
               </au>
               <au>
                  <snm>Moreno</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>Weinstock</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Widger</snm>
                  <fnm>WR</fnm>
               </au>
               <au>
                  <snm>Fox</snm>
                  <fnm>GE</fnm>
               </au>
            </aug>
            <source>Photosynth Res</source>
            <pubdate>2003</pubdate>
            <volume>75</volume>
            <fpage>211</fpage>
            <lpage>221</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1023/A:1023990402346</pubid>
                  <pubid idtype="pmpid">16228602</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>The cyanobacterial genome core and the origin of photosynthesis</p>
            </title>
            <aug>
               <au>
                  <snm>Mulkidjanian</snm>
                  <fnm>AY</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Makarova</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Mekhedov</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Sorokin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Dufresne</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Partensky</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Burd</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kaznadzey</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Haselkorn</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Galperin</snm>
                  <fnm>MY</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2006</pubdate>
            <volume>103</volume>
            <fpage>13126</fpage>
            <lpage>13131</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.0605709103</pubid>
                  <pubid idtype="pmcid">1551899,1551899</pubid>
                  <pubid idtype="pmpid" link="fulltext">16924101</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Toward automatic reconstruction of a highly resolved tree of life</p>
            </title>
            <aug>
               <au>
                  <snm>Ciccarelli</snm>
                  <fnm>FD</fnm>
               </au>
               <au>
                  <snm>Doerks</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>von Mering</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Creevey</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2006</pubdate>
            <volume>311</volume>
            <fpage>1283</fpage>
            <lpage>1287</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1123061</pubid>
                  <pubid idtype="pmpid" link="fulltext">16513982</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Horizontal gene transfer, genome innovation and evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Gogarten</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Townsend</snm>
                  <fnm>JP</fnm>
               </au>
            </aug>
            <source>Nat Rev Microbiol</source>
            <pubdate>2005</pubdate>
            <volume>3</volume>
            <fpage>679</fpage>
            <lpage>687</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrmicro1204</pubid>
                  <pubid idtype="pmpid" link="fulltext">16138096</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Intertwined evolutionary histories of marine Synechococcus and Prochlorococcus marinus</p>
            </title>
            <aug>
               <au>
                  <snm>Zhaxybayeva</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Doolittle</snm>
                  <fnm>WF</fnm>
               </au>
               <au>
                  <snm>Papke</snm>
                  <fnm>RT</fnm>
               </au>
               <au>
                  <snm>Gogarten</snm>
                  <fnm>JP</fnm>
               </au>
            </aug>
            <source>Genome Biology and Evolution</source>
            <pubdate>2009</pubdate>
            <volume>1</volume>
            <fpage>325</fpage>
            <lpage>339</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1093/gbe/evp032</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Nonhomogeneous model of sequence evolution indicates independent origins of primary endosymbionts within the enterobacteriales (gamma-Proteobacteria)</p>
            </title>
            <aug>
               <au>
                  <snm>Herbeck</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Degnan</snm>
                  <fnm>PH</fnm>
               </au>
               <au>
                  <snm>Wernegreen</snm>
                  <fnm>JJ</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2005</pubdate>
            <volume>22</volume>
            <fpage>520</fpage>
            <lpage>532</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msi036</pubid>
                  <pubid idtype="pmpid" link="fulltext">15525700</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>The genetic core of the universal ancestor</p>
            </title>
            <aug>
               <au>
                  <snm>Harris</snm>
                  <fnm>JK</fnm>
               </au>
               <au>
                  <snm>Kelley</snm>
                  <fnm>ST</fnm>
               </au>
               <au>
                  <snm>Spiegelman</snm>
                  <fnm>GB</fnm>
               </au>
               <au>
                  <snm>Pace</snm>
                  <fnm>NR</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <fpage>407</fpage>
            <lpage>412</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gr.652803</pubid>
                  <pubid idtype="pmcid">430263</pubid>
                  <pubid idtype="pmpid" link="fulltext">12618371</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Complete genome structure of Gloeobacter violaceus PCC a cyanobacterium that lacks thylakoids</p>
            </title>
            <aug>
               <au>
                  <snm>Nakamura</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Kaneko</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Sato</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Mimuro</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Miyashita</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Tsuchiya</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Sasamoto</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Watanabe</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kawashima</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kishida</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Kiyokawa</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Kohara</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Matsumoto</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Matsuno</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Nakazaki</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Shimpo</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Takeuchi</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Yamada</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Tabata</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>DNA Res</source>
            <pubdate>7421</pubdate>
            <volume>10</volume>
            <fpage>137</fpage>
            <lpage>145</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1093/dnares/10.4.137</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Detection of seven major evolutionary lineages in cyanobacteria based on the 16S rRNA gene sequence analysis with new sequences of five marine <it>Synechococcus </it>strains</p>
            </title>
            <aug>
               <au>
                  <snm>Honda</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Yokota</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sugiyama</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>1999</pubdate>
            <volume>48</volume>
            <fpage>723</fpage>
            <lpage>739</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/PL00006517</pubid>
                  <pubid idtype="pmpid" link="fulltext">10229577</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Evolutionary relationships among cyanobacteria and green chloroplasts</p>
            </title>
            <aug>
               <au>
                  <snm>Giovannoni</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Turner</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Olsen</snm>
                  <fnm>GJ</fnm>
               </au>
               <au>
                  <snm>Barns</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lane</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Pace</snm>
                  <fnm>NR</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>1988</pubdate>
            <volume>170</volume>
            <fpage>3584</fpage>
            <lpage>3592</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">211332</pubid>
                  <pubid idtype="pmpid" link="fulltext">3136142</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>The phylogenetic relationships of cyanobacteria inferred from 16S rRNA, gyrB, rpoC1 and rpoD1 gene sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Seo</snm>
                  <fnm>PS</fnm>
               </au>
               <au>
                  <snm>Yokota</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>J Gen Appl Microbiol</source>
            <pubdate>2003</pubdate>
            <volume>49</volume>
            <fpage>191</fpage>
            <lpage>203</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.2323/jgam.49.191</pubid>
                  <pubid idtype="pmpid" link="fulltext">12949700</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Genome divergence in two Prochlorococcus ecotypes reflects oceanic niche differentiation</p>
            </title>
            <aug>
               <au>
                  <snm>Rocap</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Larimer</snm>
                  <fnm>FW</fnm>
               </au>
               <au>
                  <snm>Lamerdin</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Malfatti</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Chain</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Ahlgren</snm>
                  <fnm>NA</fnm>
               </au>
               <au>
                  <snm>Arellano</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Coleman</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hauser</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Hess</snm>
                  <fnm>WR</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>ZI</fnm>
               </au>
               <au>
                  <snm>Land</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lindell</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Post</snm>
                  <fnm>AF</fnm>
               </au>
               <au>
                  <snm>Regala</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Shah</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Shaw</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Steglich</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Sullivan</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Ting</snm>
                  <fnm>CS</fnm>
               </au>
               <au>
                  <snm>Tolonen</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Webb</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Zinser</snm>
                  <fnm>ER</fnm>
               </au>
               <au>
                  <snm>Chisholm</snm>
                  <fnm>SW</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2003</pubdate>
            <volume>424</volume>
            <fpage>1042</fpage>
            <lpage>1047</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature01947</pubid>
                  <pubid idtype="pmpid" link="fulltext">12917642</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Resolution of Prochlorococcus and Synechococcus ecotypes by using 16S-23S ribosomal DNA internal transcribed spacer sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Rocap</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Distel</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Waterbury</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Chisholm</snm>
                  <fnm>SW</fnm>
               </au>
            </aug>
            <source>Appl Environ Microbiol</source>
            <pubdate>2002</pubdate>
            <volume>68</volume>
            <fpage>1180</fpage>
            <lpage>1191</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1128/AEM.68.3.1180-1191.2002</pubid>
                  <pubid idtype="pmcid">123739</pubid>
                  <pubid idtype="pmpid" link="fulltext">11872466</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Phylogenomics and protein signatures elucidating the evolutionary relationships among the Gammaproteobacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Gao</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Mohan</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Gupta</snm>
                  <fnm>RS</fnm>
               </au>
            </aug>
            <source>Int J Syst Evol Microbiol</source>
            <pubdate>2009</pubdate>
            <volume>59</volume>
            <fpage>234</fpage>
            <lpage>247</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1099/ijs.0.002741-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">19196760</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Phylogenomic analysis of proteins that are distinctive of <it>Archaea </it>and its main subgroups and the origin of methanogenesis</p>
            </title>
            <aug>
               <au>
                  <snm>Gao</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Gupta</snm>
                  <fnm>RS</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>86</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/1471-2164-8-86</pubid>
                  <pubid idtype="pmcid">1852104</pubid>
                  <pubid idtype="pmpid" link="fulltext">17394648</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Evolutionary Origins of Genomic Repertoires in Bacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Lerat</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Daubin</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Ochman</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Moran</snm>
                  <fnm>NA</fnm>
               </au>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2005</pubdate>
            <volume>3</volume>
            <fpage>e130</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1371/journal.pbio.0030130</pubid>
                  <pubid idtype="pmcid">1073693,1073693</pubid>
                  <pubid idtype="pmpid" link="fulltext">15799709</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Niche adaptation and genome expansion in the chlorophyll d-producing cyanobacterium Acaryochloris marina</p>
            </title>
            <aug>
               <au>
                  <snm>Swingley</snm>
                  <fnm>WD</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Cheung</snm>
                  <fnm>PC</fnm>
               </au>
               <au>
                  <snm>Conrad</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Dejesa</snm>
                  <fnm>LC</fnm>
               </au>
               <au>
                  <snm>Hao</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Honchak</snm>
                  <fnm>BM</fnm>
               </au>
               <au>
                  <snm>Karbach</snm>
                  <fnm>LE</fnm>
               </au>
               <au>
                  <snm>Kurdoglu</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lahiri</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Mastrian</snm>
                  <fnm>SD</fnm>
               </au>
               <au>
                  <snm>Miyashita</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Page</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Ramakrishna</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Satoh</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sattley</snm>
                  <fnm>WM</fnm>
               </au>
               <au>
                  <snm>Shimada</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>HL</fnm>
               </au>
               <au>
                  <snm>Tomo</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Tsuchiya</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>ZT</fnm>
               </au>
               <au>
                  <snm>Raymond</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Mimuro</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Blankenship</snm>
                  <fnm>RE</fnm>
               </au>
               <au>
                  <snm>Touchman</snm>
                  <fnm>JW</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2008</pubdate>
            <volume>105</volume>
            <fpage>2005</fpage>
            <lpage>2010</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.0709772105</pubid>
                  <pubid idtype="pmcid">2538872</pubid>
                  <pubid idtype="pmpid" link="fulltext">18252824</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>Complete genome structure of the thermophilic cyanobacterium <it>Thermosynechococcus elongatus BP-1</it></p>
            </title>
            <aug>
               <au>
                  <snm>Nakamura</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Kaneko</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Sato</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ikeuchi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Katoh</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Sasamoto</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Watanabe</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Iriguchi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kawashima</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kimura</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Kishida</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Kiyokawa</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Kohara</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Matsumoto</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Matsuno</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Nakazaki</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Shimpo</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sugimoto</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Takeuchi</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Yamada</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Tabata</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>DNA Research</source>
            <pubdate>2002</pubdate>
            <volume>9</volume>
            <fpage>123</fpage>
            <lpage>130</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/dnares/9.4.123</pubid>
                  <pubid idtype="pmpid" link="fulltext">12240834</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>Investigating deep phylogenetic relationships among cyanobacteria and plastids by small subunit rRNA sequence analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Turner</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Pryer</snm>
                  <fnm>KM</fnm>
               </au>
               <au>
                  <snm>Miao</snm>
                  <fnm>VP</fnm>
               </au>
               <au>
                  <snm>Palmer</snm>
                  <fnm>JD</fnm>
               </au>
            </aug>
            <source>J Eukaryot Microbiol</source>
            <pubdate>1999</pubdate>
            <volume>46</volume>
            <fpage>327</fpage>
            <lpage>338</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1550-7408.1999.tb04612.x</pubid>
                  <pubid idtype="pmpid">10461381</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>Complete genomic sequence of the filamentous nitrogen-fixing cyanobacterium <it>Anabaena sp</it>. strain PCC 7120</p>
            </title>
            <aug>
               <au>
                  <snm>Kaneko</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Nakamura</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Wolk</snm>
                  <fnm>CP</fnm>
               </au>
               <au>
                  <snm>Kuritz</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Sasamoto</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Watanabe</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Iriguchi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ishikawa</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kawashima</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kimura</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Kishida</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Kohara</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Matsumoto</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Matsuno</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Muraki</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Nakazaki</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Shimpo</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sugimoto</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Takazawa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Yamada</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Yasuda</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Tabata</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>DNA Res</source>
            <pubdate>2001</pubdate>
            <volume>8</volume>
            <fpage>205</fpage>
            <lpage>213</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/dnares/8.5.205</pubid>
                  <pubid idtype="pmpid" link="fulltext">11759840</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>Distribution of split DnaE inteins in cyanobacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Caspi</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Amitai</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Belenkiy</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Pietrokovski</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>2003</pubdate>
            <volume>50</volume>
            <fpage>1569</fpage>
            <lpage>1577</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-2958.2003.03825.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">14651639</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>Heterocyst formation in cyanobacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Adams</snm>
                  <fnm>DG</fnm>
               </au>
            </aug>
            <source>Curr Opin Microbiol</source>
            <pubdate>2000</pubdate>
            <volume>3</volume>
            <fpage>618</fpage>
            <lpage>624</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S1369-5274(00)00150-8</pubid>
                  <pubid idtype="pmpid" link="fulltext">11121783</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>Genome sequence of the cyanobacterium <it>Prochlorococcus marinus </it>SS120, a nearly minimal oxyphototrophic genome</p>
            </title>
            <aug>
               <au>
                  <snm>Dufresne</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Salanoubat</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Partensky</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Artiguenave</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Axmann</snm>
                  <fnm>IM</fnm>
               </au>
               <au>
                  <snm>Barbe</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Duprat</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Galperin</snm>
                  <fnm>MY</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Le Gall</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Makarova</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Ostrowski</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Oztas</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Robert</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Rogozin</snm>
                  <fnm>IB</fnm>
               </au>
               <au>
                  <snm>Scanlan</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>De Marsac</snm>
                  <fnm>NT</fnm>
               </au>
               <au>
                  <snm>Weissenbach</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wincker</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Hess</snm>
                  <fnm>WR</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2003</pubdate>
            <volume>100</volume>
            <fpage>10020</fpage>
            <lpage>10025</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.1733211100</pubid>
                  <pubid idtype="pmcid">187748</pubid>
                  <pubid idtype="pmpid" link="fulltext">12917486</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>The genome of a motile marine Synechococcus</p>
            </title>
            <aug>
               <au>
                  <snm>Palenik</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Brahamsha</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Larimer</snm>
                  <fnm>FW</fnm>
               </au>
               <au>
                  <snm>Land</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hauser</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Chain</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Lamerdin</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Regala</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Allen</snm>
                  <fnm>EE</fnm>
               </au>
               <au>
                  <snm>McCarren</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Paulsen</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Dufresne</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Partensky</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Webb</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Waterbury</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2003</pubdate>
            <volume>424</volume>
            <fpage>1037</fpage>
            <lpage>1042</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature01943</pubid>
                  <pubid idtype="pmpid" link="fulltext">12917641</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>Genome sequence of Synechococcus CC9311: Insights into adaptation to a coastal environment</p>
            </title>
            <aug>
               <au>
                  <snm>Palenik</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Ren</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Dupont</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>Myers</snm>
                  <fnm>GS</fnm>
               </au>
               <au>
                  <snm>Heidelberg</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Badger</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Madupu</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>WC</fnm>
               </au>
               <au>
                  <snm>Brinkac</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Dodson</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Durkin</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Daugherty</snm>
                  <fnm>SC</fnm>
               </au>
               <au>
                  <snm>Sullivan</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Khouri</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Mohamoud</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Halpin</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Paulsen</snm>
                  <fnm>IT</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2006</pubdate>
            <volume>103</volume>
            <fpage>13555</fpage>
            <lpage>13559</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.0602963103</pubid>
                  <pubid idtype="pmcid">1569201</pubid>
                  <pubid idtype="pmpid" link="fulltext">16938853</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>Crystal structure of heme oxygenase-1 from cyanobacterium Synechocystis sp. PCC 6803 in complex with heme</p>
            </title>
            <aug>
               <au>
                  <snm>Sugishima</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Migita</snm>
                  <fnm>CT</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Yoshida</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Fukuyama</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Eur J Biochem</source>
            <pubdate>2004</pubdate>
            <volume>271</volume>
            <fpage>4517</fpage>
            <lpage>4525</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1432-1033.2004.04411.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">15560792</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>Conformational changes in the catalytic cycle of protochlorophyllide oxidoreductase: what lessons can be learnt from dihydrofolate reductase?</p>
            </title>
            <aug>
               <au>
                  <snm>Heyes</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Scrutton</snm>
                  <fnm>NS</fnm>
               </au>
            </aug>
            <source>Biochem Soc Trans</source>
            <pubdate>2009</pubdate>
            <volume>37</volume>
            <fpage>354</fpage>
            <lpage>357</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1042/BST0370354</pubid>
                  <pubid idtype="pmpid" link="fulltext">19290861</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <title>
               <p>Physiology and molecular phylogeny of coexisting Prochlorococcus ecotypes</p>
            </title>
            <aug>
               <au>
                  <snm>Moore</snm>
                  <fnm>LR</fnm>
               </au>
               <au>
                  <snm>Rocap</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Chisholm</snm>
                  <fnm>SW</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1998</pubdate>
            <volume>393</volume>
            <fpage>464</fpage>
            <lpage>467</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/30965</pubid>
                  <pubid idtype="pmpid" link="fulltext">9624000</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <title>
               <p>Niche adaptation in ocean cyanobacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Ferris</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Palenik</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1998</pubdate>
            <volume>396</volume>
            <fpage>226</fpage>
            <lpage>228</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1038/24297</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>Structural features and the persistence of acquired proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Narra</snm>
                  <fnm>HP</fnm>
               </au>
               <au>
                  <snm>Cordes</snm>
                  <fnm>MH</fnm>
               </au>
               <au>
                  <snm>Ochman</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Proteomics</source>
            <pubdate>2008</pubdate>
            <volume>8</volume>
            <fpage>4772</fpage>
            <lpage>4781</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/pmic.200800061</pubid>
                  <pubid idtype="pmpid" link="fulltext">18924109</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <title>
               <p>Phylogenomics and protein signatures elucidating the evolutionary relationships among the <it>Gammaproteobacteria</it></p>
            </title>
            <aug>
               <au>
                  <snm>Gao</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Mohan</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Gupta</snm>
                  <fnm>RS</fnm>
               </au>
            </aug>
            <source>Int J Syst Evol Microbiol</source>
            <pubdate>2009</pubdate>
            <volume>59</volume>
            <fpage>234</fpage>
            <lpage>247</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1099/ijs.0.002741-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">19196760</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B60">
            <title>
               <p>Analysis of singleton ORFans in fully sequenced microbial genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Siew</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Fischer</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Proteins</source>
            <pubdate>2003</pubdate>
            <volume>53</volume>
            <fpage>241</fpage>
            <lpage>251</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/prot.10423</pubid>
                  <pubid idtype="pmpid" link="fulltext">14517975</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B61">
            <title>
               <p>The fate of new bacterial genes</p>
            </title>
            <aug>
               <au>
                  <snm>Kuo</snm>
                  <fnm>CH</fnm>
               </au>
               <au>
                  <snm>Ochman</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>FEMS Microbiol Rev</source>
            <pubdate>2009</pubdate>
            <volume>33</volume>
            <fpage>38</fpage>
            <lpage>43</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1574-6976.2008.00140.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">19054121</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B62">
            <title>
               <p>Persistence drives gene clustering in bacterial genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Fang</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Rocha</snm>
                  <fnm>EP</fnm>
               </au>
               <au>
                  <snm>Danchin</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2008</pubdate>
            <volume>9</volume>
            <fpage>4</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/1471-2164-9-4</pubid>
                  <pubid idtype="pmcid">2234087</pubid>
                  <pubid idtype="pmpid" link="fulltext">18179692</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B63">
            <title>
               <p>Molecular signatures (unique proteins and conserved Indels) that are specific for the epsilon proteobacteria (Campylobacterales)</p>
            </title>
            <aug>
               <au>
                  <snm>Gupta</snm>
                  <fnm>RS</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>167</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/1471-2164-7-167</pubid>
                  <pubid idtype="pmcid">1557499</pubid>
                  <pubid idtype="pmpid" link="fulltext">16817973</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B64">
            <title>
               <p>Chlamydiae-specific proteins and indels: novel tools for studies</p>
            </title>
            <aug>
               <au>
                  <snm>Gupta</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Griffiths</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Trends Microbiol</source>
            <pubdate>2006</pubdate>
            <volume>14</volume>
            <fpage>527</fpage>
            <lpage>535</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tim.2006.10.002</pubid>
                  <pubid idtype="pmpid" link="fulltext">17049238</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B65">
            <title>
               <p>Prokaryotic evolution in light of gene transfer</p>
            </title>
            <aug>
               <au>
                  <snm>Gogarten</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Doolittle</snm>
                  <fnm>WF</fnm>
               </au>
               <au>
                  <snm>Lawrence</snm>
                  <fnm>JG</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2002</pubdate>
            <volume>19</volume>
            <fpage>2226</fpage>
            <lpage>2238</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12446813</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B66">
            <title>
               <p>Whole-genome analysis of photosynthetic prokaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Raymond</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zhaxybayeva</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Gogarten</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Gerdes</snm>
                  <fnm>SY</fnm>
               </au>
               <au>
                  <snm>Blankenship</snm>
                  <fnm>RE</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>298</volume>
            <fpage>1616</fpage>
            <lpage>1620</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1075558</pubid>
                  <pubid idtype="pmpid" link="fulltext">12446909</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B67">
            <title>
               <p>Rare genomic changes as a tool for phylogenetics</p>
            </title>
            <aug>
               <au>
                  <snm>Rokas</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Holland</snm>
                  <fnm>PW</fnm>
               </au>
            </aug>
            <source>Trends Ecol Evol</source>
            <pubdate>2000</pubdate>
            <volume>15</volume>
            <fpage>454</fpage>
            <lpage>459</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0169-5347(00)01967-4</pubid>
                  <pubid idtype="pmpid" link="fulltext">11050348</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B68">
            <title>
               <p>Ancient gene transfer as a tool in phylogenetic reconstruction</p>
            </title>
            <aug>
               <au>
                  <snm>Huang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Gogarten</snm>
                  <fnm>JP</fnm>
               </au>
            </aug>
            <source>Methods Mol Biol</source>
            <pubdate>2009</pubdate>
            <volume>532</volume>
            <fpage>127</fpage>
            <lpage>139</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">full_text</pubid>
                  <pubid idtype="pmpid" link="fulltext">19271182</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B69">
            <title>
               <p>Quantitative phylogenetic assessment of microbial communities in diverse environments</p>
            </title>
            <aug>
               <au>
                  <snm>von Mering</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Hugenholtz</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Raes</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Tringe</snm>
                  <fnm>SG</fnm>
               </au>
               <au>
                  <snm>Doerks</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Jensen</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Ward</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2007</pubdate>
            <volume>315</volume>
            <fpage>1126</fpage>
            <lpage>1130</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1133420</pubid>
                  <pubid idtype="pmpid" link="fulltext">17272687</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B70">
            <title>
               <p>Functional clues for hypothetical proteins based on genomic context analysis in prokaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Doerks</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>von Mering</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <fpage>6321</fpage>
            <lpage>6326</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkh973</pubid>
                  <pubid idtype="pmcid">535681</pubid>
                  <pubid idtype="pmpid" link="fulltext">15576358</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B71">
            <title>
               <p>How essential are nonessential genes?</p>
            </title>
            <aug>
               <au>
                  <snm>Fang</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Rocha</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Danchin</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2005</pubdate>
            <volume>22</volume>
            <fpage>2147</fpage>
            <lpage>2156</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msi211</pubid>
                  <pubid idtype="pmpid" link="fulltext">16014871</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B72">
            <title>
               <p>The power of phylogenetic comparison in revealing protein function</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <fpage>3179</fpage>
            <lpage>3180</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.0500371102</pubid>
                  <pubid idtype="pmcid">552944</pubid>
                  <pubid idtype="pmpid" link="fulltext">15728394</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B73">
            <title>
               <p>Conserved inserts in the Hsp60 (GroEL) and Hsp70 (DnaK) proteins are essential for cellular growth</p>
            </title>
            <aug>
               <au>
                  <snm>Singh</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Gupta</snm>
                  <fnm>RS</fnm>
               </au>
            </aug>
            <source>Mol Genet Genomics</source>
            <pubdate>2009</pubdate>
            <volume>281</volume>
            <fpage>361</fpage>
            <lpage>373</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s00438-008-0417-3</pubid>
                  <pubid idtype="pmpid" link="fulltext">19127371</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B74">
            <title>
               <p>Identifying protein function--a call for community action</p>
            </title>
            <aug>
               <au>
                  <snm>Roberts</snm>
                  <fnm>RJ</fnm>
               </au>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2004</pubdate>
            <volume>2</volume>
            <fpage>E42</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1371/journal.pbio.0020042</pubid>
                  <pubid idtype="pmcid">368155</pubid>
                  <pubid idtype="pmpid" link="fulltext">15024411</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B75">
            <title>
               <p>'Conserved hypothetical' proteins: prioritization of targets for experimental study</p>
            </title>
            <aug>
               <au>
                  <snm>Galperin</snm>
                  <fnm>MY</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <fpage>5452</fpage>
            <lpage>5463</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkh885</pubid>
                  <pubid idtype="pmcid">524295</pubid>
                  <pubid idtype="pmpid" link="fulltext">15479782</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B76">
            <title>
               <p>From protein sequence to function</p>
            </title>
            <aug>
               <au>
                  <snm>Danchin</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Curr Opin Struct Biol</source>
            <pubdate>1999</pubdate>
            <volume>9</volume>
            <fpage>363</fpage>
            <lpage>367</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0959-440X(99)80049-9</pubid>
                  <pubid idtype="pmpid" link="fulltext">10408894</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B77">
            <title>
               <p>Multiple sequence alignment with Clustal x</p>
            </title>
            <aug>
               <au>
                  <snm>Jeanmougin</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Thompson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Gouy</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Higgins</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Gibson</snm>
                  <fnm>TJ</fnm>
               </au>
            </aug>
            <source>Trends Biochem Sci</source>
            <pubdate>1998</pubdate>
            <volume>23</volume>
            <fpage>403</fpage>
            <lpage>405</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0968-0004(98)01285-7</pubid>
                  <pubid idtype="pmpid" link="fulltext">9810230</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B78">
            <title>
               <p>Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Castresana</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2000</pubdate>
            <volume>17</volume>
            <fpage>540</fpage>
            <lpage>552</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10742046</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B79">
            <aug>
               <au>
                  <snm>Kimura</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>The Neutral Theory of Molecular Evolution</source>
            <publisher>Cambridge: Cambridge University Press</publisher>
            <pubdate>1983</pubdate>
         </bibl>
         <bibl id="B80">
            <title>
               <p>TREECON for Windows: a software package for the construction and drawing of evolutionary trees for the Microsoft Windows environment</p>
            </title>
            <aug>
               <au>
                  <snm>Peer</snm>
                  <mnm>Van de</mnm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>De Wachter</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Comput Appl Biosci</source>
            <pubdate>1994</pubdate>
            <volume>10</volume>
            <fpage>569</fpage>
            <lpage>570</lpage>
            <xrefbib>
               <pubid idtype="pmpid">7828077</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B81">
            <title>
               <p>TREE-PUZZLE: maximum likelihood phylogenetic analysis using quartets and parallel computing</p>
            </title>
            <aug>
               <au>
                  <snm>Schmidt</snm>
                  <fnm>HA</fnm>
               </au>
               <au>
                  <snm>Strimmer</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Vingron</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>von Haeseler</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2002</pubdate>
            <volume>18</volume>
            <fpage>502</fpage>
            <lpage>504</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/18.3.502</pubid>
                  <pubid idtype="pmpid" link="fulltext">11934758</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B82">
            <title>
               <p>Gapped BLAST and PSI-BLAST: a new generation of protein databases search programs</p>
            </title>
            <aug>
               <au>
                  <snm>Altschul</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Madden</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Schaffer</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Lipman</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>1997</pubdate>
            <volume>25</volume>
            <fpage>3389</fpage>
            <lpage>3402</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/25.17.3389</pubid>
                  <pubid idtype="pmcid">146917</pubid>
                  <pubid idtype="pmpid" link="fulltext">9254694</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B83">
            <title>
               <p>Complete nucleotide sequence of the freshwater unicellular cyanobacterium Synechococcus elongatus PCC 6301 chromosome: gene content and organization</p>
            </title>
            <aug>
               <au>
                  <snm>Sugita</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ogata</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Shikata</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Jikuya</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Takano</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Furumichi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kanehisa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Omata</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Sugiura</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sugita</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Photosynth Res</source>
            <pubdate>2007</pubdate>
            <volume>93</volume>
            <fpage>55</fpage>
            <lpage>67</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s11120-006-9122-4</pubid>
                  <pubid idtype="pmpid" link="fulltext">17211581</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B84">
            <title>
               <p>Unraveling the genomic mosaic of a ubiquitous genus of marine cyanobacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Dufresne</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ostrowski</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Scanlan</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Garczarek</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Mazard</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Palenik</snm>
                  <fnm>BP</fnm>
               </au>
               <au>
                  <snm>Paulsen</snm>
                  <fnm>IT</fnm>
               </au>
               <au>
                  <snm>De Marsac</snm>
                  <fnm>NT</fnm>
               </au>
               <au>
                  <snm>Wincker</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Dossat</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ferriera</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Post</snm>
                  <fnm>AF</fnm>
               </au>
               <au>
                  <snm>Hess</snm>
                  <fnm>WR</fnm>
               </au>
               <au>
                  <snm>Partensky</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2008</pubdate>
            <volume>9</volume>
            <fpage>R90</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/gb-2008-9-5-r90</pubid>
                  <pubid idtype="pmcid">2441476</pubid>
                  <pubid idtype="pmpid" link="fulltext">18507822</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B85">
            <title>
               <p>Sequence analysis of the genome of the unicellular cyanobacterium <it>Synechocystis sp</it>. strain PCC6803. II. Sequence determination of the entire genome and assignment of potential protein-coding regions</p>
            </title>
            <aug>
               <au>
                  <snm>Kaneko</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Sato</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kotani</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Tanaka</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Asamizu</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Nakamura</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Miyajima</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Hirosawa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sugiura</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sasamoto</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kimura</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hosouchi</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Matsuno</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Muraki</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Nakazaki</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Naruo</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Okumura</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Shimpo</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Takeuchi</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Wada</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Watanabe</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Yamada</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Yasuda</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Tabata</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>DNA Research</source>
            <pubdate>1996</pubdate>
            <volume>3</volume>
            <fpage>109</fpage>
            <lpage>136</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/dnares/3.3.109</pubid>
                  <pubid idtype="pmpid" link="fulltext">8905231</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>

