<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2148-7-71</ui>
   <ji>1471-2148</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Phylogeny and molecular signatures (conserved proteins and indels) that are specific for the Bacteroidetes and Chlorobi species</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Gupta</snm>
               <mi>S</mi>
               <fnm>Radhey</fnm>
               <insr iid="I1"/>
               <email>gupta@mcmaster.ca</email>
            </au>
            <au id="A2">
               <snm>Lorenzini</snm>
               <fnm>Emily</fnm>
               <insr iid="I1"/>
               <email>lorenze@muss.cis.mcmaster.ca</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Biochemistry and Biomedical Science, McMaster University, Hamilton, L8N3Z5, Canada</p>
            </ins>
         </insg>
         <source>BMC Evolutionary Biology</source>
         <issn>1471-2148</issn>
         <pubdate>2007</pubdate>
         <volume>7</volume>
         <issue>1</issue>
         <fpage>71</fpage>
         <url>http://www.biomedcentral.com/1471-2148/7/71</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">17488508</pubid>
               <pubid idtype="doi">10.1186/1471-2148-7-71</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>21</day>
               <month>12</month>
               <year>2006</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>08</day>
               <month>5</month>
               <year>2007</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>08</day>
               <month>5</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Gupta and Lorenzini; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>The <it>Bacteroidetes </it>and <it>Chlorobi </it>species constitute two main groups of the <it>Bacteria </it>that are closely related in phylogenetic trees. The <it>Bacteroidetes </it>species are widely distributed and include many important periodontal pathogens. In contrast, all <it>Chlorobi </it>are anoxygenic obligate photoautotrophs. Very few (or no) biochemical or molecular characteristics are known that are distinctive characteristics of these bacteria, or are commonly shared by them.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Systematic blast searches were performed on each open reading frame in the genomes of <it>Porphyromonas gingivalis </it>W83, <it>Bacteroides fragilis </it>YCH46, <it>B. thetaiotaomicron </it>VPI-5482, <it>Gramella forsetii KT0803, Chlorobium luteolum </it>(formerly <it>Pelodictyon luteolum</it>) DSM 273 and <it>Chlorobaculum tepidum </it>(formerly <it>Chlorobium tepidum</it>) TLS to search for proteins that are uniquely present in either all or certain subgroups of <it>Bacteroidetes </it>and <it>Chlorobi</it>. These studies have identified > 600 proteins for which homologues are not found in other organisms. This includes 27 and 51 proteins that are specific for most of the sequenced <it>Bacteroidetes </it>and <it>Chlorobi </it>genomes, respectively; 52 and 38 proteins that are limited to species from the <it>Bacteroidales </it>and <it>Flavobacteriales </it>orders, respectively, and 5 proteins that are common to species from these two orders; 185 proteins that are specific for the <it>Bacteroides </it>genus. Additionally, 6 proteins that are uniquely shared by species from the <it>Bacteroidetes </it>and <it>Chlorobi </it>phyla (one of them also present in the <it>Fibrobacteres</it>) have also been identified. This work also describes two large conserved inserts in DNA polymerase III (DnaE) and alanyl-tRNA synthetase that are distinctive characteristics of the <it>Chlorobi </it>species and a 3 aa deletion in ClpB chaperone that is mainly found in various <it>Bacteroidales, Flavobacteriales </it>and <it>Flexebacteraceae</it>, but generally not found in the homologs from other organisms. Phylogenetic analyses of the <it>Bacteroidetes </it>and <it>Chlorobi </it>species is also reported based on concatenated sequences for 12 conserved proteins by different methods including the character compatibility (or clique) approach. The placement of <it>Salinibacter ruber </it>with other <it>Bacteroidetes </it>species was not resolved by other phylogenetic methods, but this affiliation was strongly supported by the character compatibility approach.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>The molecular signatures described here provide novel tools for identifying and circumscribing species from the <it>Bacteroidetes </it>and <it>Chlorobi </it>phyla as well as some of their main groups in clear terms. These results also provide strong evidence that species from these two phyla (and also possibly <it>Fibrobacteres</it>) are specifically related to each other and they form a single superphylum. Functional studies on these proteins and indels should aid in the discovery of novel biochemical and physiological characteristics that are unique to these groups of bacteria.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="bmc" subtype="user_supplied_xml" id="refman"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>The <it>Bacteroidetes </it>and <it>Chlorobi </it>presently comprise two of the main phyla within the <it>Bacteria </it><abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>. The bacteria from the <it>Bacteroidetes </it>phylum (previously known as the Cytophaga-Flavobacteria-Bacteroides (CFB) group) exhibit a <it>potpourri </it>of phenotypes including gliding behavior and their ability to digest and grow on a variety of complex substrates such as cellulose, chitin and agar <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp>. They inhabit diverse habitats including the oral cavity of humans, the gastrointestinal tracts of mammals, saturated thalassic brines, soil and fresh water <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>. The <it>Bacteroides </it>species such as <it>B. thetaiotaomicron </it>and <it>B. fragilis </it>are among the dominant microbes in the large intestine of human and other animals <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr></abbrgrp>. These bacteria in the human colon are also important opportunistic pathogens and they are involved in causing abscesses and soft tissue infections of the gastrointestinal tract, as well as diarrheal diseases <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp>. Other bacteroidetes species, such as <it>Porphyromonas gingivalis </it>and <it>Prevotella intermedia</it>, are major causative agents in the initiation and progression of periodontal disease in humans <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr></abbrgrp>.</p>
         <p>In contrast to wide distribution of <it>Bacteroidetes </it>species in diverse habitats, bacteria from the phylum <it>Chlorobi </it>occupy a narrow environmental niche mainly consisting of anoxic aquatic settings in stratified lakes (chemocline regions), where sunlight is able to penetrate <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp>. Some of these bacteria also exist as epibionts in phototrophic consortiums with other bacteria, particularly &#946;-proteobacteria <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B25">25</abbr></abbrgrp>. The <it>Chlorobi</it>, which are also commonly known as Green Sulfur bacteria, are all anoxygenic obligate photoautotrophs, which obtain electrons for anaerobic photosynthesis from hydrogen sulfide <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr><abbr bid="B26">26</abbr></abbrgrp>. Although the <it>Bacteroidetes </it>and <it>Chlorobi </it>are presently recognized as two distinct phyla <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B3">3</abbr></abbrgrp>, these two groups are closely related in phylogenetic trees based on 16S rRNA as well other gene sequences <abbrgrp><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr></abbrgrp>. Conserved indels (i.e. inserts or deletions) in a number of widely distributed proteins (viz. FtsK, UvrB and ATP synthase &#945; subunit), that are uniquely present in species from these two groups, also strongly indicate that these two groups of species shared a common ancestor exclusive of all other bacteria <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>.</p>
         <p>The species from the <it>Bacteroidetes </it>and <it>Chlorobi </it>phyla are presently distinguished from other bacteria primarily on the basis of their branching in phylogenetic trees <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B27">27</abbr></abbrgrp>. We have previously described a 4 aa conserved insert in DNA Gyrase B as well as a 45 aa conserved insert in SecA protein that were specific for the <it>Bacteroidetes </it>species <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. In <it>Chlorobi </it>as well as <it>Chloroflexi </it>species, their light harvesting pigments are located in unique membrane-attached sac-like structures referred to as 'chlorosomes' <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B24">24</abbr><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr></abbrgrp>. A number of genes involved in the synthesis of chlorosomes components in <it>Chlorobi </it>have been identified by genomic and mutational analysis <abbrgrp><abbr bid="B26">26</abbr></abbrgrp> and a few of them, viz. Fenna-Matthew-Olson (FMO) protein <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>, are unique for this group <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B34">34</abbr></abbrgrp>. However, the number of characteristics that are either unique to species from these two phyla, or are commonly shared by members of these phyla, are very limited. In the past few years, complete genomes of several <it>Bacteroidetes </it>and <it>Chlorobi </it>species have become available (see Table <tblr tid="T1">1</tblr>). Additionally, sequencing of genomes for many other <it>Bacteroidetes/Chlorobi </it>species is at different stages of completion (see Table <tblr tid="T1">1</tblr>), but considerable sequence information for these genomes is available in the NCBI database.</p>
         <tbl id="T1">
            <title>
               <p>Table 1</p>
            </title>
            <caption>
               <p>General Characteristics of <it>Bacteroidetes/Chlorobi </it>Genomes</p>
            </caption>
            <tblbdy cols="7">
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>
                        <b>Strain Name</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>Taxonomic Order</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p><b>Genome Size (Mb)</b>**</p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>GC Content (%)</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>Protein Number</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>Reference#</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c cspan="7">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>
                        <b>Bacteroidetes</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p><it>Porphyromonas gingivalis </it>W83<sup>#</sup></p>
                  </c>
                  <c ca="center">
                     <p>Bacteroidales</p>
                  </c>
                  <c ca="center">
                     <p>2.34</p>
                  </c>
                  <c ca="center">
                     <p>48.3</p>
                  </c>
                  <c ca="center">
                     <p>1909</p>
                  </c>
                  <c ca="center">
                     <p>[58]</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p><it>Bacteroides fragilis </it>NCTC 9343<sup>#</sup></p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>5.24</p>
                  </c>
                  <c ca="center">
                     <p>44</p>
                  </c>
                  <c ca="center">
                     <p>4184</p>
                  </c>
                  <c ca="center">
                     <p>[16]</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p><it>Bacteroides fragilis </it>YCH46<sup>#</sup></p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>5.31</p>
                  </c>
                  <c ca="center">
                     <p>33.5</p>
                  </c>
                  <c ca="center">
                     <p>4578</p>
                  </c>
                  <c ca="center">
                     <p>[35]</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p><it>Bacteroides thetaiotaomicron </it>VPI-5482<sup>#</sup></p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>6.29</p>
                  </c>
                  <c ca="center">
                     <p>42</p>
                  </c>
                  <c ca="center">
                     <p>4778</p>
                  </c>
                  <c ca="center">
                     <p>[15]</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p><it>Gramella forsetii </it>KT0803<sup>#</sup></p>
                  </c>
                  <c ca="center">
                     <p>Flavobacteriales</p>
                  </c>
                  <c ca="center">
                     <p>3.8</p>
                  </c>
                  <c ca="center">
                     <p>36.6</p>
                  </c>
                  <c ca="center">
                     <p>3559</p>
                  </c>
                  <c ca="center">
                     <p>[59]</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Flavobacteria bacterium <it>BBFL7</it>*</b>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>5</p>
                  </c>
                  <c ca="center">
                     <p>35.0</p>
                  </c>
                  <c ca="center">
                     <p>2592</p>
                  </c>
                  <c ca="center">
                     <p>a</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p><it>Flavobacteriales bacterium </it>HTCC2170*</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>5</p>
                  </c>
                  <c ca="center">
                     <p>37.0</p>
                  </c>
                  <c ca="center">
                     <p>3478</p>
                  </c>
                  <c ca="center">
                     <p>a</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p><it>Flavobacterium johnsoniae </it>UW101*</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>35.2</p>
                  </c>
                  <c ca="center">
                     <p>4985</p>
                  </c>
                  <c ca="center">
                     <p>DOE-JGI</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p><it>Flavobacterium sp</it>. MED217*</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>5</p>
                  </c>
                  <c ca="center">
                     <p>39.8</p>
                  </c>
                  <c ca="center">
                     <p>3735</p>
                  </c>
                  <c ca="center">
                     <p>a</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p><it>Cellulophaga sp</it>. MED134*</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>5</p>
                  </c>
                  <c ca="center">
                     <p>38.2</p>
                  </c>
                  <c ca="center">
                     <p>2944</p>
                  </c>
                  <c ca="center">
                     <p>a</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p><it>Croceibacter atlanticus </it>HTCC2559*</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>5</p>
                  </c>
                  <c ca="center">
                     <p>33.9</p>
                  </c>
                  <c ca="center">
                     <p>2719</p>
                  </c>
                  <c ca="center">
                     <p>a</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p><it>Polaribacter irgensii </it>23-P*</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>31.0</p>
                  </c>
                  <c ca="center">
                     <p>2557</p>
                  </c>
                  <c ca="center">
                     <p>a</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p><it>Psychroflexus torquis </it>ATCC 700755*</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>5</p>
                  </c>
                  <c ca="center">
                     <p>32&#8211;33.0</p>
                  </c>
                  <c ca="center">
                     <p>6751</p>
                  </c>
                  <c ca="center">
                     <p>a</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p><it>Robiginitalea biformata </it>HTCC2501*</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>5</p>
                  </c>
                  <c ca="center">
                     <p>56.4</p>
                  </c>
                  <c ca="center">
                     <p>3228</p>
                  </c>
                  <c ca="center">
                     <p>a</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p><it>Tenacibaculum sp</it>. MED152*</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>5</p>
                  </c>
                  <c ca="center">
                     <p>30.6</p>
                  </c>
                  <c ca="center">
                     <p>2679</p>
                  </c>
                  <c ca="center">
                     <p>a</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p><it>Cytophaga hutchinsonii </it>ATCC 33406<sup>#</sup></p>
                  </c>
                  <c ca="center">
                     <p>Sphingobacteriales</p>
                  </c>
                  <c ca="center">
                     <p>4.43</p>
                  </c>
                  <c ca="center">
                     <p>38.8</p>
                  </c>
                  <c ca="center">
                     <p>3785</p>
                  </c>
                  <c ca="center">
                     <p>CP000383</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p><it>Salinibacter ruber </it>DSM 13855<sup>#</sup></p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>3.59</p>
                  </c>
                  <c ca="center">
                     <p>66.5</p>
                  </c>
                  <c ca="center">
                     <p>2801</p>
                  </c>
                  <c ca="center">
                     <p>[36]</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>
                        <b>Chlorobi</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p><it>Chlorobium chlorochromatii </it>CaD3<sup>#</sup></p>
                  </c>
                  <c ca="center">
                     <p>Chlorobia</p>
                  </c>
                  <c ca="center">
                     <p>2.57</p>
                  </c>
                  <c ca="center">
                     <p>44.3</p>
                  </c>
                  <c ca="center">
                     <p>2002</p>
                  </c>
                  <c ca="center">
                     <p>CP000108</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p><it>Chlorobium limicola </it>DSM 245*</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>2.4</p>
                  </c>
                  <c ca="center">
                     <p>51.3</p>
                  </c>
                  <c ca="center">
                     <p>2435</p>
                  </c>
                  <c ca="center">
                     <p>DOE-JGI</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Chlorobium phaeobacteroides <it>BS1</it>*</b>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>2.</p>
                  </c>
                  <c ca="center">
                     <p>45.5</p>
                  </c>
                  <c ca="center">
                     <p>3791</p>
                  </c>
                  <c ca="center">
                     <p>DOE-JGI</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p><it>Chlorobium phaeobacteroides </it>DSM 266*</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>2.4</p>
                  </c>
                  <c ca="center">
                     <p>48.3</p>
                  </c>
                  <c ca="center">
                     <p>2789</p>
                  </c>
                  <c ca="center">
                     <p>DOE-JGI</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p><it>Chlorobaculum tepidum </it>formerly <it>Chlorobium tepidum </it>TLS<sup>#</sup></p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>2.15</p>
                  </c>
                  <c ca="center">
                     <p>56</p>
                  </c>
                  <c ca="center">
                     <p>2252</p>
                  </c>
                  <c ca="center">
                     <p>[24]</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p><it>Chlorobium luteolum </it>formerly <it>Pelodictyon luteolum </it>DSM 273<sup>#</sup></p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>2.36</p>
                  </c>
                  <c ca="center">
                     <p>57.3</p>
                  </c>
                  <c ca="center">
                     <p>2083</p>
                  </c>
                  <c ca="center">
                     <p>CP000096</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p><it>Chlorobium clathratiforme </it>formerly <it>Pelodictyon phaeoclathratiforme </it>BU-1*</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>48.1</p>
                  </c>
                  <c ca="center">
                     <p>2762</p>
                  </c>
                  <c ca="center">
                     <p>DOE-JGI</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p><it>Prosthecochloris aestuarii </it>DSM 271*</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>50.1</p>
                  </c>
                  <c ca="center">
                     <p>2313</p>
                  </c>
                  <c ca="center">
                     <p>DOE-JGI</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p><it>Chlorobium phaeovibrioides </it>formerly <it>Prosthecochloris vibrioformis </it>DSM 265*</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>53.0</p>
                  </c>
                  <c ca="center">
                     <p>1747</p>
                  </c>
                  <c ca="center">
                     <p>DOE-JGI</p>
                  </c>
               </r>
            </tblbdy>
            <tblfn>
               <p><sup>#</sup>Indicates a completely sequenced genome. The references to the published genomes are provided. For others that are fully sequenced but not published, accession numbers for the genomes are given.</p>
               <p>* Indicates that these genomes are at draft assembly stages. The information regarding genome sizes etc. in these cases have been obtained from the NCBI microbial sequence database.</p>
               <p>The revised names for a number of <it>Chlorobi </it>species are as proposed in Ref. 45.</p>
               <p><b># </b>The sequences marked 'a' are being sequenced by Gordon and Betty Moore Foundation Marine Biotechnology Intiative; DOE-JGE, indicates Department of Energy Joint Genome Institute. For some of the completed genomes, which are not published, their accession numbers are provided.</p>
            </tblfn>
         </tbl>
         <p>The availability of genomic sequences provide an opportunity to carry out in depth studies to identify novel molecular characteristics that are unique to these groups of bacteria and can be used for their diagnostics as well as biochemical and functional studies. Earlier comparative genomic studies on <it>Bacteroidetes/Chlorobi </it>species have been limited to only a few species and they have focused on specific aspects. The studies by Kuwahara <abbrgrp><abbr bid="B35">35</abbr></abbrgrp> and Cerde&#241;o-T&#225;rraga et al. <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>, who sequenced the genomes of <it>B. fragilis </it>strains, revealed that these genomes contained extensive DNA inversions in comparison to <it>B. thetaiotaomicron</it>. These inversion events are indicated to be important in terms of generating cell surface variability in these bacteria to avoid their recognition by the immune system. Large expansion of genes involved in the biosynthesis of cell surface polysaccharides and other antigens was also noted in these genomes <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B35">35</abbr></abbrgrp>. A comparative analysis by Eisen et al. <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> of <it>C. tepidum </it>TLS genome identified many probable cases of lateral gene transfers (LGTs) between this species and <it>Archaea</it>; in all about 12% of <it>C. tepidum</it>'s proteins were indicated to be most similar to those from the archaea. Similarly, the analysis of <it>S. ruber </it>genome by Mongodin et al. <abbrgrp><abbr bid="B36">36</abbr></abbrgrp> has identified many cases of potential LGT between <it>S. ruber </it>and haloarchaea, particularly involving the rhodopsin genes.</p>
         <p>In our recent work, we have used comparative genomics to systematically identify various proteins that are uniquely found in either all members, or particular subgroups, of a number of important groups of prokaryotes. These studies have identified large number of proteins that are specific for alpha proteobacteria <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>, chlamydiae <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>, Actinobacteria <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>, epsilon proteobacteria <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> and Archaea <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>. Such genes and proteins, because of their specificity for different phylogenetic or taxonomic groups, provide novel means for diagnostics and evolutionary studies <abbrgrp><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr></abbrgrp> and for the discovery of important biochemical and physiological characteristics that are unique to these groups of prokaryotes. However, thus far no comparative study has examined different genes/proteins that are uniquely present in species from the <it>Bacteroidetes </it>and <it>Chlorobi </it>phyla or are commonly shared by species from these two groups. In order to identify proteins that are uniquely found in the <it>Bacteroidetes </it>and/or <it>Chlorobi </it>groups of species, we have carried out systematic blast searches on all open reading frames in the genomes of <it>P. gingivalis </it>W83, <it>B. fragilis </it>YCH46, <it>B. thetaiotaomicron </it>VPI-5482, <it>G. forsetii KT0803</it>, <it>C. luteolum </it>DSM 273 and <it>C. tepidum </it>TLS against all available sequences in the NCBI non-redundant (nr) database. This has led to identification of large numbers of proteins that are distinctive characteristics of species from different taxonomic groups within the <it>Bacteroidetes </it>phylum (e.g. specific for the <it>Bacteroides </it>genus, specific for the <it>Bacteroidales </it>and <it>Flavobacteriales </it>orders, or specific for the entire <it>Bacteroidetes </it>phylum). Additionally, large numbers of proteins that are specific for the <it>Chlorobi </it>species as well as some proteins that are uniquely shared by the <it>Bacteroidetes </it>and <it>Chlorobi </it>phyla have also been identified. This work also describes three conserved indels in important housekeeping proteins (viz. alanyl-tRNA synthetase, DNA polymerase subunit III and ClpB) that are distinctive characteristics of either the  <it>Chlorobi </it>or the<it/><it> Bacteroidales-Flavobacteriales</it>-<it>Flexebacteraceae </it>groups. Phylogenetic analyses of the <it>Bacteroidetes </it>and <it>Chlorobi </it>were also carried out based on a concatenated sequence alignment for 12 highly conserved proteins and the results of these analyses support the inferences derived from the species distribution of various molecular signatures.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Phylogenetic analyses of Bacteroidetes and Chlorobi species</p>
            </st>
            <p>Table <tblr tid="T1">1</tblr> lists some characteristics of various <it>Bacteroidetes </it>and <it>Chlorobi </it>genomes that have been completely sequenced as well as for many others that are currently being sequenced. The taxonomy of the <it>Chlorobi </it>species has undergone significant revision in the past few years leading to name changes and new taxonomic groupings for a number of species <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>. The newly proposed and former names for some of the species that will be discussed in the present work are as follows: <it>Chlorobium tepidum </it>changed to <it>Chlorobaculum tepidum</it>; <it>Pelodictyon luteolum </it>changed to <it>Chlorobium luteolum</it>; <it>Prosthecochloris vibrioformis </it>changed to <it>Chlorobium phaeovibrioides</it>; <it>Pelodictyon phaeoclathratiforme </it>changed to <it>Chlorobium clathratiforme</it>. Although many of these species in the databases are still referred to by their former names, we have used the revised nomenclature in our work to interpret the results of evolutionary and comparative genomic studies.</p>
            <p>Prior to undertaking comparative analyses of <it>Bacteroidetes </it>and <it>Chlorobi </it>genomes, phylogenetic analyses on these species were carried out to get an overview of their evolutionary relationships, which can serve as a reference point for comparative genomic analyses. Phylogenetic analysis of <it>Bacteroidetes </it>and <it>Chlorobi </it>species has been carried out previously using 16S rRNA sequences and a few isolated protein sequences <abbrgrp><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr><abbr bid="B46">46</abbr><abbr bid="B47">47</abbr><abbr bid="B48">48</abbr><abbr bid="B49">49</abbr></abbrgrp>. However, recent studies show that analyses based on larger dataset derived from multiple genes/proteins sequences provide a more reliable phylogenetic inference <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. Hence, phylogenetic analyses for these species was carried out based on concatenated sequences for 12 highly conserved proteins involved in a broad range of functions (see Methods section). The final sequence alignment for phylogenetic analysis contained a total of 6998 aligned positions.</p>
            <p>Phylogenetic trees were constructed using the neighbour-joining (NJ), maximum-likelihood (ML) and maximum-parsimony (MP) methods <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>. The results of these analyses for the NJ and ML methods are presented in Fig. <figr fid="F1">1</figr>. The trees were rooted using the sequences for <it>Deinococcus-Thermus </it>species. The tree in both cases consisted of two well-resolved clades (100% bootstrap scores by both treeing methods), one comprising of various <it>Chlorobi </it>species and the other containing various Cytophaga-Flavobacteria-Bacteroides (CFB) species. In these trees, <it>S. ruber </it>appeared as a deep branching outgroup of the <it>Chlorobi </it>clade, but in view of the low bootstrap score of the node indicating this relationship and the long branch that separated them, this relationship was not reliable. The topology of various species in the MP tree was very similar, except that in this tree <it>S. ruber </it>formed the outgroup of <it>Chlorobi </it>as well as various other CFB group of bacteria (results not shown). In addition to the uncertain position of <it>S. ruber</it>, different species belonging to the genus <it>Flavobacterium </it>did not form a coherent phylogenetic group (Fig. <figr fid="F1">1</figr>). For most other <it>Bacteroidetes </it>species, sequence information for multiple species from the same genera was not available.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Neighbour-joining tree based on concatenated sequences for 12 highly conserved proteins</p>
               </caption>
               <text>
                  <p>Neighbour-joining tree based on concatenated sequences for 12 highly conserved proteins. The tree was rooted using sequences for <it>Deinococcus-Thermus </it>species and numbers on the nodes indicate bootstrap scores in the NJ and maximum-likelihood analyses (NJ/MP). The branching position of <it>G. forsetii</it>, which became available after this analysis was completed, is not shown. However, our analysis of a smaller dataset indicates that it exhibits closest affinity for the flavobacteria <it>Psychrobacter torquis </it>(results not shown).</p>
               </text>
               <graphic file="1471-2148-7-71-1"/>
            </fig>
            <p>Phylogenetic analysis on the concatenated dataset was also performed employing the character compatibility approach <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>. In this approach, all sites in the sequence alignment where only two amino acid states are found, with each state present in at least two species, are examined for mutual compatibility to find the largest clique of mutually compatible characters <abbrgrp><abbr bid="B52">52</abbr><abbr bid="B53">53</abbr><abbr bid="B54">54</abbr><abbr bid="B55">55</abbr><abbr bid="B56">56</abbr></abbrgrp>. By removing all homoplasic as well as fast-evolving characters from dataset, this approach provides a powerful means for obtaining correct topology in difficult to resolve cases <abbrgrp><abbr bid="B56">56</abbr><abbr bid="B57">57</abbr></abbrgrp>. Our concatenated dataset for the 12 proteins contained 832 positions where only two amino acids were found, with each amino acid present in a minimum of two species. The mutual compatibility of these binary state sites was determined as described in the Methods section.</p>
            <p>The compatibility analysis identified two largest sets of compatible characters (referred to as cliques) each containing 410 characters. These cliques were identical in all respects except that the relative branching positions of <it>Chlorobaculum tepidum </it>and <it>Chlorobium chlorochromatii</it>, which differed by a single character, were interchanged. A composite of these cliques, in which the branching positions of these two species are not resolved, is shown in Fig. <figr fid="F2">2</figr>. A large number of characters (i.e. 200) in this clique distinguished the <it>Chlorobi-Bacteroidetes </it>species from the two <it>Deinococcus-Thermus </it>species, which were included to serve as outgroup. The clique is comprised of two main clades, one consisting of various species belonging to the <it>Bacteroidetes </it>group and the other of different <it>Chlorobi </it>species. The species from each of these clades were distinguished by a large numbers of characters. In the <it>Bacteroidetes </it>clade, <it>S. ruber </it>was found to be the deepest branching lineage and its specific association with other <it>Bacteroidetes </it>species was supported by 21 uniquely shared characters, which is a highly significant result <abbrgrp><abbr bid="B57">57</abbr></abbrgrp>. Additionally, the two main orders within the <it>Bacteroidetes </it>viz. <it>Bacteriodales </it>and <it>Flavobacteriales</it>, for which sequence information is available from multiple species, were clearly distinguished based upon multiple characters. However, these analyses detected no uniquely shared character between the <it>C. hutchinsonii </it>and <it>S. ruber</it>. Although, these two species are currently placed in the order <it>Sphingobacteriales</it>, phylogenetic trees do not support a specific grouping of them (see Fig. <figr fid="F1">1</figr>). Different <it>Flavobacterium </it>species again did not group together indicating that this genus does not constitute a phylogenetically coherent taxon. Within the <it>Chlorobi </it>clade, <it>Prosthecochloris aesturaii </it>was found to be the deepest branching lineage, but branching order of other <it>Chlorobi </it>species was not resolved.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Character compatibility tree (or the largest clique of mutually compatible characters) based on two states sites in the concatenated sequence alignment for the 12 proteins</p>
               </caption>
               <text>
                  <p>Character compatibility tree (or the largest clique of mutually compatible characters) based on two states sites in the concatenated sequence alignment for the 12 proteins. The clique consisted of 410 mutually compatible characters. The numbers of characters that distinguished different clades are indicated on the nodes. Rooting was done using the sequences for <it>Deinococcus-Thermus </it>species.</p>
               </text>
               <graphic file="1471-2148-7-71-2"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Comparative Genomic Studies on <it>Bacteroidetes </it>and <it>Chlorobi </it>Species</p>
            </st>
            <p>To identify proteins that are uniquely present in species from the <it>Bacteroidetes </it>and <it>Chlorobi </it>phyla at various taxonomic levels (genus and above), systematic Blastp searches were performed on each protein or ORF in the genomes of the following species: <it>Bacteroides </it>(<it>P. gingivalis </it>W83 <abbrgrp><abbr bid="B58">58</abbr></abbrgrp>, <it>B. fragilis </it>YCH46 <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>, <it>B. thetaiotaomicron </it>VPI-5482 <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>); <it>Flavobacteria </it>(<it>G. forsetii str</it>. KT0803 <abbrgrp><abbr bid="B59">59</abbr></abbrgrp>); <it>Chlorobi </it>(<it>C. luteolum </it>DSM 273 and <it>C. tepidum </it>TLS <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>). These analyses have identified a large number of proteins that are specific for different taxonomic groups. A brief description of these signature proteins and their evolutionary significances are discussed below.</p>
            <sec>
               <st>
                  <p>Proteins (or ORFs) that are Specific for the Species within the Bacteroidetes Phylum</p>
               </st>
               <p>The blast searches on various ORFs from <it>P. gingivalis</it>, <it>B. fragilis </it>YCH46 and <it>B. thetaiotaomicron </it>genomes have identified 27 proteins that are present in most of the species from the <it>Bacteroidetes </it>phylum (Table <tblr tid="T2">2</tblr>). In addition to most fully sequenced <it>Bacteroidetes </it>genomes, the homologs of these proteins are also present in most <it>Bacteroidetes </it>species whose genomes have been partially sequenced. One of these proteins, PG0448, is present in all of the <it>Bacteroidetes </it>species, whose genomes have been either partially or fully sequenced. Two other proteins, PG1850 and PG2092, are present in all fully as well as partially sequenced genomes, except those from the <it>Bacteroides </it>genus. The absence of these proteins in only the species from this genus is very likely due to selective gene loss from this lineage and their genes also likely originated in a common ancestor of the <it>Bacteroidetes </it>phylum. Similarly, four proteins viz. PG0449, PG0779, PG1679 and PG2066, are present in virtually all other <it>Bacteroidetes </it>genomes, but they are missing in <it>C. hutchinsonii</it>. Their absence again is very likely due to selective gene loss from <it>C. hutchinsonii</it>. Eight additional proteins viz. PG0202, PG0362, PG0399, PG0482, PG0621, PG01281, PG1367 and PG1394, are present in all other fully as well as partially sequenced <it>Bacteroidetes </it>genomes, but they are only missing in <it>S. ruber</it>. The absence of these proteins in <it>S. ruber </it>can also be explained by selective gene loss. However, in view of the fact that <it>S. ruber </it>branches very deeply in comparison to all other <it>Bacteroidetes </it>species (Figs. <figr fid="F1">1</figr> and <figr fid="F2">2</figr>), it is also possible that their genes evolved in a common ancestor of the other <it>Bacteroidetes </it>species after the divergence of <it>S. ruber</it>.</p>
               <tbl id="T2">
                  <title>
                     <p>Table 2</p>
                  </title>
                  <caption>
                     <p>Proteins that are Specific for the Phylum Bacteroidetes</p>
                  </caption>
                  <tblbdy cols="5">
                     <r>
                        <c ca="center">
                           <p>
                              <b>Protein Name</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>Accession No.</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <it>Length</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>Possible/Predicted Function</b>
                           </p>
                        </c>
                        <c ca="left">
                           <p>
                              <it>Comments</it>
                           </p>
                        </c>
                     </r>
                     <r>
                        <c cspan="5">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>PG0202</p>
                        </c>
                        <c ca="center">
                           <p>
                              <ext-link ext-link-type="gen" ext-link-id="NP_904537">NP_904537</ext-link>
                           </p>
                        </c>
                        <c ca="center">
                           <p>165</p>
                        </c>
                        <c ca="left">
                           <p>Uroporphyrinogen-III synthase HemD, putative; COG1587, HemD; pfam02602, HEM4</p>
                        </c>
                        <c ca="left">
                           <p>Missing in <it>S. ruber</it><sup>+</sup></p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>PG0362</p>
                        </c>
                        <c ca="center">
                           <p>
                              <ext-link ext-link-type="gen" ext-link-id="NP_904673">NP_904673</ext-link>
                           </p>
                        </c>
                        <c ca="center">
                           <p>722</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical</p>
                        </c>
                        <c ca="left">
                           <p>Missing in <it>S. ruber</it></p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>PG0399</p>
                        </c>
                        <c ca="center">
                           <p>
                              <ext-link ext-link-type="gen" ext-link-id="NP_904705">NP_904705</ext-link>
                           </p>
                        </c>
                        <c ca="center">
                           <p>156</p>
                        </c>
                        <c ca="left">
                           <p>Putative lipoprotein</p>
                        </c>
                        <c ca="left">
                           <p>Missing in <it>S. ruber</it></p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>PG0448</p>
                        </c>
                        <c ca="center">
                           <p>
                              <ext-link ext-link-type="gen" ext-link-id="NP_904748">NP_904748</ext-link>
                           </p>
                        </c>
                        <c ca="center">
                           <p>434</p>
                        </c>
                        <c ca="left">
                           <p>Toluene &#215; outer membrane/transport protein (OMPP1/FadL/TodX); pfam03349</p>
                        </c>
                        <c ca="left">
                           <p>All species present</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>PG0449</p>
                        </c>
                        <c ca="center">
                           <p>
                              <ext-link ext-link-type="gen" ext-link-id="NP_904749">NP_904749</ext-link>
                           </p>
                        </c>
                        <c ca="center">
                           <p>441</p>
                        </c>
                        <c ca="left">
                           <p>TPR domain protein; cd00189, TPR; COG3071, HemY; COG4783, Putative Zn-dependent protease</p>
                        </c>
                        <c ca="left">
                           <p>Not found in <it>F. johnsoniae </it>and <it>C. hutchinsonii</it></p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>PG0482</p>
                        </c>
                        <c ca="center">
                           <p>
                              <ext-link ext-link-type="gen" ext-link-id="NP_904777">NP_904777</ext-link>
                           </p>
                        </c>
                        <c ca="center">
                           <p>143</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>Missing in <it>S. ruber</it></p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>PG0621</p>
                        </c>
                        <c ca="center">
                           <p>
                              <ext-link ext-link-type="gen" ext-link-id="NP_904906">NP_904906</ext-link>
                           </p>
                        </c>
                        <c ca="center">
                           <p>182</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>Missing in <it>S. ruber</it></p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>PG0779</p>
                        </c>
                        <c ca="center">
                           <p>
                              <ext-link ext-link-type="gen" ext-link-id="NP_905041">NP_905041</ext-link>
                           </p>
                        </c>
                        <c ca="center">
                           <p>157</p>
                        </c>
                        <c ca="left">
                           <p>ExbD, Biopolymer transport protein; COG0848</p>
                        </c>
                        <c ca="left">
                           <p>Not found in <it>F. bacterium HTC</it>, <it>F. johnsoniae </it>and <it>C. hutchinsonii</it></p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>PG1281</p>
                        </c>
                        <c ca="center">
                           <p>
                              <ext-link ext-link-type="gen" ext-link-id="NP_905462">NP_905462</ext-link>
                           </p>
                        </c>
                        <c ca="center">
                           <p>387</p>
                        </c>
                        <c ca="left">
                           <p>Putative DNA mismatch repair protein; pfam01713, Smr; COG1193, Mismatch repair ATPase</p>
                        </c>
                        <c ca="left">
                           <p>Not found in <it>C. atlanticus </it>and <it>S. ruber</it><sup>+</sup></p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>PG1367</p>
                        </c>
                        <c ca="center">
                           <p>
                              <ext-link ext-link-type="gen" ext-link-id="NP_905532">NP_905532</ext-link>
                           </p>
                        </c>
                        <c ca="center">
                           <p>200</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>Missing in <it>S. ruber</it></p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>PG1394</p>
                        </c>
                        <c ca="center">
                           <p>
                              <ext-link ext-link-type="gen" ext-link-id="NP_905555">NP_905555</ext-link>
                           </p>
                        </c>
                        <c ca="center">
                           <p>165</p>
                        </c>
                        <c ca="left">
                           <p>Putative trans-membrane</p>
                        </c>
                        <c ca="left">
                           <p>Missing in <it>S. ruber</it></p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>PG1626</p>
                        </c>
                        <c ca="center">
                           <p>
                              <ext-link ext-link-type="gen" ext-link-id="NP_905755">NP_905755</ext-link>
                           </p>
                        </c>
                        <c ca="center">
                           <p>554</p>
                        </c>
                        <c ca="left">
                           <p>Putative hemin receptor</p>
                        </c>
                        <c ca="left">
                           <p>Missing in <it>C. hutchinsonii</it>; also present in <it>C. phaeobacteriodes</it>.</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>PG1679</p>
                        </c>
                        <c ca="center">
                           <p>
                              <ext-link ext-link-type="gen" ext-link-id="NP_905797">NP_905797</ext-link>
                           </p>
                        </c>
                        <c ca="center">
                           <p>464</p>
                        </c>
                        <c ca="left">
                           <p>Putative trans-membrane</p>
                        </c>
                        <c ca="left">
                           <p>Not found in <it>P. ruminicola </it>and <it>C. hutchinsonii</it></p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>PG1850</p>
                        </c>
                        <c ca="center">
                           <p>
                              <ext-link ext-link-type="gen" ext-link-id="NP_905940">NP_905940</ext-link>
                           </p>
                        </c>
                        <c ca="center">
                           <p>302</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>Missing in <it>Bacteroides </it>species<sup>+</sup></p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>PG2066</p>
                        </c>
                        <c ca="center">
                           <p>
                              <ext-link ext-link-type="gen" ext-link-id="NP_906128">NP_906128</ext-link>
                           </p>
                        </c>
                        <c ca="center">
                           <p>351</p>
                        </c>
                        <c ca="left">
                           <p>Putative lipoprotein</p>
                        </c>
                        <c ca="left">
                           <p>Missing in <it>P. ruminicola </it>and <it>C. hutchinsonii</it></p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>PG2092</p>
                        </c>
                        <c ca="center">
                           <p>
                              <ext-link ext-link-type="gen" ext-link-id="NP_906153">NP_906153</ext-link>
                           </p>
                        </c>
                        <c ca="center">
                           <p>419</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>Missing in <it>Bacteroides </it>species</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>BF0296</p>
                        </c>
                        <c ca="center">
                           <p>
                              <ext-link ext-link-type="gen" ext-link-id="YP_097579">YP_097579</ext-link>
                           </p>
                        </c>
                        <c ca="center">
                           <p>988</p>
                        </c>
                        <c ca="left">
                           <p>Outer membrane assembly protein</p>
                        </c>
                        <c ca="left">
                           <p>Missing in <it>P. gingivalis </it>and <it>S. ruber</it><sup>+</sup></p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>BF0439</p>
                        </c>
                        <c ca="center">
                           <p>
                              <ext-link ext-link-type="gen" ext-link-id="YP_097722">YP_097722</ext-link>
                           </p>
                        </c>
                        <c ca="center">
                           <p>565</p>
                        </c>
                        <c ca="left">
                           <p>Putative outer membrane protein probably involved in nutrient binding</p>
                        </c>
                        <c ca="left">
                           <p>Missing in <it>P. gingivalis</it>, and <it>Tenacibaculum</it></p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>BF0534</p>
                        </c>
                        <c ca="center">
                           <p>
                              <ext-link ext-link-type="gen" ext-link-id="YP_097817">YP_097817</ext-link>
                           </p>
                        </c>
                        <c ca="center">
                           <p>192</p>
                        </c>
                        <c ca="left">
                           <p>Putative acetyl-transferase</p>
                        </c>
                        <c ca="left">
                           <p>Missing in <it>P. gingivalis, C. atlanticus </it>and <it>S. ruber</it></p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>BF0665</p>
                        </c>
                        <c ca="center">
                           <p>
                              <ext-link ext-link-type="gen" ext-link-id="YP_097947">YP_097947</ext-link>
                           </p>
                        </c>
                        <c ca="center">
                           <p>531</p>
                        </c>
                        <c ca="left">
                           <p>Putative exported protein</p>
                        </c>
                        <c ca="left">
                           <p>Missing in <it>P. gingivalis</it>, and <it>C. hutchinsonii</it></p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>BF0751</p>
                        </c>
                        <c ca="center">
                           <p>
                              <ext-link ext-link-type="gen" ext-link-id="YP_098036">YP_098036</ext-link>
                           </p>
                        </c>
                        <c ca="center">
                           <p>577</p>
                        </c>
                        <c ca="left">
                           <p>Putative exported protein</p>
                        </c>
                        <c ca="left">
                           <p>Missing in <it>P. gingivalis, P. torquis, R. biformata </it>and <it>C. hutchinsonii</it></p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>BF1057</p>
                        </c>
                        <c ca="center">
                           <p>
                              <ext-link ext-link-type="gen" ext-link-id="YP_098341">YP_098341</ext-link>
                           </p>
                        </c>
                        <c ca="center">
                           <p>506</p>
                        </c>
                        <c ca="left">
                           <p>Putative exported protein</p>
                        </c>
                        <c ca="left">
                           <p>Missing in <it>P. gingivalis</it>, and <it>C. hutchinsonii</it></p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>BF1254</p>
                        </c>
                        <c ca="center">
                           <p>
                              <ext-link ext-link-type="gen" ext-link-id="YP_098538">YP_098538</ext-link>
                           </p>
                        </c>
                        <c ca="center">
                           <p>507</p>
                        </c>
                        <c ca="left">
                           <p>Putative exported protein</p>
                        </c>
                        <c ca="left">
                           <p>Missing in <it>P. gingivalis</it>, and <it>C. hutchinsonii</it></p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>BF1327</p>
                        </c>
                        <c ca="center">
                           <p>
                              <ext-link ext-link-type="gen" ext-link-id="YP_098610">YP_098610</ext-link>
                           </p>
                        </c>
                        <c ca="center">
                           <p>514</p>
                        </c>
                        <c ca="left">
                           <p>Putative exported protein</p>
                        </c>
                        <c ca="left">
                           <p>Missing in <it>P. gingivalis</it>, and <it>C. hutchinsonii</it></p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>BF3185</p>
                        </c>
                        <c ca="center">
                           <p>
                              <ext-link ext-link-type="gen" ext-link-id="YP_100464">YP_100464</ext-link>
                           </p>
                        </c>
                        <c ca="center">
                           <p>490</p>
                        </c>
                        <c ca="left">
                           <p>Putative exported protein</p>
                        </c>
                        <c ca="left">
                           <p>Missing in <it>P. gingivalis</it>, and <it>C. hutchinsonii</it></p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>BF3612</p>
                        </c>
                        <c ca="center">
                           <p>
                              <ext-link ext-link-type="gen" ext-link-id="YP_100889">YP_100889</ext-link>
                           </p>
                        </c>
                        <c ca="center">
                           <p>542</p>
                        </c>
                        <c ca="left">
                           <p>Putative exported protein</p>
                        </c>
                        <c ca="left">
                           <p>Missing in <it>P. gingivalis </it>and <it>C. hutchinsonii</it></p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>BF4330</p>
                        </c>
                        <c ca="center">
                           <p>
                              <ext-link ext-link-type="gen" ext-link-id="YP_101602">YP_101602</ext-link>
                           </p>
                        </c>
                        <c ca="center">
                           <p>538</p>
                        </c>
                        <c ca="left">
                           <p>Putative exported protein</p>
                        </c>
                        <c ca="left">
                           <p>Missing in <it>P. gingivalis,, R. biformata </it>and <it>C. hutchinsonii</it></p>
                        </c>
                     </r>
                  </tblbdy>
                  <tblfn>
                     <p>For the proteins listed here, all significant blast hits are from various <it>Bacteroidetes </it>species, except as noted below. These proteins are present in all of the <it>Bacteroidetes </it>species listed in Table 1, except as noted in the comments.</p>
                     <p><sup>+</sup>For the protein PG0202, a significant hit is also observed from <it>C. phaeobacteroides</it>; For BF0296 significant hits also observed from <it>P. aestuarii </it>and <it>C. phaeobacteroides</it>.</p>
                     <p>Of the proteins listed here, the following proteins are homologous to each other: BF0751, BF1057, BF1327, BF3185; BF1254, BF3612, BF4330.</p>
                  </tblfn>
               </tbl>
               <p>We have also come identified a 3 aa deletion in a conserved region of ClpB protease that is present in all other <it>Bacteroidetes </it>species, except <it>S. ruber </it>(Additional file <supplr sid="S1">1</supplr>). Similar to the genes for the above 8 proteins, this deletion likely occurred in a common ancestor of the other <it>Bacteroidetes </it>species after the branching of <it>S. ruber</it>. Besides <it>Bacteroidetes </it>species, this indel is also present in the ClpB homologs from <it>C. phaebacteroidetes </it>(only <it>Chlorobi </it>species containing this protein) and the archaeum <it>Methanospirillum hungatei</it>, which is likely due to LGT. The remaining proteins in Table <tblr tid="T2">2</tblr>, of which 7 (BF0751, BF1057, BF1327, BF3185; BF1254, BF3612, BF4330) are homologous to each other, are missing in 1 or 2 sequenced species (e.g. <it>P. gingivalis, B. fragilis</it>, <it>C. hutchinsonii </it>or <it>S. ruber</it>) and their distribution pattern can also be explained by similar mechanisms as discussed above. Except for a few proteins that show limited similarity to conserved domains (CDs) found in other proteins <abbrgrp><abbr bid="B60">60</abbr></abbrgrp>, most of the proteins in Table <tblr tid="T2">2</tblr> are of unknown function.</p>
               <suppl id="S1">
                  <title>
                     <p>Additional File 1</p>
                  </title>
                  <text>
                     <p>A conserved indel (3 aa deletion) in ClpB protein that is mainly specific for the <it>Bacteroidales</it>, <it>Flavobacteriales </it>and <it>Flexibacteraceae </it>species. Partial sequence alignment of the ClpB protein containing this indel region is shown. The boxed region is missing in the <it>Bacteroidales</it>, <it>Flavobacteriales </it>and <it>Flexibacteraceae </it>species. Dashes in the alignment show identity with the amino acid on the top line. The ClpB homologs from <it>C. phaebacteroidetes </it>and <it>Methanospirillum hungatei </it>also lack the boxed region, which could be due to LGT. The beta and gamma proteobacteria contain a larger insert in this region, which has likely occurred independently.</p>
                  </text>
                  <file name="1471-2148-7-71-S1.pdf">
                     <p>Click here for file</p>
                  </file>
               </suppl>
               <p>These searches have also identified several proteins that at present appear unique for the species from the <it>Bacteroidales </it>and <it>Flavobacteriales </it>orders. These proteins are listed in Table <tblr tid="T3">3</tblr>. Of these, the first 4 proteins viz. PG0336, PG1302, PG1537, PG2030 are present in nearly all complete as well as partially sequenced species from the above two orders, but they are not found in <it>S. ruber </it>as well as <it>C. hutchinsonii</it>. The latter two species, which show much deeper branching than all other <it>Bacteroidales </it>species (Figs. <figr fid="F1">1</figr> and <figr fid="F2">2</figr>), are currently placed in the order <it>Sphingobacteriales</it>. The genes for the proteins listed in Table <tblr tid="T3">3</tblr> have thus likely originated in a common ancestor of the <it>Bacteroidales </it>and <it>Flavobacteriales </it>after the divergence of <it>Sphingobacteria</it>. Thirty-seven additional proteins in Table <tblr tid="T3">3</tblr> are also uniquely present in either all or many of the sequenced <it>Bacteroidales </it>species and a small number of flavobacteria species including <it>G. forsetii</it>. A large number of these proteins are only missing in <it>P. gingivalis</it>, which is likely due to gene loss. Of the proteins listed in Table <tblr tid="T3">3</tblr>, seven are indicated to be conjugative transposon proteins: TraJ, TraN, TraK, TraF, TraE and TraB (PG1251 and PG1479, PG1475, PG1478, PG1482, PG1483 and BF0127, respectively). Four of them are present in two clusters very close to each other (PG1478-PG1479, PG1482-PG1483), supporting their involvement in related functions <abbrgrp><abbr bid="B61">61</abbr><abbr bid="B62">62</abbr></abbrgrp>. The genes for these proteins have also likely evolved in a common ancestor of the <it>Bacteroidales </it>and <it>Flavobacteriales</it>, followed by gene losses in various species.</p>
               <tbl id="T3">
                  <title>
                     <p>Table 3</p>
                  </title>
                  <caption>
                     <p>Proteins that are Specific for the Bacteroidales and Flavobacteriales Orders</p>
                  </caption>
                  <tblbdy cols="4">
                     <r>
                        <c ca="center">
                           <p>
                              <b>Genome ID No. [Accession No.]</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>Possible/Predicted Function</b>
                           </p>
                        </c>
                        <c ca="left">
                           <p>
                              <b>Genome ID No. [Accession No.]</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>Possible/Predicted Function</b>
                           </p>
                        </c>
                     </r>
                     <r>
                        <c cspan="4">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG0336 [<ext-link ext-link-type="gen" ext-link-id="NP_904650">NP_904650</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>PG1276 [<ext-link ext-link-type="gen" ext-link-id="NP_905457">NP_905457</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>DNA-binding protein, histone-like family</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG1302 [<ext-link ext-link-type="gen" ext-link-id="NP_905476">NP_905476</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>PG1389 [<ext-link ext-link-type="gen" ext-link-id="NP_905551">NP_905551</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>DNA-binding protein, histone-like family</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG1537 [<ext-link ext-link-type="gen" ext-link-id="NP_905677">NP_905677</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>PG1444 [<ext-link ext-link-type="gen" ext-link-id="NP_905595">NP_905595</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG1251 [<ext-link ext-link-type="gen" ext-link-id="NP_905435">NP_905435</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Conjugative transposon protein TraJ</p>
                        </c>
                        <c ca="left">
                           <p>PG1475 [<ext-link ext-link-type="gen" ext-link-id="NP_905621">NP_905621</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Conjugative transposon protein TraN</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG2030 [<ext-link ext-link-type="gen" ext-link-id="NP_906097">NP_906097</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>PG1478 [<ext-link ext-link-type="gen" ext-link-id="NP_905624">NP_905624</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Conjugative transposon protein TraK</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG1492 [<ext-link ext-link-type="gen" ext-link-id="NP_905638">NP_905638</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>PG1479 [<ext-link ext-link-type="gen" ext-link-id="NP_905625">NP_905625</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Conjugative transposon protein TraJ</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG0302 [<ext-link ext-link-type="gen" ext-link-id="NP_904618">NP_904618</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>PG1482 [<ext-link ext-link-type="gen" ext-link-id="NP_905628">NP_905628</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Conjugative transposon protein TraF</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG0330 [<ext-link ext-link-type="gen" ext-link-id="NP_904645">NP_904645</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>DNA-binding protein, histone-like family; smart00411, BHL</p>
                        </c>
                        <c ca="left">
                           <p>PG1483 [<ext-link ext-link-type="gen" ext-link-id="NP_905629">NP_905629</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Conjugative transposon protein TraE</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG0555 [<ext-link ext-link-type="gen" ext-link-id="NP_904845">NP_904845</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>DNA-binding protein, histone-like family</p>
                        </c>
                        <c ca="left">
                           <p>PG1488 [<ext-link ext-link-type="gen" ext-link-id="NP_905634">NP_905634</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG0829 [<ext-link ext-link-type="gen" ext-link-id="NP_905084">NP_905084</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>PG1494 [<ext-link ext-link-type="gen" ext-link-id="NP_905640">NP_905640</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG0870 [<ext-link ext-link-type="gen" ext-link-id="NP_905118">NP_905118</ext-link>]*</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>PG2040 [<ext-link ext-link-type="gen" ext-link-id="NP_906106">NP_906106</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>DNA-binding protein, histone-like family</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG0875 [<ext-link ext-link-type="gen" ext-link-id="NP_905123">NP_905123</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>TnpA; DNA replication, recombination &amp; repair, COG2452</p>
                        </c>
                        <c ca="left">
                           <p>PG2127 [<ext-link ext-link-type="gen" ext-link-id="NP_906183">NP_906183</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG1206 [<ext-link ext-link-type="gen" ext-link-id="NP_905397">NP_905397</ext-link>]<sup>c</sup></p>
                        </c>
                        <c ca="left">
                           <p>Mobilizable transposon, tnpC protein</p>
                        </c>
                        <c ca="left">
                           <p>BF1773 [<ext-link ext-link-type="gen" ext-link-id="YP_099054">YP_099054</ext-link>]*</p>
                        </c>
                        <c ca="left">
                           <p>Probable truncated integrase; cd01185, INT_Tn4399</p>
                        </c>
                     </r>
                     <r>
                        <c cspan="4">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c cspan="4" ca="center">
                           <p>
                              <b>Missing in <it>Porphyromonas gingivalis </it>W83</b>
                           </p>
                        </c>
                     </r>
                     <r>
                        <c cspan="4">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>BF0127 [<ext-link ext-link-type="gen" ext-link-id="YP_097410">YP_097410</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>TraB</p>
                        </c>
                        <c ca="left">
                           <p>BF1727 [<ext-link ext-link-type="gen" ext-link-id="YP_099008">YP_099008</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Putative outer membrane protein maybe involved in nutrient binding</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>BF0136 [<ext-link ext-link-type="gen" ext-link-id="YP_097419">YP_097419</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Tetracycline resistance element mobilization: RteC</p>
                        </c>
                        <c ca="left">
                           <p>BF1860 [<ext-link ext-link-type="gen" ext-link-id="YP_099142">YP_099142</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>BF0146 [<ext-link ext-link-type="gen" ext-link-id="YP_097429">YP_097429</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>BF1926 [<ext-link ext-link-type="gen" ext-link-id="YP_211559">YP_211559</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>BF0319 [<ext-link ext-link-type="gen" ext-link-id="YP_097602">YP_097602</ext-link>]*</p>
                        </c>
                        <c ca="left">
                           <p>Putative exported protein</p>
                        </c>
                        <c ca="left">
                           <p>BF2130 [<ext-link ext-link-type="gen" ext-link-id="YP_099411">YP_099411</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>BF0342 [<ext-link ext-link-type="gen" ext-link-id="YP_097625">YP_097625</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Putative exported protein</p>
                        </c>
                        <c ca="left">
                           <p>BF2214 [<ext-link ext-link-type="gen" ext-link-id="YP_099495">YP_099495</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>BF1067 [<ext-link ext-link-type="gen" ext-link-id="YP_098351">YP_098351</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>BF3164 [<ext-link ext-link-type="gen" ext-link-id="YP_100443">YP_100443</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Putative lipoprotein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>BF1422 [<ext-link ext-link-type="gen" ext-link-id="YP_098707">YP_098707</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>BF4258 [<ext-link ext-link-type="gen" ext-link-id="YP_101533">YP_101533</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>BF1567 [<ext-link ext-link-type="gen" ext-link-id="YP_098851">YP_098851</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                  </tblbdy>
                  <tblfn>
                     <p>The first five proteins in this table (viz. PG0336, PG1302, PG1537, PG1626, PG2030) are present in all <it>Bacteroidales </it>and <it>Flavobacteriales </it>species listed in Table 1; The other proteins are present in many of the <it>Bacteroidales </it>and <it>Flavobacteriales </it>species.</p>
                     <p>*Also present in <it>C. phaeobacteroides</it>.</p>
                     <p>The following proteins are homologous to each other: PG0330, PG0555, PG1276, PG1389, PG2040; PG0829, PG1444, PG1488; PG1251, PG1479; BF1422, BF1860; BF1926, BF4258.</p>
                  </tblfn>
               </tbl>
            </sec>
            <sec>
               <st>
                  <p>Proteins that are specific for the order Bacteroidales or the genus Bacteroides</p>
               </st>
               <p>There are 4 genomes for the <it>Bacteroidales </it>species (<it>P. gingivalis </it>W83, <it>B. thetaiotaomicron </it>VPI-5482 and <it>B. fragilis </it>strains: NCTC 9343 and YCH46) that have been fully sequenced. Additionally, sequence information for a large number of genes/proteins from <it>Prevotella intermedia </it>17 and <it>Prevotella ruminicola </it>23, which belong to this order is available in the in NCBI database (see Table <tblr tid="T1">1</tblr>). Our blast searches on proteins from <it>P. gingivalis </it>genome have identified 52 proteins that are uniquely shared by either all or most of the sequenced <it>Bacteriodales </it>species and whose homologs are not found in any other species, except where noted (Table <tblr tid="T4">4</tblr>). Thirty-nine of these 52 proteins are uniquely found in all 4 fully sequenced <it>Bacteroidales </it>species. These species also form a strongly supported clade in phylogenetic trees (Figs. <figr fid="F1">1</figr> and <figr fid="F2">2</figr>). Thus, it is likely that the genes for these proteins evolved in a common ancestor of this order. The remaining 13 proteins are lacking in at least one of the <it>Bacteroides </it>species (noted in Table <tblr tid="T4">4</tblr>), which is likely due to gene loss. In addition to the sequenced <it>Bacteroidales </it>species, high scoring homologs for many of the above proteins were also found in the two <it>Prevotella </it>species. These latter homologs were detected via genomic blasts against partially sequenced genomes from these species (see Methods).</p>
               <tbl id="T4">
                  <title>
                     <p>Table 4</p>
                  </title>
                  <caption>
                     <p>Proteins Unique to the Bacteroidales Order</p>
                  </caption>
                  <tblbdy cols="4">
                     <r>
                        <c ca="center">
                           <p>
                              <b>Genome ID No. [Accession No.]</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>Possible/Predicted Function</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>Genome ID No. [Accession No.]</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>Possible/Predicted Function</b>
                           </p>
                        </c>
                     </r>
                     <r>
                        <c cspan="4">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG0018 [<ext-link ext-link-type="gen" ext-link-id="NP_904375">NP_904375</ext-link>]<sup>x</sup></p>
                        </c>
                        <c ca="left">
                           <p>hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>PG1133 [<ext-link ext-link-type="gen" ext-link-id="NP_905341">NP_905341</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>conserved hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG0082 [<ext-link ext-link-type="gen" ext-link-id="NP_904431">NP_904431</ext-link>]<sup>+</sup></p>
                        </c>
                        <c ca="left">
                           <p>putative exported protein</p>
                        </c>
                        <c ca="left">
                           <p>PG1139 [<ext-link ext-link-type="gen" ext-link-id="NP_905347">NP_905347</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>conserved hypothetical protein; cd02966, Tlp_A_like_family</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG0125 [<ext-link ext-link-type="gen" ext-link-id="NP_904468">NP_904468</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>conserved hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>PG1214 [<ext-link ext-link-type="gen" ext-link-id="NP_905405">NP_905405</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG0179 [<ext-link ext-link-type="gen" ext-link-id="NP_904515">NP_904515</ext-link>]<sup>+</sup></p>
                        </c>
                        <c ca="left">
                           <p>putative exported protein</p>
                        </c>
                        <c ca="left">
                           <p>PG1301 [<ext-link ext-link-type="gen" ext-link-id="NP_905475">NP_905475</ext-link>]<sup>+</sup></p>
                        </c>
                        <c ca="left">
                           <p>conserved hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG0188 [<ext-link ext-link-type="gen" ext-link-id="NP_904523">NP_904523</ext-link>]<sup>x</sup></p>
                        </c>
                        <c ca="left">
                           <p>lipoprotein, putative</p>
                        </c>
                        <c ca="left">
                           <p>PG1333 [<ext-link ext-link-type="gen" ext-link-id="NP_905502">NP_905502</ext-link>]<sup>x</sup></p>
                        </c>
                        <c ca="left">
                           <p>putative exported protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG0216 [<ext-link ext-link-type="gen" ext-link-id="NP_904548">NP_904548</ext-link>]<sup>+</sup></p>
                        </c>
                        <c ca="left">
                           <p>conserved hypothetical exported protein</p>
                        </c>
                        <c ca="left">
                           <p>PG1352 [<ext-link ext-link-type="gen" ext-link-id="NP_905517">NP_905517</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>putative conserved hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG0217 [<ext-link ext-link-type="gen" ext-link-id="NP_904549">NP_904549</ext-link>]<sup>+</sup></p>
                        </c>
                        <c ca="left">
                           <p>cons. hypothetical exported protein</p>
                        </c>
                        <c ca="left">
                           <p>PG1388 [<ext-link ext-link-type="gen" ext-link-id="NP_905550">NP_905550</ext-link>]<sup>+</sup></p>
                        </c>
                        <c ca="left">
                           <p>conserved hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG0218 [<ext-link ext-link-type="gen" ext-link-id="NP_904550">NP_904550</ext-link>]<sup>+</sup></p>
                        </c>
                        <c ca="left">
                           <p>conserved hypothetical exported protein</p>
                        </c>
                        <c ca="left">
                           <p>PG1441 [<ext-link ext-link-type="gen" ext-link-id="NP_905593">NP_905593</ext-link>]<sup>x</sup></p>
                        </c>
                        <c ca="left">
                           <p>lysozyme-related protein; cd00737, endolysin_autolysin; COG3772, phage-related lysozyme</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG0246 [<ext-link ext-link-type="gen" ext-link-id="NP_904573">NP_904573</ext-link>]<sup>+</sup></p>
                        </c>
                        <c ca="left">
                           <p>putative DNA-binding protein</p>
                        </c>
                        <c ca="left">
                           <p>PG1442 [<ext-link ext-link-type="gen" ext-link-id="NP_905594">NP_905594</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>TraB</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG0312 [<ext-link ext-link-type="gen" ext-link-id="NP_904628">NP_904628</ext-link>]<sup>+</sup></p>
                        </c>
                        <c ca="left">
                           <p>putative transmembrane protein</p>
                        </c>
                        <c ca="left">
                           <p>PG1458 [<ext-link ext-link-type="gen" ext-link-id="NP_905606">NP_905606</ext-link>]<sup>x</sup></p>
                        </c>
                        <c ca="left">
                           <p>hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG0326 [<ext-link ext-link-type="gen" ext-link-id="NP_904641">NP_904641</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>hypoth; COG3637, Opacity protein &amp; related surface antigens</p>
                        </c>
                        <c ca="left">
                           <p>PG1473 [<ext-link ext-link-type="gen" ext-link-id="NP_905619">NP_905619</ext-link>]<sup>+</sup></p>
                        </c>
                        <c ca="left">
                           <p>conjugative transposon protein TraQ</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG0366 [<ext-link ext-link-type="gen" ext-link-id="NP_904677">NP_904677</ext-link>]<sup>+#</sup></p>
                        </c>
                        <c ca="left">
                           <p>hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>PG1621 [<ext-link ext-link-type="gen" ext-link-id="NP_905750">NP_905750</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>conserved hypothetical exported protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG0434 [<ext-link ext-link-type="gen" ext-link-id="NP_904735">NP_904735</ext-link>]<sup>+</sup></p>
                        </c>
                        <c ca="left">
                           <p>putative transmembrane protein</p>
                        </c>
                        <c ca="left">
                           <p>PG1757 [<ext-link ext-link-type="gen" ext-link-id="NP_905859">NP_905859</ext-link>]<sup>x</sup></p>
                        </c>
                        <c ca="left">
                           <p>hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG0541 [<ext-link ext-link-type="gen" ext-link-id="NP_904834">NP_904834</ext-link>]<sup>+</sup></p>
                        </c>
                        <c ca="left">
                           <p>conserved hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>PG1881 [<ext-link ext-link-type="gen" ext-link-id="NP_905968">NP_905968</ext-link>]<sup>+</sup></p>
                        </c>
                        <c ca="left">
                           <p>putative lipoprotein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG0574 [<ext-link ext-link-type="gen" ext-link-id="NP_904862">NP_904862</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>PG1889 [<ext-link ext-link-type="gen" ext-link-id="NP_905974">NP_905974</ext-link>]<sup>x</sup></p>
                        </c>
                        <c ca="left">
                           <p>hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG0717 [<ext-link ext-link-type="gen" ext-link-id="NP_904988">NP_904988</ext-link>]<sup>+</sup></p>
                        </c>
                        <c ca="left">
                           <p>lipoprotein, putative</p>
                        </c>
                        <c ca="left">
                           <p>PG1945 [<ext-link ext-link-type="gen" ext-link-id="NP_906027">NP_906027</ext-link>]<sup>+</sup></p>
                        </c>
                        <c ca="left">
                           <p>conserved hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG0781 [<ext-link ext-link-type="gen" ext-link-id="NP_905043">NP_905043</ext-link>]<sup>+</sup></p>
                        </c>
                        <c ca="left">
                           <p>putative membrane protein</p>
                        </c>
                        <c ca="left">
                           <p>PG2006 [<ext-link ext-link-type="gen" ext-link-id="NP_906077">NP_906077</ext-link>]<sup>+</sup></p>
                        </c>
                        <c ca="left">
                           <p>conserved hypothetical membrane protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG0816 [<ext-link ext-link-type="gen" ext-link-id="NP_905074">NP_905074</ext-link>]<sup>x</sup></p>
                        </c>
                        <c ca="left">
                           <p>hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>PG2079 [<ext-link ext-link-type="gen" ext-link-id="NP_906141">NP_906141</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>conserved hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG0831 [<ext-link ext-link-type="gen" ext-link-id="NP_905085">NP_905085</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Cons. protein maybe related to TraB</p>
                        </c>
                        <c ca="left">
                           <p>PG2083 [<ext-link ext-link-type="gen" ext-link-id="NP_906145">NP_906145</ext-link>]<sup>+</sup></p>
                        </c>
                        <c ca="left">
                           <p>conserved hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG0843 [<ext-link ext-link-type="gen" ext-link-id="NP_905095">NP_905095</ext-link>]<sup>x</sup></p>
                        </c>
                        <c ca="left">
                           <p>conserved hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>PG2116 [<ext-link ext-link-type="gen" ext-link-id="NP_906174">NP_906174</ext-link>]<sup>x</sup></p>
                        </c>
                        <c ca="left">
                           <p>transposase</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG0851 [<ext-link ext-link-type="gen" ext-link-id="NP_905101">NP_905101</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Toprim domain protein</p>
                        </c>
                        <c ca="left">
                           <p>PG2130 [<ext-link ext-link-type="gen" ext-link-id="NP_906186">NP_906186</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>FimX</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG0910 [<ext-link ext-link-type="gen" ext-link-id="NP_905150">NP_905150</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>FHA domain protein, cd00060</p>
                        </c>
                        <c ca="left">
                           <p>PG2131 [<ext-link ext-link-type="gen" ext-link-id="NP_906187">NP_906187</ext-link>]<sup>+</sup></p>
                        </c>
                        <c ca="left">
                           <p>60 kDa protein; OmpA, COG2885.2</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG0937 [<ext-link ext-link-type="gen" ext-link-id="NP_905172">NP_905172</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>putative exported protein</p>
                        </c>
                        <c ca="left">
                           <p>PG2133 [<ext-link ext-link-type="gen" ext-link-id="NP_906189">NP_906189</ext-link>]<sup>x</sup></p>
                        </c>
                        <c ca="left">
                           <p>lipoprotein, putative</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG0961 [<ext-link ext-link-type="gen" ext-link-id="NP_905192">NP_905192</ext-link>]<sup>+</sup></p>
                        </c>
                        <c ca="left">
                           <p>conserved hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>PG2149 [<ext-link ext-link-type="gen" ext-link-id="NP_906203">NP_906203</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>putative conserved exported protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG1050 [<ext-link ext-link-type="gen" ext-link-id="NP_905267">NP_905267</ext-link>]<sup>x</sup></p>
                        </c>
                        <c ca="left">
                           <p>putative lipoprotein</p>
                        </c>
                        <c ca="left">
                           <p>PG2168 [<ext-link ext-link-type="gen" ext-link-id="NP_906219">NP_906219</ext-link>]<sup>+</sup></p>
                        </c>
                        <c ca="left">
                           <p>FimX</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>PG1125 [<ext-link ext-link-type="gen" ext-link-id="NP_905334">NP_905334</ext-link>]<sup>+</sup></p>
                        </c>
                        <c ca="left">
                           <p>conserved hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>PG2224 [<ext-link ext-link-type="gen" ext-link-id="NP_906265">NP_906265</ext-link>]<sup>x</sup></p>
                        </c>
                        <c ca="left">
                           <p>hypothetical protein</p>
                        </c>
                     </r>
                  </tblbdy>
                  <tblfn>
                     <p>All significant hits for these proteins are observed from the following <it>Bacteroidales </it>species, for which sequence information is available in the NCBI or TIGR microbial sequence database. <it>P. gingivalis </it>W83, <it>B. fragilis </it>NCTC 9343, <it>B. fragilis </it>YCH46 , <it>B. thetaiotaomicron </it>VPI-5482, <it>Prevotella intermedia </it>17, <it>P. ruminicola </it>23. Unless noted below, these proteins are present in all of the above-mentioned species. Of these proteins, the following are homologous to each other: PG0179, PG2133; PG0217, PG0218; PG0816, PG1458; PG0831, PG1442; PG2130, PG2168.</p>
                     <p><sup>+</sup>Not found at present in either 1 or both <it>Prevotella </it>species.</p>
                     <p><sup>x</sup>Missing in one of the <it>Bacteroides </it>species as well as <it>Prevotella </it>species.</p>
                     <p><sup>#</sup>Also present was <it>C. phaeobacteroides </it>BS1.</p>
                  </tblfn>
               </tbl>
               <p>The majority of the <it>Bacteroidales</it>-specific proteins are hypothetical and some of them are indicated as putative exported proteins (Table <tblr tid="T4">4</tblr>). Some interesting proteins in this list include the FimX proteins (PG2130, PG2168) that are involved in fimbriae production, which is necessary for adhesion to host surfaces <abbrgrp><abbr bid="B63">63</abbr></abbrgrp>. Also of interest is the conserved hypothetical protein, PG1139, which shows slight but significant similarity to the conserved domain in the TlpA-like family, responsible for cytochrome maturation. One of the proteins PG0366 also had a significant hit from <it>C. phaeobacteroides</it>, which could be due to LGT <abbrgrp><abbr bid="B64">64</abbr></abbrgrp>. There are seven proteins in Table <tblr tid="T4">4</tblr> (PG0216-PG0218, PG1441-PG1442, PG2130-PG2131) that are present in clusters of two or three, suggesting that they could form functional units. Further, a number of proteins in this table (viz. PG0179, PG2133; PG0217, PG0218; PG0816, PG1458; PG0831, PG1442; PG2130, PG2168) are homologous to each other, indicating that they resulted from gene duplication events.</p>
               <p>The <it>Bacteroides </it>genus contains three fully sequenced genomes corresponding to <it>B. thetaiotaomicron </it>VPI-5482 and <it>B. fragilis </it>strains NCTC 9343 and YCH46. The blast searches on <it>B. fragilis </it>YCH46 genome have identified 185 proteins that are mainly specific for these species (Additional file <supplr sid="S2">2</supplr>) and their homologs are not found in <it>P. gingivalis</it>. For 10 of these proteins, significant similarity was also observed for at least one of the two <it>Prevotella </it>species, suggesting that within the order <it>Bacteriodales</it>, species from the <it>Bacteroides </it>and <it>Prevotella </it>genera may be more closely related to each other in comparison to <it>P. gingivalis</it>. Most of these proteins are of unknown functions, however, some have been annotated as transmembrane or lipoproteins. Thirty-nine of these proteins are present in clusters of two to four in the genome indicating that they could be involved in related functions. A number of proteins in this table are homologous to each other indicating that they have likely resulted from gene duplication events.</p>
               <suppl id="S2">
                  <title>
                     <p>Additional File 2</p>
                  </title>
                  <text>
                     <p>Proteins that are specific for the Bacteroides Genus. All significant hits for these proteins are from the following sequenced <it>Bacteroides </it>species unless otherwise indicated: <it>B. thetaiotaomicron </it>VPI-5482, <it>B. fragilis </it>NCTC 9343 and YCH46.</p>
                  </text>
                  <file name="1471-2148-7-71-S2.pdf">
                     <p>Click here for file</p>
                  </file>
               </suppl>
            </sec>
            <sec>
               <st>
                  <p>Proteins that are specific for the order Flavobacteriales</p>
               </st>
               <p>The complete genome of the first flavobacteria species viz.<it>G. forsetii KT0803 </it>became available very recently <abbrgrp><abbr bid="B59">59</abbr></abbrgrp>. However, sequence information for a large number of other <it>Flavobacteriales </it>species, whose genomes are being sequenced (see Table <tblr tid="T1">1</tblr>), is available in the NCBI database. Our blastp and PSI-blast searches on different ORFs in the <it>G. forsetii </it>genome have identified 38 proteins that are uniquely present in virtually all of the <it>Flavobacteriales </it>species (Table <tblr tid="T5">5</tblr>). Twenty-six of these proteins are present in all <it>Flavobacteriales </it>species listed in Table <tblr tid="T1">1</tblr>, whereas the remaining 12 are missing in only one of the species. An additional group of 146 proteins are also specific for the <it>Flavobacteriales</it>, but they are missing in some flavobacteria species or limited to only a small numbers of flavobacteria (Additional file <supplr sid="S3">3</supplr>). Because the genomes for most of the <it>Flavobacteriales </it>species are not complete at the present time, these proteins were not separated into different groups.</p>
               <tbl id="T5">
                  <title>
                     <p>Table 5</p>
                  </title>
                  <caption>
                     <p>Proteins that are Specific for Species from the Flavobacteriales Order</p>
                  </caption>
                  <tblbdy cols="4">
                     <r>
                        <c ca="center">
                           <p>
                              <b>orf No. [Accession No.]</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>Possible/Predicted Function</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>Genome ID No. [Accession No.]</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>Possible/Predicted Function</b>
                           </p>
                        </c>
                     </r>
                     <r>
                        <c cspan="4">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>orf89 [<ext-link ext-link-type="gen" ext-link-id="CAL65078">CAL65078</ext-link>]<sup>b</sup></p>
                        </c>
                        <c ca="left">
                           <p>membrane protein</p>
                        </c>
                        <c ca="left">
                           <p>orf1826 [<ext-link ext-link-type="gen" ext-link-id="CAL66812">CAL66812</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>orf92 [<ext-link ext-link-type="gen" ext-link-id="CAL65081">CAL65081</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>secreted protein</p>
                        </c>
                        <c ca="left">
                           <p>orf1872 [<ext-link ext-link-type="gen" ext-link-id="CAL66858">CAL66858</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>orf107 [<ext-link ext-link-type="gen" ext-link-id="CAL65096">CAL65096</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>membrane protein</p>
                        </c>
                        <c ca="left">
                           <p>orf2280 [<ext-link ext-link-type="gen" ext-link-id="CAL67264">CAL67264</ext-link>]<sup>c</sup></p>
                        </c>
                        <c ca="left">
                           <p>hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>orf110 [<ext-link ext-link-type="gen" ext-link-id="CAL65099">CAL65099</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>conserved hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>orf2667 [<ext-link ext-link-type="gen" ext-link-id="CAL67651">CAL67651</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>conserved hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>orf191 [<ext-link ext-link-type="gen" ext-link-id="CAL65180">CAL65180</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>membrane or secreted protein</p>
                        </c>
                        <c ca="left">
                           <p>orf2698 [<ext-link ext-link-type="gen" ext-link-id="CAL67682">CAL67682</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>secreted protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>orf403 [<ext-link ext-link-type="gen" ext-link-id="CAL65392">CAL65392</ext-link>]<sup>b</sup></p>
                        </c>
                        <c ca="left">
                           <p>hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>orf2700 [<ext-link ext-link-type="gen" ext-link-id="CAL67684">CAL67684</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>orf509 [<ext-link ext-link-type="gen" ext-link-id="CAL65498">CAL65498</ext-link>]<sup>c</sup></p>
                        </c>
                        <c ca="left">
                           <p>hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>orf2718 [<ext-link ext-link-type="gen" ext-link-id="CAL67702">CAL67702</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>orf612 [<ext-link ext-link-type="gen" ext-link-id="CAL65599">CAL65599</ext-link>]<sup>b</sup></p>
                        </c>
                        <c ca="left">
                           <p>secreted protein</p>
                        </c>
                        <c ca="left">
                           <p>orf2731 [<ext-link ext-link-type="gen" ext-link-id="CAL67715">CAL67715</ext-link>]<sup>b</sup></p>
                        </c>
                        <c ca="left">
                           <p>secreted protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>orf983 [<ext-link ext-link-type="gen" ext-link-id="CAL65970">CAL65970</ext-link>]<sup>a</sup></p>
                        </c>
                        <c ca="left">
                           <p>hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>orf2756 [<ext-link ext-link-type="gen" ext-link-id="CAL67740">CAL67740</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>secreted protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>orf995 [<ext-link ext-link-type="gen" ext-link-id="CAL65982">CAL65982</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>phospholipid/glycerol acyltransferase; smart00563, PlsC</p>
                        </c>
                        <c ca="left">
                           <p>orf2825 [<ext-link ext-link-type="gen" ext-link-id="CAL67809">CAL67809</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>secreted protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>orf998 [<ext-link ext-link-type="gen" ext-link-id="CAL65985">CAL65985</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>membrane protein</p>
                        </c>
                        <c ca="left">
                           <p>orf2844 [<ext-link ext-link-type="gen" ext-link-id="CAL67828">CAL67828</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>membrane protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>orf1059 [<ext-link ext-link-type="gen" ext-link-id="CAL66046">CAL66046</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>membrane protein</p>
                        </c>
                        <c ca="left">
                           <p>orf2917 [<ext-link ext-link-type="gen" ext-link-id="CAL67899">CAL67899</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>secreted protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>orf1078 [<ext-link ext-link-type="gen" ext-link-id="CAL66065">CAL66065</ext-link>]<sup>a</sup></p>
                        </c>
                        <c ca="left">
                           <p>membrane or secreted protein</p>
                        </c>
                        <c ca="left">
                           <p>orf2939 [<ext-link ext-link-type="gen" ext-link-id="CAL67921">CAL67921</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>hypothetical; COG1577, ERG12</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>orf1453 [<ext-link ext-link-type="gen" ext-link-id="CAL66440">CAL66440</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>secreted protein</p>
                        </c>
                        <c ca="left">
                           <p>orf3043 [<ext-link ext-link-type="gen" ext-link-id="CAL68025">CAL68025</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>orf1469 [<ext-link ext-link-type="gen" ext-link-id="CAL66456">CAL66456</ext-link>]<sup>a</sup></p>
                        </c>
                        <c ca="left">
                           <p>secreted protein</p>
                        </c>
                        <c ca="left">
                           <p>orf3076 [<ext-link ext-link-type="gen" ext-link-id="CAL68058">CAL68058</ext-link>]<sup>e</sup></p>
                        </c>
                        <c ca="left">
                           <p>hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>orf1555 [<ext-link ext-link-type="gen" ext-link-id="CAL66542">CAL66542</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>secreted protein</p>
                        </c>
                        <c ca="left">
                           <p>orf3240 [<ext-link ext-link-type="gen" ext-link-id="CAL68223">CAL68223</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>membrane protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>orf1618 [<ext-link ext-link-type="gen" ext-link-id="CAL66605">CAL66605</ext-link>]<sup>d</sup></p>
                        </c>
                        <c ca="left">
                           <p>secreted protein</p>
                        </c>
                        <c ca="left">
                           <p>orf3266 [<ext-link ext-link-type="gen" ext-link-id="CAL68249">CAL68249</ext-link>]<sup>a</sup></p>
                        </c>
                        <c ca="left">
                           <p>conserved hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>orf1766 [<ext-link ext-link-type="gen" ext-link-id="CAL66752">CAL66752</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>orf3313 [<ext-link ext-link-type="gen" ext-link-id="CAL68296">CAL68296</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>orf1776 [<ext-link ext-link-type="gen" ext-link-id="CAL66762">CAL66762</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>orf3501 [<ext-link ext-link-type="gen" ext-link-id="CAL68484">CAL68484</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>conserved hypothetical protein</p>
                        </c>
                     </r>
                  </tblbdy>
                  <tblfn>
                     <p>Unless otherwise indicated, all significant hits for the proteins listed in this table are from the following <it>Flavobacteriales </it>species, for which sequence information is presently available. <it>G. forestii KT0803, F. bacterium BBFL7, F. bacterium HTCC2170, F. johnsoniae UW101, Flavobacterium sp. MED217, Cellulophaga sp. MED134, C. atlanticus HTCC2559, P. irgensii 23-P, P. torquis ATCC 700755, R. biformata HTCC2501, Tenacibaculum sp. MED152</it>. The proteins are present in all of these species except as noted below:</p>
                     <p><sup>a</sup>Missing in <it>Polaribacter irgensii </it>23-P</p>
                     <p><sup>b</sup>Missing in <it>Flavobacterium johnsoniae </it>UW101</p>
                     <p><sup>c</sup>Missing in <it>Flavobacteria bacterium </it>BBFL7</p>
                     <p><sup>d</sup>Missing in <it>Psychroflexus torquis </it>ATCC 700755</p>
                     <p><sup>e</sup>Missing in <it>Robiginitalea biformata </it>HTCC2501</p>
                  </tblfn>
               </tbl>
               <suppl id="S3">
                  <title>
                     <p>Additional file 3</p>
                  </title>
                  <text>
                     <p>Proteins specific for Flavobacteria that are missing in several species. All significant hits for these proteins are also from <it>Flavobacteriale</it>s species. However, unlike the proteins listed in Table <tblr tid="T5">5</tblr>, these proteins are either present in only a small number of Flavobacteria or are missing from many species.</p>
                  </text>
                  <file name="1471-2148-7-71-S3.pdf">
                     <p>Click here for file</p>
                  </file>
               </suppl>
            </sec>
            <sec>
               <st>
                  <p>Proteins that are unique for the Chlorobi Phylum</p>
               </st>
               <p>The genomes of three <it>Chlorobi </it>species viz. <it>C. luteolum </it>DSM 273,<it>C. tepidum </it>TLS, and <it>C. chlorochromatii </it>CaD3, have been fully sequenced. Our blast analyses on the <it>C. tepidum </it>and <it>C. luteolum </it>genomes have identified 51 proteins that are uniquely shared by species from this phylum (Table <tblr tid="T6">6</tblr>). In addition to the 3 completely sequenced genomes, homologs of these proteins are also present in six others <it>Chlorobi </it>species (see Table <tblr tid="T1">1</tblr>), for which sequence information is available in the NCBI database. The genes for these proteins likely originated in a common ancestor of various <it>Chlorobi </it>species, which form a distinct, strongly supported, clade in phylogenetic trees (see Figures <figr fid="F1">1</figr> and <figr fid="F2">2</figr>). The vast majority of these proteins are of hypothetical or unknown functions. However, 5 of them are indicated to be involved in functions related to photosynthesis. Of these, Plut_0264 and Plut_0265 are clustered in the genome and they correspond to chlorosome envelope proteins C and A, respectively. The protein Plut_1500, which is indicated as bacteriochlorophyll A protein, corresponds to the FMO protein that is involved in the attachment of chlorosomes to the cytoplasmic membrane <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. The other two photosynthesis-related proteins, Plut_0620 and Plut_1628, are annotated as photosystem P840 reaction centre protein PscD and the photosystem P840 reaction centre cytochrome c-551, respectively <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B32">32</abbr></abbrgrp>. Three additional chlorobi-specific proteins, Plut_1714-Plut_1716, are clustered together in the genome indicating that they may form a functional unit <abbrgrp><abbr bid="B61">61</abbr></abbrgrp>. There are 65 additional proteins that are also specific for the <it>Chlorobi </it>species (Additional file <supplr sid="S4">4</supplr>). However, unlike the proteins in Table <tblr tid="T6">6</tblr>, these proteins are missing in a number of the <it>Chlorobi </it>species and their species distribution does not show any clear pattern and these could involve gene loss or LGTs <abbrgrp><abbr bid="B65">65</abbr></abbrgrp>. However, the first 8 proteins listed in this additional file (viz. Plut_0107, Plut_0759, Plut_0762, Plut_0981, Plut_0985, Plut_1092, Plut_1145, Plut_1858) are only found in <it>C. luteolum </it>and <it>C. phaeovibrioides</it>. These two species form a strongly supported clade in various phylogenetic trees (Figs. <figr fid="F1">1</figr> and <figr fid="F2">2</figr>) and a specific relationship between them is further supported by the unique presence of these shared proteins. For one of the proteins in this table, Plut_1345, a significant hit is also observed from <it>C. hutchinsonii</it>, which could be due to LGT <abbrgrp><abbr bid="B64">64</abbr><abbr bid="B65">65</abbr></abbrgrp>.</p>
               <tbl id="T6">
                  <title>
                     <p>Table 6</p>
                  </title>
                  <caption>
                     <p>Proteins that are Specific for the <it>Chlorobi </it>Species</p>
                  </caption>
                  <tblbdy cols="4">
                     <r>
                        <c ca="center">
                           <p>
                              <b>Genome ID No. [Accession No.]</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>Possible/Predicted Function</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>Genome ID No. [Accession No.]</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>Possible/Predicted Function</b>
                           </p>
                        </c>
                     </r>
                     <r>
                        <c cspan="4">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Plut_0059 [<ext-link ext-link-type="gen" ext-link-id="YP_373992">YP_373992</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>Plut_1225 [<ext-link ext-link-type="gen" ext-link-id="YP_375130">YP_375130</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Plut_0074 [<ext-link ext-link-type="gen" ext-link-id="YP_374007">YP_374007</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>orfCR</p>
                        </c>
                        <c ca="left">
                           <p>Plut_1238 [<ext-link ext-link-type="gen" ext-link-id="YP_375143">YP_375143</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Plut_0111 [<ext-link ext-link-type="gen" ext-link-id="YP_374044">YP_374044</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>Plut_1332 [<ext-link ext-link-type="gen" ext-link-id="YP_375234">YP_375234</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>TPR repeat</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Plut_0145 [<ext-link ext-link-type="gen" ext-link-id="YP_374078">YP_374078</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>Plut_1409 [<ext-link ext-link-type="gen" ext-link-id="YP_375311">YP_375311</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Plut_0160 [<ext-link ext-link-type="gen" ext-link-id="YP_374093">YP_374093</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>Plut_1465 [<ext-link ext-link-type="gen" ext-link-id="YP_375367">YP_375367</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Plut_0264 [<ext-link ext-link-type="gen" ext-link-id="YP_374195">YP_374195</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>chlorosome envelope protein C</p>
                        </c>
                        <c ca="left">
                           <p>Plut_1469 [<ext-link ext-link-type="gen" ext-link-id="YP_375371">YP_375371</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Plut_0265 [<ext-link ext-link-type="gen" ext-link-id="YP_374196">YP_374196</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>chlorosome envelope protein A; Bac_chlorC, pfam02043</p>
                        </c>
                        <c ca="left">
                           <p>Plut_1491 [<ext-link ext-link-type="gen" ext-link-id="YP_375393">YP_375393</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Plut_0278 [<ext-link ext-link-type="gen" ext-link-id="YP_374209">YP_374209</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>Plut_1500 [<ext-link ext-link-type="gen" ext-link-id="YP_375402">YP_375402</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>FMO, BchlA protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Plut_0282 [<ext-link ext-link-type="gen" ext-link-id="YP_374213">YP_374213</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>Plut_1517 [<ext-link ext-link-type="gen" ext-link-id="YP_375417">YP_375417</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Plut_0295 [<ext-link ext-link-type="gen" ext-link-id="YP_374226">YP_374226</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>Plut_1608 [<ext-link ext-link-type="gen" ext-link-id="YP_375505">YP_375505</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Srm</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Plut_0325 [<ext-link ext-link-type="gen" ext-link-id="YP_374256">YP_374256</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>Plut_1625 [<ext-link ext-link-type="gen" ext-link-id="YP_375522">YP_375522</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Plut_0409 [<ext-link ext-link-type="gen" ext-link-id="YP_374340">YP_374340</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>Plut_1628 [<ext-link ext-link-type="gen" ext-link-id="YP_375525">YP_375525</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Photosystem P840 reaction center cytochrome c-551</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Plut_0422 [<ext-link ext-link-type="gen" ext-link-id="YP_374353">YP_374353</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>Plut_1682 [<ext-link ext-link-type="gen" ext-link-id="YP_375579">YP_375579</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Plut_0489 [<ext-link ext-link-type="gen" ext-link-id="YP_374420">YP_374420</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>Plut_1714 [<ext-link ext-link-type="gen" ext-link-id="YP_375611">YP_375611</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Plut_0499 [<ext-link ext-link-type="gen" ext-link-id="YP_374430">YP_374430</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>Plut_1715 [<ext-link ext-link-type="gen" ext-link-id="YP_375612">YP_375612</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Plut_0540 [<ext-link ext-link-type="gen" ext-link-id="YP_374467">YP_374467</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>Plut_1716 [<ext-link ext-link-type="gen" ext-link-id="YP_375613">YP_375613</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Plut_0572 [<ext-link ext-link-type="gen" ext-link-id="YP_374498">YP_374498</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>Plut_1725 [<ext-link ext-link-type="gen" ext-link-id="YP_375622">YP_375622</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Plut_0620 [<ext-link ext-link-type="gen" ext-link-id="YP_374546">YP_374546</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Photosystem P840 reaction center protein PscD</p>
                        </c>
                        <c ca="left">
                           <p>Plut_1742 [<ext-link ext-link-type="gen" ext-link-id="YP_375639">YP_375639</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Plut_0666 [<ext-link ext-link-type="gen" ext-link-id="YP_374587">YP_374587</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>Plut_1743 [<ext-link ext-link-type="gen" ext-link-id="YP_375640">YP_375640</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Plut_0713 [<ext-link ext-link-type="gen" ext-link-id="YP_374634">YP_374634</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>Plut_1746 [<ext-link ext-link-type="gen" ext-link-id="YP_375643">YP_375643</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Plut_0779 [<ext-link ext-link-type="gen" ext-link-id="YP_374695">YP_374695</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>Plut_1933 [<ext-link ext-link-type="gen" ext-link-id="YP_375818">YP_375818</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Plut_0950 [<ext-link ext-link-type="gen" ext-link-id="YP_374855">YP_374855</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>Plut_2003 [<ext-link ext-link-type="gen" ext-link-id="YP_375888">YP_375888</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>orfCR</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Plut_1012 [<ext-link ext-link-type="gen" ext-link-id="YP_374917">YP_374917</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>Plut_2041 [<ext-link ext-link-type="gen" ext-link-id="YP_375926">YP_375926</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Plut_1195 [<ext-link ext-link-type="gen" ext-link-id="YP_375100">YP_375100</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>Plut_2100 [<ext-link ext-link-type="gen" ext-link-id="YP_375985">YP_375985</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Plut_1217 [<ext-link ext-link-type="gen" ext-link-id="YP_375122">YP_375122</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c ca="left">
                           <p>Plut_2117 [<ext-link ext-link-type="gen" ext-link-id="YP_376002">YP_376002</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Plut_1223 [<ext-link ext-link-type="gen" ext-link-id="YP_375128">YP_375128</ext-link>]</p>
                        </c>
                        <c ca="left">
                           <p>Hypothetical protein</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                  </tblbdy>
                  <tblfn>
                     <p>All significant Blast hits for these proteins are observed only from the following <it>Chlorobi </it>species for which sequence information is presently available: <it>C. luteolum </it>DSM 273, <it>C. tepidum </it>TLS, <it>C. chlorochromatii </it>CaD3, <it>C. limicola </it>DSM 245, <it>C. phaeobacteroides </it>BS1, <it>C. phaeobacteroides </it>DSM 266, <it>C. clathratiforme </it>BU-1, <it>P. aestuarii </it>DSM 271 and <it>C. phaeovibrioides </it>DSM 265. Of these proteins, the following proteins are homologous to each other:Plut_0059, Plut_1491; Plut_0074, Plut_2003.</p>
                  </tblfn>
               </tbl>
               <suppl id="S4">
                  <title>
                     <p>Additional file 4</p>
                  </title>
                  <text>
                     <p>Chlorobi-specific proteins that are missing in some species. All significant hits for these proteins are also from <it>Chlorobi </it>species. However, unlike the proteins listed in Table <tblr tid="T6">6</tblr>, these proteins are not present in all <it>Chlorobi </it>species.</p>
                  </text>
                  <file name="1471-2148-7-71-S4.pdf">
                     <p>Click here for file</p>
                  </file>
               </suppl>
               <p>In addition to the proteins that are uniquely found in various <it>Chlorobi </it>species, we have also identified two large conserved inserts in two widely distributed proteins that are distinctive characteristics of the <it>Chlorobi </it>phylum. The first of these signatures is a 28 aa insert in the DNA polymerase III alpha subunit encoded by the <it>dnaE </it>gene that is required for chromosomal replication in bacteria <abbrgrp><abbr bid="B66">66</abbr></abbrgrp>. The large insert in DnaE is present in all of the <it>Chlorobi </it>homologs but it is not found in any other species (Fig. <figr fid="F3">3</figr>). A smaller insert of 3&#8211;4 aa is probably also present in this regions in some <it>Bacteroidales </it>species, but based on their different sizes and sequence characteristics, these inserts are of independent origin. The other <it>Chlorobi</it>-specific signature consists of a 12&#8211;14 aa insert in alanyl-tRNA synthetase (Fig. <figr fid="F4">4</figr>), which plays an essential role in protein synthesis. This insert is again present in all <it>Chlorobi </it>homologs but not found in any other species indicating that it provides a reliable molecular marker for this group.</p>
               <fig id="F3">
                  <title>
                     <p>Figure 3</p>
                  </title>
                  <caption>
                     <p>Partial sequence alignments of the DnaE protein showing a large insert of about 28 aa that is uniquely present in <it>Chlorobi </it>homologs</p>
                  </caption>
                  <text>
                     <p>Partial sequence alignments of the DnaE protein showing a large insert of about 28 aa that is uniquely present in <it>Chlorobi </it>homologs. The dashes (-) denote identity with the amino acid on the top line. Except for the <it>Chlorobi </it>species, this insert is not found in any other organism. Sequence information for only representative species from other groups of bacteria is shown. Abbreviations in the species names are: <it>Bac., Bacteroides; Cb., Chlorobaculum; Chl., Chlorobium; Chrom., Chromobacterium; Clo., Clostridium; Pro., Prosthecochloris; Sym., Symbiobacterium; Flavo., Flavobacterium; Strep., Streptococcus</it>.</p>
                  </text>
                  <graphic file="1471-2148-7-71-3"/>
               </fig>
               <fig id="F4">
                  <title>
                     <p>Figure 4</p>
                  </title>
                  <caption>
                     <p>Partial sequence alignments of alanyl-tRNA synthetase showing a conserved insert of about 12&#8211;14 aa that is a distinctive characteristic of <it>Chlorobi </it>homologs and not found in other bacteria</p>
                  </caption>
                  <text>
                     <p>Partial sequence alignments of alanyl-tRNA synthetase showing a conserved insert of about 12&#8211;14 aa that is a distinctive characteristic of <it>Chlorobi </it>homologs and not found in other bacteria. The dashes (-) denote identity with the amino acid on the top line. Additional abbreviations: <it>Geo., Geobacter; Sta., Staphylococcus; Thermo., Thermoanaerobacter</it>.</p>
                  </text>
                  <graphic file="1471-2148-7-71-4"/>
               </fig>
            </sec>
            <sec>
               <st>
                  <p>Proteins that are uniquely shared by the Bacteroidetes and Chlorobi species</p>
               </st>
               <p>The <it>Bacteroidetes </it>and <it>Chlorobi </it>species generally branch very close to each other in phylogenetic trees <abbrgrp><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr></abbrgrp>. However, there are very few characteristics known that are uniquely shared by species from these two phyla. Our analysis has identified 3 proteins (PG0081, PG0649 and PG2432 in Table <tblr tid="T7">7</tblr>) that are uniquely found in virtually all of the fully as well as partially sequenced <it>Bacteroidetes </it>and <it>Chlorobi </it>genomes. These results are significant because <it>Bacteroidetes </it>or <it>Chlorobi </it>species do not share any protein in common with different species from any other group of bacteria. Of these proteins, the protein PG0081 is also found in <it>Fibrobacter succinogenes</it>. A close and specific relationship of <it>F. succinogenes </it>to the <it>Bacteroidetes </it>and <it>Chlorobi </it>groups was strongly suggested by our earlier work based on conserved indels in different proteins <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. This inference is reinforced by the unique presence of this protein in these different groups. The absence of the protein PG0649, which is present in all other <it>Bacteroidetes </it>and <it>Chlorobi </it>species, in <it>S. ruber</it>, is probably due to gene loss. Three other proteins, PG1818, PG1977 and BF2465 although they appear unique to the <it>Bacteriodales </it>and <it>Chlorobi</it>, their homologs are not detected in most <it>Flavobacteria </it>including <it>G. forsetii</it>. It is likely that the genes for these proteins also evolved in a common ancestor of the <it>Bacteroidetes </it>and <it>Chlorobi </it>phyla followed by gene losses in particular <it>Bacteroidetes </it>lineages. All of these proteins are of unknown functions except PG1818, which is annotated as a putative transmembrane protein with significant similarity to the conserved domain of the ResB-like family.</p>
               <tbl id="T7">
                  <title>
                     <p>Table 7</p>
                  </title>
                  <caption>
                     <p>Proteins that are Uniquely Shared by the <it>Bacteroidetes </it>and <it>Chlorobi </it>Species</p>
                  </caption>
                  <tblbdy cols="7">
                     <r>
                        <c ca="center">
                           <p>
                              <b>Protein I.D. No. [Accession No.]</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>PG0081 [<ext-link ext-link-type="gen" ext-link-id="NP_904430">NP_904430</ext-link>]</p>
                        </c>
                        <c ca="center">
                           <p>PG0649 [<ext-link ext-link-type="gen" ext-link-id="NP_904929">NP_904929</ext-link>]</p>
                        </c>
                        <c ca="center">
                           <p>BF2432 [<ext-link ext-link-type="gen" ext-link-id="YP_099715">YP_099715</ext-link>]</p>
                        </c>
                        <c ca="center">
                           <p>PG1818 [<ext-link ext-link-type="gen" ext-link-id="NP_905917">NP_905917</ext-link>]</p>
                        </c>
                        <c ca="center">
                           <p>PG1977 [<ext-link ext-link-type="gen" ext-link-id="NP_906051">NP_906051</ext-link>]</p>
                        </c>
                        <c ca="center">
                           <p>BF2465 [<ext-link ext-link-type="gen" ext-link-id="YP_099748">YP_099748</ext-link>]</p>
                        </c>
                     </r>
                     <r>
                        <c cspan="7">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <b>Length (aa)</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>725 aa</p>
                        </c>
                        <c ca="center">
                           <p>194 aa</p>
                        </c>
                        <c ca="center">
                           <p>1478</p>
                        </c>
                        <c ca="center">
                           <p>238 aa</p>
                        </c>
                        <c ca="center">
                           <p>668 aa</p>
                        </c>
                        <c ca="center">
                           <p>93 aa</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <b>Possible Function</b>
                           </p>
                        </c>
                        <c ca="center">
                           <p>Hypoth.</p>
                        </c>
                        <c ca="center">
                           <p>Hypoth.</p>
                        </c>
                        <c ca="center">
                           <p>Hypoth.</p>
                        </c>
                        <c ca="center">
                           <p>Putative transmembrane</p>
                        </c>
                        <c ca="center">
                           <p>Hypoth.</p>
                        </c>
                        <c ca="center">
                           <p>Hypoth.</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <b>
                                 <it>Bacteroidetes </it>
                              </b>
                           </p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>P. gingivalis</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><it>B. fragilis </it>NCT</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><it>B. fragilis</it>YCH</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><it>B. thetaiotaomi</it>.</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Prev. intermedia</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Prev. ruminicola</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>G. forestii</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>*</b>
                           </p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><it>F. bacterium </it>BBF</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>*</b>
                           </p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="center">
                           <p>
                              <b>*</b>
                           </p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><it>F. bacterium </it>HTC</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>*</b>
                           </p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>F. johnsoniae</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>*</b>
                           </p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Flavobacterium</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>*</b>
                           </p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Cellulophaga</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>*</b>
                           </p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>C. atlanticus</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>
                              <b>*</b>
                           </p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Polibacter irgensii</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Psychro. torquis</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Rob. biformata</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Tenacibaculum</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Cyto. hutchinsonii</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Salinibacter ruber</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <b>
                                 <it>Chlorobi</it>
                              </b>
                           </p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Cb. tepidum</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><it>C. chlorochrom</it>.</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>C. luteolum</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>C. limicola</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><it>C. phaeobac </it>BS1</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><it>C. phaeobac </it>DSM</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>C. clathratiforme</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Prosthec. aesturii</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>C. phaeovibrioides</it>
                           </p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c ca="center">
                           <p>*</p>
                        </c>
                        <c>
                           <p/>
                        </c>
                     </r>
                  </tblbdy>
                  <tblfn>
                     <p>All significant blast hits for these proteins are from the indicated (marked by *) Bacteroidetes and Chlorobi species. For the protein PG0081, a homolog is also present in <it>Fibrobacter succinogenes subsp. succinogenes S85 </it>(identified by blast search against the partial sequence). The blank space indicates that no hit showing significant similarity was detected at the present time.</p>
                  </tblfn>
               </tbl>
            </sec>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>This work has identified a large number of proteins that are specific for <it>Bacteroidetes </it>and <it>Chlorobi </it>species at various taxonomic levels. Homologs exhibiting significant similarity to these proteins are not found in any other bacteria, except in a few isolated cases. Among the proteins that are specific for the <it>Bacteroidetes</it>, 27 proteins are specific for the entire phylum as their homologs are present in species from all three main orders within this phylum. Many other proteins are limited to various clades within the <it>Bacteroidetes </it>phylum. These include 41 proteins that are common to the <it>Flavobacteriales </it>and <it>Bacteroidales </it>orders; 53 and 38 proteins that are specific for the <it>Bacteriodales </it>and <it>Flavobacteriales </it>orders, respectively; and 185 proteins that are specific for the <it>Bacteriodes </it>genus. We have also identified 51 proteins that are specific for the <it>Chlorobi </it>species and 6 proteins that are uniquely shared by the <it>Bacteroidetes </it>and <it>Chlorobi phyla</it>. Two large conserved inserts in the DnaE and AlaRS proteins that are distinctive characteristics of the <it>Chlorobi </it>species were also discovered in this work. In addition, a deletion in ClpB protein that is mainly specific for the <it>Bacteriodales</it>, <it>Flavobacteriales </it>and <it>Flexibacteraceae </it>was also identified. In earlier work, a number of conserved inserts that are specific for either the <it>Bacteroidetes </it>phylum (viz. SecA and Gyrase B) or commonly shared by the <it>Bacteroidetes </it>and <it>Chlorobi </it>species were also described. Based upon their specificity for the <it>Bacteroidetes </it>and <it>Chlorobi </it>species, these molecular markers provide novel and more definitive means for identifying and circumscribing species from these groups.</p>
         <p>The species distribution patterns of these signature proteins and conserved indels strongly suggest that they or the genes for them have evolved at various stages in the evolution of these bacteria (Fig. <figr fid="F5">5</figr>). However, subsequent to their evolution or introduction in these genes, these genomic characteristics are stably retained in various descendents of these lineages with minimal gene loss or LGTs, as has also been found in other related studies <abbrgrp><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr><abbr bid="B44">44</abbr><abbr bid="B67">67</abbr><abbr bid="B68">68</abbr></abbrgrp>. The evolutionary relationship among the <it>Bacteroidetes </it>species as deduced from these signature proteins is in complete agreement with their branching pattern in phylogenetic trees (Figs. <figr fid="F1">1</figr> and <figr fid="F2">2</figr>). The unique presence of several signature proteins as well as conserved indels in a number of essential proteins (viz. FtsK, UvrB and ATP synthase alpha subunit) by different <it>Bacteroidetes </it>and <it>Chlorobi </it>species provides compelling evidence that species from these two groups shared a common ancestor exclusive of all other bacteria. In earlier studies, a close relationship of <it>Fiborbacteres </it>to the <it>Bacteroidetes </it>and <it>Chlorobi </it>was also observed <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B30">30</abbr></abbrgrp>. The species from all these three groups were found to contain large conserved indels in RNA polymerase &#946; ' subunit and serine hydroxymethyl transferase, that were not found in any other bacteria <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. The species from these three groups also branched in the same position based on distribution profiles of signature sequences in a number of other proteins and in different phylogenetic trees <abbrgrp><abbr bid="B30">30</abbr><abbr bid="B69">69</abbr></abbrgrp>. The unique shared presence of the protein PG0081 by all sequenced <it>Chlorobi </it>and <it>Bacteroidetes </it>species as well as <it>Fibrobacter succinogenes</it>, provides further evidence that species from these three groups form a single superphylum and that they shared a common ancestor exclusive of all other bacteria <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>.</p>
         <fig id="F5">
            <title>
               <p>Figure 5</p>
            </title>
            <caption>
               <p>A summary diagram showing the evolutionary stages where different signature proteins and conserved indels that are specific for the <it>Bacteroidetes </it>and <it>Chlorobi </it>species have likely evolved or originated</p>
            </caption>
            <text>
               <p>A summary diagram showing the evolutionary stages where different signature proteins and conserved indels that are specific for the <it>Bacteroidetes </it>and <it>Chlorobi </it>species have likely evolved or originated. Some of the conserved inserts that are specific for these groups or indicate their branching position relative to other bacterial phyla have been described in earlier work [30,43,69].</p>
            </text>
            <graphic file="1471-2148-7-71-5"/>
         </fig>
         <p>This paper also reports phylogenetic analyses of <it>Bacteroidetes </it>and <it>Chlorobi </it>species based on a concatenated alignment of 12 highly conserved proteins. The branching order of various species in the trees obtained using different phylogenetic methods were in general very similar with the clades corresponding to the <it>Chlorobi </it>species and the <it>Cytophaga-Flavobacteria-Bacteroides </it>species, well resolved from each other with 100% bootstrap scores. The species corresponding to the two main groups within the <it>Bacteroidetes </it>phylum (viz. the <it>Bacteriodales </it>and <it>Flavobacteriales </it>orders) were also clearly resolved. However, in the trees constructed by traditional phylogenetic methods such as NJ, ML and MP, the phylogenetic placement of <it>S. ruber </it>was not resolved. In all of these trees, <it>S. ruber </it>appeared either as a very deep branch of the <it>Chlorobi </it>clade (i.e. in NJ and ML trees) or as outgroup of both the <it>Chlorobi </it>and the CFB clades (MP tree). In contrast to these trees, when the same dataset was analyzed by means of the character compatibility or clique approach, <it>S. ruber </it>formed the deepest branch of the <it>Bacteroidetes </it>species and its specific association with this group was supported by 21 uniquely shared characters, indicating strongly that this affiliation was reliable <abbrgrp><abbr bid="B57">57</abbr></abbrgrp>. These results provide evidence that the character compatibility approach, which removes all fast-evolving as well as homoplasic sites from a given dataset, provides a powerful means for obtaining correct topology in cases, such as that for <it>S. ruber</it>, whose phyletic affinity has proven difficult to establish by traditional phylogenetic methods <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B52">52</abbr><abbr bid="B56">56</abbr><abbr bid="B57">57</abbr></abbrgrp>.</p>
         <p>The cellular functions of most of the <it>Bacteroidetes </it>or <it>Chlorobi</it>-specific proteins identified in the present work are not known. A few of the <it>Chlorobi-</it>specific proteins are involved in chlorosome- or photosynthesis-related functions, which is expected as <it>Chlorobi </it>is one of the few bacteria phyla that possesses photosynthetic ability <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr><abbr bid="B34">34</abbr><abbr bid="B70">70</abbr></abbrgrp>. A number of other proteins exhibit weak sequence similarity to conserved domains in certain other proteins, but considering that the overall sequence similarity is not significant, the actual functions of these proteins could be quite different. Therefore, an important task for the future is to determine the cellular functions of these <it>Bacteroidetes </it>or <it>Chlorobi </it>specific proteins. Likewise, it is also of much interest to determine the functional significance of the conserved indels in SecA, Gyrase B and ClpB proteins that are distinctive characteristics of the <it>Bacteroidetes </it><abbrgrp><abbr bid="B30">30</abbr></abbrgrp>, and of the inserts in DnaE and AlaRS proteins that are specific for the <it>Chlorobi </it>species. The retention of these signature proteins and conserved indels by all species from these groups strongly suggests that they are functionally important for these bacteria. Hence, further studies on these molecular signatures should lead to the discovery of novel biochemical and physiological characteristics that are unique to these bacteria. The primary sequences of many of these genes/proteins that are specific for the <it>Bacteroidetes </it>or <it>Chlorobi </it>species are highly conserved and they provide novel means for identification of both known as well as novel species belonging to these groups by means of PCR-based and immunological methods. Several <it>Bacteroidetes </it>species play central role in the initiation and progression of periodontal diseases in humans <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr><abbr bid="B58">58</abbr></abbrgrp>. Hence, the proteins that are specific to these bacteria also provide important potential targets for development of therapeutics and vaccines for treatment and prevention of periodontal diseases.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Identification of Proteins that are Specific for Bacteroidetes and Chlorobi</p>
            </st>
            <p>The blastp searches were carried out on each ORF in the genomes of <it>P. gingivalis </it>W83 <abbrgrp><abbr bid="B58">58</abbr></abbrgrp>, <it>B. fragilis </it>YCH46 <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>, <it>B. thetaiotaomicron </it>VPI-5482 <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>, <it>G. forsetii KT080 </it><abbrgrp><abbr bid="B59">59</abbr></abbrgrp>, <it>Chlorobium </it>(<it>Pelodictyon</it>)<it>luteolum </it>DSM 273 and <it>C. tepidum </it>TLS <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>, to identify proteins that are specific for the <it>Bacteroidetes </it>and <it>Chlorobi </it>phyla at different taxonomic levels. The blastp searches were performed against all organisms (i.e. using the NCBI non-redundant (nr) database) with default settings except that the low complexity filter was not used <abbrgrp><abbr bid="B71">71</abbr></abbrgrp>. The proteins that were of interest were those where either all significant hits were from these groups of species or which involved a large increase in E values from the last <it>Bacteroidetes-Chlorobi </it>hit to the first hit from any other organism and the E values for the latter hits in most cases > 10<sup>-4</sup>, which indicates a weak similarity that could occur by chance. However, higher E values were sometimes acceptable particularly for smaller proteins as the magnitude of the E value depends upon the length of the query sequence. All promising proteins were further analyzed using the position-specific iterated (PSI)-blast program <abbrgrp><abbr bid="B71">71</abbr></abbrgrp>. This program creates a position-specific scoring matrix from statistically significant alignments produced by the blastp program and then searches the database using this matrix. The PSI-blast is more sensitive in identifying weak but biologically relevant sequence similarity as compared to the blastp program <abbrgrp><abbr bid="B71">71</abbr></abbrgrp>. In the present work, a protein was considered to be specific for a given group if all hits producing significant alignments were from that group of species. However, we have also retained a few proteins where 1 or 2 isolated species from other groups of bacteria also had acceptable E values, as they provide possible cases of lateral gene transfer. For all of the <it>Bacteroidetes </it>or <it>Chlorobi</it>-specific proteins identified in the present work, their protein ID's, accession numbers and any information regarding cellular functions (such as COG number or the presence of any conserved domain) are presented here. Preliminary sequence information regarding the presence of a homolog of a query protein in the partially sequenced genomes of <it>P. intermedia, P. ruminicola </it>and <it>F. succinogenes subsp. succinogenes S85 </it>were obtained via genomic blasts against The Institute for Genomic Research database for unfinished microbial genomes <abbrgrp><abbr bid="B72">72</abbr></abbrgrp>. In describing various proteins in the text, "PG," "BF," "BT," "orf", "Plut" and "CT" indicate the identification numbers of the proteins in the genomes of <it>P. gingivalis </it>W83, <it>B. fragilis </it>YCH46, <it>B. thetaiotaomicron </it>VPI-5482,<it>G. forsetii KT080, C. (Pelodictyon) luteolum </it>DSM and <it>C. tepidum </it>TLS, respectively.</p>
         </sec>
         <sec>
            <st>
               <p>Phylogenetic Analysis</p>
            </st>
            <p>The amino acid sequences for the 12 conserved proteins viz. RNA polymerase &#946; subunit, RNA polymerase &#946; ' subunit (RpoC), alanyl-tRNA synthetase (AlaRS), arginyl-tRNA synthetase, phenylalanyl-tRNA synthetase, elongation factor-Tu, elongation factor G, RecA protein, DNA gyrase subunit A, DNA gyrase subunit B, Hsp60 or GroEL protein and DnaK or Hsp70 protein, for different species were downloaded from the NCBI database and aligned using the ClustalX (1.83) program using the default settings <abbrgrp><abbr bid="B73">73</abbr></abbrgrp>. The sequences for two deep-branching species, <it>D. radiodurans </it>and <it>T. aquaticus </it><abbrgrp><abbr bid="B27">27</abbr></abbrgrp>, were included in this dataset for rooting purposes. The sequence alignments for all 12 proteins were concatenated into a single large alignment containing 8899 positions. Poorly aligned regions from this alignment were removed with the Gblocks 0.91b program <abbrgrp><abbr bid="B74">74</abbr></abbrgrp>, using the default settings, except that allowable gap position was selected to half. This resulted in a final sequence alignment of 6998 sites, which was used for phylogenetic analyses. A NJ tree based on this alignment (bootstrapped 1000 time) was constructed based on Kimura's model <abbrgrp><abbr bid="B75">75</abbr></abbrgrp> using the TREECON programs <abbrgrp><abbr bid="B76">76</abbr></abbrgrp>. Maximum-likelihood and MP trees were computed using the WAG+F model plus a gamma distribution with four categories <abbrgrp><abbr bid="B77">77</abbr></abbrgrp> using the TREE-PUZZLE <abbrgrp><abbr bid="B78">78</abbr></abbrgrp> and Mega 3.1 program <abbrgrp><abbr bid="B79">79</abbr></abbrgrp>, respectively. All trees were bootstrapped 100 times <abbrgrp><abbr bid="B80">80</abbr></abbrgrp>, unless otherwise indicated.</p>
            <p>The character compatibility analysis on the concatenated alignment was carried out as described recently <abbrgrp><abbr bid="B57">57</abbr></abbrgrp>. Using the program "DUALSITE" <abbrgrp><abbr bid="B57">57</abbr></abbrgrp>, all sites in the alignments where only two amino acid states were found, with each state present in at least two species, were selected. All columns with any gaps were omitted. The sites where one of the states is present in only a single species are not useful for compatibility analysis. All useful two state sites were converted into a binary file of "0, 1" characters using the DUALSITE program and this file was used for compatibility analysis <abbrgrp><abbr bid="B57">57</abbr></abbrgrp>. The compatibility analysis was carried out using the CLIQUE program from the PHYLIP (ver. 3.5c) program package <abbrgrp><abbr bid="B81">81</abbr></abbrgrp> to identify the largest clique(s) of compatible characters. The cliques were drawn and the numbers of characters that distinguished different nodes were indicated. The sequence information for <it>G. forsetii</it>, which became available after these analyses were completed <abbrgrp><abbr bid="B59">59</abbr></abbrgrp> is not included in these trees.</p>
         </sec>
         <sec>
            <st>
               <p>Identification of conserved indels</p>
            </st>
            <p>Multiple sequence alignments for large numbers of proteins have been created in our earlier work <abbrgrp><abbr bid="B82">82</abbr><abbr bid="B83">83</abbr><abbr bid="B84">84</abbr></abbrgrp>. These alignments were visually inspected to search for any indels in a conserved region that was uniquely present in <it>C. tepidum </it>(the only <it>Chlorobi </it>species present in these groups). The specificity of any potential indel for these groups was evaluated by carrying out by blastp searches on a short segment of the sequence (between 80&#8211;120 aa) containing the indel and the flanking conserved regions against the nr database. The purpose of these blast searches was to obtain information from all available homologs to determine the specificities of the indels.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p>CD, conserved domain; CFB, Cytophaga-Flavobacteria-Bacteroides; Indel, insert or deletion; ORF, open reading frame; ORFans, open reading frames of unknown functions; AlaRS, alanyl-tRNA synthetase; RGC, rare genetic change; RpoC, RNA polymerase &#946; '-subunit.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>The initial blastp searches on various genomes were carried out by RSG with the computer assistance provided by Venus Wong. EL analyzed the results of these blast searches to identify various group-specific proteins and confirmed their specificities by means of PSI-blast and genomic blasts. RSG carried out the phylogenetic analyses and identified the conserved indels described here. RSG was also responsible for conceiving and directing this study and for the final evaluation of results. RSG was responsible for the preparation of the final manuscript. All authors have read and approved the submitted manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank Venus Wong and Yan Li for providing computer support for blast analyses and Beile Gao for help with some phylogenetic analysis. We thank the investigators at US DOE Joint Genome Institute and Gordon and Betty Moore Foundation Marine Biotechnology Initiative for making the genome data for various Bacteroidetes and Chlorobi species available in public databases prior to publication, which were of central importance in these studies. Blast searches against preliminary sequence data for the <it>P. intermedia, P. ruminicola </it>and <it>F. succinogenes </it>were performed at The Institute for Genomic Research website <abbrgrp><abbr bid="B72">72</abbr></abbrgrp>. This work was supported by a research grant from the Canadian Institute of Health Research.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>The Revised Road Map to the Manual</p>
            </title>
            <aug>
               <au>
                  <snm>Garrity</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Bell</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Lilburn</snm>
                  <fnm>TG</fnm>
               </au>
            </aug>
            <source>Bergey's Manual of Systematic Bacteriology, Volume 2, Part A, Introductory Essays</source>
            <publisher>New York, Springer</publisher>
            <editor>Brenner DJ, Krieg NR and Staley JT</editor>
            <pubdate>2005</pubdate>
            <fpage>159</fpage>
            <lpage>220</lpage>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Bacterial evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Woese</snm>
                  <fnm>CR</fnm>
               </au>
            </aug>
            <source>Microbiol Rev</source>
            <pubdate>1987</pubdate>
            <volume>51</volume>
            <fpage>221</fpage>
            <lpage>271</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">373105</pubid>
                  <pubid idtype="pmpid" link="fulltext">2439888</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Overview: A phylogenetic backbone and taxonomic framework for prokaryotic systamatics</p>
            </title>
            <aug>
               <au>
                  <snm>Ludwig</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Klenk</snm>
                  <fnm>HP</fnm>
               </au>
            </aug>
            <source>Bergey's Manual of Systematic Bacteriology</source>
            <publisher>Berlin, Springer-Verlag</publisher>
            <editor>Boone DR and Castenholz RW</editor>
            <edition>2nd</edition>
            <pubdate>2001</pubdate>
            <fpage>49</fpage>
            <lpage>65</lpage>
         </bibl>
         <bibl id="B4">
            <title>
               <p>The Genus Bacteroides and Related taxa</p>
            </title>
            <aug>
               <au>
                  <snm>Shah</snm>
                  <fnm>HN</fnm>
               </au>
            </aug>
            <source>The Prokaryotes</source>
            <publisher>New York, Springer-Verlag</publisher>
            <editor>Balows A, Truper HG, Dworkin M, Harder W and Schleifer KH</editor>
            <edition>2</edition>
            <pubdate>1992</pubdate>
            <volume>196</volume>
            <fpage>3593</fpage>
            <lpage>3607</lpage>
         </bibl>
         <bibl id="B5">
            <title>
               <p>The Order Cytophagales</p>
            </title>
            <aug>
               <au>
                  <snm>Reichenbach</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>The Prokaryotes</source>
            <publisher>New York, Springer-Verlag</publisher>
            <editor>Balows A, Truper HG, Dworkin M, Harder W and Schleifer KH</editor>
            <edition>2</edition>
            <pubdate>1992</pubdate>
            <volume>199</volume>
            <fpage>3631</fpage>
            <lpage>3675</lpage>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Phylogeny of bacteroides, prevotella, and porphyromonas spp.and related bacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Paster</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Dewhirst</snm>
                  <fnm>FE</fnm>
               </au>
               <au>
                  <snm>Olsen</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Fraser</snm>
                  <fnm>GJ</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>1994</pubdate>
            <volume>176</volume>
            <fpage>725</fpage>
            <lpage>732</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">205110</pubid>
                  <pubid idtype="pmpid" link="fulltext">8300528</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Family I. Bacteroidaceae Pribam 1933.</p>
            </title>
            <aug>
               <au>
                  <snm>Holdeman</snm>
                  <fnm>LV</fnm>
               </au>
               <au>
                  <snm>Kelley</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>Moore</snm>
                  <fnm>WEC</fnm>
               </au>
            </aug>
            <source>Bergey's Manual of Systematic Bacteriology</source>
            <publisher>Baltimore, Williams and Wilkins</publisher>
            <editor>Krieg NR and Holt JG</editor>
            <edition>Ist</edition>
            <pubdate>1984</pubdate>
            <fpage>602</fpage>
            <lpage>662</lpage>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Bacteroides, Prevotella, and Porphyromonas</p>
            </title>
            <aug>
               <au>
                  <snm>Shah</snm>
                  <fnm>HN</fnm>
               </au>
               <au>
                  <snm>Gharbia</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Olsen</snm>
                  <fnm>I</fnm>
               </au>
            </aug>
            <source>Topley &amp; Wilson's Microbiology and Microbial Infections,</source>
            <publisher>London, Hodder Arnold</publisher>
            <editor>Borrelio SP, Murray PR and Funke G</editor>
            <edition>10th</edition>
            <pubdate>2005</pubdate>
            <volume>80</volume>
            <fpage>1913</fpage>
            <lpage>1944</lpage>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Diverse bacteria related to the bacteroides subgroup of the CFB phylum within the gut symbiotic communities of various termites</p>
            </title>
            <aug>
               <au>
                  <snm>Ohkuma</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Noda</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hongoh</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Kudo</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Biosci Biotechnol Biochem</source>
            <pubdate>2002</pubdate>
            <volume>66</volume>
            <fpage>78</fpage>
            <lpage>84</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11866123</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>New degenerate Cytophaga-Flexibacter-Bacteroides-specific 16S ribosomal DNA-targeted oligonucleotide probes reveal high bacterial diversity in River Taff epilithon</p>
            </title>
            <aug>
               <au>
                  <snm>O'Sullivan</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Weightman</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Fry</snm>
                  <fnm>JC</fnm>
               </au>
            </aug>
            <source>Appl Environ Microbiol</source>
            <pubdate>2002</pubdate>
            <volume>68</volume>
            <fpage>201</fpage>
            <lpage>210</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">126579</pubid>
                  <pubid idtype="pmpid" link="fulltext">11772628</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Salinibacter ruber gen. nov., sp. nov., a novel, extremely halophilic member of the Bacteria from saltern crystallizer ponds</p>
            </title>
            <aug>
               <au>
                  <snm>Anton</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Oren</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Benlloch</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Rodriguez-Valera</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Amann</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Rossello-Mora</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Int J Syst Evol Microbiol</source>
            <pubdate>2002</pubdate>
            <volume>52</volume>
            <fpage>485</fpage>
            <lpage>491</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11931160</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Genomics of oral bacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Duncan</snm>
                  <fnm>MJ</fnm>
               </au>
            </aug>
            <source>Crit Rev Oral Biol Med</source>
            <pubdate>2003</pubdate>
            <volume>14</volume>
            <fpage>175</fpage>
            <lpage>187</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12799321</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>The Genera Rhodothermus, Thermonema, Hymenobacter and Salinibacter</p>
            </title>
            <aug>
               <au>
                  <snm>Oren</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>The Prokaryotes: An Evolving Electronic Resource for the Microbiological Community</source>
            <publisher>New York, Springer-Verlag</publisher>
            <editor>Dworkin M and al </editor>
            <edition>3rd, Release 3.3</edition>
            <pubdate>2000</pubdate>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Bacteroides of the human lower intestinal tract</p>
            </title>
            <aug>
               <au>
                  <snm>Salyers</snm>
                  <fnm>AA</fnm>
               </au>
            </aug>
            <source>Annu Rev Microbiol</source>
            <pubdate>1984</pubdate>
            <volume>38</volume>
            <fpage>293</fpage>
            <lpage>313</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">6388494</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>A genomic view of the human-Bacteroides thetaiotaomicron symbiosis</p>
            </title>
            <aug>
               <au>
                  <snm>Xu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bjursell</snm>
                  <fnm>MK</fnm>
               </au>
               <au>
                  <snm>Himrod</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Deng</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Carmichael</snm>
                  <fnm>LK</fnm>
               </au>
               <au>
                  <snm>Chiang</snm>
                  <fnm>HC</fnm>
               </au>
               <au>
                  <snm>Hooper</snm>
                  <fnm>LV</fnm>
               </au>
               <au>
                  <snm>Gordon</snm>
                  <fnm>JI</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2003</pubdate>
            <volume>299</volume>
            <fpage>2074</fpage>
            <lpage>2076</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12663928</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Extensive DNA inversions in the B. fragilis genome control variable gene expression</p>
            </title>
            <aug>
               <au>
                  <snm>Cerdeno-Tarraga</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Patrick</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Crossman</snm>
                  <fnm>LC</fnm>
               </au>
               <au>
                  <snm>Blakely</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Abratt</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Lennard</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Poxton</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Duerden</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Quail</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Barron</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Clark</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Corton</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Doggett</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Holden</snm>
                  <fnm>MT</fnm>
               </au>
               <au>
                  <snm>Larke</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Line</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lord</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Norbertczak</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Ormond</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Price</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Rabbinowitsch</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Woodward</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Barrell</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Parkhill</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2005</pubdate>
            <volume>307</volume>
            <fpage>1463</fpage>
            <lpage>1465</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15746427</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Prevalence of Enterotoxigenic Bacteroides fragilis in patients with diarrhea: A controlled study</p>
            </title>
            <aug>
               <au>
                  <snm>Durmaz</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Dalgalar</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Durmaz</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Anaerobe</source>
            <pubdate>2005</pubdate>
            <volume>11</volume>
            <fpage>318</fpage>
            <lpage>321</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16701593</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Bacteroides, Prevotella and Porphyromonas</p>
            </title>
            <aug>
               <au>
                  <snm>Shah</snm>
                  <fnm>HN</fnm>
               </au>
               <au>
                  <snm>Gharbia</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Duerden</snm>
                  <fnm>BI</fnm>
               </au>
            </aug>
            <source>Topley &amp;Wilson's Microbiology and Microbial Infections vol. 2 Systematic Bacteriology</source>
            <publisher>London, Arnold</publisher>
            <editor>Balows A and Duerden BI</editor>
            <edition>9th</edition>
            <pubdate>1998</pubdate>
            <volume>58</volume>
            <fpage>1305</fpage>
            <lpage>1330</lpage>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Bacterial diversity in human subgingival plaque</p>
            </title>
            <aug>
               <au>
                  <snm>Paster</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Boches</snm>
                  <fnm>SK</fnm>
               </au>
               <au>
                  <snm>Galvin</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Ericson</snm>
                  <fnm>RE</fnm>
               </au>
               <au>
                  <snm>Lau</snm>
                  <fnm>CN</fnm>
               </au>
               <au>
                  <snm>Levanos</snm>
                  <fnm>VA</fnm>
               </au>
               <au>
                  <snm>Sahasrabudhe</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Dewhirst</snm>
                  <fnm>FE</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2001</pubdate>
            <volume>183</volume>
            <fpage>3770</fpage>
            <lpage>3783</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">95255</pubid>
                  <pubid idtype="pmpid" link="fulltext">11371542</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Microbial composition of supra- and subgingival plaque in subjects with adult periodontitis</p>
            </title>
            <aug>
               <au>
                  <snm>Ximenez-Fyvie</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Haffajee</snm>
                  <fnm>AD</fnm>
               </au>
               <au>
                  <snm>Socransky</snm>
                  <fnm>SS</fnm>
               </au>
            </aug>
            <source>J Clin Periodontol</source>
            <pubdate>2000</pubdate>
            <volume>27</volume>
            <fpage>722</fpage>
            <lpage>732</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11034118</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>The Phototrophic way of Life</p>
            </title>
            <aug>
               <au>
                  <snm>Overmann</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Garcia-Pichel</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>The Prokaryotes: An Evolving Electronic Resource for the Microbiological Community</source>
            <publisher>New York, Springer-Verlag</publisher>
            <editor>Dworkin M and al </editor>
            <edition>3rd, Release 3.3</edition>
            <pubdate>2000</pubdate>
            <url>http://141.150.157.117:8080/prokPUB/chaprender/jsp/showchap.jsp?chapnum=239</url>
         </bibl>
         <bibl id="B22">
            <title>
               <p>The Family Chlorobiaceae</p>
            </title>
            <aug>
               <au>
                  <snm>Overmann</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>The Prokaryotes</source>
            <edition>3rd edition</edition>
            <pubdate>2003</pubdate>
         </bibl>
         <bibl id="B23">
            <title>
               <p>The Family Chlorobiaceae</p>
            </title>
            <aug>
               <au>
                  <snm>Truper</snm>
                  <fnm>HG</fnm>
               </au>
               <au>
                  <snm>Pfennig</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>The Prokaryotes</source>
            <publisher>New York, Springer-Verlag</publisher>
            <editor>Balows A, Truper HG, Dworkin M, Harder W and Schleifer KH</editor>
            <edition>2</edition>
            <pubdate>1992</pubdate>
            <volume>195</volume>
            <fpage>3583</fpage>
            <lpage>3592</lpage>
         </bibl>
         <bibl id="B24">
            <title>
               <p>The complete genome sequence of Chlorobium tepidum TLS, a photosynthetic, anaerobic, green-sulfur bacterium</p>
            </title>
            <aug>
               <au>
                  <snm>Eisen</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>KE</fnm>
               </au>
               <au>
                  <snm>Paulsen</snm>
                  <fnm>IT</fnm>
               </au>
               <au>
                  <snm>Heidelberg</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Dodson</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>DeBoy</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Gwinn</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>WC</fnm>
               </au>
               <au>
                  <snm>Haft</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Hickey</snm>
                  <fnm>EK</fnm>
               </au>
               <au>
                  <snm>Peterson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Durkin</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Kolonay</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Holt</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Umayam</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Mason</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Brenner</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Shea</snm>
                  <fnm>TP</fnm>
               </au>
               <au>
                  <snm>Parksey</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Nierman</snm>
                  <fnm>WC</fnm>
               </au>
               <au>
                  <snm>Feldblyum</snm>
                  <fnm>TV</fnm>
               </au>
               <au>
                  <snm>Hansen</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>Craven</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Radune</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Vamathevan</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Khouri</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Gruber</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Ketchum</snm>
                  <fnm>KA</fnm>
               </au>
               <au>
                  <snm>Venter</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Tettelin</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Bryant</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Fraser</snm>
                  <fnm>CM</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2002</pubdate>
            <volume>99</volume>
            <fpage>9509</fpage>
            <lpage>9514</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">123171</pubid>
                  <pubid idtype="pmpid" link="fulltext">12093901</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Phylogenetic affiliation of the bacteria that constitute phototrophic consortia</p>
            </title>
            <aug>
               <au>
                  <snm>Frostl</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Overmann</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Arch Microbiol</source>
            <pubdate>2000</pubdate>
            <volume>174</volume>
            <fpage>50</fpage>
            <lpage>58</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10985742</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Prokaryotic photosynthesis and phototrophy illuminated</p>
            </title>
            <aug>
               <au>
                  <snm>Bryant</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Frigaard</snm>
                  <fnm>NU</fnm>
               </au>
            </aug>
            <source>Trends Microbiol</source>
            <pubdate>2006</pubdate>
            <volume>14</volume>
            <fpage>488</fpage>
            <lpage>496</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16997562</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>The winds of (evolutionary) change: breathing new life into microbiology.</p>
            </title>
            <aug>
               <au>
                  <snm>Olsen</snm>
                  <fnm>GJ</fnm>
               </au>
               <au>
                  <snm>Woese</snm>
                  <fnm>CR</fnm>
               </au>
               <au>
                  <snm>Overbeek</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>1994</pubdate>
            <volume>176</volume>
            <fpage>1</fpage>
            <lpage>6</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">205007</pubid>
                  <pubid idtype="pmpid" link="fulltext">8282683</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>The phylogenetic relationships of Chlorobium tepidum and Chloroflexus auranticus based upon their RecA sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Gruber</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Eisen</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Gish</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Bryant</snm>
                  <fnm>DA</fnm>
               </au>
            </aug>
            <source>FEMS Microbiol Lett</source>
            <pubdate>1998</pubdate>
            <volume>162</volume>
            <fpage>53</fpage>
            <lpage>60</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9595663</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Evolutionary relationships among photosynthetic prokaryotes (Heliobacterium chlorum, Chloroflexus aurantiacus, cyanobacteria, Chlorobium tepidum and proteobacteria): implications regarding the origin of photosynthesis</p>
            </title>
            <aug>
               <au>
                  <snm>Gupta</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Mukhtar</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Singh</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>1999</pubdate>
            <volume>32</volume>
            <fpage>893</fpage>
            <lpage>906</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10361294</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>The Phylogeny and Signature Sequences characteristics of Fibrobacters, Chlorobi and Bacteroidetes</p>
            </title>
            <aug>
               <au>
                  <snm>Gupta</snm>
                  <fnm>RS</fnm>
               </au>
            </aug>
            <source>Crit Rev Microbiol</source>
            <pubdate>2004</pubdate>
            <volume>30</volume>
            <fpage>123</fpage>
            <lpage>143</lpage>
            <xrefbib>
               <pubid idtype="pmpid">15239383</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Origin and early evolution of photosynthesis</p>
            </title>
            <aug>
               <au>
                  <snm>Blankenship</snm>
                  <fnm>RE</fnm>
               </au>
            </aug>
            <source>Photosynthesis Research</source>
            <pubdate>1992</pubdate>
            <volume>33</volume>
            <fpage>91</fpage>
            <lpage>111</lpage>
            <xrefbib>
               <pubid idtype="pmpid">11538390</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Chlorobium tepidum: insights into the structure, physiology, and metabolism of a green sulfur bacterium derived from the complete genome sequence</p>
            </title>
            <aug>
               <au>
                  <snm>Frigaard</snm>
                  <fnm>NU</fnm>
               </au>
               <au>
                  <snm>Chew</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Maresca</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Bryant</snm>
                  <fnm>DA</fnm>
               </au>
            </aug>
            <source>Photosynth Res</source>
            <pubdate>2003</pubdate>
            <volume>78</volume>
            <fpage>93</fpage>
            <lpage>117</lpage>
            <xrefbib>
               <pubid idtype="pmpid">16245042</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Structure of a bacteriochlorophyll-protein from the green photosynthetic bacterium Chlorobium limicola: crystallographic evidence for a trimer</p>
            </title>
            <aug>
               <au>
                  <snm>Fenna</snm>
                  <fnm>RE</fnm>
               </au>
               <au>
                  <snm>Matthews</snm>
                  <fnm>BW</fnm>
               </au>
               <au>
                  <snm>Olson</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Shaw</snm>
                  <fnm>EK</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1974</pubdate>
            <volume>84</volume>
            <fpage>231</fpage>
            <lpage>240</lpage>
            <xrefbib>
               <pubid idtype="pmpid">4830866</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Antenna complexes from green photosynthetic bacateria</p>
            </title>
            <aug>
               <au>
                  <snm>Blankenship</snm>
                  <fnm>RE</fnm>
               </au>
               <au>
                  <snm>Olson</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Anoxygenic photosynthetic bacteria</source>
            <publisher>Dordrecht, Kluwer</publisher>
            <editor>Blankenship RE, Madigan MT and Bauer CE</editor>
            <pubdate>1995</pubdate>
            <fpage>399</fpage>
            <lpage>435</lpage>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Genomic analysis of Bacteroides fragilis reveals extensive DNA inversions regulating cell surface adaptation</p>
            </title>
            <aug>
               <au>
                  <snm>Kuwahara</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Yamashita</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hirakawa</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Nakayama</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Toh</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Okada</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Kuhara</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hattori</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hayashi</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Ohnishi</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2004</pubdate>
            <volume>101</volume>
            <fpage>14919</fpage>
            <lpage>14924</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">522005</pubid>
                  <pubid idtype="pmpid" link="fulltext">15466707</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>The genome of Salinibacter ruber: Convergence and gene exchange among hyperhalophilic bacteria and archaea</p>
            </title>
            <aug>
               <au>
                  <snm>Mongodin</snm>
                  <fnm>EF</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>KE</fnm>
               </au>
               <au>
                  <snm>Daugherty</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>DeBoy</snm>
                  <fnm>RT</fnm>
               </au>
               <au>
                  <snm>Wister</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Khouri</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Weidman</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Walsh</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Papke</snm>
                  <fnm>RT</fnm>
               </au>
               <au>
                  <snm>Sanchez</snm>
                  <fnm>PG</fnm>
               </au>
               <au>
                  <snm>Sharma</snm>
                  <fnm>AK</fnm>
               </au>
               <au>
                  <snm>Nesbo</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>MacLeod</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Bapteste</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Doolittle</snm>
                  <fnm>WF</fnm>
               </au>
               <au>
                  <snm>Charlebois</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Legault</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Rodriguez-Valera</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <fpage>18147</fpage>
            <lpage>18152</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1312414</pubid>
                  <pubid idtype="pmpid" link="fulltext">16330755</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Signature Proteins that are Distinctive of Alpha Proteobacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Kainth</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Gupta</snm>
                  <fnm>RS</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>94</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1182365</pubid>
                  <pubid idtype="pmpid" link="fulltext">15960851</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>BLAST screening of chlamydial genomes to identify signature proteins that are unique for the Chlamydiales, Chlamydiaceae, Chlamydophila and Chlamydia groups of species</p>
            </title>
            <aug>
               <au>
                  <snm>Griffiths</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Ventresca</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Gupta</snm>
                  <fnm>RS</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>14</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1403754</pubid>
                  <pubid idtype="pmpid" link="fulltext">16436211</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Signature proteins that are distinctive characteristics of Actinobacteria and their subgroups</p>
            </title>
            <aug>
               <au>
                  <snm>Gao</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Parmanathan</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Gupta</snm>
                  <fnm>RS</fnm>
               </au>
            </aug>
            <source>Antonie van Leeuwenhoek</source>
            <pubdate>2006</pubdate>
            <volume>(In press)</volume>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Molecular signatures (unique proteins and conserved Indels) that are specific for the epsilon proteobacteria (Campylobacterales)</p>
            </title>
            <aug>
               <au>
                  <snm>Gupta</snm>
                  <fnm>RS</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>167</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1557499</pubid>
                  <pubid idtype="pmpid" link="fulltext">16817973</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Signature Proteins for Archaea and Its Main Subgroups and the Origin of Methanogenesis</p>
            </title>
            <aug>
               <au>
                  <snm>Gao</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Gupta</snm>
                  <fnm>RS</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>86</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1852104</pubid>
                  <pubid idtype="pmpid" link="fulltext">17394648</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Prokaryote diversity and taxonomy: current status and future challenges</p>
            </title>
            <aug>
               <au>
                  <snm>Oren</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Philos Trans R Soc Lond B Biol Sci</source>
            <pubdate>2004</pubdate>
            <volume>359</volume>
            <fpage>623</fpage>
            <lpage>638</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1693353</pubid>
                  <pubid idtype="pmpid" link="fulltext">15253349</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Critical Issues in Bacterial Phylogenies</p>
            </title>
            <aug>
               <au>
                  <snm>Gupta</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Griffiths</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Theor Popul Biol</source>
            <pubdate>2002</pubdate>
            <volume>61</volume>
            <fpage>423</fpage>
            <lpage>434</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12167362</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Distributional profiles of homologous open reading frames among bacterial phyla: implications for vertical and lateral transmission</p>
            </title>
            <aug>
               <au>
                  <snm>Ragan</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Charlebois</snm>
                  <fnm>RL</fnm>
               </au>
            </aug>
            <source>Int J Syst Evol Microbiol</source>
            <pubdate>2002</pubdate>
            <volume>52</volume>
            <fpage>777</fpage>
            <lpage>787</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12054238</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Phylogenetic taxonomy of the family Chlorobiaceae on the basis of 16S rRNA and fmo (Fenna-Matthews-Olson protein) gene sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Imhoff</snm>
                  <fnm>JF</fnm>
               </au>
            </aug>
            <source>Int J Syst Evol Microbiol</source>
            <pubdate>2003</pubdate>
            <volume>53</volume>
            <fpage>941</fpage>
            <lpage>951</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12892110</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>Phylogeny and molecular fingerprinting of green sulfur bacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Overmann</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Tuschak</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Arch Microbiol</source>
            <pubdate>1997</pubdate>
            <volume>167</volume>
            <fpage>302</fpage>
            <lpage>309</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9094228</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>Phylogeny of green sulfur bacteria on the basis of gene sequences of 16S rRNA and of the Fenna-Matthews-Olson protein</p>
            </title>
            <aug>
               <au>
                  <snm>Alexander</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Andersen</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Cox</snm>
                  <fnm>RP</fnm>
               </au>
               <au>
                  <snm>Imhoff</snm>
                  <fnm>JF</fnm>
               </au>
            </aug>
            <source>Arch Microbiol</source>
            <pubdate>2002</pubdate>
            <volume>178</volume>
            <fpage>131</fpage>
            <lpage>140</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12115058</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>Molecular systematic studies of eubacteria, using s70- type sigma factors of group 1 and group 2</p>
            </title>
            <aug>
               <au>
                  <snm>Gruber</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Bryant</snm>
                  <fnm>DA</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>1997</pubdate>
            <volume>179</volume>
            <fpage>1734</fpage>
            <lpage>1747</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">178889</pubid>
                  <pubid idtype="pmpid" link="fulltext">9045836</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>Phylogenetic analysis and taxonomic study of marine Cytophaga-like bacteria: proposal for Tenacibaculum gen. nov with Tenacibaculum maritimum comb. nov and Tenacibaculum ovolyticum comb. nov., and description of Tenacibaculum mesophilum sp nov and Tenacibaculum amylolyticum sp nov</p>
            </title>
            <aug>
               <au>
                  <snm>Suzuki</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Nakagawa</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Harayama</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Yamamoto</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Int J Syst Evol Microbiol</source>
            <pubdate>2001</pubdate>
            <volume>51</volume>
            <fpage>1639</fpage>
            <lpage>1652</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11594591</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>Genome-scale approaches to resolving incongruence in molecular phylogenies</p>
            </title>
            <aug>
               <au>
                  <snm>Rokas</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Williams</snm>
                  <fnm>BL</fnm>
               </au>
               <au>
                  <snm>King</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Carroll</snm>
                  <fnm>SB</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2003</pubdate>
            <volume>425</volume>
            <fpage>798</fpage>
            <lpage>804</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14574403</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>Inferring phylogenies from protein sequences by parsimony, distance, and likelihood methods</p>
            </title>
            <aug>
               <au>
                  <snm>Felsenstein</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Methods in Enzymology</source>
            <pubdate>1996</pubdate>
            <volume>266:418-27</volume>
            <fpage>418</fpage>
            <lpage>427</lpage>
         </bibl>
         <bibl id="B52">
            <aug>
               <au>
                  <snm>Felsenstein</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Inferring Phylogenies</source>
            <publisher>Sunderland, Mass., Sinauer Associates, Inc.</publisher>
            <pubdate>2004</pubdate>
         </bibl>
         <bibl id="B53">
            <title>
               <p>The uniquely evolved character concept and its cladistic application</p>
            </title>
            <aug>
               <au>
                  <snm>Le Quesne</snm>
                  <fnm>WJ</fnm>
               </au>
            </aug>
            <source>Systematic Zoology</source>
            <pubdate>1975</pubdate>
            <volume>23</volume>
            <fpage>513</fpage>
            <lpage>517</lpage>
         </bibl>
         <bibl id="B54">
            <title>
               <p>Detecting evolutionary incompatibilities from protein sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Sneath</snm>
                  <fnm>PHA</fnm>
               </au>
               <au>
                  <snm>Sackin</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Ambler</snm>
                  <fnm>RP</fnm>
               </au>
            </aug>
            <source>Systematic Zoology</source>
            <pubdate>1975</pubdate>
            <volume>24</volume>
            <fpage>311</fpage>
            <lpage>332</lpage>
         </bibl>
         <bibl id="B55">
            <title>
               <p>A mathematical foundation for the analysis of cladistic character compatibility</p>
            </title>
            <aug>
               <au>
                  <snm>Estabrook</snm>
                  <fnm>GF</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>CSJ</fnm>
               </au>
               <au>
                  <snm>McMorris</snm>
                  <fnm>FR</fnm>
               </au>
            </aug>
            <source>Mathematical Biosciences</source>
            <pubdate>1976</pubdate>
            <volume>29</volume>
            <fpage>181</fpage>
            <lpage>187</lpage>
         </bibl>
         <bibl id="B56">
            <title>
               <p>Comaptibility methods in Systematics</p>
            </title>
            <aug>
               <au>
                  <snm>Meacham</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Estabrook</snm>
                  <fnm>GF</fnm>
               </au>
            </aug>
            <source>Ann Rev Ecol Syst</source>
            <pubdate>1985</pubdate>
            <volume>16</volume>
            <fpage>431</fpage>
            <lpage>446</lpage>
         </bibl>
         <bibl id="B57">
            <title>
               <p>Application of the character compatibility approach to generalized molecular sequence data: Branching order of the proteobacterial subdivisions</p>
            </title>
            <aug>
               <au>
                  <snm>Gupta</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Sneath</snm>
                  <fnm>PHA</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>2006</pubdate>
            <volume>64</volume>
            <fpage>90</fpage>
            <lpage>100</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17160641</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>Complete genome sequence of the oral pathogenic bacterium Porphyromonas gingivalis strain W83</p>
            </title>
            <aug>
               <au>
                  <snm>Nelson</snm>
                  <fnm>KE</fnm>
               </au>
               <au>
                  <snm>Fleischmann</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>DeBoy</snm>
                  <fnm>RT</fnm>
               </au>
               <au>
                  <snm>Paulsen</snm>
                  <fnm>IT</fnm>
               </au>
               <au>
                  <snm>Fouts</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Eisen</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Daugherty</snm>
                  <fnm>SC</fnm>
               </au>
               <au>
                  <snm>Dodson</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Durkin</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Gwinn</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Haft</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Kolonay</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>WC</fnm>
               </au>
               <au>
                  <snm>Mason</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Tallon</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Gray</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Granger</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Tettelin</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Dong</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Galvin</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Duncan</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Dewhirst</snm>
                  <fnm>FE</fnm>
               </au>
               <au>
                  <snm>Fraser</snm>
                  <fnm>CM</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2003</pubdate>
            <volume>185</volume>
            <fpage>5591</fpage>
            <lpage>5601</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">193775</pubid>
                  <pubid idtype="pmpid" link="fulltext">12949112</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <title>
               <p>Whole genome analysis of the marine Bacteroidetes'Gramella forsetii' reveals adaptations to degradation of polymeric organic matter</p>
            </title>
            <aug>
               <au>
                  <snm>Bauer</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kube</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Teeling</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Richter</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lombardot</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Allers</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Wurdemann</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Quast</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Kuhl</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Knaust</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Woebken</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Bischof</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Mussmann</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Choudhuri</snm>
                  <fnm>JV</fnm>
               </au>
               <au>
                  <snm>Meyer</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Reinhardt</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Amann</snm>
                  <fnm>RI</fnm>
               </au>
               <au>
                  <snm>Glockner</snm>
                  <fnm>FO</fnm>
               </au>
            </aug>
            <source>Environ Microbiol</source>
            <pubdate>2006</pubdate>
            <volume>8</volume>
            <fpage>2201</fpage>
            <lpage>2213</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17107561</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B60">
            <title>
               <p>CD-Search: protein domain annotations on the fly</p>
            </title>
            <aug>
               <au>
                  <snm>Marchler-Bauer</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bryant</snm>
                  <fnm>SH</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <fpage>W327</fpage>
            <lpage>W331</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">441592</pubid>
                  <pubid idtype="pmpid" link="fulltext">15215404</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B61">
            <title>
               <p>Who's your neighbor? New computational approaches for functional genomics</p>
            </title>
            <aug>
               <au>
                  <snm>Galperin</snm>
                  <fnm>MY</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Nat Biotechnol</source>
            <pubdate>2000</pubdate>
            <volume>18</volume>
            <fpage>609</fpage>
            <lpage>613</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10835597</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B62">
            <title>
               <p>From protein sequence to function</p>
            </title>
            <aug>
               <au>
                  <snm>Danchin</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Curr Opin Struct Biol</source>
            <pubdate>1999</pubdate>
            <volume>9</volume>
            <fpage>363</fpage>
            <lpage>367</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10408894</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B63">
            <title>
               <p>A regulation cascade controls expression of Porphyromonas gingivalis fimbriae via the FimR response regulator</p>
            </title>
            <aug>
               <au>
                  <snm>Nishikawa</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Yoshimura</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Duncan</snm>
                  <fnm>MJ</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>2004</pubdate>
            <volume>54</volume>
            <fpage>546</fpage>
            <lpage>560</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15469523</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B64">
            <title>
               <p>Prokaryotic evolution in light of gene transfer</p>
            </title>
            <aug>
               <au>
                  <snm>Gogarten</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Doolittle</snm>
                  <fnm>WF</fnm>
               </au>
               <au>
                  <snm>Lawrence</snm>
                  <fnm>JG</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2002</pubdate>
            <volume>19</volume>
            <fpage>2226</fpage>
            <lpage>2238</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12446813</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B65">
            <title>
               <p>Horizontal gene transfer, genome innovation and evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Gogarten</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Townsend</snm>
                  <fnm>JP</fnm>
               </au>
            </aug>
            <source>Nat Rev Microbiol</source>
            <pubdate>2005</pubdate>
            <volume>3</volume>
            <fpage>679</fpage>
            <lpage>687</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16138096</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B66">
            <title>
               <p>The essential C family DnaE polymerase is error-prone and efficient at lesion bypass</p>
            </title>
            <aug>
               <au>
                  <snm>Bruck</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Goodman</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>O'Donnell</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2003</pubdate>
            <volume>278</volume>
            <fpage>44361</fpage>
            <lpage>44368</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12949067</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B67">
            <title>
               <p>Bacterial genomes as new gene homes: the genealogy of ORFans in E. coli</p>
            </title>
            <aug>
               <au>
                  <snm>Daubin</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Ochman</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <fpage>1036</fpage>
            <lpage>1042</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">419781</pubid>
                  <pubid idtype="pmpid" link="fulltext">15173110</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B68">
            <title>
               <p>Highways of gene sharing in prokaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Beiko</snm>
                  <fnm>RG</fnm>
               </au>
               <au>
                  <snm>Harlow</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Ragan</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <fpage>14332</fpage>
            <lpage>14337</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1242295</pubid>
                  <pubid idtype="pmpid" link="fulltext">16176988</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B69">
            <title>
               <p>The use of signature sequences in different proteins to determine the relative branching order of bacterial divisions: evidence that Fibrobacter diverged at a similar time to Chlamydia and the Cytophaga- Flavobacterium-Bacteroides division</p>
            </title>
            <aug>
               <au>
                  <snm>Griffiths</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Gupta</snm>
                  <fnm>RS</fnm>
               </au>
            </aug>
            <source>Microbiology</source>
            <pubdate>2001</pubdate>
            <volume>147</volume>
            <fpage>2611</fpage>
            <lpage>2622</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11535801</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B70">
            <title>
               <p>Evolutionary Relationships Among Photosynthetic Bacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Gupta</snm>
                  <fnm>RS</fnm>
               </au>
            </aug>
            <source>Photosynth Res</source>
            <pubdate>2003</pubdate>
            <volume>76</volume>
            <fpage>173</fpage>
            <lpage>183</lpage>
            <xrefbib>
               <pubid idtype="pmpid">16228576</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B71">
            <title>
               <p>Gapped BLAST and PSI-BLAST: a new generation of protein databases search programs</p>
            </title>
            <aug>
               <au>
                  <snm>Altschul</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Madden</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Schaffer</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Lipman</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>1997</pubdate>
            <volume>25</volume>
            <fpage>3389</fpage>
            <lpage>3402</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">146917</pubid>
                  <pubid idtype="pmpid" link="fulltext">9254694</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B72">
            <title>
               <p>TIGR Unfinished Microbial Genome Database. </p>
            </title>
            <aug>
               <au>
                  <cnm>TIGR</cnm>
               </au>
            </aug>
            <pubdate>2007</pubdate>
            <url>http://www.tigr.org/tdb/ufmg/index.shtml</url>
         </bibl>
         <bibl id="B73">
            <title>
               <p>Multiple sequence alignment with Clustal x</p>
            </title>
            <aug>
               <au>
                  <snm>Jeanmougin</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Thompson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Gouy</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Higgins</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Gibson</snm>
                  <fnm>TJ</fnm>
               </au>
            </aug>
            <source>Trends Biochem Sci</source>
            <pubdate>1998</pubdate>
            <volume>23</volume>
            <fpage>403</fpage>
            <lpage>405</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9810230</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B74">
            <title>
               <p>Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Castresana</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2000</pubdate>
            <volume>17</volume>
            <fpage>540</fpage>
            <lpage>552</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10742046</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B75">
            <aug>
               <au>
                  <snm>Kimura</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>The Neutral Theory of Molecular Evolution</source>
            <publisher>Cambridge, Cambridge University Press</publisher>
            <pubdate>1983</pubdate>
         </bibl>
         <bibl id="B76">
            <title>
               <p>TREECON for Windows: a software package for the construction and drawing of evolutionary trees for the Microsoft Windows environment</p>
            </title>
            <aug>
               <au>
                  <snm>Van de</snm>
                  <fnm>PY</fnm>
               </au>
               <au>
                  <snm>De Wachter</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Comput Appl Biosci</source>
            <pubdate>1994</pubdate>
            <volume>10</volume>
            <fpage>569</fpage>
            <lpage>570</lpage>
            <xrefbib>
               <pubid idtype="pmpid">7828077</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B77">
            <title>
               <p>A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach</p>
            </title>
            <aug>
               <au>
                  <snm>Whelan</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Goldman</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2001</pubdate>
            <volume>18</volume>
            <fpage>691</fpage>
            <lpage>699</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11319253</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B78">
            <title>
               <p>TREE-PUZZLE: maximum likelihood phylogenetic analysis using quartets and parallel computing</p>
            </title>
            <aug>
               <au>
                  <snm>Schmidt</snm>
                  <fnm>HA</fnm>
               </au>
               <au>
                  <snm>Strimmer</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Vingron</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>von Haeseler</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2002</pubdate>
            <volume>18</volume>
            <fpage>502</fpage>
            <lpage>504</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11934758</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B79">
            <title>
               <p>MEGA3: Integrated software for molecular evolutionary genetics analysis and sequence alignment</p>
            </title>
            <aug>
               <au>
                  <snm>Kumar</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Tamura</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Nei</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Brief Bioinform</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>150</fpage>
            <lpage>163</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15260895</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B80">
            <title>
               <p>Confidence limits in phylogenies: an approach using the bootstap</p>
            </title>
            <aug>
               <au>
                  <snm>Felsenstein</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Evolution</source>
            <pubdate>1985</pubdate>
            <volume>39</volume>
            <fpage>783</fpage>
            <lpage>791</lpage>
         </bibl>
         <bibl id="B81">
            <title>
               <p>PHYLIP, version 3.5c</p>
            </title>
            <aug>
               <au>
                  <snm>Felsenstein</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <publisher>Seattle, WA, University of Washington</publisher>
            <pubdate>1993</pubdate>
         </bibl>
         <bibl id="B82">
            <title>
               <p>Protein Phylogenies and Signature Sequences: A Reappraisal of Evolutionary Relationships Among Archaebacteria, Eubacteria, and Eukaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Gupta</snm>
                  <fnm>RS</fnm>
               </au>
            </aug>
            <source>Microbiol Mol Biol Rev</source>
            <pubdate>1998</pubdate>
            <volume>62</volume>
            <fpage>1435</fpage>
            <lpage>1491</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">98952</pubid>
                  <pubid idtype="pmpid" link="fulltext">9841678</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B83">
            <title>
               <p>The phylogeny of Proteobacteria: relationships to other eubacterial phyla and eukaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Gupta</snm>
                  <fnm>RS</fnm>
               </au>
            </aug>
            <source>FEMS Microbiol Rev</source>
            <pubdate>2000</pubdate>
            <volume>24</volume>
            <fpage>367</fpage>
            <lpage>402</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10978543</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B84">
            <title>
               <p>Molecular signatures in protein sequences that are characteristic of Cyanobacteria and plastid homologues</p>
            </title>
            <aug>
               <au>
                  <snm>Gupta</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Pereira</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Chandrasekera</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Johari</snm>
                  <fnm>V</fnm>
               </au>
            </aug>
            <source>Int J Syst Evol Microbiol</source>
            <pubdate>2003</pubdate>
            <volume>53</volume>
            <fpage>1833</fpage>
            <lpage>1842</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14657112</pubid>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
