<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2003-4-9-r55</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>Evolution of mosaic operons by horizontal gene transfer and gene displacement <it>in situ</it></p>
         </title>
         <aug>
            <au id="A1">
               <snm>Omelchenko</snm>
               <mi>V</mi>
               <fnm>Marina</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
            </au>
            <au id="A2">
               <snm>Makarova</snm>
               <mi>S</mi>
               <fnm>Kira</fnm>
               <insr iid="I2"/>
            </au>
            <au id="A3">
               <snm>Wolf</snm>
               <mi>I</mi>
               <fnm>Yuri</fnm>
               <insr iid="I2"/>
            </au>
            <au id="A4">
               <snm>Rogozin</snm>
               <mi>B</mi>
               <fnm>Igor</fnm>
               <insr iid="I2"/>
            </au>
            <au id="A5" ca="yes">
               <snm>Koonin</snm>
               <mi>V</mi>
               <fnm>Eugene</fnm>
               <insr iid="I2"/>
               <email>koonin@ncbi.nlm.nih.gov</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Pathology, FE Hebert School of Medicine, Uniformed Services University of the Health Sciences, Bethesda, MD 20814-4799, USA</p>
            </ins>
            <ins id="I2">
               <p>National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2003</pubdate>
         <volume>4</volume>
         <issue>9</issue>
         <fpage>R55</fpage>
         <url>http://genomebiology.com/2003/4/9/R55</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="doi">10.1186/gb-2003-4-9-r55</pubid>
               <pubid idtype="pmpid">12952534</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>22</day>
               <month>4</month>
               <year>2003</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>26</day>
               <month>6</month>
               <year>2003</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>17</day>
               <month>7</month>
               <year>2003</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>29</day>
               <month>8</month>
               <year>2003</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2003</year>
         <collab>Omelchenko et al.; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.</collab>
      </cpyrt>
      <shorttitle>
         <p>Evolution of mosaic operons by horizontal gene transfer and gene displacement <it>in situ</it></p>
      </shorttitle>
      <shortabs>
         <p>Comparative genomics and phylogenetic analysis have been used to examine horizontal transfer of entire operons versus displacement of individual genes within operons by horizontally acquired orthologs and independent assembly of the same or similar operons from genes with different phylogenetic affinities.</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Shuffling and disruption of operons and horizontal gene transfer are major contributions to the new, dynamic view of prokaryotic evolution. Under the 'selfish operon' hypothesis, operons are viewed as mobile genetic entities that are constantly disseminated via horizontal gene transfer, although their retention could be favored by the advantage of coregulation of functionally linked genes. Here we apply comparative genomics and phylogenetic analysis to examine horizontal transfer of entire operons versus displacement of individual genes within operons by horizontally acquired orthologs and independent assembly of the same or similar operons from genes with different phylogenetic affinities.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Since a substantial number of operons have been identified experimentally in only a few model bacteria, evolutionarily conserved gene strings were analyzed as surrogates of operons. The phylogenetic affinities within these predicted operons were assessed first by sequence similarity analysis and then by phylogenetic analysis, including statistical tests of tree topology. Numerous cases of apparent horizontal transfer of entire operons were detected. However, it was shown that apparent horizontal transfer of individual genes or arrays of genes within operons is not uncommon either and results in xenologous gene displacement <it>in situ</it>, that is, displacement of an ancestral gene by a horizontally transferred ortholog from a taxonomically distant organism without change of the local gene organization. On rarer occasions, operons might have evolved via independent assembly, in part from horizontally acquired genes.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusions</p>
               </st>
               <p>The discovery of <it>in situ </it>gene displacement shows that combination of rampant horizontal gene transfer with selection for preservation of operon structure provides for events in prokaryotic evolution that, <it>a priori</it>, seem improbable. These findings also emphasize that not all aspects of operon evolution are selfish, with operon integrity maintained by purifying selection at the organism level.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010008">Evolution</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010010">Genome studies</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010014">Microbiology and parasitology</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Operons, clusters of co-transcribed genes that often encode functionally linked proteins, are the principal form of gene organization and regulation in prokaryotes <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. Comparative analysis of bacterial and archaeal genomes has shown that only a few operons are conserved across large evolutionary distances. In general, gene order in prokaryotes is poorly conserved and prone to numerous rearrangements <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp>. A detailed analysis of gene order conservation has shown that only 5-25% of the genes in bacterial and archaeal genomes belongs to gene strings (probable operons) shared by at least two distantly related species <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. The presence of identical or similarly organized operons and suboperons in phylogenetically distant bacterial or archaeal lineages may result from three distinct evolutionary processes. Firstly, inheritance from the respective common ancestor - the core of the ribosomal protein superoperon is a case in point, but such conservation of operon organization is relatively rare; secondly, independent origin of identical operons or suboperons in different lineages; and thirdly, emergence of operons in a single lineage with subsequent dissemination by horizontal transfer. The potential central role of horizontal transfer in the evolution of operon organization of prokaryotic genomes is embodied in the 'selfish operon model' (SOM) <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>. This model posits that "the physical proximity of genes in an operon provides no selective benefit to the individual organism but does enhance the fitness of the gene cluster itself, as clusters can be efficiently inherited horizontally as well as vertically" <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. Under SOM, operons are conceptually analogous to integrating viruses (phages), transposons and other mobile genetic elements, although coregulation of the genes in an operon could be an important selective factor that favors retention of operons during evolution.</p>
         <p>Horizontal gene transfer (HGT) events have been classified into distinct categories of acquisition of new genes, acquisition of paralogs of existing genes and xenologous gene displacement whereby a gene is displaced by a horizontally transferred ortholog from another lineage (xenolog <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>). Each of these types of horizontal transfer is common among prokaryotes, but their relative contributions differ in different lineages <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. Comparative-genomic analyses by many groups have suggested that, on the whole, horizontal gene transfer had substantial effects, albeit uneven in different lineages, on the gene content of bacterial and archaeal genomes <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp>. However, in spite of the considerable popularity of the selfish operon theory, we are unaware of systematic studies of horizontal gene transfer events at the level of operons. In part, this is likely to have been caused by the scarcity of experimental data on operon organization in any prokaryote other than <it>Escherichia coli</it>.</p>
         <p>Recent phylogenetic analyses of ribosomal proteins revealed several instances of apparent xenologous gene displacement within a conserved operon, in which other genes have not been horizontally transferred; in other words, these operons appear to represent an evolutionary mosaic <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp>. Another study demonstrated a complicated mosaic organization of the leukotoxin operon in bacteria of the genus <it>Mannheimia </it>(<it>Pasteurella</it>); the observed evolutionary pattern had to be explained through multiple gene transfer events, which led to the hypothesis that, in this case, frequent gene displacement conferred selective advantage onto the bacterium by maintaining antigenic variation <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. In earlier studies, evolution of operons from gene blocks with distinct evolutionary fates has been considered for rfb operons coding for lipopolysaccharide biosynthesis in enterobacteria <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>.</p>
         <p>To assess the role of horizontal gene transfer in the evolution of operons systematically, we undertook phylogenetic analysis of members of highly conserved gene neighborhoods that are predicted to constitute operons <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. We focused primarily on mosaic operons in which one or more of the genes apparently have been transferred from distantly related species such that the phylogeny of the transferred genes is obviously incongruent with the phylogeny of the remaining genes in the respective operons.</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <sec>
            <st>
               <p>Identification of horizontal gene transfer</p>
            </st>
            <p>Experimental data on operons in organisms other than <it>E. coli </it>and, to a lesser extent, <it>B. subtilis </it>are scarce. Therefore we used conserved gene pairs and connected gene neighborhoods associated with them as an approximation of operon organization of genes in other prokaryotic genomes. Several studies have suggested strongly that all gene pairs that are conserved in multiple genomes belong to the same operon <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr></abbrgrp>. Here we used an extremely conservative threshold (conservation of a gene pair in 10 genomes) to ensure that only genuine operons were analyzed. BLASTP searches for potential horizontal gene transfer identified 729 candidate genes (9% of all genes comprising conserved neighborhoods in 41 analyzed genomes), that is, genes whose encoded protein sequences were more similar to homologs from phylogenetically distant taxa than to those from the reference taxon (it might be worth noting that, throughout this analysis, we treated genes as atomic units and did not consider the relatively unlikely possibility of HGT for portions of genes). Phylogenetic analysis of these genes and their neighbors revealed different types of evolutionary events, some of which involve whole operons, whereas others seem to reflect operon mosaicity.</p>
            <p>Probable horizontal transfer of whole operons or large portions of operons, when phylogenetic trees for all genes in a predicted operon had the same topology (which, however, was incompatible with the species tree) was identified in 35 neighborhoods - approximately one third of all analyzed neighborhoods. These events were classified into three categories: acquisition of a new (for the given lineage) operon, paralogous operon acquisition and xenologous operon displacement <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. Examples of all these classes of apparent operon transfer events are given in Table <tblr tid="T1">1</tblr>. These 35 neighborhoods generally represented functional classes of genes known to be prone to HGT: transporters, general metabolism-related genes and signal transduction systems <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B15">15</abbr><abbr bid="B17">17</abbr></abbrgrp>. This seems to be a relatively low level of horizontal transfer in view of the purported selfish behavior of operons <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>. However, the strict threshold, described above, on the detection of conserved gene pairs undoubtedly led to many horizontally transferred operons being missed. Thus, the present analysis gives a conservative low bound of operon transfer.</p>
            <tbl id="T1" hint_layout="double">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Examples of horizontally transferred operons</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="left">
                        <p>Operon</p>
                     </c>
                     <c ca="left">
                        <p>Recipient organism and correspondent genes</p>
                     </c>
                     <c ca="left">
                        <p>Probable source</p>
                     </c>
                     <c ca="left">
                        <p>Other probable recipients</p>
                     </c>
                     <c ca="left">
                        <p>Comment</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c cspan="5" ca="left">
                        <p>
                           <b>Operon acquisition</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Pyruvate:ferredoxin oxidoreductase</p>
                     </c>
                     <c ca="left">
                        <p><it>Thermotoga maritima </it>TM0015-TM0018</p>
                     </c>
                     <c ca="left">
                        <p>Archaea</p>
                     </c>
                     <c ca="left">
                        <p>Aae, Hpy, Bha/Sau</p>
                     </c>
                     <c ca="left">
                        <p>Apparently, the related operon for 2-oxoisovalerate oxidoreductase (TM1758-TM1759) was also transferred from archaea</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Sulfate/molybdate transport</p>
                     </c>
                     <c ca="left">
                        <p><it>Bacillus halodurans </it>BH3128-BH3130</p>
                     </c>
                     <c ca="left">
                        <p>Gram-negative bacteria</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>No other such operons in <it>Bacillus</it>-<it>Clostridium </it>group members</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Putative effector of murein hydrolase</p>
                     </c>
                     <c ca="left">
                        <p><it>Pyrococcus horikoshii </it>PH1801-PH1802</p>
                     </c>
                     <c ca="left">
                        <p>Bacteria</p>
                     </c>
                     <c ca="left">
                        <p>Pab, Mac</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Allophanate hydrolase subunits</p>
                     </c>
                     <c ca="left">
                        <p><it>Pyrococcus horikoshii </it>PH0987-PH0988</p>
                     </c>
                     <c ca="left">
                        <p>Bacteria</p>
                     </c>
                     <c ca="left">
                        <p>Pab</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c cspan="5" ca="left">
                        <p>
                           <b>Paralogous operon acquisition</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Dipeptide transporter</p>
                     </c>
                     <c ca="left">
                        <p><it>Vibrio cholerae </it>VC0620-VC0616</p>
                     </c>
                     <c ca="left">
                        <p>Thermotoga/Archaea</p>
                     </c>
                     <c ca="left">
                        <p>Tma</p>
                     </c>
                     <c ca="left">
                        <p>It has several another bacterial operons including VC1091-VC1095</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Ribonucleotide reductase alpha and beta subunit</p>
                     </c>
                     <c ca="left">
                        <p><it>Halobacterium </it>sp. VNG2384G VNG2383G</p>
                     </c>
                     <c ca="left">
                        <p>Bacteria</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>Additional to "archaeal:" Ribonucleotide reductase alpha subunit VNG1644G, beta subunit is apparently lost</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Aromatic amino-acid biosynthesis</p>
                     </c>
                     <c ca="left">
                        <p><it>Halobacterium </it>sp. VNG0384G VNG0386G</p>
                     </c>
                     <c ca="left">
                        <p>Bacteria</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>Paralogs of this pair are VNG1646G-VNG1647G</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c cspan="5" ca="left">
                        <p>
                           <b>Xenologous operon displacement</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Histidine biosynthesis suboperon</p>
                     </c>
                     <c ca="left">
                        <p><it>Pseudomonas aeruginosa </it>PA3151-PA3152</p>
                     </c>
                     <c ca="left">
                        <p>Epsilon-Proteobacteria</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Panthothenate synthesis</p>
                     </c>
                     <c ca="left">
                        <p><it>Campylobacter jejuni </it>Cj0297c-Cj0298c</p>
                     </c>
                     <c ca="left">
                        <p>Gram-positive bacteria</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>DNA repair SbcDC</p>
                     </c>
                     <c ca="left">
                        <p><it>Vibrio cholerae </it>VCA0520-VCA0521</p>
                     </c>
                     <c ca="left">
                        <p>Gram-positive bacteria</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>DNA gyrase A and B</p>
                     </c>
                     <c ca="left">
                        <p><it>Halobacterium </it>sp. VNG0887G-VNG0889G</p>
                     </c>
                     <c ca="left">
                        <p>Bacteria</p>
                     </c>
                     <c ca="left">
                        <p>Hbs, Tac, Tvo, Afu,</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Dipeptide transporter</p>
                     </c>
                     <c ca="left">
                        <p><it>Streptococcus pyogenes </it>SPy2000-SPy2004</p>
                     </c>
                     <c ca="left">
                        <p>Gamma-Proteobacteria</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Glutamate synthase complex</p>
                     </c>
                     <c ca="left">
                        <p><it>Thermotoga maritima </it>TM0394-TM0398</p>
                     </c>
                     <c ca="left">
                        <p>Archaea</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>There is another homolog for gene TM0397 of possible archaeal origin</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NADH:ubiquinone oxidoreductase</p>
                     </c>
                     <c ca="left">
                        <p><it>Halobacterium </it>sp. VNG0635G-VNG0637G</p>
                     </c>
                     <c ca="left">
                        <p>Bacteria</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Phosphate transporter</p>
                     </c>
                     <c ca="left">
                        <p><it>Methanothermobacter thermoautotrophicum </it>MTH1727-MTH1734</p>
                     </c>
                     <c ca="left">
                        <p>Bacteria</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>In addition, 19 predicted operons with different phylogenetic affinities of the constituent genes, that is, apparent mosaic operons, were identified (Table <tblr tid="T2">2</tblr>). Again, this is definitely a low bound - not only because of the high threshold set for the identification of conserved gene pairs, but also because this number includes only cases that were clearly resolved by phylogenetic tree analysis. In addition, we detected many uncertain cases where the different phylogenetic affinities of genes within an operon were not strongly supported (data not shown); at least some of these are probably also mosaic operons.</p>
            <tbl id="T2" hint_layout="double">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Examples of probable mosaic operons</p>
               </caption>
               <tblbdy cols="6">
                  <r>
                     <c ca="left">
                        <p>Species</p>
                     </c>
                     <c ca="left">
                        <p>Predicted operon</p>
                     </c>
                     <c ca="left">
                        <p>General operon function</p>
                     </c>
                     <c ca="left">
                        <p>Horizontally acquired genes</p>
                     </c>
                     <c ca="left">
                        <p>Probable source of horizontally acquired genes</p>
                     </c>
                     <c ca="left">
                        <p>Functions of horizontally acquired genes</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Cluster 1*</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Rickettsia prowazekii Rickettsia conorii</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>RP633-661, RC0980-1008</p>
                     </c>
                     <c ca="left">
                        <p>Ribosomal operon</p>
                     </c>
                     <c ca="left">
                        <p>RP651 RC0998</p>
                     </c>
                     <c ca="left">
                        <p>Chlamydia</p>
                     </c>
                     <c ca="left">
                        <p>L29</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Aquifex aeolicus</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Aq001-021</p>
                     </c>
                     <c ca="left">
                        <p>Ribosomal operon</p>
                     </c>
                     <c ca="left">
                        <p>Aq018a</p>
                     </c>
                     <c ca="left">
                        <p>Archaea</p>
                     </c>
                     <c ca="left">
                        <p>L29</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Cluster 2</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Rickettsia prowazekii Rickettsia conorii</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>RP800-804, RC1234-1238</p>
                     </c>
                     <c ca="left">
                        <p>F0F1-type ATPase</p>
                     </c>
                     <c ca="left">
                        <p>RP804 RC1238 Gram-positive bacteria</p>
                     </c>
                     <c ca="left">
                        <p>Delta subunit</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Ureaplasma urealyticum</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>UU128-138</p>
                     </c>
                     <c ca="left">
                        <p>F0F1-type ATPase</p>
                     </c>
                     <c ca="left">
                        <p>UU128, UU132_1, UU133, UU134</p>
                     </c>
                     <c ca="left">
                        <p>Gram-negative bacteria</p>
                     </c>
                     <c ca="left">
                        <p>Epsilon subunit, alpha subunit, delta subunit, delta subunit</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Mycobacterium leprae</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>ML1139-1146</p>
                     </c>
                     <c ca="left">
                        <p>F0F1-type ATPase</p>
                     </c>
                     <c ca="left">
                        <p>ML1139</p>
                     </c>
                     <c ca="left">
                        <p>Gram-negative bacteria</p>
                     </c>
                     <c ca="left">
                        <p>A chain protein</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Cluster 3</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Rickettsia prowazekii Rickettsia conorii</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>RP134-139, RC175-180</p>
                     </c>
                     <c ca="left">
                        <p>Ribosomal proteins, transcription antiterminator, SecE</p>
                     </c>
                     <c ca="left">
                        <p>RP134 RC175</p>
                     </c>
                     <c ca="left">
                        <p>Gram-positive bacteria</p>
                     </c>
                     <c ca="left">
                        <p>Preprotein translocase subunit SecE</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Cluster 5</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Aquifex aeolicus</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Aq1968_1_2 two domains</p>
                     </c>
                     <c ca="left">
                        <p>Histidine biosynthesis</p>
                     </c>
                     <c ca="left">
                        <p>Gram-negative bacteria</p>
                     </c>
                     <c ca="left">
                        <p>Phosphoribosyl-AMP cyclohydrolase</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Cluster 8</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Methanococcus jannaschii</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>MJ1037-1038</p>
                     </c>
                     <c ca="left">
                        <p>Tryptophan biosynthesis</p>
                     </c>
                     <c ca="left">
                        <p>MJ1037</p>
                     </c>
                     <c ca="left">
                        <p>Bacteria</p>
                     </c>
                     <c ca="left">
                        <p>Tryptophan synthase beta chain</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Methanobacterium thermoautotrophicum</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>MTH1655-1661</p>
                     </c>
                     <c ca="left">
                        <p>Tryptophan biosynthesis</p>
                     </c>
                     <c ca="left">
                        <p>MTH1660</p>
                     </c>
                     <c ca="left">
                        <p>Gram-negative bacteria</p>
                     </c>
                     <c ca="left">
                        <p>Tryptophan synthase alpha chain</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Halobacterium </it>sp.</p>
                     </c>
                     <c ca="left">
                        <p>VNG0305-0309</p>
                     </c>
                     <c ca="left">
                        <p>Tryptophan biosynthesis</p>
                     </c>
                     <c ca="left">
                        <p>VNG0307G</p>
                     </c>
                     <c ca="left">
                        <p>Bacteria</p>
                     </c>
                     <c ca="left">
                        <p>Tryptophan synthase beta chain</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Bacillus subtilis Bacillus halodurans</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>PabB-folK BH0090-0095</p>
                     </c>
                     <c ca="left">
                        <p>Tryptophan biosynthesis</p>
                     </c>
                     <c ca="left">
                        <p>PabB, BH0090</p>
                     </c>
                     <c ca="left">
                        <p>Gram-negative bacteria</p>
                     </c>
                     <c ca="left">
                        <p>Anthranilate/para-aminobenzoate synthases component I</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Cluster 9</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Halobacterium </it>sp.</p>
                     </c>
                     <c ca="left">
                        <p>VNG0635G-0647G</p>
                     </c>
                     <c ca="left">
                        <p>NADH:ubiquinone oxidoreductase</p>
                     </c>
                     <c ca="left">
                        <p>VNG0640G</p>
                     </c>
                     <c ca="left">
                        <p>Gram-negative bacteria</p>
                     </c>
                     <c ca="left">
                        <p>NADH dehydrogenase-like protein</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Cluster 18</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Rickettsia prowazekii Rickettsia conorii</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>RP423-425, RC0588-0590</p>
                     </c>
                     <c ca="left">
                        <p>Lipid metabolism</p>
                     </c>
                     <c ca="left">
                        <p>RP425, RC0590</p>
                     </c>
                     <c ca="left">
                        <p>Spirochetes</p>
                     </c>
                     <c ca="left">
                        <p>Undecaprenyl pyrophosphate synthase</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Cluster 27</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Halobacterium </it>sp.</p>
                     </c>
                     <c ca="left">
                        <p>VNG1306G-1310G</p>
                     </c>
                     <c ca="left">
                        <p>Succinate dehydrogenase/fumarate reductase</p>
                     </c>
                     <c ca="left">
                        <p>VNG1310G</p>
                     </c>
                     <c ca="left">
                        <p>Actinobacteria</p>
                     </c>
                     <c ca="left">
                        <p>Succinate dehydrogenase subunit C</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Cluster 29</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Mycoplasma genitalium Mycoplasma pneumoniae</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>MG461-466 MPN677-682</p>
                     </c>
                     <c ca="left">
                        <p>Housekeeping</p>
                     </c>
                     <c ca="left">
                        <p>MG466 MPN682</p>
                     </c>
                     <c ca="left">
                        <p>Gram-negative bacteria</p>
                     </c>
                     <c ca="left">
                        <p>Ribosomal protein L34</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Cluster 34</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Thermotoga maritima</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>TM0548-0556</p>
                     </c>
                     <c ca="left">
                        <p>Leucine/isoleucine biosynthesis</p>
                     </c>
                     <c ca="left">
                        <p>TM0552 TM0555 TM0554</p>
                     </c>
                     <c ca="left">
                        <p>2-Isopropylmalate synthase 3-Isopropylmalate dehydratase, small subunit 3-Isopropylmalate dehydratase, large subunit</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Pyrococcus abyssi</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>PAB888-895</p>
                     </c>
                     <c ca="left">
                        <p>PAB0890 PAB0893</p>
                     </c>
                     <c ca="left">
                        <p>Bacteria</p>
                     </c>
                     <c ca="left">
                        <p>2-Isopropylmalate synthase (<it>LeuA</it>-1) 3-Isopropylmalate dehydrogenase (<it>LeuB</it>)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Clostridium acetobutylicum</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>CAC3169-3174</p>
                     </c>
                     <c ca="left">
                        <p>Leucine/isoleucine biosynthesis</p>
                     </c>
                     <c ca="left">
                        <p>CAC3172 CAC3173 CAC3174 Archaea</p>
                     </c>
                     <c ca="left">
                        <p>3-Isopropylmalate dehydratase, small subunit 3-Isopropylmalate dehydratase, large subunit 2-Isopropylmalate synthase</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Cluster 41</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Thermotoga maritima</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>TM1243-1251</p>
                     </c>
                     <c ca="left">
                        <p>Nucleotide metabolism</p>
                     </c>
                     <c ca="left">
                        <p>TM1243</p>
                     </c>
                     <c ca="left">
                        <p>Archaea</p>
                     </c>
                     <c ca="left">
                        <p>Phosphoribosylaminoimidazole-succinocarboxamide synthase</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Cluster 42</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Lactococcus lactis</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>L0104-0108</p>
                     </c>
                     <c ca="left">
                        <p>Arginine biosynthesis</p>
                     </c>
                     <c ca="left">
                        <p>L0107</p>
                     </c>
                     <c ca="left">
                        <p>Gram-negative bacteria</p>
                     </c>
                     <c ca="left">
                        <p>Acetylglutamate kinase</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Thermotoga maritima</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>TM1780-1785</p>
                     </c>
                     <c ca="left">
                        <p>Arginine biosynthesis TM1784</p>
                     </c>
                     <c ca="left">
                        <p>Archaea</p>
                     </c>
                     <c ca="left">
                        <p>Acetylglutamate kinase</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Cluster 48</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Borrelia burgdorferi</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>BB0054-0061</p>
                     </c>
                     <c ca="left">
                        <p>Carbohydrate metabolism (glycolysis, gluconeogenesis)</p>
                     </c>
                     <c ca="left">
                        <p>BB0057</p>
                     </c>
                     <c ca="left">
                        <p>Gram-positive bacteria</p>
                     </c>
                     <c ca="left">
                        <p>Glyceraldehyde-3-phosphate dehydrogenase</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Cluster 54</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Thermotoga maritima</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>TM1780-1785</p>
                     </c>
                     <c ca="left">
                        <p>Arginine biosynthesis</p>
                     </c>
                     <c ca="left">
                        <p>TM1780</p>
                     </c>
                     <c ca="left">
                        <p>Gram-negative bacteria</p>
                     </c>
                     <c ca="left">
                        <p>Argininosuccinate synthase</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Cluster 63</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Mycoplasma pneumoniae Mycoplasma genitalium</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>MPN573-574 MG391-392</p>
                     </c>
                     <c ca="left">
                        <p>Molecular chaperones</p>
                     </c>
                     <c ca="left">
                        <p>MPN574 MG393</p>
                     </c>
                     <c ca="left">
                        <p>Gram-negative bacteria</p>
                     </c>
                     <c ca="left">
                        <p>Heat shock protein (groES)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Cluster 82</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Mycoplasma pneumoniae, Mycoplasma genitalium</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>MPN535-536 MG358-359</p>
                     </c>
                     <c ca="left">
                        <p>DNA replication, recombination and repair</p>
                     </c>
                     <c ca="left">
                        <p>MPN536 MG359</p>
                     </c>
                     <c ca="left">
                        <p>Gram-negative bacteria</p>
                     </c>
                     <c ca="left">
                        <p>Holliday junction resolvasome helicase subunit</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Ureaplasma urealyticum</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>UU448-449</p>
                     </c>
                     <c ca="left">
                        <p>DNA replication, recombination and repair</p>
                     </c>
                     <c ca="left">
                        <p>UU448</p>
                     </c>
                     <c ca="left">
                        <p>Gram-negative bacteria</p>
                     </c>
                     <c ca="left">
                        <p>Holliday junction resolvasome helicase subunit</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Cluster 86</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Halobacterium </it>sp.</p>
                     </c>
                     <c ca="left">
                        <p>VNG6305CC-6306C</p>
                     </c>
                     <c ca="left">
                        <p>Tetrahydrobiopterin biosynthesis</p>
                     </c>
                     <c ca="left">
                        <p>VNG6305C</p>
                     </c>
                     <c ca="left">
                        <p>Gram-negative bacteria</p>
                     </c>
                     <c ca="left">
                        <p>Organic radical activating enzyme</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Cluster 87</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Halobacterium </it>sp.</p>
                     </c>
                     <c ca="left">
                        <p>VNG0582C-0586C</p>
                     </c>
                     <c ca="left">
                        <p>Energy production and conversion</p>
                     </c>
                     <c ca="left">
                        <p>VNG0582, VNG0583G</p>
                     </c>
                     <c ca="left">
                        <p>Bacteria</p>
                     </c>
                     <c ca="left">
                        <p>Cytochrome b subunit of the bc complex Cytochrome b subunit of the bc complex</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Cluster 103</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Archaeoglobus fulgidus</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>AF0321-0325</p>
                     </c>
                     <c ca="left">
                        <p>Lipopolysaccharide biosynthesis</p>
                     </c>
                     <c ca="left">
                        <p>AF0323b</p>
                     </c>
                     <c ca="left">
                        <p>Bacteria</p>
                     </c>
                     <c ca="left">
                        <p>dTDP-4-dehydrorhamnose 3,5-epimerase and related enzymes</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Deinococcus radiodurans</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>DRA0037-DRA0044</p>
                     </c>
                     <c ca="left">
                        <p>Lipopolysaccharide biosynthesis</p>
                     </c>
                     <c ca="left">
                        <p>DRA0044</p>
                     </c>
                     <c ca="left">
                        <p>Archaea</p>
                     </c>
                     <c ca="left">
                        <p>dTDP-4-dehydrorhamnose epimerase</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Methanothermobacter thermoautotrophicus</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>MTH1789-1792</p>
                     </c>
                     <c ca="left">
                        <p>Lipopolysaccharide biosynthesis</p>
                     </c>
                     <c ca="left">
                        <p>MTH1789, MTH1790, MTH1791</p>
                     </c>
                     <c ca="left">
                        <p>Gram-positive bacteria Bacteria Bacteria</p>
                     </c>
                     <c ca="left">
                        <p>dTDP-D-glucose 4,6-dehydratase dTDP-4-dehydrorhamnose 3,5-epimerase dTDP-glucose pyrophosphorylase</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>*The numbering of gene clusters is from the previously published analysis of gene neighborhoods in prokaryotic genomes <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>.</p>
               </tblfn>
            </tbl>
            <p>Below we describe in greater detail several case studies of putative mosaic operons; in each of these cases, in addition to the basic set of 41 species, we included in the analysis the apparent orthologs of the respective proteins from all prokaryotic species in which they were detected, in order to control for possible effects of taxon sampling. We found that, although the details of tree topology inevitably depended on the set of species analyzed, the conclusions regarding HGT were not affected by the inclusion of additional species.</p>
         </sec>
         <sec>
            <st>
               <p>Case studies of mosaic operons</p>
            </st>
            <sec>
               <st>
                  <p>Ribosomal protein L29 gene</p>
               </st>
               <p>In the previous study that prompted this work, we analyzed the phylogeny of several ribosomal proteins and found several cases of apparent horizontal transfer resulting in mosaic operon organization <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. Horizontal transfer "in the heart of the ribosome" also has been independently described by others <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp>. Here we report another case of a ribosomal protein operon with apparent <it>in situ </it>gene displacement (that is, displacement without change of the local gene arrangement) via HGT. Figure <figr fid="F1">1a</figr> shows the highly conserved gene arrangement around the gene for the large subunit protein L29. The phylogenetic trees for the flanking <it>L16 </it>and <it>S17 </it>genes showed largely congruent topologies without any indications of HGT (Figure <figr fid="F1">1b,d</figr>). In contrast in the L29 tree, unexpected clustering is seen for <it>Aquifex aeolicus </it>and both <it>Rickettsia</it>: the <it>Aquifex </it>branch is within the archaeal cluster, whereas the <it>Rickettsia </it>group is with <it>Chlamydia</it>, rather than with the rest of alpha-proteobacteria: the taxon where <it>Rickettsia </it>belong (Figure <figr fid="F1">1c</figr>). <it>In situ </it>displacement is the most likely mechanism behind this observation given that the structure of this operon is conserved in the majority of bacteria. The nature of the selective advantages conferred by this gene substitution is unclear, but the apparent sources of the transferred genes suggest that the displacements indeed might be adaptive. <it>Aquifex </it>apparently acquired the L29 gene from archaea, which could be related to the adaptation to the hyperthermal conditions, whereas <it>Rickettsia </it>probably captured the gene from other parasitic bacteria, such as <it>Chlamydia</it>. However, these observations also allow a non-adaptationist interpretation, under which the apparent source of acquired genes simply reflects the increased likelihood of gene exchange between the respective organisms due to co-habitation, with chance fixation of some of the transferred genes.</p>
               <fig id="F1">
                  <title>
                     <p>Figure 1</p>
                  </title>
                  <caption>
                     <p>Genes with different phylogenetic affinities in a ribosomal operon from <it>Aquifex aeolicus </it>and <it>Rickettsia prowazekii</it></p>
                  </caption>
                  <text>
                     <p>Genes with different phylogenetic affinities in a ribosomal operon from <it>Aquifex aeolicus </it>and <it>Rickettsia prowazekii</it>. <b>(a) </b>A fragment of ribosomal operon in <it>Aquifex aeolicus </it>(the operon from <it>Thermotoga maritima </it>is shown for comparison), <it>Rickettsia prowazekii </it>and <it>Rickettsia conorrii </it>(operons from other alpha-proteobacteria are shown for comparison). Genes are shown not to scale; the direction of transcription is indicated by arrows and gene numbers/names are given inside each arrow. Orthologous genes are shown by the same color. White arrows show genes in each genome that are unique in this operonic context. Phylogenetic affinity of a gene is shown as a thick colored border on the respective arrow; black denotes belonging to the reference taxon, red denotes not belonging to reference taxon. COG0197 - ribosomal protein L16/L10E; COG0255 - ribosomal protein L29; COG0186 - ribosomal protein S17. For species abbreviations, see Materials and methods. <b>(b) </b>Unrooted maximum-likelihood tree for ribosomal protein L16. Branches supported by bootstrap probability >70% are marked by black circles. Names of the genes from mosaic operons and the respective branches are shown in red. Branches for which the likelihoods of alternative placements were assessed using the RELL method are indicated by circles with numbers (see Table <tblr tid="T3">3</tblr>). <b>(c) </b>Unrooted maximum-likelihood tree for ribosomal protein L29;. the designations are as in Figure <figr fid="F1">1b</figr>. <b>(d) </b>Unrooted maximum-likelihood tree for ribosomal protein S17; the designations are as in Figure <figr fid="F1">1b</figr>.</p>
                  </text>
                  <graphic file="gb-2003-4-9-r55-1"/>
               </fig>
               <tbl id="T3" hint_layout="single">
                  <title>
                     <p>Table 3</p>
                  </title>
                  <caption>
                     <p>Kishino-Hasegawa test for the analyzed cases of apparent xenologous gene displacement <it>in situ</it></p>
                  </caption>
                  <tblbdy cols="4">
                     <r>
                        <c ca="left">
                           <p>Tree*</p>
                        </c>
                        <c ca="center">
                           <p>Diff lnL<sup>&#8224;</sup></p>
                        </c>
                        <c ca="center">
                           <p>S.E.<sup>&#8225;</sup></p>
                        </c>
                        <c ca="center">
                           <p>RELL-BP<sup>&#167;</sup></p>
                        </c>
                     </r>
                     <r>
                        <c cspan="4">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>L19 original</p>
                        </c>
                        <c ca="center">
                           <p>0.0</p>
                        </c>
                        <c ca="center">
                           <p>ML</p>
                        </c>
                        <c ca="center">
                           <p>0.8004</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>1R2</p>
                        </c>
                        <c ca="center">
                           <p>-12.6</p>
                        </c>
                        <c ca="center">
                           <p>7.7</p>
                        </c>
                        <c ca="center">
                           <p>0.0480</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>3R4</p>
                        </c>
                        <c ca="center">
                           <p>-6.6</p>
                        </c>
                        <c ca="center">
                           <p>6.6</p>
                        </c>
                        <c ca="center">
                           <p>0.1516</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><it>RuvB </it>original</p>
                        </c>
                        <c ca="center">
                           <p>0.00</p>
                        </c>
                        <c ca="center">
                           <p>ML</p>
                        </c>
                        <c ca="center">
                           <p>0.9631</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>1R2</p>
                        </c>
                        <c ca="center">
                           <p>-27.1</p>
                        </c>
                        <c ca="center">
                           <p>15.4</p>
                        </c>
                        <c ca="center">
                           <p>0.0369</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><it>UppS </it>original</p>
                        </c>
                        <c ca="center">
                           <p>0.00</p>
                        </c>
                        <c ca="center">
                           <p>ML</p>
                        </c>
                        <c ca="center">
                           <p>0.9883</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>1R2</p>
                        </c>
                        <c ca="center">
                           <p>-29.3</p>
                        </c>
                        <c ca="center">
                           <p>12.8</p>
                        </c>
                        <c ca="center">
                           <p>0.0117</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><it>NuoH </it>original</p>
                        </c>
                        <c ca="center">
                           <p>0.00</p>
                        </c>
                        <c ca="center">
                           <p>ML</p>
                        </c>
                        <c ca="center">
                           <p>0.8336</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>1R2</p>
                        </c>
                        <c ca="center">
                           <p>-7.4</p>
                        </c>
                        <c ca="center">
                           <p>7.9</p>
                        </c>
                        <c ca="center">
                           <p>0.1664</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><it>RfbA </it>original</p>
                        </c>
                        <c ca="center">
                           <p>0.00</p>
                        </c>
                        <c ca="center">
                           <p>ML</p>
                        </c>
                        <c ca="center">
                           <p>1.0000</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>1R2</p>
                        </c>
                        <c ca="center">
                           <p>-151.1</p>
                        </c>
                        <c ca="center">
                           <p>25.0</p>
                        </c>
                        <c ca="center">
                           <p>0.0000</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><it>RfbD </it>original</p>
                        </c>
                        <c ca="center">
                           <p>0.00</p>
                        </c>
                        <c ca="center">
                           <p>ML</p>
                        </c>
                        <c ca="center">
                           <p>0.9005</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>1R2</p>
                        </c>
                        <c ca="center">
                           <p>-17.0</p>
                        </c>
                        <c ca="center">
                           <p>13.3</p>
                        </c>
                        <c ca="center">
                           <p>0.0995</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><it>LeuA </it>original</p>
                        </c>
                        <c ca="center">
                           <p>0.00</p>
                        </c>
                        <c ca="center">
                           <p>ML</p>
                        </c>
                        <c ca="center">
                           <p>1.0000</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>1R2</p>
                        </c>
                        <c ca="center">
                           <p>-150.2</p>
                        </c>
                        <c ca="center">
                           <p>25.8</p>
                        </c>
                        <c ca="center">
                           <p>0.0000</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>3R4</p>
                        </c>
                        <c ca="center">
                           <p>-418.6</p>
                        </c>
                        <c ca="center">
                           <p>31.5</p>
                        </c>
                        <c ca="center">
                           <p>0.0000</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>5R6</p>
                        </c>
                        <c ca="center">
                           <p>-245.0</p>
                        </c>
                        <c ca="center">
                           <p>27.8</p>
                        </c>
                        <c ca="center">
                           <p>0.0000</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><it>LeuB </it>original</p>
                        </c>
                        <c ca="center">
                           <p>0.00</p>
                        </c>
                        <c ca="center">
                           <p>ML</p>
                        </c>
                        <c ca="center">
                           <p>0.9847</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>1R2</p>
                        </c>
                        <c ca="center">
                           <p>-52.9</p>
                        </c>
                        <c ca="center">
                           <p>18.1</p>
                        </c>
                        <c ca="center">
                           <p>0.0007</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>3R4</p>
                        </c>
                        <c ca="center">
                           <p>-31.7</p>
                        </c>
                        <c ca="center">
                           <p>14.9</p>
                        </c>
                        <c ca="center">
                           <p>0.0146</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><it>LeuC </it>original</p>
                        </c>
                        <c ca="center">
                           <p>0.00</p>
                        </c>
                        <c ca="center">
                           <p>ML</p>
                        </c>
                        <c ca="center">
                           <p>1.0000</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>1R2</p>
                        </c>
                        <c ca="center">
                           <p>-302.7</p>
                        </c>
                        <c ca="center">
                           <p>31.6</p>
                        </c>
                        <c ca="center">
                           <p>0.0000</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>3R4</p>
                        </c>
                        <c ca="center">
                           <p>-439.1</p>
                        </c>
                        <c ca="center">
                           <p>32.1</p>
                        </c>
                        <c ca="center">
                           <p>0.0000</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p><it>LeuD </it>original</p>
                        </c>
                        <c ca="center">
                           <p>0.00</p>
                        </c>
                        <c ca="center">
                           <p>ML</p>
                        </c>
                        <c ca="center">
                           <p>1.0000</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>1R2</p>
                        </c>
                        <c ca="center">
                           <p>-66.6</p>
                        </c>
                        <c ca="center">
                           <p>17.2</p>
                        </c>
                        <c ca="center">
                           <p>0.0000</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>3R4</p>
                        </c>
                        <c ca="center">
                           <p>-76.7</p>
                        </c>
                        <c ca="center">
                           <p>16.8</p>
                        </c>
                        <c ca="center">
                           <p>0.0000</p>
                        </c>
                     </r>
                  </tblbdy>
                  <tblfn>
                     <p>*The numbers refer to local rearrangements of the tree as indicated on the corresponding figures. <sup>&#8224;</sup>Difference of the Log-likelihoods relative to the best tree. <sup>&#8225;</sup>Standard error of Diff lnL. <sup>&#167;</sup>Bootstrap probability of the given tree calculated using the RELL method (Resampling of Estimated Log-likelihoods).</p>
                  </tblfn>
               </tbl>
            </sec>
            <sec>
               <st>
                  <p>The <it>ruvB </it>gene of Mycoplasma</p>
               </st>
               <p>The genes for Holliday junction resolvase subunits <it>RuvA </it>and <it>RuvB </it>form an operon that is conserved in most of the sequenced bacterial genomes (Figure <figr fid="F2">2a</figr>). In the phylogenetic trees for <it>RuvA </it>and <it>RuvB</it>, the branch that includes Ureaplasma and Mycoplasma occupies drastically different positions. In contrast to <it>RuvA</it>, which belongs to the Gram-positive clade as expected (Figure <figr fid="F2">2b</figr>), mycoplasmal <it>RuvB </it>clusters with the epsilon-proteobacteria (<it>Helicobacter </it>and <it>Campylobacter</it>) and the mycoplasma-epsilon-proteobacteria clade further joins alpha-proteobacteria (Figure <figr fid="F2">2c</figr>). This clustering is strongly supported by bootstrap analysis and was shown to be robust using statistical tests of tree topology (Table <tblr tid="T3">3</tblr>). Thus, the <it>ruvB </it>gene seems to have undergone xenologous displacement <it>in situ </it>after the divergence of the mycoplasmal branch from the rest of Gram-positive bacteria. Notably, the gene exchange seems to have occurred between phylogenetically distant parasitic bacteria.</p>
               <fig id="F2">
                  <title>
                     <p>Figure 2</p>
                  </title>
                  <caption>
                     <p><it>In situ </it>displacement of the <it>ruvB </it>gene in <it>Mycoplasma</it></p>
                  </caption>
                  <text>
                     <p><it>In situ </it>displacement of the <it>ruvB </it>gene in <it>Mycoplasma</it>. <b>(a) </b>Organization of the Holliday junction resolvasome operon and surrounding genes in bacteria. COG0632 - Holliday junction resolvasome, DNA-binding subunit, COG2255 - Holliday junction resolvasome, DNA-binding subunit, COG0817 - Holliday junction resolvasome, endonuclease subunit, COG0392 - Predicted integral membrane protein, COG0282 - acetate kinase, COG0839 - NADH:ubiquinone oxidoreductase subunit 6 (chain J), COG0244 - ribosomal protein L10, COG0732 - restriction endonuclease S subunits, COG0809 - S-adenosylmethionine:tRNA-ribosyltransferase-isomerase, COG0772 - bacterial cell division membrane protein, COG0624 - acetylornithine deacetylase/succinyl-diaminopimelate desuccinylase and related deacylases, COG1487 - predicted nucleic acid-binding protein, COG1132 - ABC-type multidrug transport system, ATPase and permease components, COG0442 - prolyl-tRNA synthetase, COG0323 - DNA mismatch repair enzyme, COG1408 - predicted phosphohydrolases. The designations are as in Figure <figr fid="F1">1a</figr>. For species abbreviations, see Materials and methods. <b>(b,c) </b>Unrooted maximum-likelihood tree for <it>RuvA </it>(b) and <it>RuvB </it>(c); the designations are as in Figure <figr fid="F1">1b</figr>.</p>
                  </text>
                  <graphic file="gb-2003-4-9-r55-2"/>
               </fig>
            </sec>
            <sec>
               <st>
                  <p>Undecaprenyl pyrophosphate synthase gene in the lipid biosynthesis operon of <it>Rickettsia</it></p>
               </st>
               <p>In <it>Rickettsia</it>, the undecaprenyl pyrophosphate synthase gene (<it>uppS</it>), which belongs to a highly conserved doublet of lipid biosynthesis genes embedded in functionally diverse operons (Figure <figr fid="F3">3a</figr>), clusters with an unexpected assemblage of bacterial orthologs, including those from the spirochete <it>Treponema pallidum </it>and <it>Fusobacterium nucleatum</it>, but not with the 'native' taxon, alpha-proteobacteria (Figure <figr fid="F3">3b,c</figr>). Statistical testing of the tree topology showed that clustering of rickettsial <it>uppS </it>with those from other alpha-proteobacteria is highly unlikely (Table <tblr tid="T3">3</tblr>). The apparent <it>in situ </it>gene displacement of the <it>uppS </it>gene in <it>Rickettsia </it>was accompanied by a breakdown of the operon into three fragments (Figure <figr fid="F3">3a</figr>). The topology of the <it>uppS </it>tree suggests the possibility of multiple HGT events, although only the rickettsial genomes show evidence of gene displacement <it>in situ</it>. The emergence of gene displacement in bacterial parasites is noted here again.</p>
               <fig id="F3">
                  <title>
                     <p>Figure 3</p>
                  </title>
                  <caption>
                     <p>Genes with different phylogenetic affinities in the lipid biosynthesis operon of <it>Rickettsia</it></p>
                  </caption>
                  <text>
                     <p>Genes with different phylogenetic affinities in the lipid biosynthesis operon of <it>Rickettsia</it>. <b>(a) </b>Organization of the lipid biosynthesis operon and surrounding genes in <it>Rickettsia prowazekii </it>and <it>Rickettsia conorrii </it>(operons from three other alpha-proteobacteria are shown for comparison). COG0020 - undecaprenyl pyrophosphate synthase, <it>UppS</it>; COG0575 - CDP-diglyceride synthetase; COG0750 - predicted membrane-associated Zn-dependent proteases; COG0233 - ribosome recycling factor; COG0528 - uridylate kinase; COG0745 - OmpR-like response regulator; COG0642 - signal transduction histidine kinase; COG0729 - outer membrane protein; COG2919 - septum formation initiator; COG0743 - 1-deoxy-D-xylulose 5-phosphate reductoisomerase. The designations are as in Figure <figr fid="F1">1a</figr>. For species abbreviations, see Materials and methods. <b>(b,c) </b>Unrooted maximum-likelihood tree for <it>UppS </it>(b) and <it>CdsA </it>(c); the designations are as in Figure <figr fid="F1">1b</figr>.</p>
                  </text>
                  <graphic file="gb-2003-4-9-r55-3"/>
               </fig>
            </sec>
            <sec>
               <st>
                  <p>NADH:ubiquinone oxidoreductase subunits in <it>Halobacterium</it> sp</p>
               </st>
               <p>Gene organization in the NADH:ubiquinone oxidoreductase operon is highly conserved in all sequenced archaeal genomes and those of several groups of bacteria (Figure <figr fid="F4">4a</figr>). The <it>nuoI </it>gene of <it>Halobacterium </it>sp. shows an unexpected phylogenetic affinity with proteobacteria (Figure <figr fid="F4">4c</figr>), whereas the neighboring genes have the regular archaeal affinities (Figure <figr fid="F4">4b,d</figr>). The unusual phylogeny of halobacterial NuoI, which was strongly supported by statistical tests (Table <tblr tid="T3">3</tblr>), suggests <it>in situ </it>displacement by a proteobacterial gene. Notably, all three NADH:ubiquinone oxidoreductase subunits of the cyanobacteria unexpectedly grouped within the archaeal clusters of the respective trees (Figure <figr fid="F4">4b-d</figr>). These observations point to a complex history of HGT for the genes encoding all subunits of NADH:ubiquinone oxidoreductase.</p>
               <fig id="F4">
                  <title>
                     <p>Figure 4</p>
                  </title>
                  <caption>
                     <p><it>In situ </it>gene displacement in the NADH-ubiquinone oxidoreductase operon in <it>Halobacterium</it></p>
                  </caption>
                  <text>
                     <p><it>In situ </it>gene displacement in the NADH-ubiquinone oxidoreductase operon in <it>Halobacterium</it>. <b>(a) </b>Organization of the NADH-ubiquinone oxidoreductase operon in selected archaeal and bacterial genomes. COG0838 - NADH:ubiquinone oxidoreductase subunit 3 (chain A), COG3077 - DNA-damage-inducible protein J, COG0852 - NADH:ubiquinone oxidoreductase 27 kD subunit, COG0649 - NADH:ubiquinone oxidoreductase 49 kD subunit 7, COG1905 - NADH:ubiquinone oxidoreductase 24 kD subunit, COG1894 - NADH:ubiquinone oxidoreductase, NADH-binding (51 kD) subunit, COG1034 - NADH dehydrogenase/NADH:ubiquinone oxidoreductase 75 kD subunit (chain G), COG1005 - NADH:ubiquinone oxidoreductase subunit 1 (chain H), COG1143 - Formate hydrogenlyase subunit 6/NADH:ubiquinone oxidoreductase 23 kD subunit (chain I), COG0839 - NADH:ubiquinone oxidoreductase subunit 6 (chain J), COG0713 - NADH:ubiquinone oxidoreductase subunit 11 or 4L (chain K), COG1009 - NADH:ubiquinone oxidoreductase subunit 5 (chain L), COG1008 - NADH:ubiquinone oxidoreductase subunit 4 (chain M), COG1007 - NADH:ubiquinone oxidoreductase subunit 2 (chain N). The designations are as in Figure <figr fid="F1">1a</figr>. For species abbreviations, see Materials and methods. <b>(b-d) </b>Unrooted maximum-likelihood tree for <it>NuoH </it>(b), <it>NuoI </it>(c) and <it>NuoJ </it>(d); the designations are as in Figure <figr fid="F1">1b</figr>.</p>
                  </text>
                  <graphic file="gb-2003-4-9-r55-4"/>
               </fig>
            </sec>
            <sec>
               <st>
                  <p>Lipopolysaccharide biosynthesis operon in <it>Methanothermobacter thermoautotrophicus </it>and <it>Deinococcus radiodurans</it></p>
               </st>
               <p>The genes of the lipopolysaccharide biosynthesis (<it>rfbABCD</it>) operon appear to have been extensively and independently shuffled in many prokaryotic genomes and might have undergone multiple horizontal transfers. This conclusion is supported both by examination of the operon organization (Figure <figr fid="F5">5a</figr>) and by phylogenetic tree analysis (Figure <figr fid="F5">5b-e</figr>). The trees showed a clear affinity between the <it>rfbA</it>, <it>rfbB</it>, <it>rfbC </it>genes of <it>Methanothermobacter thermoautotrophicum </it>and <it>Clostridium acetobutylicum </it>(Figure <figr fid="F5">5b-d</figr>), with <it>Fusobacterium nucleatum </it>and <it>Listeria monocytogenes </it>joining the cluster in the case of <it>rfbB </it>(Figure <figr fid="F5">5b</figr>), whereas <it>M. thermoautotrophicum </it>RfbD clustered with its archaeal orthologs as expected (Figure <figr fid="F5">5e</figr>). The genes of the <it>rfbABCD </it>operon in <it>Methanothermobacter </it>are shuffled compared to the probable ancestral order, which is found in many bacteria and <it>C. acetobutylicum </it>also shows a rearrangement (Figure <figr fid="F5">5a</figr>). One likely scenario in this case is that <it>M. thermoautotrophicum </it>acquired the <it>rfbABCD </it>operon with the typical gene order from a bacterium of the clostridial lineage, which was followed by displacement of three resident genes and loss of one of the invading genes, accompanied by operon rearrangement. An alternative scenario is that the rearrangement occurred in the source bacterium of the clostridial group and <it>Methanothermobacter </it>acquired only the <it>rfbACB </it>portion, which might have inserted head-to-tail downstream of the original operon, followed by elimination of the resident <it>rfbABC </it>(Figure <figr fid="F5">5a</figr>).</p>
               <fig id="F5">
                  <title>
                     <p>Figure 5</p>
                  </title>
                  <caption>
                     <p>Genes with different phylogenetic affinities in the lipopolysaccharide biosynthesis operon of <it>Methanothermobacter thermoautotrophicus </it>and <it>Deinococcus radiodurans</it></p>
                  </caption>
                  <text>
                     <p>Genes with different phylogenetic affinities in the lipopolysaccharide biosynthesis operon of <it>Methanothermobacter thermoautotrophicus </it>and <it>Deinococcus radiodurans</it>. <b>(a) </b>Organization of the lipopolysaccharide biosynthesis operon in different prokaryotes. COG1091 - dTDP-4-dehydrorhamnose reductase; COG1209 dTDP-glucose pyrophosphorylase; COG1898 - dTDP-4-dehydrorhamnose 3,5-epimerase and related enzymes; COG1088 - dTDP-D-glucose 4,6-dehydratase. The designations are as in Figure <figr fid="F1">1a</figr>. For species abbreviations, see Materials and methods. <b>(b-e) </b>Unrooted maximum-likelihood tree for <it>RfbB </it>(b), <it>RfbC </it>(c), <it>RfbA </it>(d) and <it>RfbD </it>(e); the designations are as in Figure <figr fid="F1">1b</figr>.</p>
                  </text>
                  <graphic file="gb-2003-4-9-r55-5"/>
               </fig>
               <p>Another interesting case of mosaic structure of the same operon is seen in <it>Deinococcus radiodurans </it>(Figure <figr fid="F5">5a</figr>). <it>Deinococcus </it>RfbA shows clear affinity with proteobacteria (Figure <figr fid="F5">5d</figr>), whereas RfbD is of archaeal descent (Figure <figr fid="F5">5e</figr>), with RELL analysis revealing no competing topologies (Table <tblr tid="T3">3</tblr>). The remaining two genes of this operon in <it>Deinococcus</it>, <it>rfbB </it>(DRA0041) and <it>rfbC </it>(DRA0043), have uncertain phylogenetic affinities (Figure <figr fid="F5">5b,5c</figr>). Thus, as in the case of <it>M. thermoautotrophicus, </it>this operon in <it>Deinococcus </it>was apparently formed through at least two events of xenologous gene displacement <it>in situ </it>and gene shuffling.</p>
            </sec>
            <sec>
               <st>
                  <p>Leucine/isoleucine biosynthesis operon</p>
               </st>
               <p>Perhaps the most prominent case of mosaic operon organization is the leucine/isoleucine biosynthesis operon of several bacteria and archaea, particularly <it>Thermotoga maritima</it>. This is the only known branched chain amino acid biosynthesis operon, and it is partly conserved in a wide range of bacteria (Figure <figr fid="F6">6a</figr>). Following initial indications from the analysis of taxon-specific BLAST hits, we constructed phylogenetic trees for each of the genes of this operon. Unlike other bacteria, <it>Thermotoga </it>has two <it>leuA </it>paralogs, which are adjacent in the operon. The proteins encoded by these paralogous genes show clearly distinct phylogenetic affinities: TM0552 belongs to a distinct clade within the archaeal domain, whereas TM0553 is part of a Gram-positive bacterial cluster (Figure <figr fid="F6">6b</figr>). This phylogenetic mosaic in <it>Thermotoga </it>extends further, with LeuB (TM0556) clustering with proteobacterial orthologs (Figure <figr fid="F6">6c</figr>), and LeuC (TM0554) and LeuD (TM0555) with archaeal orthologs (Figure <figr fid="F6">6d,e</figr>). All these affinities were strongly supported by two versions of bootstrap analysis (Table <tblr tid="T3">3</tblr>). The genes encoding LeuA, LeuC, and LeuD from <it>Thermotoga</it>, <it>Clostridium</it>, <it>Aquifex </it>and both <it>Pyrococcus abyssi </it>and <it>P. furiosus </it>belong to a well-defined clade, which also includes a medley of alpha-proteobacteria and cyanobacteria, within the archaeal domain in the respective trees (Figure <figr fid="F6">6b-e</figr>). Thus, this sub-operon apparently has been relatively recently horizontally spread among these organisms. <it>Pyrococcus abyssi </it>and <it>P. furiosus </it>probably acquired these genes after the divergence from the common ancestor with <it>P. horikoshii </it>because the latter has only the typical archaeal operon (Figure <figr fid="F6">6a</figr>).</p>
               <fig id="F6">
                  <title>
                     <p>Figure 6</p>
                  </title>
                  <caption>
                     <p>Genes with different phylogenetic affinities in the leucine/isoleucine biosynthesis operon</p>
                  </caption>
                  <text>
                     <p>Genes with different phylogenetic affinities in the leucine/isoleucine biosynthesis operon. <b>(a) </b>Operon organization in different prokaryotic species. COG0028 - acetolactate synthase, large subunit; COG0440 - acetolactate synthase, small subunit; COG0059 - ketol-acid reductoisomerase; COG0129 - dihydroxyacid dehydratase; COG0119 - isopropylmalate synthases; COG0473 - isocitrate/isopropylmalate dehydrogenase; COG0066 - 3-isopropylmalate dehydratase, small subunit; COG0065 - 3-isopropylmalate dehydratase, large subunit. The designations are as in Figure <figr fid="F1">1a</figr>. For species abbreviations, see Materials and methods. <b>(b-e) </b>Unrooted maximum-likelihood tree for <it>LeuA </it>(b), <it>LeuB </it>(c), <it>LeuC </it>(d) and <it>LeuD </it>(e); the designations are as in Figure <figr fid="F1">1b</figr>.</p>
                  </text>
                  <graphic file="gb-2003-4-9-r55-6"/>
               </fig>
               <p>Given the apparent propensity of <it>Thermotoga </it>(and other hyperthermophilic bacteria) for acquisition of archaeal genes via HGT, it seems most likely that the archaeal version of the <it>leuACD </it>suboperon originally entered the bacterial domain via <it>Thermotoga </it>or a related thermophilic bacterium. Formally, in <it>Thermotoga </it>these events could be classified as a combination of paralogous (sub)operon acquisition (TM0554-TM0555 in addition to another paralogous archaeal gene pair TM0291-TM0292) and xenologous gene displacements (genes TM0553, TM0556). In <it>Clostridium</it>, xenologous operon displacement seems to have occurred because the ancestral operon of the Gram-positive type apparently had been lost. The subsequent evolution of this operon in the four organisms proceeded along different paths. <it>Aquifex </it>has lost the operon structure even for the two subunits of 3-isopropylmalate dehydratase (<it>LeuB</it>, <it>LeuD</it>). Different genes in the operons of <it>P. abyssi </it>and <it>C. acetobutylicum </it>have been translocated and several genes probably have been independently accrued (Figure <figr fid="F6">6a</figr>). In both <it>P. abyssi </it>and <it>Thermotoga</it>, the original <it>leuA </it>and <it>leuB </it>genes within the <it>leuABDC </it>core seem to have been independently displaced by bacterial orthologs without a clear affinity with any specific bacterial lineage (Figure <figr fid="F6">6a</figr>). The most likely scenario for evolution of this operon in <it>Thermotoga </it>is that it originated as a Gram-positive type operon and subsequently many genes (or sub-operons) have been displaced <it>in situ </it>through multiple horizontal transfers and a few additional genes have been inserted into the preexisting structure. The alternative but less likely hypothesis involves independent, <it>de novo </it>operon assembly from genes of different phylogenetic affinities. Several other apparent HGT events were detected during the analysis of the phylogenetic trees for leucine biosynthesis genes (DR1614 in <it>LeuD </it>tree, DR1610 in <it>LeuC </it>tree (Figure <figr fid="F6">6d,e</figr>)) but, in these cases, the acquired genes do not belong to conserved operons.</p>
            </sec>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusions</p>
         </st>
         <p>Intragenomic plasticity and inter-species horizontal mobility of operons are thought to be important facets of prokaryotic genome evolution. Indeed, the results presented here indicate that horizontal transfer of entire operons is the most likely explanation for most of the findings of co-localized 'alien' genes in a genome, which is generally consistent with SOM. However, a substantial fraction - approximately 35% - of operons with indications of horizontal transfer events appear to consist of genes with different phylogenetic affinities. Barring artifacts of phylogenetic analysis, which can never be ruled out completely, but appear unlikely given the strong statistical support for the anomalous placement of the genes in question in phylogenetic trees, two evolutionary scenarios for the origin of such mosaic operons are conceivable. The first involves <it>de novo </it>assembly of operons, in part from genes acquired via HGT, whereas the second one postulates <it>in situ </it>xenologous displacement of genes within a resident operon. Analysis of mosaic operons suggested that both scenarios might apply, but <it>in situ </it>displacement is likely to be more frequent. In several cases, <it>in situ </it>displacement seems to have occurred between genomes of distantly related parasitic bacteria that might have shared a host. A sequence of events that is often considered as an alternative to HGT is an ancient duplication with subsequent differential loss of paralogs. However, in the cases analyzed here, this seems to be a particularly remote possibility because a tandem duplication followed by lengthy evolution of both paralogs within the operon would be required to mimic <it>in situ </it>displacement. Tandem pairs of paralogs are uncommon in operons and such a 'smoking gun' was not observed in any of the suspected cases of <it>in situ </it>displacement.</p>
         <p>At first glance, <it>in situ </it>gene displacement seems highly unlikely: given the vast evolutionary distance separating the donor and recipient genomes, homologous recombination is out of the question. In cases when the displacing gene(s) is located on the periphery of an operon (for example, Figure <figr fid="F5">5a</figr>), a plausible mechanism could involve initial insertion of the invading gene in the vicinity of the resident operon, followed by deletion of intervening genes (provided these are non-essential). However, when the displacing gene is tucked between resident ones (for example, Figures <figr fid="F4">4a</figr>, <figr fid="F6">6a</figr>), displacement must have occurred with surgical precision. The only conceivable explanation seems to be that HGT is extremely common in the evolution of prokaryotes and so is intragenomic recombination, which provides for rare chance occurrences of <it>in situ </it>displacement. Conceivably, a horizontally acquired gene that displaces the resident ortholog without disruption of operon organization would have its chances of evolutionary fixation greatly increased, hence the apparent disproportional survival of the displacing genes. This explanation does not refute SOM as the conceptual framework explaining the origin of operons but emphasizes the 'altruistic' aspect of the evolution of operons whereby the operon integrity is maintained by strong purifying selection at the organism level.</p>
      </sec>
      <sec>
         <st>
            <p>Materials and methods</p>
         </st>
         <sec>
            <st>
               <p>Sequence data</p>
            </st>
            <p>Amino acid sequences from 41 completely sequenced prokaryotic genomes were extracted from the Genome division of the Entrez retrieval system <abbrgrp><abbr bid="B27">27</abbr></abbrgrp> and used as the master species set for this analysis. Bacterial species abbreviations: <it>Aquifex aeolicus </it>(Aae), <it>Bacillus halodurans </it>(Bha), <it>Bacillus subtilis </it>(Bsu), <it>Streptococcus pyogenes </it>(Spy), <it>Staphylococcus aureus </it>(Sau), <it>Clostridium acetobutylicum </it>(Cac), <it>Borrelia burgdorferi </it>(Bbu), <it>Campylobacter jejunii </it>(Cje), <it>Chlamydia trachomatis </it>(Ctr), <it>Chlamydophila pneumoniae </it>(Cpn), <it>Deinococcus radiodurans </it>(Dra), <it>Escherichia coli </it>(Eco), <it>Haemophilus influenzae </it>(Hin), <it>Helicobacter pylori </it>(Hpy), <it>Lactococcus lactis </it>(Lla), <it>Mesorhizobium loti </it>(Mlo), <it>Mycoplasma genitalium </it>(Mge), <it>Mycoplasma pneumoniae </it>(Mpn), <it>Mycobacterium tuberculosis </it>(Mtu), <it>Mycobacterium leprae </it>(Mle), <it>Pasteurella multocida </it>(Pmu), <it>Neisseria meningitidis </it>(Nme), <it>Pseudomonas aeruginosa </it>(Pae), <it>Rickettsia prowazekii </it>(Rpr), <it>Rickettsia conorii </it>(Rco), <it>Synechocystis </it>PCC6803 (Ssp), <it>Thermotoga maritima </it>(Tma), <it>Treponema pallidum </it>(Tpa), <it>Vibrio cholerae </it>(Vch), <it>Xylella fastidiosa </it>(Xfa), <it>Buchnera </it>sp. (Bsp), <it>Caulobacter crescentus </it>(Ccr), and <it>Ureaplasma urealyticum </it>(Uur). Archaeal species abbreviations: <it>Aeropyrum pernix </it>(Ape), <it>Archaeoglobus fulgidus </it>(Afu), <it>Halobacterium </it>sp. (Hsp), <it>Methanothermobacter thermoautotrophicum </it>(Mth), <it>Methanococcus jannaschii </it>(Mja), <it>Pyrococcus horikoshii </it>(Pho), <it>Pyrococcus abyssi </it>(Pab), <it>Thermoplasma volcanium </it>(Tvo), <it>Thermoplasma acidophilum </it>(Tac), <it>Sulfolobus solfataricus </it>(Sso). In addition, the following species were included in the case studies described in the text; bacteria: <it>Agrobacterium tumefaciens </it>(Atu), <it>Bifidobacterium longum </it>(Blo), <it>Brucella melitensis </it>(Rso), <it>Chlorobium tepidum </it>(Cte), <it>Enterococcus faecalis </it>(Efa), <it>Fusobacterium nucleatum </it>(Fnu), <it>Lactobacillus plantarum </it>(Lpl), <it>Leptospira interrogans serovar </it>(Lint), <it>Listeria innocua </it>(Lin), <it>Listeria monocytogenes </it>(Lmo), <it>Nitrosomonas europaea </it>(Neu), <it>Nostoc </it>sp. (Nsp), <it>Oceanobacillus iheyensis </it>(Oih), <it>Ralstonia solanacearum </it>(Rso), <it>Sinorhizobium meliloti </it>(Sme), <it>Streptomyces coelicolor </it>(Sco), <it>Thermoanaerobacter tengcongensis </it>(Tte), <it>Thermosynechococcus elongatus </it>(Tel), <it>Xanthomonas campestris </it>(Xca), <it>Shewanella oneidensis </it>(Son); archaea: <it>Methanopyrus kandleri </it>(Mka), <it>Methanosarcina acetivorans </it>(Mac), <it>Pyrobaculum aerophilum </it>(Pae), <it>Pyrococcus furiosus </it>(Pfu).</p>
         </sec>
         <sec>
            <st>
               <p>Reconstruction of gene neighborhoods</p>
            </st>
            <p>Gene neighborhoods for the 41 compared genomes were reconstructed as previously described <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. Briefly, the collection of clusters of orthologous groups of proteins from complete genomes (COGs) <abbrgrp><abbr bid="B28">28</abbr></abbrgrp> was used as the source of information on orthologous relationships for detecting conserved gene pairs. For the purpose of this analysis only 'highly conserved' gene pairs were considered, that is, those formed by genes from two COGs that were present in the same orientation and separated by less than three genes in at least 10 of the compared genomes. This conservative approach was adopted in order to ensure that all analyzed gene pairs belong to the same operon. At the next step, overlapping gene pairs were joined in triplets; each triplet was required to exist in at least one genome. Overlapping triplets were used to construct gene arrays by run search in an oriented graph; a gene array may or may not be found in its entirety in any available genome. Finally, gene arrays that shared at least three COGs were clustered into neighborhoods by using a single-linkage clustering algorithm <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. Conserved gene pairs that did not belong to the reconstructed gene arrays were also analyzed.</p>
         </sec>
         <sec>
            <st>
               <p>Searching for candidate horizontally transferred genes</p>
            </st>
            <p>The protein sequences encoded by the genes of each neighborhood were searched against the non-redundant protein sequence database (NCBI, NIH, Bethesda) using the BLASTP program. The BLAST hits were analyzed to identify their potential phylogenetic affinity. For each protein, the best hits were identified to the taxon to which the given species belongs (hereinafter, reference taxon) and to other major taxa; hits to closely related species were disregarded (see Table 1S in the additional data file). Proteins that had more significant (lower E-value) hits to a non-reference taxon than to the reference taxon were considered candidates for horizontal transfer and the respective orthologous protein clusters were subject to further phylogenetic analysis as described in the next section. If phylogenetic analysis indicated that a particular gene was likely to be horizontally transferred, phylogenetic trees were built also for the genes predicted to belong to the same operon. When different phylogenetic affinities were found for genes of the same predicted operon, this operon was considered to be 'mosaic'.</p>
         </sec>
         <sec>
            <st>
               <p>Phylogenetic analysis</p>
            </st>
            <p>Multiple protein sequence alignments were constructed using the T-Coffee program <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> and positions containing >70% gaps were excluded. Distance trees were constructed by using the least-square method as implemented in the FITCH program of the PHYLIP package <abbrgrp><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr></abbrgrp>. The least-square trees were subjected to maximum-likelihood local rearrangement using the ProtML program of the MOLPHY package, with the JTT-F model of amino acid substitutions <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr></abbrgrp>. The resulting trees are a surrogate for maximum-likelihood phylogenies; exhaustive maximum-likelihood tree construction is impractical for the number of species analyzed here. Bootstrap analysis was performed for each maximum-likelihood tree using the Resampling of Estimated Log-Likelihoods (RELL) method as implemented in MOLPHY <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr></abbrgrp>. Alternative placements of selected clades in maximum-likelihood trees were compared by using the rearrangement optimization (Kishino-Hasegawa) method as implemented in the ProtML program <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Additional data file</p>
         </st>
         <p>Additional data, including schematics of operon organization and phylogenetic trees for all gene clusters listed in Table <tblr tid="T2">2</tblr>, are available in an additional data file (Additional data file <supplr sid="s1">1</supplr>).</p>
         <suppl id="s1">
            <title>
               <p>Additional data file 1</p>
            </title>
            <caption>
               <p>Additional data, including schematics of operon organization and phylogenetic trees for all gene clusters listed in Table 2</p>
            </caption>
            <text>
               <p>Additional data, including schematics of operon organization and phylogenetic trees for all gene clusters listed in Table 2</p>
            </text>
            <file name="gb-2003-4-9-r55-s1.doc">
               <p>Click here for additional data file</p>
            </file>
         </suppl>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank Jeffrey Lawrence for critical reading of the manuscript. Marina V. Omelchenko is supported by a grant from the US Department of Energy (Office of Biological and Environmental Research, Office of Science) grants DE-FG02 01ER63220 from the Genomes to Life Program.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Genetic regulatory mechanisms in the synthesis of proteins.</p>
            </title>
            <aug>
               <au>
                  <snm>Jacob</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Monod</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1961</pubdate>
            <volume>3</volume>
            <fpage>318</fpage>
            <lpage>356</lpage>
            <xrefbib>
               <pubid idtype="pmpid">13718526</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <aug>
               <au>
                  <snm>Miller</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Reznikoff</snm>
                  <fnm>WSE</fnm>
               </au>
            </aug>
            <source>The Operon.</source>
            <publisher>Cold Spring Harbor, NY: Cold Spring Harbor Laboratory</publisher>
            <pubdate>1978</pubdate>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Gene order is not conserved in bacterial evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Mushegian</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>1996</pubdate>
            <volume>12</volume>
            <fpage>289</fpage>
            <lpage>290</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0168-9525(96)20006-X</pubid>
                  <pubid idtype="pmpid" link="fulltext">8783936</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Conservation of gene order: a fingerprint of proteins that physically interact.</p>
            </title>
            <aug>
               <au>
                  <snm>Dandekar</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Huynen</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Trends Biochem Sci</source>
            <pubdate>1998</pubdate>
            <volume>23</volume>
            <fpage>324</fpage>
            <lpage>328</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0968-0004(98)01274-2</pubid>
                  <pubid idtype="pmpid" link="fulltext">9787636</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Genome plasticity as a paradigm of eubacteria evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Watanabe</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Mori</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Itoh</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Gojobori</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>1997</pubdate>
            <volume>44</volume>
            <fpage>S57</fpage>
            <lpage>S64</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9395406</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Evolutionary instability of operon structures disclosed by sequence comparisons of complete microbial genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Itoh</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Takemoto</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Mori</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Gojobori</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1999</pubdate>
            <volume>16</volume>
            <fpage>332</fpage>
            <lpage>346</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10331260</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Genome alignment, evolution of prokaryotic genome organization, and prediction of gene function using genomic context.</p>
            </title>
            <aug>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Rogozin</snm>
                  <fnm>IB</fnm>
               </au>
               <au>
                  <snm>Kondrashov</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2001</pubdate>
            <volume>11</volume>
            <fpage>356</fpage>
            <lpage>372</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gr.GR-1619R</pubid>
                  <pubid idtype="pmpid" link="fulltext">11230160</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Shared strategies in gene organization among prokaryotes and eukaryotes.</p>
            </title>
            <aug>
               <au>
                  <snm>Lawrence</snm>
                  <fnm>JG</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2002</pubdate>
            <volume>110</volume>
            <fpage>407</fpage>
            <lpage>413</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12202031</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Selfish operons: horizontal transfer may drive the evolution of gene clusters.</p>
            </title>
            <aug>
               <au>
                  <snm>Lawrence</snm>
                  <fnm>JG</fnm>
               </au>
               <au>
                  <snm>Roth</snm>
                  <fnm>JR</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1996</pubdate>
            <volume>143</volume>
            <fpage>1843</fpage>
            <lpage>1860</lpage>
            <xrefbib>
               <pubid idtype="pmpid">8844169</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Selfish operons: the evolutionary impact of gene clustering in prokaryotes and eukaryotes.</p>
            </title>
            <aug>
               <au>
                  <snm>Lawrence</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Curr Opin Genet Dev</source>
            <pubdate>1999</pubdate>
            <volume>9</volume>
            <fpage>642</fpage>
            <lpage>648</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0959-437X(99)00025-8</pubid>
                  <pubid idtype="pmpid" link="fulltext">10607610</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Selfish operons and speciation by gene transfer.</p>
            </title>
            <aug>
               <au>
                  <snm>Lawrence</snm>
                  <fnm>JG</fnm>
               </au>
            </aug>
            <source>Trends Microbiol</source>
            <pubdate>1997</pubdate>
            <volume>5</volume>
            <fpage>355</fpage>
            <lpage>359</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0966-842X(97)01110-4</pubid>
                  <pubid idtype="pmpid" link="fulltext">9294891</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Evolution of antibiotic resistance genes: the DNA sequence of a kanamycin resistance gene from <it>Staphylococcus aureus</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Gray</snm>
                  <fnm>GS</fnm>
               </au>
               <au>
                  <snm>Fitch</snm>
                  <fnm>WM</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1983</pubdate>
            <volume>1</volume>
            <fpage>57</fpage>
            <lpage>66</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">6100986</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Horizontal gene transfer in prokaryotes - quantification and classification.</p>
            </title>
            <aug>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Makarova</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Aravind</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Annu Rev Microbiol</source>
            <pubdate>2001</pubdate>
            <volume>55</volume>
            <fpage>709</fpage>
            <lpage>742</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.micro.55.1.709</pubid>
                  <pubid idtype="pmpid" link="fulltext">11544372</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Evidence for massive gene exchange between archaeal and bacterial hyperthermophiles</p>
            </title>
            <aug>
               <au>
                  <snm>Aravind</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Tatusov</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Walker</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>1998</pubdate>
            <volume>14</volume>
            <fpage>442</fpage>
            <lpage>444</lpage>
            <note>A published erratum appears in <it>Trends Genet </it>1998, <b>15:</b>41.</note>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-9525(98)01553-4</pubid>
                  <pubid idtype="pmpid" link="fulltext">9825671</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Horizontal gene transfer among genomes: the complexity hypothesis.</p>
            </title>
            <aug>
               <au>
                  <snm>Jain</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Rivera</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Lake</snm>
                  <fnm>JA</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1999</pubdate>
            <volume>96</volume>
            <fpage>3801</fpage>
            <lpage>3806</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.96.7.3801</pubid>
                  <pubid idtype="pmpid" link="fulltext">10097118</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Evidence for lateral gene transfer between archaea and bacteria from genome sequence of <it>Thermotoga maritima</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Nelson</snm>
                  <fnm>KE</fnm>
               </au>
               <au>
                  <snm>Clayton</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Gill</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Gwinn</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Dodson</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Haft</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Hickey</snm>
                  <fnm>EK</fnm>
               </au>
               <au>
                  <snm>Peterson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>WC</fnm>
               </au>
               <au>
                  <snm>Ketchum</snm>
                  <fnm>KA</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>1999</pubdate>
            <volume>399</volume>
            <fpage>323</fpage>
            <lpage>329</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/20601</pubid>
                  <pubid idtype="pmpid" link="fulltext">10360571</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Lateral genomics.</p>
            </title>
            <aug>
               <au>
                  <snm>Doolittle</snm>
                  <fnm>WF</fnm>
               </au>
            </aug>
            <source>Trends Cell Biol</source>
            <pubdate>1999</pubdate>
            <volume>9</volume>
            <fpage>M5</fpage>
            <lpage>M8</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0962-8924(99)01664-5</pubid>
                  <pubid idtype="pmpid" link="fulltext">10611671</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Horizontal gene transfer in bacterial and archaeal complete genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Garcia-Vallve</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Romeu</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Palau</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2000</pubdate>
            <volume>10</volume>
            <fpage>1719</fpage>
            <lpage>1725</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gr.130000</pubid>
                  <pubid idtype="pmpid" link="fulltext">11076857</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Prokaryotic evolution in light of gene transfer.</p>
            </title>
            <aug>
               <au>
                  <snm>Gogarten</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Doolittle</snm>
                  <fnm>WF</fnm>
               </au>
               <au>
                  <snm>Lawrence</snm>
                  <fnm>JG</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2002</pubdate>
            <volume>19</volume>
            <fpage>2226</fpage>
            <lpage>2238</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12446813</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Two C or not two C: recurrent disruption of Zn-ribbons, gene duplication, lineage-specific gene loss, and horizontal gene transfer in evolution of bacterial ribosomal proteins.</p>
            </title>
            <aug>
               <au>
                  <snm>Makarova</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Ponomarev</snm>
                  <fnm>VA</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2001</pubdate>
            <volume>2</volume>
            <fpage>research0033.1</fpage>
            <lpage>0033.14</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/gb-2001-2-9-research0033</pubid>
                  <pubid idtype="pmpid" link="fulltext">11574053</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>The evolutionary history of ribosomal protein RpS14: horizontal gene transfer at the heart of the ribosome.</p>
            </title>
            <aug>
               <au>
                  <snm>Brochier</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Philippe</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Moreira</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2000</pubdate>
            <volume>16</volume>
            <fpage>529</fpage>
            <lpage>533</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-9525(00)02142-9</pubid>
                  <pubid idtype="pmpid" link="fulltext">11102698</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Eubacterial phylogeny based on translational apparatus proteins.</p>
            </title>
            <aug>
               <au>
                  <snm>Brochier</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Bapteste</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Moreira</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Philippe</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2002</pubdate>
            <volume>18</volume>
            <fpage>1</fpage>
            <lpage>5</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-9525(01)02522-7</pubid>
                  <pubid idtype="pmpid" link="fulltext">11750686</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Mosaic structure and molecular evolution of the leukotoxin operon (lktCABD) in <it>Mannheimia </it>(<it>Pasteurella</it>) <it>haemolytica</it>, <it>Mannheimia glucosida</it>, and <it>Pasteurella trehalosi</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Davies</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Campbell</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Whittam</snm>
                  <fnm>TS</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2002</pubdate>
            <volume>184</volume>
            <fpage>266</fpage>
            <lpage>277</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1128/JB.184.1.266-277.2002</pubid>
                  <pubid idtype="pmpid" link="fulltext">11741868</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Genetics of lipopolysaccharide biosynthesis in enteric bacteria.</p>
            </title>
            <aug>
               <au>
                  <snm>Schnaitman</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Klena</snm>
                  <fnm>JD</fnm>
               </au>
            </aug>
            <source>Microbiol Rev</source>
            <pubdate>1993</pubdate>
            <volume>57</volume>
            <fpage>655</fpage>
            <lpage>682</lpage>
            <xrefbib>
               <pubid idtype="pmpid">7504166</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Connected gene neighborhoods in prokaryotic genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Rogozin</snm>
                  <fnm>IB</fnm>
               </au>
               <au>
                  <snm>Makarova</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Murvai</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Czabarka</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Tatusov</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Szekely</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2002</pubdate>
            <volume>30</volume>
            <fpage>2212</fpage>
            <lpage>2223</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/30.10.2212</pubid>
                  <pubid idtype="pmpid" link="fulltext">12000841</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>STRING: a web-server to retrieve and display the repeatedly occurring neighbourhood of a gene.</p>
            </title>
            <aug>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Lehmann</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Huynen</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2000</pubdate>
            <volume>28</volume>
            <fpage>3442</fpage>
            <lpage>3444</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/28.18.3442</pubid>
                  <pubid idtype="pmpid" link="fulltext">10982861</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Entrez Genome</p>
            </title>
            <url>http://www.ncbi.nlm.nih.gov:80/PMGifs/Genomes/org.html</url>
         </bibl>
         <bibl id="B28">
            <title>
               <p>The COG database: new developments in phylogenetic classification of proteins from complete genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Tatusov</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Natale</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Garkavtsev</snm>
                  <fnm>IV</fnm>
               </au>
               <au>
                  <snm>Tatusova</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Shankavaram</snm>
                  <fnm>UT</fnm>
               </au>
               <au>
                  <snm>Rao</snm>
                  <fnm>BS</fnm>
               </au>
               <au>
                  <snm>Kiryutin</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Galperin</snm>
                  <fnm>MY</fnm>
               </au>
               <au>
                  <snm>Fedorova</snm>
                  <fnm>ND</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <fpage>22</fpage>
            <lpage>28</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/29.1.22</pubid>
                  <pubid idtype="pmpid" link="fulltext">11125040</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>T-Coffeee: a novel method for fast and accurate multiple sequence alignment.</p>
            </title>
            <aug>
               <au>
                  <snm>Notredame</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Higgins</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Heringa</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2000</pubdate>
            <volume>302</volume>
            <fpage>205</fpage>
            <lpage>217</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.2000.4042</pubid>
                  <pubid idtype="pmpid" link="fulltext">10964570</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Construction of phylogenetic trees.</p>
            </title>
            <aug>
               <au>
                  <snm>Fitch</snm>
                  <fnm>WM</fnm>
               </au>
               <au>
                  <snm>Margoliash</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1967</pubdate>
            <volume>155</volume>
            <fpage>279</fpage>
            <lpage>284</lpage>
            <xrefbib>
               <pubid idtype="pmpid">5334057</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Inferring phylogenies from protein sequences by parsimony, distance, and likelihood methods.</p>
            </title>
            <aug>
               <au>
                  <snm>Felsenstein</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Methods Enzymol</source>
            <pubdate>1996</pubdate>
            <volume>266</volume>
            <fpage>418</fpage>
            <lpage>427</lpage>
            <xrefbib>
               <pubid idtype="pmpid">8743697</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>On the maximum likelihood method in molecular phylogenetics.</p>
            </title>
            <aug>
               <au>
                  <snm>Hasegawa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kishino</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Saitou</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>1991</pubdate>
            <volume>32</volume>
            <fpage>443</fpage>
            <lpage>445</lpage>
            <xrefbib>
               <pubid idtype="pmpid">1904100</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>MOLPHY: programs for molecular phylogenetics.</p>
            </title>
            <aug>
               <au>
                  <snm>Adachi</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hasegawa</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>In Computer Science Monographs 27</source>
            <publisher>Tokyo: Institute of Statistical Mathematics</publisher>
            <pubdate>1992</pubdate>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Maximum likelihood inference of protein phylogeny and the origin of chloroplasts.</p>
            </title>
            <aug>
               <au>
                  <snm>Kishino</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Miyata</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hasegawa</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>1990</pubdate>
            <volume>31</volume>
            <fpage>151</fpage>
            <lpage>160</lpage>
         </bibl>
      </refgrp>
   </bm>
</art>
