<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1741-7007-2-15</ui>
   <ji>1741-7007</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Inter-genomic displacement via lateral gene transfer of bacterial <it>trp </it>operons in an overall context of vertical genealogy</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Xie</snm>
               <fnm>Gary</fnm>
               <insr iid="I1"/>
               <email>xie@lanl.gov</email>
            </au>
            <au id="A2">
               <snm>Bonner</snm>
               <mi>A</mi>
               <fnm>Carol</fnm>
               <insr iid="I2"/>
               <email>cbonner@ufl.edu</email>
            </au>
            <au id="A3">
               <snm>Song</snm>
               <fnm>Jian</fnm>
               <insr iid="I1"/>
               <email>jian@lanl.gov</email>
            </au>
            <au id="A4">
               <snm>Keyhani</snm>
               <mi>O</mi>
               <fnm>Nemat</fnm>
               <insr iid="I2"/>
               <email>keyhani@ufl.edu</email>
            </au>
            <au id="A5" ca="yes">
               <snm>Jensen</snm>
               <mi>A</mi>
               <fnm>Roy</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>rjensen@ufl.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Los Alamos National Laboratory, Los Alamos, New Mexico, 87544, USA</p>
            </ins>
            <ins id="I2">
               <p>Department of Microbiology &amp; Cell Science, University of Florida, PO Box 110700, Gainesville, Florida, 32611, USA</p>
            </ins>
         </insg>
         <source>BMC Biology</source>
         <issn>1741-7007</issn>
         <pubdate>2004</pubdate>
         <volume>2</volume>
         <issue>1</issue>
         <fpage>15</fpage>
         <url>http://www.biomedcentral.com/1741-7007/2/15</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">15214963</pubid>
               <pubid idtype="doi">10.1186/1741-7007-2-15</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>27</day>
               <month>1</month>
               <year>2004</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>23</day>
               <month>6</month>
               <year>2004</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>23</day>
               <month>6</month>
               <year>2004</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2004</year>
         <collab>Xie et al; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.</collab>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>The growing conviction that lateral gene transfer plays a significant role in prokaryote genealogy opens up a need for comprehensive evaluations of gene-enzyme systems on a case-by-case basis. Genes of tryptophan biosynthesis are frequently organized as whole-pathway operons, an attribute that is expected to facilitate multi-gene transfer in a single step. We have asked whether events of lateral gene transfer are sufficient to have obscured our ability to track the vertical genealogy that underpins tryptophan biosynthesis.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>In 47 complete-genome <it>Bacteria</it>, the genes encoding the seven catalytic domains that participate in primary tryptophan biosynthesis were distinguished from any paralogs or xenologs engaged in other specialized functions. A reliable list of orthologs with carefully ascertained functional roles has thus been assembled and should be valuable as an annotation resource. The protein domains associated with primary tryptophan biosynthesis were then concatenated, yielding single amino-acid sequence strings that represent the entire tryptophan pathway. Lateral gene transfer of several whole-pathway <it>trp </it>operons was demonstrated by use of phylogenetic analysis. Lateral gene transfer of partial-pathway <it>trp </it>operons was also shown, with newly recruited genes functioning either in primary biosynthesis (rarely) or specialized metabolism (more frequently).</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusions</p>
               </st>
               <p><b>(i) </b>Concatenated tryptophan protein trees are congruent with 16S rRNA subtrees provided that the genomes represented are of sufficiently close phylogenetic spacing. There are currently seven tryptophan congruency groups in the <it>Bacteria</it>. Recognition of a succession of others can be expected in the near future, but ultimately these should coalesce to a single grouping that parallels the 16S rRNA tree (except for cases of lateral gene transfer). <b>(ii) </b>The vertical trace of evolution for tryptophan biosynthesis can be deduced. The daunting complexities engendered by paralogy, xenology, and idiosyncrasies of nomenclature at this point in time have necessitated an expert-assisted manual effort to achieve a correct analysis. Once recognized and sorted out, paralogy and xenology can be viewed as features that enrich evolutionary histories.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="bmc" subtype="user_supplied_xml" id="endnote"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <sec>
            <st>
               <p>The process of LGT</p>
            </st>
            <p>Lateral gene transfer (LGT) undoubtedly occurs with high frequency <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. But what is required for any given LGT event to escape transience and actually survive as a non-vertical contributor to the evolutionary history of a species? (<b>i</b>) Initially the incoming gene(s) must be replicated in the original recipient cell and its immediate progeny. (<b>ii</b>) Selective advantages of the LGT-gene(s) must be sufficient to foster eventual domination of the species population. (<b>iii</b>) During this time a crucial factor influencing survival will be whether the considerable demands imposed by the recipient genome for amelioration of alien genes can be met <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. Obviously, there will be less amelioration pressure if the donor and recipient genomes are phylogenetically close and have similar guanine/cytosine base ratios, dinucleotide frequencies, promotor motifs, and so on. Even here a LGT insertion into a genome might be contra-selected for other reasons, for example, if the location of the insertion disrupts the symmetry and physical balance between the origin and terminus points of replication <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. There is a balance in that one can expect alien sources of greatest novelty to be phylogenetically distant, yet genes from such remote sources will usually confront the greatest amelioration pressures. The most obvious candidates as successfully imported alien genes will encode novel functions that confer clear selective value, such as resistance to threatening environmental agents (e.g. antibiotics) or ability to utilize a new source of carbon and energy.</p>
            <p>Enhanced probability of LGT success can be expected if only a single gene is needed for the novel function. This does not necessarily rule out the acquisition of a new function that is complex and multi-genic. Sometimes where existing functions are complex and multi-step, a single incoming gene can create a new metabolic linkage. For example, an organism having a multi-step pathway for tyrosine catabolism could acquire a previously unavailable ability to also catabolize phenylalanine through import of a gene encoding phenylalanine hydroxylase (which converts phenylalanine to tyrosine). In cases where the total repertoire of multiple steps that define a novel function are all absent in a given organism, acquisition of that function by LGT would be highly improbable were it not for the existence of operon modules in prokaryotes. Lawrence has proposed, in fact, that gene organization as operons largely exists because of "selfish" properties that operate at the hierarchical level of genes <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. Although genes of <it>L</it>-tryptophan (Trp) biosynthesis, a classic operon system, are completely dispersed in some prokaryote genomes (e.g. <it>Aquifex</it>, <it>Chlorobium</it>, <it>Wolinella </it>and unicellular cyanobacteria such as <it>Synechococcus</it>, <it>Synechocystis</it>, and <it>Prochlorococcus</it>), they usually exist either as whole-pathway operons or as a combination of several partial-pathway operons <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>.</p>
            <p>Approaches for the detection of LGT events are either phylogenetic or parametric. Parametric approaches include the detection of nucleotide composition, dinucleotide frequencies and codon-usage biases in gene segments that are atypical of the recipient genome. Such parametric analysis is well illustrated by a study of the <it>Escherichia coli </it>genome <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. However, such atypical parametric properties will drift with time toward those of the recipient genome (amelioration). Therefore, parametric analysis is limited to detection of genes that were acquired recently. We have found only a single example where <it>trp </it>genes exhibited parametric features suggesting LGT. In this case a low-GC gene block in <it>Xylella fastidiosa </it>contains seven genes, of which two encode the subunits of anthranilate synthase and one encodes a repressor, TrpR <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. On the other hand, phylogenetic analysis can detect ancient events of LGT, and this approach has allowed the recognition of a number of whole-pathway and partial-pathway <it>trp </it>operons that were acquired by LGT. Presumably, the initial aberrant parametric properties associated with LGT have been ameliorated. Phylogenetic analysis also is a much more powerful indicator of the likely donor lineage following gene acquisition by LGT.</p>
         </sec>
         <sec>
            <st>
               <p><it>En bloc </it>importation of primary pathways via LGT</p>
            </st>
            <p>Ubiquitous pathways of primary biosynthesis, having a long history of genomic optimization and metabolic integration, are generally improbable candidates for replacement by LGT <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. Among the challenges confronting successful LGT are that key regulatory genes may be spatially separated from the structural genes, and promoters and transcription signals vary between bacterial species, as also is the case for elements required for translation.</p>
            <p>Exceptions can be envisioned, among them: (<b>i</b>) A gene of primary biosynthesis could, in fact, be synonymous with a gene of antibiotic resistance. Thus, if one or more primary-pathway enzymes is the target of an antimicrobial agent, then genes in nature that encode resistant versions of that enzyme(s) could confer strong selective pressure for replacement. The apparent displacement of an essential enzyme of isoprenoid biosynthesis (hydroxymethylglutaryl coenzyme A reductase) in <it>Archaea </it>by a statin-resistant enzyme of bacterial origin exemplifies this <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. (<b>ii</b>) If primary pathway genes are lost by deletion, as often occurs with pathogens or symbionts, re-acquisition of the pathway might later become advantageous. In this case, no native pathway having a sophisticated history of amelioration is present to out-compete a pathway of alien origin. In a sense, the recipient has reverted to a pristine evolutionary state that is probably subject to instability and rapid change. (<b>iii</b>) An alien suite of enzymes might provide extraordinary properties of catalysis and/or regulation that are of sufficient selective value to offset a lack of ameliorative history.</p>
         </sec>
         <sec>
            <st>
               <p>The pathway of tryptophan biosynthesis</p>
            </st>
            <p>Amino acid biosynthetic pathways, such as the Trp pathway (see Table <tblr tid="T1">1</tblr> for nomenclature used), are not novel in the sense that they are ancient and widely distributed. When the Trp pathway is absent in prokaryotes, it is a consequence of gene loss (reductive evolution), usually, if not always, in pathogens or symbionts whose hosts or symbiont partners supply the Trp required. Table <tblr tid="T2">2</tblr> provides a current listing of microbial genomes that have lost some or all of the <it>trp </it>genes. Such reductive changes can be quite recent, as illustrated by existence of some <it>trp</it>-gene degradation in <it>Yersinia pestis </it>KIM, but not in the other two completely sequenced <it>Y. pestis </it>genomes. Genomes having incomplete <it>trp </it>pathways are presumably in an intermediate state of genome reduction. However, see Xie <it>et al. </it><abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp> for novel functional innovations of "incomplete" Trp pathways in chlamydiae. For example, the addition of two genes to the partial-pathway operon of <it>Chlamydophila psittaci </it>has been asserted to have created a novel operon responsible for a kynurenine-to-Trp pathway that is important in overcoming a mechanism of host defense during pathogenesis <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. This hypothesis has recently been confirmed experimentally <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. Also, see Barona-G&#242;mez and Hodgson <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> for their resolution of the mystery of why <it>trpC </it>is absent in the clade of actinomycete organisms, that is, they identified <it>priA </it>as a replacement for the otherwise universal <it>trpC</it>. A thorough overview of the phenomenon of "missing genes in metabolic pathways" and how this can be approached via comparative genomics has been provided by Osterman and Overbeek <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Key to nomenclature<sup>a </sup>used</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c ca="center">
                        <p>New gene name</p>
                     </c>
                     <c ca="center">
                        <p>Prior gene name</p>
                     </c>
                     <c ca="center">
                        <p>Protein domain encoded</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <it>trpAa</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>trpE</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Anthranilate synthase: aminase subunit (&#945;)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <it>trpAb</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>trpG</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Anthranilate synthase: amidotransferase subunit (&#946;)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <it>trpB</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>trpD</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Anthranilate phosphoribosyl transferase</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <it>trpC</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>trpF</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Phosphoribosyl-anthranilate isomerase</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <it>trpD</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>trpC</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Indoleglycerol phosphate synthase</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <it>trpEa</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>trpA</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Tryptophan synthase, &#945; subunit</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>
                           <it>trpEb</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>trpB</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Tryptophan synthase, &#946; subunit</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><sup>a</sup>The nomenclature is at the level of catalytic domain in the order of reaction steps in the pathway. See [6] for a detailed rationale supporting the new nomenclature. Overall reactions of tight complexes are assigned one capital letter, with &#945; and &#946; subunits assigned the corresponding lowercase 'a' and 'b' respectively. The convention of a bullet denotes a fusion, for example, <it>trpD</it>&#8226;<it>trpC</it>.</p>
               </tblfn>
            </tbl>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Complete genomes where Trp-pathway genes were lost via reductive evolution</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c ca="center">
                        <p>Bacterial organisms</p>
                     </c>
                     <c ca="center">
                        <p>No pathway</p>
                     </c>
                     <c ca="center">
                        <p>Incomplete pathway</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Bdellovibrio bacteriovorus</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>&#8730;</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Borrelia burgdorferi</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>&#8730;</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Chlamydia muridarum</it>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>&#8730;</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Chlamydia trachomatis</it>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>&#8730;</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Chlamydophila pneumoniae</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>&#8730;</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Chlamydophila psittaci</it>
                           <sup>a</sup>
                        </p>
                     </c>
                     <c ca="center">
                        <p>&#8730;</p>
                     </c>
                     <c ca="center">
                        <p>&#8730;</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Clostridium difficile</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>&#8730;</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Clostridium perfringens</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>&#8730;</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Clostridium tetani</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>&#8730;</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Coxiella burnetti</it>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>&#8730;</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Enterococcus faecalis</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>&#8730;</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Fusobacterium nucleatum</it>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>&#8730;</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Haemophilus ducreyi</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>&#8730;</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Lactobacillus johnsonii</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>&#8730;</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Mycoplasma genitalium</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>&#8730;</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Mycoplasma mycoides</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>&#8730;</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Mycoplasma pneumoniae</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>&#8730;</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Phytoplasma asteris</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>&#8730;</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Porphyromonas gingivalis</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>&#8730;</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Rickettsia prowazekii</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>&#8730;</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Streptococcus agalactiae</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>&#8730;</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Streptococcus equi</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>&#8730;</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Streptococcus pyogenes</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>&#8730;</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Treponema denticola</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>&#8730;</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Treponema pallidum</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>&#8730;</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Tropheryma whipplei</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>&#8730;</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Ureaplasma urealyticum</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>&#8730;</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Wigglesworthia glossinidia</it>
                           <sup>b</sup>
                        </p>
                     </c>
                     <c ca="center">
                        <p>&#8730;</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Wolbachia </it>sp.<sup>b</sup></p>
                     </c>
                     <c ca="center">
                        <p>&#8730;</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Yesinia pestis KIM</it>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>&#8730;</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><sup>a</sup>But see [10] for a description of how an incomplete Trp pathway has been joined with other genes to yield a mosaic operon with a novel function. <sup>b</sup>Insect endosymbiont.</p>
               </tblfn>
            </tbl>
            <p>Trp is biochemically the most expensive of the amino acids synthesized by prokaryotes. Accordingly, it is not surprising that Trp biosynthesis is usually regulated with fine-tuned precision. Different organisms deploy completely different modes of regulation, for example, compare those of <it>E. coli </it>and <it>Bacillus subtilis </it><abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>. <it>Pseudomonas aeruginosa </it>exhibits yet a third distinctive system of regulation, part of which involves an activator gene (<it>trpI</it>) <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. The sophisticated, complex, and highly distinctive regulation in each of the latter three organisms appears to be of recent origin based upon the relatively narrow clades possessing these particular regulatory systems. <b>This is a very important point in support of the thesis </b><abbrgrp><abbr bid="B6">6</abbr></abbrgrp>&#160;<b>that modern, sophisticated genomes may be much less prone to displacement by LGT of <it>trp </it>genes than were ancient genomes. </b>In this context, it will be quite important to determine whether organisms that lack known modes of regulation based upon current model organisms really have relatively unsophisticated control mechanisms, or whether unknown regulatory mechanisms are in place. For example, it has been initially surprising that <it>trp </it>genes of <it>Streptomyces coelicolor </it>are not regulated by feedback repression. However, an excellent and detailed study <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> has demonstrated the existence of regulation that is both growth phase-dependent and growth rate-dependent. This has been attributed to the oligotrophic lifestyle of <it>Streptomyces</it>. It would be interesting to know whether the clade defined by the <it>Streptomyces </it>mode of regulation is also narrow (and therefore of recent origin).</p>
         </sec>
         <sec>
            <st>
               <p>Identical enzyme steps operating in different pathways</p>
            </st>
            <p>We refer to <it>trp </it>genes that produce Trp in general support of protein synthesis as genes of <b>primary </b>metabolism. Those <it>trp </it>genes responsible for the production of intermediates (or Trp) for any other purpose (e.g. as precursors of antibiotics or pigments) are referred to as genes of <b>specialized </b>(or <b>secondary</b>) metabolism. There is ample precedent for maintenance in a given genome of co-existing structural genes whose gene products catalyze the same reaction, but which function in differing temporal or spatial modes in different pathways. Such genes may be differentially regulated by distinctive control mechanisms or mechanisms that accomplish spatial separation. For example, <it>S. coelicolor </it>possesses <it>trp </it>paralog genes of identical catalytic function that exist in separate operons dedicated to primary biosynthesis, on the one hand, or to calcium-dependent antibiotic (CDA) production, on the other hand <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B19">19</abbr></abbrgrp>. It is interesting that, in such cases, one or more of the pathway genes sometimes exist as a single copy and, therefore, must be shared between both functions.</p>
         </sec>
         <sec>
            <st>
               <p>Phylogenetic trees for proteins require a continuum of close relatives</p>
            </st>
            <p>This study is primarily a phylogenetic analysis and depends upon protein trees. However, sequences of proteins such as the Trp enzymes are not nearly as conserved as 16S rRNA, and it is well known that they are of limited value for making phylogenetic inferences over wide phylogenetic distances <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. On the other hand, for a group of very closely related organisms, the 16S rRNA sequences can be so similar that there is a limited basis for discriminating order of branching. Here some protein trees may yield more refined branching relationships over short phylogenetic distances. Until recently, there have been relatively few sequenced genomes available that would provide a critical mass of closely related organisms as a source of Trp-protein sequences. The sequencing of new genomes is now beginning to provide phylogenetic groups of ever more dense genome representation. Health-related organisms heavily influence priorities, and wide gaps in the currently available tree exist. Phylogenetic regions where genome representation is sparse will undoubtedly persist for many years, and it would be helpful if new genomes for sequencing were selected specifically to fill phylogenetic gaps. One can anticipate that a given protein tree will progressively increase its useful phylogenetic span of discrimination as the gaps between regions of sufficient genome representation are filled.</p>
            <p>Additional daunting problems (which are minimal for 16S rRNA trees) assail the phylogenetic validity of protein trees. These are: (<b>i</b>) unrecognized paralogy and (<b>ii</b>) xenology. Ancient paralogs that arose in a common cell will usually have diverged greatly in the contemporary organisms that house them. If one or the other paralog has been lost in various lineages in an erratic fashion and the genealogical history of these losses is not recognized, this situation has been termed "unrecognized paralogy". Differential loss of ancient paralogs in different lineages has the potential to place surviving homolog proteins of distantly related organisms closer to one another than to the surviving proteins of closely related organisms. In this case, xenology can be erroneously inferred. Although LGT has been increasingly recognized for genes in general, genes encoding 16S rRNA are thought to be recalcitrant to LGT (but see Gogarten <it>et al. </it><abbrgrp><abbr bid="B21">21</abbr></abbrgrp> and Doolittle <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Congruency and implied cases of LGT</p>
            </st>
            <p>Since particular lineages have optimized Trp biosynthesis to overall demands of primary and secondary metabolism that are both qualitatively and quantitatively individualistic, it seems that abandonment of native <it>trp </it>genes in favor of imported alien genes would not occur very often. If correct, then one would expect that the vertical trace of evolutionary descent for <it>trp </it>genes engaged in primary Trp biosynthesis would be demonstrable, as has indeed been shown following a very comprehensive analysis <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. In the latter study, individual Trp-protein trees were found to be generally congruent with 16S rRNA trees in subtree regions that were supported by sufficiently dense phylogenetic representation.</p>
            <p>Against this background of congruency (and, indeed, enabled <b>because </b>of the overall congruency), several clear examples were found of LGT of whole-pathway <it>trp </it>operons in which all of the native <it>trp </it>genes were displaced, occasionally leaving a surviving remnant behind. In addition, examples were found of LGT of partial-pathway <it>trp </it>operons that either displaced native genes of primary biosynthesis (one case) or became associated with a secondary function. In the literature relevant to LGT there is frequently little evidence about likely donor genomes, and even less indications of direction of transfer <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. This paper provides detailed information and analysis not given in <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, and it provides an especially thorough documentation in that the occasional LGT transposition of the entire suite of seven genes responsible for Trp biosynthesis are evaluated. Importantly, we were able to identify donor lineages, to identify a gene remnant of the pre-existing Trp system in three divergent descendants of one of the recipient genomes, and to identify in one genome several evolutionary steps of specific operon perturbation that resumed in the vertical genealogy following LGT.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <p>Figure <figr fid="F1">1</figr> presents a 16S rRNA tree of 47 finished-genome organisms from the domain <it>Bacteria</it>. In preliminary work with the individual protein trees corresponding to the seven catalytic domains of Trp biosynthesis, we noticed that at least seven subtree blocks on the Trp-protein trees tended to be congruent with corresponding subtree blocks (differentially highlighted in Figure <figr fid="F1">1</figr>) of the 16S rRNA tree. The Gram-negative proteobacteria command special attention in this paper simply for the fortuitous reason that the greatest density of sequenced genomes is to be found in proteobacteria. The divisions of the proteobacteria are labelled in the lower-right portion of Figure <figr fid="F1">1</figr>. <it>Geobacter </it>and <it>Desulfovibrio </it>(delta-proteobacteria) are very divergent lineages and do not contribute to a common tryptophan congruency group. The grey area (epsilon proteobacteria) represents a subtree region that will probably become a tryptophan congruency group when more dense genome representation becomes available in this region. Another subtree region (not shown on Figure <figr fid="F1">1</figr>) in this category is that represented by the chlamydiae. This latter subtree region is temporarily limited for Trp analysis because it is currently represented by genomes having Trp pathways that are incomplete or absent. Beta-proteobacteria fall into a common tryptophan congruency group with some gamma-proteobacteria ("upper" gamma-proteobacteria), whereas the remaining "lower" gamma-proteobacteria (enteric lineage) form a separate and distinct tryptophan congruency group.</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Positioning of 47 complete-genome organisms on a 16S rRNA tree (phylogram view)</p>
            </caption>
            <text>
               <p>Positioning of 47 complete-genome organisms on a 16S rRNA tree (phylogram view). Among these, seven 16S rRNA subtree regions are color-coded to facilitate comparison with the TCG regions on the Trp-protein concatenate tree of Fig. 2. Dashed lines in orange indicate organisms representing lineages where genomes of close relatives have not yet been sequenced. A grey box indicates a region of minimal current genome representation that is expected to become a region of subtree congruency. The subdivisions of the Proteobacteria are labelled at the lower right. Note that it is an idiosyncracy of tree presentation that Tfe and Xfa appear to group more closely with beta-proteobacteria than with other gamma-proteobacteria. However, it is a distance tree and close inspection of the distances reveal the identity of the gamma-proteobacteria as a single, cohesive group. The horizontal bar corresponds to 0.1 substitutions per site on the distance tree.</p>
            </text>
            <graphic file="1741-7007-2-15-1"/>
         </fig>
         <sec>
            <st>
               <p>Seven tryptophan congruency groups (TCGs)</p>
            </st>
            <p>Individual Trp-protein domains vary in size and degree of conservation and thus confer varied amounts of phylogenetic information. Figure <figr fid="F2">2</figr> shows a protein tree in which the seven protein domains responsible for the Trp pathway of biosynthesis have been concatenated in the order of the pathway steps (TrpAa/TrpAb/TrpB/TrpC/TrpD/TrpEa/TrpEb) prior to use of the tree-building program. The seven-domain Trp tree was constructed by using 47 organisms having a complete Trp pathway <b>and by application of an analysis aimed at exclusion of divergent paralogs and xenologs that do not function in primary Trp biosynthesis </b>(excluded paralogs and xenologs can be viewed in the succeeding figures). Each of the seven protein sections of a given concatenate has an impact on the overall tree that is roughly proportional to the number of amino acids encoded. Each of the seven nodes defining TCGs is supported by a bootstrap value of 100% (in contrast to the weakness of individual Trp-protein trees). Although the <b>overall </b>concatenated tree does not parallel the <b>overall </b>16S rRNA tree, seven subtree regions of the concatenated tree do parallel 16S rRNA subtree regions. These seven groupings are not necessarily equivalent in phylogenetic span. TCG-6, for example, is a small, closely related grouping compared with the much looser TCG-7 assemblage. The emerging, fuller membership of these seven TCGs is hinted at in Table <tblr tid="T3">3</tblr>, where additional provisional TCG members that were not included in Figure <figr fid="F2">2</figr> are listed in regular non-bold type. The latter were assigned (mostly with newly available genomes after completion of our primary analysis) by assessment of the best BLAST (Basic Local Alignment Search Tool) hits obtained after entering each Trp domain of the organisms listed as query sequences. Best-match methods are useful for rapid screening, but are subject to limitations <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. Tentative TCG assignments were also assisted by other features. For example, <it>Brucella melitensis</it>, <it>Agrobacterium tumefaciens</it>, <it>Bradyrhizobium japonicum</it>, <it>Sinorhizobium meliloti </it>and <it>Rhodopseudomonas palustris </it>all possess in common the following split-pathway gene organization: <it>trpAatrpAb </it>(fused genes), a <it>trpB/trpD </it>operon, and a <it>trpC/trpEb/trpEa </it>operon <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Phylogenetic tree (radial view) constructed using the seven-domain (TrpAa/Ab/B/C/D/Ea/Eb) concatenated sequences of Trp proteins that specifically participate in primary biosynthesis</p>
               </caption>
               <text>
                  <p>Phylogenetic tree (radial view) constructed using the seven-domain (TrpAa/Ab/B/C/D/Ea/Eb) concatenated sequences of Trp proteins that specifically participate in primary biosynthesis. The Trp-pathway concatenates are from the same 47 organisms shown in Fig. 1. Dashed, orange lines indicate the lineage positions of concatenated sequences from organisms that presently lack any close relatives whose genomes have been sequenced. TCG clusters are color-coded as in Fig. 1, where full bacterial names matching the corresponding abbreviations can be found. The Cje lineage, marked in aqua, represents a probable TCG grouping. Nodes marked with solid black circles are supported by bootstrap values of 100%. Concatenates of LGT origin within TCG-1 are outlined with a grey pattern.</p>
               </text>
               <graphic file="1741-7007-2-15-2"/>
            </fig>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Membership composition of Tryptophan Congruency Groups<sup>a</sup></p>
               </caption>
               <tblbdy cols="2">
                  <r>
                     <c ca="left">
                        <p>
                           <b>TCG-1</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b><it>[Corynebacterium diptheriae</it>]<sup>b</sup></b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <b><it>[Corynebacterium glutamicum</it>]</b>
                           <sup>b</sup>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Escherichia coli</it>
                           </b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Haemophilus influenzae</it>
                           </b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <b><it>[Helicobacter pylori</it>]</b>
                           <sup>b</sup>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Shewanella putrefaciens</it>
                           </b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Vibrio cholerae</it>
                           </b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Actinobacillus actinomycetemcomitans</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Buchnera aphidicola</it>
                           <sup>c</sup>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Blochmannia floridanus</it>
                           <sup>c</sup>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>[<it>Corynebacterium efficiens</it>]<sup>b</sup></p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Erwinia carotovora</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Klebsiella pneumoniae</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Pasteurella multocida</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Photorhabdus luminescens</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Salmonella enterica</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Shigella flexneri</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Vibrio parahaemolyticus</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Vibrio vulnificus</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Yersinia pestis</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>TCG-2<sup>d</sup></b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Burkholderia fungorum</it>
                           </b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Methylococcus capsulatus</it>
                           </b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Neisseria meningitidis</it>
                           </b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Nitrosomonas europaea</it>
                           </b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Pseudomonas aeruginosa*</it>
                           </b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Pseudomonas syringae*</it>
                           </b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Ralstonia metallidurans</it>
                           </b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Thiobacillus ferrooxidans</it>
                           </b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Xanthomonas campestris</it>
                           </b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Xylella fastidiosa</it>
                           </b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Acinetobacter sp.</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Azotobacter vinelandii*</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Bordetella bronchisepticum</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Bordetella parapertussis</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Bordetella pertussis</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Burkholderia cepacia</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Burkholderia multivorans</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Chromobacterium violaceum</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Microbulbifer degradans</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Neisseria gonorrhoeae</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Pseudomonas fluorescens*</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Pseudomonas putida*</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Psychrobacter sp.</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Ralstonia solanacearum</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Xanthomonas axonopodis</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>TCG-3</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Caulobacter crescentus</it>
                           </b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Rhodobacter sphaeroides</it>
                           </b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Sphingomonas aromaticivorans</it>
                           </b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Agrobacterium tumefaciens</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Bradyrhizobium japonicum</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Brucella melitensis</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Brucella suis</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Rhizobium loti</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Rhodopseudomonas palustris</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Sinorhizobium meliloti</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>TCG-4</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b><it>Anabaena </it>(Nostoc) sp. PCC 7120</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Nostoc punctiforme</it>
                           </b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <b><it>Prochlorococcus marinus </it>CCMP 1986 (MED4)</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <b><it>Synechococcus </it>sp. WH8102</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <b><it>Synechocystis </it>sp. PCC 6803</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p><it>Anabaena variabilis </it>ATCC 29413</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p><it>Crocosphaera watsonii </it>WH 8501</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p><it>Gloeobacter violaceus </it>PCC 2471</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p><it>Prochlorococcus marinus </it>CCMP1375 (SS120)</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Prochlorococcus marinus MIT9313</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p><it>Synechococcus elongatus </it>PCC 7942</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p><it>Thermosynechococcus elongatus </it>BP-1</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Tricodesmium erythraeum</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>TCG-5<sup>e</sup></b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Streptomyces coelicolor</it>
                           </b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Mycobacterium tuberculosis</it>
                           </b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Thermomonospora fusca</it>
                           </b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Bifidobacterium longum</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Mycobacterium avium</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Mycobacterium bovis</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Mycobacterium leprae</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Mycobacterium smegmatis</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Streptomyces avermitilis</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>TCG-6</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Bacillus halodurans</it>
                           </b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Bacillus stearothermophilus</it>
                           </b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Bacillus subtilis</it>
                           </b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Oceanobacillus iheyensis</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>TCG-7</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Bacillus anthracis</it>
                           </b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Lactococcus lactis</it>
                           </b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Listeria monocytogenes</it>
                           </b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Staphylococcus aureus</it>
                           </b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <b>
                              <it>Streptococcus pneumoniae</it>
                           </b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Bacillus cereus</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Listeria innocua</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Staphylococcus epidermidis</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Streptococcus gordonii</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Streptococcus mutans</it>
                        </p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><sup>a</sup>Each tryptophan congruency group (TCG) defined by the concatenated tree for Trp proteins (Fig. 2) is congruent with the color-coded subtree section within the 16S rRNA tree (Fig. 1). Organisms that are included in the concatenated tree of Fig. 2 are indicated in boldface type, whereas additional organisms not included in the concatenated tree but that were qualitatively determined to belong to a given TCG are indicated in regular type. <sup>b</sup>TCG members originating by LGT are indicated within brackets and indented. <sup>c</sup>Insect symbionts. <sup>d</sup>The five organisms marked with asterisks form a distinctive subclade that is, in fact, not a "pure" component of TCG-2 because of the LGT origins of <it>trpEa </it>and <it>trpEb </it>from TCG-3 (see text). <sup>e</sup>All members of TCG-5 lack <it>trpC </it>and presumably utilize a dual-pathway <it>hisA </it>(<it>priA</it>) for this function.</p>
               </tblfn>
            </tbl>
            <p>The few exceptions that violate the otherwise good congruency between TCGs and 16S rRNA subtree regions point strongly to instances of whole-pathway <it>trp</it>-operon displacement. These include the incongruent presence of Trp-protein concatenates from both <it>Helicobacter pylori </it>and the coryneform bacteria in TCG-1. This indicates that inter-genomic transfer of whole-pathway <it>trp </it>operons occurred, with the donor identifiable as a member of TCG-1.</p>
            <p>Orange-highlighted dashed lines in Figures <figr fid="F1">1</figr> and <figr fid="F2">2</figr> mark lineages where closely related genomes have not yet been sequenced, that is, where there is sparse genome representation. There is no congruency of the branching positions of these ten orphan organisms when the 16S rRNA tree (Figure <figr fid="F1">1</figr>) is compared with the Trp-protein concatenate tree (Figure <figr fid="F2">2</figr>). It is generally believed that the position of these lineages on the 16S rRNA tree is reliable because 16S rRNA comparisons can discriminate very well over wide phylogenetic distances. In contrast, support for the position of branching obtained for these lineages with respect to the concatenated Trp-protein tree is not significant, that is, low bootstrap values. One can reasonably expect that, as more genomes are sequenced, new TCGs will emerge that include these current orphan organisms. For example, our preliminary data indicate that the Trp proteins of <it>Bacteroides thetaiotaomicron </it>will reside within a new TCG group with <it>Cytophaga hutchinsonii</it>.</p>
         </sec>
         <sec>
            <st>
               <p>Individual Trp-protein trees</p>
            </st>
            <p>In the following section, individual protein trees are shown for comparison with the much superior tree of Trp-protein concatenates shown in Figure <figr fid="F2">2</figr>. Each individual tree consists of 47 sequences that comprise segments of each concatenate string used for the tree shown in Figure <figr fid="F2">2</figr>. Included among the concatenates are proteins of LGT origin (from <it>Helicobacter pylori</it>, <it>Corynebacterium glutamicum</it>, and <it>Corynebacterium diptheriae</it>) that function for primary Trp biosynthesis. In addition to the foregoing sequences, Figure <figr fid="F3">3</figr>, Figures <figr fid="F5">5</figr>,<figr fid="F6">6</figr>,<figr fid="F7">7</figr>, and Figures <figr fid="F9">9</figr>,<figr fid="F10">10</figr>,<figr fid="F11">11</figr> also display paralogs and xenologs that were excluded from the concatenate strings because they were not deemed to function for primary biosynthesis. It can be seen that TrpAb (Figure <figr fid="F5">5</figr>) and TrpC (Figure <figr fid="F7">7</figr>), being relatively short and not highly conserved sequences, are the least informative. TCG-2 and TCG-7 are frequently not visualized as entirely cohesive groupings when the individual trees are inspected. In each individual Trp-protein tree, primary-pathway Trp domains are designated by the organism acronym (e.g. Det), whereas unknown xenologs or paralogs are designated with a following number (e.g. Det_2). If functions are known or if names exist in the literature, for example, Sco_CDA or Pae_PhnA, respectively, these designations are used. Remnant proteins are denoted with an 'r', for example, Cgl_r (Figure <figr fid="F9">9</figr>).</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Phylogenetic tree (phylogram view) of TrpAa sequences</p>
               </caption>
               <text>
                  <p>Phylogenetic tree (phylogram view) of TrpAa sequences. Nodes occupied by a solid, black circle are supported by bootstrap values of 100%. Other bootstrap values are given within unfilled circles. Xenologs are shown with grey candy-striped bars, and paralogs are shown with lavender/white patterning. A specialized-pathway paralog from <it>Streptomyces coelicolor </it>(Sco_CDA) is shown, as well as a probable specialized-pathway xenolog (Xfa_2) from <it>Xylella </it>[7]. Genes in <it>C. diptheriae</it>, <it>C. glutamicum</it>, and <it>H. pylori </it>that belong to whole-pathway operons originating via LGT are shown within TCG-1. TCG-1 also includes PhnA, encoded by a gene of a partial-pathway operon from <it>P. aeruginosa</it>. TCG-7 is fragmented, the Ban and Sau TrpAa sequences being too divergent to fall within TCG-7.</p>
               </text>
               <graphic file="1741-7007-2-15-3"/>
            </fig>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Phylogenetic tree of TrpAb sequences</p>
               </caption>
               <text>
                  <p>Phylogenetic tree of TrpAb sequences. Xfa_2 is a probable specialized-pathway xenolog from <it>Xylella</it>. Sco_CDA is a specialized-pathway paralog within TCG-5. TCG-1 contains a xenolog (PhnB) from <it>Pseudomonas aeruginosa </it>that (together with PhnA) is encoded by genes of a partial-pathway operon. Trp Ab genes from <it>Helicobacter pylori</it>, <it>Corynebacterium diptheriae</it>, and <it>C. glutamicum </it>are xenolog members of whole-pathway operons. TCG-2 and TCG-7 are fragmented. (Consult Fig. 2 for intact TCGs.) Asterisks (Tma, Cje, and Eco) indicate domains that exist as part of TrpAb&#8226;TrpB fusions.</p>
               </text>
               <graphic file="1741-7007-2-15-5"/>
            </fig>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>Phylogenetic tree of TrpB sequences</p>
               </caption>
               <text>
                  <p>Phylogenetic tree of TrpB sequences. Det_2 is diagrammed with both patterns, indicating it could be either an ancient paralog or a xenolog. Sco_CDA is a specialized-pathway paralog (antibiotic). Asp, Asp_2, Npu, and Npu_2 are paralogs from a gene duplication that preceded speciation of Npu and Asp. The Asp and Npu sequences were arbitrarily used for input into the concatenate tree of Fig. 2. TCG-7 is fragmented. TCG-1 contains xenolog members of whole-pathway operons from <it>H. pylori</it>, <it>C. diptheriae</it>, and <it>C. glutamicum</it>. Asterisks (Tma, Cje, and Eco) indicate domains existing as part of TrpAb&#8226;TrpB fusions.</p>
               </text>
               <graphic file="1741-7007-2-15-6"/>
            </fig>
            <fig id="F7">
               <title>
                  <p>Figure 7</p>
               </title>
               <caption>
                  <p>Phylogenetic tree of TrpC sequences</p>
               </caption>
               <text>
                  <p>Phylogenetic tree of TrpC sequences. TCG-1 contains xenolog members of whole-pathway operons from <it>H. pylori</it>, <it>C. glutamicum </it>and <it>C. diptheriae</it>. TCG-7 is fragmented. All members of TCG-1 shown (except Cje, whose position is probably coincidental) possess &#8226;TrpC as a fusion domain with TrpD&#8226;.</p>
               </text>
               <graphic file="1741-7007-2-15-7"/>
            </fig>
            <fig id="F9">
               <title>
                  <p>Figure 9</p>
               </title>
               <caption>
                  <p>Phylogenetic tree of TrpD sequences</p>
               </caption>
               <text>
                  <p>Phylogenetic tree of TrpD sequences. <it>N. punctiforme </it>and <it>Anabaena </it>sp. each possess a set of three paralogs. The Asp_2 and Npu_2 paralog sequences were used for input into the concatenates of Fig. 2. In addition to the xenolog TrpD&#8226; domain of Cgl and Cdi that is present in TCG-1, Cgl and Cdi also possess 'remnant' TrpD proteins denoted Cgl_r and Cdi_r that cluster in TCG-5.</p>
               </text>
               <graphic file="1741-7007-2-15-9"/>
            </fig>
            <fig id="F10">
               <title>
                  <p>Figure 10</p>
               </title>
               <caption>
                  <p>Phylogenetic tree of TrpEa sequences</p>
               </caption>
               <text>
                  <p>Phylogenetic tree of TrpEa sequences. TrpEa proteins from Pae and Psy fall into TCG-3 instead of into TCG-2. Paralogs Asp_1, Npu_1, Asp_2 and Npu_2 were generated by a gene duplication that preceded speciation of Asp and Npu. Asp_2 and Npu_2 were arbitrarily used for concatenate input (Fig. 2). TCG-7 is not very cohesive.</p>
               </text>
               <graphic file="1741-7007-2-15-10"/>
            </fig>
            <fig id="F11">
               <title>
                  <p>Figure 11</p>
               </title>
               <caption>
                  <p>Phylogenetic tree of TrpEb sequences</p>
               </caption>
               <text>
                  <p>Phylogenetic tree of TrpEb sequences. Cdi_2 is encoded by a paralog copy of <it>trpEb_1 </it>that has been inserted ahead of the Cdi <it>trp </it>operon. TrpEb_2 is encoded by a highly divergent paralog subclass of <it>trpEb </it>that probably has a specialized function [28]. No TrpEb_2 sequences were used to construct concatenates (Fig. 2). Rsp_2 is shown with two patterns, indicating uncertainty about whether it is an ancient paralog or a xenolog.</p>
               </text>
               <graphic file="1741-7007-2-15-11"/>
            </fig>
            <p>Xie <it>et al. </it><abbrgrp><abbr bid="B6">6</abbr></abbrgrp> can be consulted for a detailed overview of the Trp pathway reactions (summarized here in Table <tblr tid="T1">1</tblr>) and for a perspective on the nomenclature issues. Table 4 is a comprehensive listing [see <supplr sid="S1">Additional File 1</supplr>] that contains gi (gene identification) numbers (in bold) that correspond to each of the seven domains asserted to function in primary Trp biosynthesis. Paralog and xenolog gi numbers are presented in regular type. Each gi number is hyperlinked to the corresponding record at NCBI (National Center for Biotechnology Information).</p>
            <suppl id="S1">
               <title>
                  <p>Additional File 1</p>
               </title>
               <text>
                  <p>Table 4, entitled "Keys to sequence identifiers", is provided as supplementary material in an html document. This table contains a full collection of sequence data and annotations contained in this paper, and gi (gene identification) numbers are included and hyperlinked to facilitate access to the corresponding GenBank records. For future reference to a progressively updated table, refer to <url>http://biosphere.lanl.gov/aropath/table4.html</url>.</p>
               </text>
               <file name="1741-7007-2-15-S1.html">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <sec>
               <st>
                  <p>TrpAa</p>
               </st>
               <p>TrpAa is the aminase subunit of anthranilate synthase. Figure <figr fid="F3">3</figr> shows an individual protein tree for the TrpAa catalytic domain. Forty-seven of these are domain segments of the 47 concatenates making up the tree of Figure <figr fid="F2">2</figr>. An additional three sequences are putative specialized paralogs or xenologs that have been excluded from Figure <figr fid="F2">2</figr>. These include an apparent TrpAa xenolog from <it>X. fastidiosa </it>(Xfa_2) <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. The origin of Xfa_2 TrpAa is unknown. It is part of an apparent six-gene operon that is divergently oriented next to <it>trpR </it><abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. It is perhaps suggestive of epsilon proteobacteria origin that Xfa_2 TrpAa exhibits a 100% bootstrap score with the TrpAa of <it>Campylobacter</it>, although the branch distances are great. Except for Xfa TrpAb (which also exhibits a suggestively close relationship with <it>Campylobacter </it>TrpAb; see Figure <figr fid="F5">5</figr>), the <it>Campylobacter </it>genome possesses neither the other four genes of the <it>Xylella </it>operon, nor <it>trpR</it>.</p>
               <p>PhnA from <it>P. aeruginosa </it><abbrgrp><abbr bid="B25">25</abbr></abbrgrp>, together with PhnB (see Figure <figr fid="F5">5</figr>), encode a partial-pathway operon that originated from the enteric lineage. A TrpAa paralog (Sco_CDA) in <it>S. coelicolor </it>functions in the calcium-dependent antibiotic (CDA) pathway <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>.</p>
               <p>Figure <figr fid="F3">3</figr> does not show the position of the TrpAa domain from <it>Thermomonospora fusca</it>, which possesses a <it>trpAa</it>&#8226;<it>trpAb </it>gene fusion. Other actinomycetes lack this fusion. The fusion is present in some widely separated organisms (Figure <figr fid="F4">4</figr>) that also use it for primary Trp biosynthesis <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. These include <it>Legionella pneumophila </it>(Lpn), <it>Azospirillum brasilense</it>, and the closely related group of organisms: <it>Rhodopseudomonas palustris</it>, <it>Mesorhizobium loti </it>(Mlo), <it>Sinorhizobium meliloti </it>(Sme), and <it>Brucella melitensis </it>(Bme). Although <it>Nostoc punctiforme </it>(Npu) and <it>Anabaena </it>sp. (Asp and Asp_2) also have <it>trpAa</it>&#8226;<it>trpAb </it>fusions, these are deemed not to be engaged in primary Trp biosynthesis <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. These TrpAa&#8226; domains all cluster closely together to the exclusion of other TrpAa sequences. They do not fall into any of the TCGs that would be expected according to the phylogenetic position of the organism having the fusion. Figure <figr fid="F4">4</figr> shows a broader phylogenetic sampling of TrpAa sequences than shown in Figure <figr fid="F3">3</figr>. Figure <figr fid="F4">4</figr> illustrates the distinct cohesion of TrpAa&#8226; domains of TrpAa&#8226;TrpAb fusions, as well as tight clustering of another variant of TrpAa&#8226;TrpAb fusion denoted TrpAa&#8226;TrpAb_phz. The latter group has a deleted region, which probably contributes to the dead-end production of 2-amino-2-deoxy-isochorismate (normally an enzyme-bound intermediate in the anthranilate synthase reaction). This compound is a precursor of phenazine pigments <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>, hence our acronym TrpAa&#8226;TrpAb_phz. Figure <figr fid="F4">4</figr> shows in the unlabelled (i.e. no organism acronyms) upper group a variety of TrpAa proteins from <it>Bacteria</it>, <it>Archaea</it>, and one lower eukaryote. It is qualitatively apparent that TrpAa&#8226; in a given fusion protein is distinctly separated from TrpAa proteins of relatively close relatives lacking the fusion (as indicated by the color coding in Figure <figr fid="F4">4</figr>). For example, unfused TrpAa proteins from <it>S. coelicolor </it>or <it>Mycobacterium tuberculosis</it>, very similar to one another as cohesive members of TCG-5, are not even as close to the TrpAa&#8226; domain from the fellow actinomycete, <it>Thermomonospora fusca</it>, as they are to various archaeal TrpAa proteins or to that from <it>Saccharomyces cerevisiae</it>.</p>
               <fig id="F4">
                  <title>
                     <p>Figure 4</p>
                  </title>
                  <caption>
                     <p>Phylogenetic tree (radial view) showing that free-standing TrpAa domains, TrpAa components of TrpAa&#8226;TrpAb fusions, and TrpAa components of TrpAa&#8226;TrpAb_phz fusions are all distinct from one another</p>
                  </caption>
                  <text>
                     <p>Phylogenetic tree (radial view) showing that free-standing TrpAa domains, TrpAa components of TrpAa&#8226;TrpAb fusions, and TrpAa components of TrpAa&#8226;TrpAb_phz fusions are all distinct from one another. The position of some TCG groups are marked at the top. TrpA&#8226; fusion domains are color-coded to indicate their expected TCG placements if convergent evolution were not a factor. The TCG-1 grouping is distinctly divergent from all of the other free-standing TrpAa sequences.</p>
                  </text>
                  <graphic file="1741-7007-2-15-4"/>
               </fig>
               <p>The &#8226;TrpAb domain of TrpAa&#8226;TrpAb or TrpAa&#8226;TrpAb_phzE fusion proteins showed similar tight clustering when comprehensive TrpAb trees were constructed (data not shown). We have suggested that a number of TrpAa&#8226;TrpAb fusions have occurred as independent fusion events, but that the sequences have then converged due to rigid constraints imposed for proper protein-protein interactions of these subunits <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>.</p>
            </sec>
            <sec>
               <st>
                  <p>TrpAb</p>
               </st>
               <p>TrpAb is the glutamine-binding subunit of anthranilate synthase responsible for function of the TrpAa/TrpAb complex as an amidotransferase. It is the smallest and least conserved of the seven Trp domains. Sequence features of TrpAb do not distinguish it from from PabAb, an amidotransferase subunit of <it>p</it>-aminobenzoate (PABA) synthase. Usually, however, the functional role of the two members of this homolog group can be deduced by the presence of at least one homolog in a <it>trp </it>operon or a <it>pab </it>operon. In some cases, as with <it>B. subtilis </it>and other members of TCG-6, a single subunit participates in both anthranilate and PABA synthesis. In this case it has dual-pathway function and has the ability to complex with TrpAa, on the one hand, or with PabAa, on the other hand. Most of the TCGs seen in Figure <figr fid="F2">2</figr> are recognizable on the TrpAb tree in faint outline (supported with only marginal bootstrap values), with TCG-2 and TCG-7 being especially fragmented (Figure <figr fid="F5">5</figr>). The position of the &#8226;TrpAb domain from <it>T. fusca </it>is not shown. (The discussion given immediately above for TrpAa&#8226; from <it>T. fusca </it>applies here.) Note that TrpAb from <it>Thermotoga</it>, <it>Campylobacter</it>, and a small subclade of TCG-1 represented in Figure <figr fid="F5">5</figr> by <it>E. coli </it>are fused with TrpB (TrpAb&#8226;TrpB). These seem to have arisen as independent fusions, and there is no indication here of convergent evolution (in contrast to the TrpAa&#8226;TrpAb fusions discussed above).</p>
               <p>PhnB from <it>P. aeruginosa </it>is encoded by a gene within an operon that also contains <it>phnA </it>(Figure <figr fid="F3">3</figr>). The <it>phnA </it>and <it>phnB </it>xenologs in <it>P. aeruginosa </it>are both divergent from the <it>P. aeruginosa trpAa </it>and <it>trpAb </it>homologs that are engaged in primary biosynthesis. The <it>phnA/phnB </it>operon originated by LGT from within TCG-1.</p>
               <p>The gene encoding the TrpAb paralog (Sco_CDA) is a member of a large operon (along with TrpAa, TrpB, and TrpD paralogs) that is engaged in "calcium-dependent antibiotic" biosynthesis <abbrgrp><abbr bid="B19">19</abbr></abbrgrp> and presumably arose intragenomically by gene duplication.</p>
            </sec>
            <sec>
               <st>
                  <p>TrpB</p>
               </st>
               <p>TrpB catalyzes the anthranilate phosphoribosyl transferase step in which anthranilate and PRPP combine to yield phosphoribosyl-anthranilate. In this paper the usage of "TrpB" refers to TrpB_1, the major and ubiquitous subtype species. The distinctly divergent TrpB_2 subtype is narrowly distributed (being cyanobacteria-specific) and usually co-exists with TrpB_1 in cyanobacteria (Table 4 [see <supplr sid="S1">Additional File 1</supplr>]). TrpB_2 might prove to be an ancient paralog in cyanobacteria that has diverged to perform some other phosphoribosyl-transfer function. However, at present the closest homolog of TrpB_2 proteins are the TrpB_1 enzymes that catalyse the second step of Trp biosynthesis.</p>
               <p>Figure <figr fid="F6">6</figr> includes another <it>S. coelicolor </it>paralog dedicated to antibiotic synthesis (Sco_CDA) that can be distinguished from the primary-biosynthesis paralog. <it>Dehalococcoides ethanogenes </it>has two highly divergent paralogs, but Det_2 can be reasonably excluded from the primary-biosynthesis pathway (and, therefore, from the Det concatenated sequence of Figure <figr fid="F2">2</figr>) because it alone is not located within the whole-pathway operon <it>trpAaAbBDC</it>(<it>aroA</it><sub>I&#946;</sub>)<it>EbEa </it><abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. Det_2 could be a very ancient paralog, or it may have come from a LGT donor that happens to be relatively distant from any of the homologs included in Figure <figr fid="F6">6</figr>.</p>
               <p>In the case of the unicellular cyanobacteria, a single set of completely dispersed <it>trp </it>genes exists in every genome. The <it>Nostoc/Anabaena </it>subclade is more complex, having additional operon-organized <it>trp </it>genes. Since these species also possess the set of dispersed <it>trp </it>genes, it has been concluded that the dispersed <it>trp </it>genes constitute a basic set of conserved genes whose functional role is primary Trp biosynthesis in all cyanobacteria <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. Accordingly, the <it>trp </it>genes that are present in operon organization (including <it>trpB</it>) have been excluded from the Trp concatenates constructed for <it>Anabaena </it>sp. and <it>Nostoc punctiforme </it>(Figure <figr fid="F2">2</figr>).</p>
               <p>As observed above (Figure <figr fid="F5">5</figr>) with the TrpAb&#8226; domain of three phylogentically separated TrpAb&#8226;TrpB fusions, the corresponding &#8226;TrpB domains (marked with asterisks in Figure <figr fid="F6">6</figr>) also show no indication of evolutionary convergence (suggesting that for this particular domain combination, fusion does not impose a set of rigid constraints that must be met in each independent fusion).</p>
            </sec>
            <sec>
               <st>
                  <p>TrpC</p>
               </st>
               <p>TrpC catalyzes the phosphoribosyl-anthranilate isomerase step, which yields 1-(<it>o</it>-carboxyphenylamino)-1-deoxyribulose 5-phosphate (CDRP). Figure <figr fid="F7">7</figr> shows the TrpC-protein tree on which the seven TCG groups are marked, as well as TrpC from the ten lineages (orange highlight) whose branching positions are not supported by significant bootstrap values. The &#8226;TrpC domain of <it>H. pylori </it>is the most divergent sequence within TCG-1. It clusters with TrpC from <it>Campylobacter jejuni </it>with a bootstrap value of 81%. All members of TCG-1 in the TrpC tree are &#8226;TrpC domains of TrpD&#8226;TrpC fusions, except for <it>C. jejuni </it>TrpC (whose location in TCG-1 is probably artifactual).</p>
               <p>TCG-5 organisms (actinomycete bacteria) lack a <it>trpC </it>gene (except, of course, for the horizontally transferred genes of coryneform bacteria). <it>hisA </it>(<it>priA</it>) from these organisms presumably encodes a dual-pathway enzyme that performs the isomerase function in both the histidine and Trp biosynthetic pathways <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. These dual-function <it>priA </it>genes are listed in the <it>hisA </it>column of Table 4 [see <supplr sid="S1">Additional File 1</supplr>]. Since &#8226;<it>trpC </it>in the coryneform bacteria is coordinated with the other <it>trp-</it>operon genes that originated by LGT, their <it>hisA </it>(<it>priA</it>) gene can be considered a remnant in the context of Trp biosynthesis. However, an essential role in histidine biosynthesis has undoubtedly selected for maintenance of <it>hisA</it>. It would be interesting to know whether the <it>trpC </it>function of <it>hisA </it>in the coryneform species of bacteria has deteriorated or not.</p>
            </sec>
            <sec>
               <st>
                  <p>HisA</p>
               </st>
               <p>HisA catalyses an Amadori rearrangement that is analogous to the TrpC rearrangement. This HisA isomerase catalyses the rearrangement of N'&#8211; [(5'-phospho<b>ribosyl</b>) forimino]-5-aminoimidazole-4-carboxamide ribonucleotide to N'&#8211; [(5'-phospho<b>ribulosyl</b>) forimino]-5-aminoimidazole-4-carboxamide ribonucleotide.</p>
               <p>As noted above, the actinomycete group of HisA proteins is dually competent for both the TrpC and HisA isomerase functions, and they have been named PriA to distinguish them from the pathway-specific TrpC and HisA proteins <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. However, the PriA proteins are clear homologs of HisA proteins, whereas homology with TrpC is uncertain. It is noteworthy that the HisA-protein tree shown in Figure <figr fid="F8">8</figr> resembles the individual Trp-protein trees in terms of the congruency groupings that can be recognized, and these have been given parallel HCG (Histidine Congruency Group) denotations. HisA clusters are recognizable that parallel TCG-1, TCG-2, TCG-3, TCG-4, TCG-5, and TCG-6. HCG-2 exhibits some fragmentation in that it does not include the sequences from Xca and Xfa. Note that all of the actinomycete HisA (PriA) proteins, including those from <it>C. diptheriae</it>, <it>C. glutamicum </it>(and <it>Corynebacterium efficiens</it>, not shown), form a cohesive cluster. <it>H. pylori </it>has no histidine pathway. <it>C. jejuni </it>HisA falls within HCG-1. Indeed, <it>C. jejuni </it>appears to have acquired the entire histidine pathway from the enteric lineage (tentative observations based upon BLAST queries). Organisms belonging to TCG-7 do not exhibit cohesive clustering of HisA proteins on the HisA tree, but comparable fragmentation of TCG-7 is not unusual for many of the individual Trp-protein trees (i.e. especially with the least conserved proteins).</p>
               <fig id="F8">
                  <title>
                     <p>Figure 8</p>
                  </title>
                  <caption>
                     <p>Phylogenetic tree of HisA sequences</p>
                  </caption>
                  <text>
                     <p>Phylogenetic tree of HisA sequences. Congruency groupings that match TCG clusters are labelled 'HCG' for histidine congruency group. <it>Helicobacter pylori </it>and <it>Streptococcus pneumoniae </it>are not represented on the tree because they have lost the histidine pathway. <it>C. jejuni </it>HisA appears to be a xenologous member of TCG-1. Similar to the relatively loose TCG-2 and TCG-7 groupings, HCG-2 and HCG-8 are fragmented. Actinomycete bacteria possess a tightly clustered section of dual-pathway HisA (PriA) sequences within HCG-5.</p>
                  </text>
                  <graphic file="1741-7007-2-15-8"/>
               </fig>
            </sec>
            <sec>
               <st>
                  <p>TrpD</p>
               </st>
               <p>TrpD catalyzes the reaction of indoleglycerol phosphate synthase, whereby CDRP is converted to indole 3-glycerol phosphate with the release of carbon dioxide (CO<sub>2</sub>) and water (H<sub>2</sub>O). The TCG groupings seen in Figure <figr fid="F2">2</figr> are well articulated in the TrpD-protein tree of Figure <figr fid="F9">9</figr>. Within the cyanobacterial TCG-4, both <it>Anabaena </it>sp. and <it>N. punctiforme </it>possess multiple paralogs that are speculated <abbrgrp><abbr bid="B7">7</abbr></abbrgrp> to have arisen by two gene duplications that occurred in a common ancester of <it>Anabaena </it>and <it>Nostoc</it>. The TrpD&#8226; domains of <it>H. pylori </it>and the coryneform bacteria cluster within TCG-1. <it>S. coelicolor </it>possesses a divergent paralog, Sco_CDA, that is dedicated to antibiotic biosynthesis.</p>
               <p>All of the actinomycete bacteria (bottom of Figure <figr fid="F9">9</figr>) possess monofunctional TrpD proteins that cluster within TCG-5. The <it>trpD</it> genes from coryneform bacteria in TCG-5 are remnants that have been functionally displaced by the <it>trpD&#8226;</it> domain (located in TCG-1). The latter exists as part of the <it>trpD</it>&#8226;<it>trpC </it>fusion within the <it>trp </it>operon imported via LGT. The long branches of the TrpD sequences (Cgl_r and Cdi_r) from the coryneform bacteria (especially from <it>C. glutamicum</it>) suggest strongly that these remnants may have lost catalytic function.</p>
            </sec>
            <sec>
               <st>
                  <p>TrpEa</p>
               </st>
               <p>TrpEa is the alpha subunit of tryptophan synthase. It converts indole 3-glycerol phosphate to indole with the release of glyceraldehyde 3-P. It forms a tight complex with TrpEb <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. The TCG clusters are well defined, as shown in Figure <figr fid="F10">10</figr>. Within cyanobacterial TCG-4, <it>trpEa </it>appears to have been duplicated in a common ancestor of <it>Anabaena </it>and <it>Nostoc</it>, as discussed previously <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. The close relatives, <it>P. aeruginosa </it>and <it>Pseudomonas syringae</it>, possess a single <it>trpEa </it>gene, which clearly falls into TCG-3, rather than TCG-2. Thus, the native <it>trpEa </it>in these <it>Pseudomonas </it>species must have been displaced by a gene originating within the lineage of TCG-3 organisms.</p>
            </sec>
            <sec>
               <st>
                  <p>TrpEb (TrpEb_1)</p>
               </st>
               <p>TrpEb is the beta subunit of tryptophan synthase and condenses indole with <it>L</it>-serine to produce <it>L</it>-tryptophan. Indole is not a free intermediate and passes through a tunnel that is created within the TrpEa/TrpEb complex <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. A small number of <it>Archaea </it>and <it>Bacteria </it>possess a homolog of TrpEb that falls into a distinct subcluster termed TrpEb_2 <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. (In this manuscript the major enzyme species, TrpEb_1, may simply be referred to as TrpEb in any context where there would be no confusion.) It has been suggested that the TrpEb_2 species does not form a complex with TrpEa and might have some other stand-alone function, such as that of serine deaminase <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. Figure <figr fid="F11">11</figr> shows four bacterial genomes that possess <it>trpEb</it>_2 in addition to <it>trpEb</it>_1, namely <it>Thermotoga</it>, <it>Geobacter</it>, <it>Aquifex</it>, and <it>Chlorobium</it>. These organisms are widely spaced on the 16S rRNA tree (Figure <figr fid="F1">1</figr>). <it>Geobacter </it>has the partial-pathway operon <it>trpAa/trpAb/trpB/trpEb</it>_2/<it>trpC. trpEa </it>and <it>trpEb</it>_<it>1 </it>are located outside the operon and are unlinked to one another. Even though TrpEb_2 is located in the operon, it was excluded from the Gsu concatenate string (Figure <figr fid="F2">2</figr>) because <it>trpEb</it>_1 encodes all of the critical contact residues known to be important for physical association with TrpEa <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>.</p>
               <p>The TrpEb_1 sequences in Figure <figr fid="F11">11</figr> form TCG clusters that are relatively well defined, judging from the excellent conformation with the tree of concatenated Trp sequences (Figure <figr fid="F2">2</figr>). The paralogs present in <it>Anabaena </it>and <it>Nostoc </it>within TCG-4 presumably originated following gene duplication in a common ancestor of these genera <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. The sequences of TrpEb from <it>Thermomonospora</it>, <it>Streptomyces</it>, and <it>Mycobacteria </it>cluster together as expected for actinomycetes (TCG-5). Exceptions are the TrpEb proteins of the coryneform bacteria, which originated by LGT from a source within TCG-1. Interestingly, after LGT, gene duplication appears to have resulted in a second TrpEb species (Cdi_2) in <it>C. diptheriae</it>. The branching position shown in Figure <figr fid="F11">11</figr> suggests that the duplication preceded speciation. If so, <it>C. glutamicum </it>has since lost the paralog. Alternatively, rapid divergence associated with probable status of Cdi_2 as a pseudogene might account for the position on the distance tree. It is interesting that it has recently been asserted that duplication occurs more often among laterally transferred genes than for native genes <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Aromatic Congruency Groups implicate two deteriorated TCGs</p>
            </st>
            <sec>
               <st>
                  <p>Chlamydophila/Chlamydia</p>
               </st>
               <p>The chlamydiae have an intact common pathway of aromatic biosynthesis, but none have a complete <it>trp </it>pathway. In one case, <it>Chlamydophila pneumoniae</it>, no <it>trp </it>genes are present. At the other extreme, <it>C. psittaci </it>possesses all <it>trp </it>genes except for <it>trpAa </it>and <it>trpAb</it>. Other species possess an intermediate number of <it>trp </it>genes. Various rationales have been advanced <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp> in support of novel functions for these incomplete pathways that have some interesting implications for pathogenesis. Those few <it>trp </it>proteins remaining in the chlamydiae are always very tightly clustered with one another on individual protein trees. We expect that when sufficient related genomes are sequenced, Trp congruency will be established for the chlamydiae. We mainly base this on the fact that trees corresponding to the seven enzymes of the common aromatic pathway yield a very cohesive Aromatic Congruency Group (Xie and Jensen, in progress).</p>
            </sec>
            <sec>
               <st>
                  <p>Epsilon proteobacteria</p>
               </st>
               <p>The epsilon proteobacteria, <it>Helicobacter pylori </it>and <it>C. jejuni, </it>were among the early complete and published genomes. Recently, the genomes of <it>H. hepaticus </it>and <it>Wolinella succinogenes </it>have also become available. These four organisms all possess common-aromatic pathway enzymes whose sequences cluster tightly together (Xie and Jensen, in progress). Since the <it>H. pylori trp </it>genes had been replaced via LGT after divergence from their closest neighbor (<it>H. hepaticus</it>), we expected that the Trp proteins of the three remaining epsilon proteobacteria would define a TCG grouping. However, Trp proteins from <it>H. hepaticus</it>, <it>W. succinogenes </it>and <it>C. jejuni </it>are all divergent from one another. They are also divergent from sequences of other organisms. There is, so far, no indication of an LGT history. <it>C. jejuni </it>has a <it>trpAa/trpAb</it>&#8226;<it>trpB/trpC/trpEb/trpEa </it>operon (<it>trpD </it>is isolated in an unlinked position). <it>H. hepaticus </it>has spaced the <it>trp </it>genes in three locations, where <it>trpAa/trpAb</it>&#8226;<it>trpB</it>, <it>trpD</it>&#8226;<it>trpC </it>and <it>trpEb/trpEa </it>reside. It is anomalous and quite possibly symptomatic of ongoing reductive evolution that <it>trpEa </it>and <it>trpEb </it>are divergently oriented. <it>H. hepaticus </it>not only has the <it>trpAb</it>&#8226;<it>trpB </it>fusion, but also has a <it>trpD</it>&#8226;<it>trpC </it>fusion. The <it>trpD</it>&#8226;<it>trpC </it>fusion of <it>H. hepaticus </it>appears to be of independent origin with respect to all of the others in TCG-1, based upon phylogenetic analysis and inter-domain linker analysis. In <it>W. succinogenes </it>the <it>trp </it>genes are all dispersed and none of them are fused. In the future, a core of epsilon proteobacteria may become available as a source of sequences to build a Tryptophan Congruency Group. The organisms currently available have experienced dynamic (and perhaps disruptive) evolutionary events that currently prevent definitive conclusions about the common ancestor.</p>
               <p>For comparison, we took a preliminary look at the histidine pathway in these four epsilon proteobacteria. <it>H. pylori </it>has completely lost the histidine pathway, and <it>C. jejuni </it>appears to have displaced its pathway with the histidine operon from the enteric lineage. Genes of histidine biosynthesis are completely dispersed in <it>H. hepaticus </it>and <it>W. succinogenes</it>, and all yield mutually best BLAST hits. Thus, <it>H. hepaticus </it>and <it>W. succinogenes </it>may be core genomes that represent a congruency group with respect to histidine biosynthesis.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Some underlying complexities of Trp-protein concatenates</p>
            </st>
            <sec>
               <st>
                  <p>Convergence of TrpAa&#8226;TrpAb fusions</p>
               </st>
               <p>Fusions of general aromatic biosynthetic pathway genes, including those of the Trp pathway, have occurred frequently. It is not uncommon for the same genes in different organisms to have undergone fusion independently. In cases where the catalytic domains function separately, there may be great latitude for successful fusion orientations, for example, <it>aroQ</it>&#8226;<it>pheA </it>fusions <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. In contrast, TrpAb delivers ammonia to the active site of TrpAa in a way that may impose particular demands upon the spatial orientation of the protein-protein interaction operating for the anthranilate synthase complex. Although we suggest that TrpAa&#8226;TrpAb fusions have occurred independently at least five times in widely spaced lineages, they all group tightly together in both TrpAa trees (Figure <figr fid="F4">4</figr>) and TrpAb trees (not shown). We believe that this is due to strong selective pressure for convergence. Thus, the individual TrpAa and TrpAb trees place the <it>T. fusca </it>TrpAa and TrpAb proteins in an anomalous group that includes proteins from <it>Legionella</it>, <it>Azospirillum</it>, <it>Brucella </it>(and close relatives) and the <it>Nostoc/Anabaena </it>group of cyanobacteria. This phenomenon of convergence disrupts the congruence of the individual TrpAa tree when these organisms are included, but this is not enough to undermine the proper placement of the <it>T. fusca </it>Trp-protein concatenate within TCG-5.</p>
            </sec>
            <sec>
               <st>
                  <p>The bifunctional <it>priA </it>gene of TCG-5 actinomycetes</p>
               </st>
               <p>The enzyme encoded by <it>priA </it>is competent for the isomerase step in both the histidine and Trp pathways of actinomycete bacteria <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. Thus, PriA sequences would be concatenate components defining not only TCG-5, but also a Histidine Congruency Group (HCG-5) (Figure <figr fid="F8">8</figr>). Presumably the acquisition of the <it>trp </it>operon in coryneform species leaves <it>priA </it>unnecessary for a role in Trp biosynthesis. The dual substrate capabilities of PriA might reflect an ancient state of broad-specificity competence, with <it>hisA </it>and <it>trpC </it>being derived from specialized offspring of gene duplication <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. On the other hand, because PriA is clearly a homolog of HisA, but an uncertain homolog of TrpC, it is possible that an ancestral <it>trpC </it>was lost and subsequently replaced by a divergent duplicate of <it>hisA</it>. The overall effect of PriA being uniquely present in Trp-protein concatenates of actinomycete bacteria (i.e. being forced to align with TrpC in all other concatenates) is to compress slightly the actinomycete TCG grouping.</p>
            </sec>
            <sec>
               <st>
                  <p>The dual-pathway <it>pabAb </it>of TCG-6 bacilli</p>
               </st>
               <p>Very much like the above situation, TCG-6 <it>Bacillus </it>species have lost <it>trpAb </it>from an otherwise complete <it>trp </it>operon. <it>pabAb</it>, located in the <it>pab </it>operon, is competent to function for both Trp and folate biosynthesis (see Xie <it>et al. </it><abbrgrp><abbr bid="B6">6</abbr></abbrgrp>). Thus, within the narrow <it>Bacillus </it>clade that includes <it>B. subtilis</it>, <it>B. halodurans </it>and <it>B. stearothermophilus</it>, PabAb sequences help to define not only TCG-6, but also a pending Folate Congruency Group.</p>
            </sec>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <sec>
            <st>
               <p>Evolutionary scenario for LGT of whole-pathway Trp operons</p>
            </st>
            <p>Figure <figr fid="F12">12</figr> depicts the evolutionary events proposed to account for the location in both <it>H. pylori </it>and coryneform bacteria of all seven domains that are associated with primary Trp biosynthesis within TCG-1, which otherwise is populated by members of the enteric lineage. Members of TCG-1 all possess whole-pathway operons with a <it>trpD</it>&#8226;<it>trpC </it>fusion. As an isolated observation, we cannot absolutely rule out independent fusions of <it>trpD </it>with <it>trpC </it>in <it>Helicobacter </it>and coryneform bacteria, followed by convergent evolution with respect to the remaining <it>trpD</it>&#8226;<it>trpC </it>fusions, to explain their joint locations in TCG-1 for TrpD and TrpC trees. However, the identical overall gene order of the 7-gene operon, the phylogenetic positioning of the remaining five protein domains that are not fused, and the existence of congruent <it>trpD </it>remnants in coryneform bacteria are observations that support the conclusion of LGT with conviction. Although the phylogenetic analysis pinpoints the LGT donor of the whole-pathway <it>trp </it>operon to a member within the enteric lineage, the subclade that includes <it>E. coli</it>, <it>Salmonella typhimurium, Klebsiella pneumoniae</it>, and <it>Shigella flexneri </it>can be excluded. This is because the latter four organisms all possess a recent gene fusion (<it>trpAb</it>&#8226;<it>trpB</it>) that is absent elsewhere in the enteric lineage (absent also, of course, in <it>H. pylori </it>and coryneform bacteria).</p>
            <sec>
               <st>
                  <p>Helicobacter</p>
               </st>
               <p>The ancestral <it>trp </it>genes of <it>H. pylori </it>were displaced by a complete <it>trp </it>operon, but apparently without a leader region. <it>H. hepaticus </it>is the closest published complete genome to <it>H. pylori</it>. The LGT event can be pinpointed after divergence of <it>H. pylori </it>and <it>H. hepaticus </it>since the <it>trp </it>genes of <it>H. hepaticus </it>do not cluster within TCG-1. <it>H. hepaticus </it>possesses a partial-pathway <it>trpAa/trpAb</it>&#8226;<it>trpB </it>operon and orphan <it>trpC </it>and <it>trpD </it>genes. There are indications that the pathway is in a state of rapid deterioration, as has been described for some pathogenic enteric bacteria, for example, in <it>Actinobacillus actinomycetemcomitans </it><abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. Most notably, TrpEb and TrpEa, although still linked, have been scrambled to a divergent orientation (thus, not having the typical operon organization). In addition, <it>H. hepaticus </it>TrpEa and TrpEb possess alterations in two and four, respectively, of the residues diagrammed as invariant residues by Xie <it>et al</it>. <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. Thus, one or both may be inactive pseudogenes. In view of this rapid deterioration, the contemporary genome of <it>C. jejuni </it>might be more similar to the displaced <it>H. pylori </it>Trp genes. The <it>C. jejuni </it>operon gene order is <it>trpAaAb</it>&#8226;<it>BCEbEa</it>. In <it>C. jejuni trpD </it>is not fused to <it>trpC </it>and, in fact, <it>trpD </it>has escaped the otherwise intact <it>trp </it>operon. <it>C. jejuni </it>does not have a leader peptide encoded by a gene upstream of its <it>trp </it>operon.</p>
               <fig id="F12">
                  <title>
                     <p>Figure 12</p>
                  </title>
                  <caption>
                     <p>Schematic portrayal of two whole-pathway <it>trp </it>operon transfers and two partial-pathway <it>trp </it>operon transfers</p>
                  </caption>
                  <text>
                     <p>Schematic portrayal of two whole-pathway <it>trp </it>operon transfers and two partial-pathway <it>trp </it>operon transfers. The partial tree shown is taken from Fig. 1, which identifies the organisms. In <it>C. diptheriae</it>, the vertical evolutionary events of gene duplication and insertion that occurred following LGT can be visualized by comparing the gene organization shown to the right (including intergenic spacing) with the gene organization originally received from the enteric donor (bottom right). Gene insertions that followed LGT are shown in white.</p>
                  </text>
                  <graphic file="1741-7007-2-15-12"/>
               </fig>
            </sec>
            <sec>
               <st>
                  <p>Corynebacterium</p>
               </st>
               <p>The ancestor of coryneform bacteria also imported a whole-pathway <it>trp </it>operon, but in this case including a coding region for a leader peptide. It is significant that three species of coryneform bacteria possess an unlinked second copy of <it>trpD </it>(denoted TrpD_r), which is a remnant of the native displaced operon that still persists in other actinomycete <it>Bacteria</it>. This conclusion is supported (Figure <figr fid="F9">9</figr>) by the clustering of the stand-alone TrpD_r from both <it>C. glutamicum </it>and <it>C. diptheriae </it>with TrpD proteins from other actinomycete bacteria (species of <it>Bifidobacteria</it>, <it>Mycobacteria</it>, <it>Thermomonospora </it>and <it>Streptomyces</it>) in TCG-5. The latter organisms possess <it>trpD </it>within a partial-pathway <it>trp </it>operon having the order <it>trpAaDEbEa </it><abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. The elongated branches seen for TrpD_r in Cgl and Cdi in Figure <figr fid="F9">9</figr> are consistent with drift of the remnant genes from their previous functional role, due to the lack of selective pressure that followed functional replacement by TrpD&#8226; of the LGT operon.</p>
               <p>It is interesting that <it>C. diptheriae </it>exhibits evidence of changes to its <it>trp </it>operon that are both relatively recent and disruptive (Figure <figr fid="F12">12</figr>). A duplicate of <it>trpEb </it>has been inserted at the beginning of the operon between <it>trpL </it>(encoding the leader peptide) and <it>trpAa</it>. This TrpEb protein may not be functional, considering that invariant residue 162G (<it>S. typhimurium </it>numbering) has been changed to 162E and invariant residue 167K has been changed to 167S, as pointed out earlier <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. Residue 167K is crucial to the formation of an intersubunit salt bridge with 56D of TrpEa <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>.</p>
               <p>Another post-LGT event in <it>C. diptheriae </it>has been the insertion of two genes of <it>D</it>-pantothenate biosynthesis between <it>trpD</it>&#8226;<it>C </it>and <it>trpEb</it>. In <it>C. glutamicum </it>the <it>panB/panC </it>operon is remote from the <it>trp </it>operon and has been well characterized <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. In <it>C. diptheriae </it>the insertion between <it>trpD</it>&#8226;<it>C </it>and <it>trpEb </it>has been in a scrambled orientation such that <it>panC </it>and <it>panB </it>are transcribed convergently, hence no longer being an operon. The transcription of <it>panB </it>in the opposite direction as the flanking genes of <it>trp </it>biosynthesis should prevent the formation of a single transcript from the <it>trp </it>genes. It would appear that the original xenologous <it>trp </it>operon of <it>C. diptheriae </it>has been split such that <it>trpAaAbBD</it>&#8226;<it>C </it>and <it>trpEbEa </it>would be separately transcribed. This may illustrate the phenomenon of reductive evolution for a pathogen in the process of discarding un-needed genes.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>All elements of operon regulation were not transferred</p>
            </st>
            <p>The <it>trp </it>operon of <it>E. coli </it>(and presumably other members of the enteric clade) is subject to control at several levels <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B27">27</abbr><abbr bid="B33">33</abbr></abbrgrp>. These include (<b>i</b>) allosteric control of anthranilate synthase, whereby Trp acts as a potent feedback inhibitor; and (<b>ii</b>) an attenuator mechanism that is mediated by a Trp-rich leader peptide (encoded by <it>trpL</it>), and repression control mediated by <it>trpR</it>, which binds Trp as a corepressor moiety. Because sensitivity to feedback inhibition is built into the allosteric domain of TrpAa, and because <it>trpL </it>is located immediately upstream of the <it>E. coli trp </it>operon, it is not surprising that recipient organisms such as the coryneform bacteria possess both of the latter regulatory features. However, <it>trpR </it>is distant from the operon and was not co-transferred with the operon.</p>
            <p>Perhaps the contemporary <it>trp </it>operon acquired by LGT offered selective advantages to the ancestor of coryneform bacteria, but Trp regulation in coryneform bacteria would appear to be relatively unsophisticated without <it>trpR</it>. In <it>E. coli </it>the impact of attenuation is relatively weak compared with the impact of repression <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B27">27</abbr></abbrgrp>. Repression detects free Trp whereas attenuation detects uncharged tRNA<sup>Trp</sup>. Since free Trp concentrations can be fairly low and still maintain highly charged tRNA<sup>Trp</sup>, <it>trpR</it>-mediated repression acts over a large range of expression. Relief from attenuation ensues after maximal derepression has occurred. In <it>E. coli</it>, <it>trpR </it>also functions beyond the Trp pathway in that it binds to an operator for an initial gene of aromatic biosynthesis.</p>
         </sec>
         <sec>
            <st>
               <p>LGT of <it>trpL </it>in coryneform bacteria</p>
            </st>
            <p>The existence of a leader peptide associated with regulation of the <it>trp </it>operon (and, indeed, the unexpectedly close relationship with the <it>E. coli trp </it>operon) was reported some time ago in <it>C. glutamicum </it><abbrgrp><abbr bid="B34">34</abbr><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr></abbrgrp>. The leader-peptide region (encoded by <it>trpL</it>) upstream of the <it>trp </it>operons of <it>C. glutamicum</it>, <it>C. efficiens</it>, and <it>C. diptheriae </it>are shown (Figure <figr fid="F13">13</figr>) in comparison with the corresponding region of <it>V. cholerae</it>, a member of the enteric lineage that represents the LGT donor. In <it>V. cholerae</it>, <it>C. glutamicum</it>, and <it>C. efficiens</it>, <it>trpL </it>is immediately upstream of <it>trpAa</it>, the initial structural genes of the <it>trp </it>operon in these organisms. In <it>C. diptheriae </it>a copy of <it>trpEb </it>has recently been inserted between <it>trpL </it>and <it>trpAa </it>(Figure <figr fid="F12">12</figr>). The putative start codons of <it>trpL </it>in <it>Vibrio cholerae</it>, <it>C. glutamicum</it>, and <it>C. diptheriae </it>are ATG, TTG, ATG, and GTG, respectively. These yield leader peptides of 30 amino acids in length. The start codon is uncertain, and Heery and Dunican <abbrgrp><abbr bid="B35">35</abbr></abbrgrp> have suggested a start codon (vertical black arrow) for <it>C. glutamicum </it>that would yield a 17-amino acid leader peptide.</p>
            <fig id="F13">
               <title>
                  <p>Figure 13</p>
               </title>
               <caption>
                  <p>Comparison of <it>trpL </it>leader regions in species of coryneform bacteria and in a representative enteric bacterium</p>
               </caption>
               <text>
                  <p>Comparison of <it>trpL </it>leader regions in species of coryneform bacteria and in a representative enteric bacterium. In the coryneform bacteria, the start codon for <it>trpL </it>is uncertain, and second start sites at an internal position for <it>C. glutamicum </it>and <it>C. efficiens </it>are indicated by black, vertical arrows.</p>
               </text>
               <graphic file="1741-7007-2-15-13"/>
            </fig>
            <p>In the case of <it>H. pylori</it>, the <it>trp </it>operon was either acquired without <it>trpL</it>, or <it>trpL </it>was soon lost. Thus, the <it>trp </it>pathway of <it>H. pylori </it>appears to lack regulation by both attenuation and repression. Assuming that the displaced pathway of <it>H. pylori </it>was similar to that of the contemporary <it>C. jejuni </it>or other epsilon-proteobacteria, it would be interesting to compare the relative efficiencies of the modern Trp pathways in these fairly close relatives.</p>
         </sec>
         <sec>
            <st>
               <p>LGT of partial-pathway Trp operons</p>
            </st>
            <p><it>P. aeruginosa </it>possesses a partial-pathway operon consisting of genes encoding the two subunits of tryptophan synthase that are of xenologous origin. These <it>trpEb/trpEa </it>genes are engaged in primary biosynthesis. The donor is clearly a member of the TCG-3 clade. Thus, in the scenario considered <abbrgrp><abbr bid="B6">6</abbr></abbrgrp> for gene separation of the ancestral <it>trpC/trpEb/trpEa </it>operon to yield a stand-alone <it>trpC </it>gene and a <it>trpEb/trpEa </it>operon, the native <it>trpEb/trpEa </it>must have been displaced by LGT, after the intragenomic translocation. Because the regulatory gene, <it>trpI</it>, is adjacent in divergent orientation to <it>trpEb </it>in the <it>P. aeruginosa</it>/<it>P. syringae </it>clade (and three other close relatives; see asterisks in Table <tblr tid="T3">3</tblr>), it will be interesting if any genomic members of TCG-3 to be sequenced in the future prove to have <it>trpI </it>adjacent to a <it>trpEb/trpEa </it>operon. (However, TrpI is a member of the ubiquitous and large family of LysR transcriptional activators, and could easily have emerged recently in the <it>P. aeruginosa </it>clade following gene duplication.)</p>
            <p><it>P. aeruginosa </it>also possesses a partial-pathway <it>trp </it>operon encoding the two subunits of anthranilate synthase. Phylogenetic analysis indicates that these are <it>trpAa trpAb </it>xenologs that originated from within the enteric lineage. As with the whole-pathway operons just discussed, the <it>E. coli</it>/<it>S. typhimurium</it>/<it>K. pneumoniae</it>/<it>S. flexneri </it>clade can be excluded as the specific donor because this clade possesses a <it>trpAb</it>&#8226;<it>trpB </it>fusion. This xenologous two-gene operon of <it>P. aeruginosa </it>(originally denoted <it>phnA </it>and <it>phnB </it>by Crawford <it>et al. </it><abbrgrp><abbr bid="B25">25</abbr><abbr bid="B37">37</abbr></abbrgrp>) has not displaced the corresponding <it>trp </it>genes that are engaged in primary biosynthesis. Although <it>phnA </it>and <it>phnB </it>were originally thought to function in phenazine production, as the naming implies, phenazine pigments are not derived from anthranilate <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. Instead, the xenolog pair appear to have an unknown specialized function that is geared to stationary-phase physiology <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. The LGT acquisition of the <it>phnA/phnB </it>xenolog operon by <it>P. aeruginosa </it>occurred after its divergence from close relatives such as <it>P. putida</it>, <it>P. fluorescens</it>, <it>P. syringae </it>and <it>Azotobacter vinelandii </it>because none of these relatives have the xenologous <it>phnA</it>/<it>phnB </it>operon. Hence, this acquisition appears to be quite recent.</p>
            <p>Species of <it>Xylella </it>also possess a xenologous two-gene <it>trpAa</it>/<it>trpAb </it>partial-pathway <it>trp </it>operon that appears to have a specialized function that is distinct from homologs engaged in primary Trp biosynthesis. Xie <it>et al. </it><abbrgrp><abbr bid="B7">7</abbr></abbrgrp> have suggested that the gene denoted <it>acl </it>probably encodes an aryl-CoA ligase. Since <it>acl </it>appears to exist within the <it>Xylella trpAa/trpAb </it>operon, its gene product may have the specificity of anthranilate-CoA ligase. Activated anthranilate may then function as a key precursor for production of antibiotic, siderophore, and so on. Note that, if this is correct, the primary <it>trp</it>-pathway genes <it>trpB/trpC/trpD/trpEa/trpEb </it>are irrelevant to the specialized pathway. Thus, reference to the xenologous <it>trpAa/trpAb </it>pair as a "partial-pathway operon" could be somewhat a misnomer, and perhaps "hybrid operon" might prove to be more apt. The origin of the <it>Xylella trpAa/trpAb </it>genes might have been from a close relative of <it>C. jejuni </it>because the best matches of both genes are with <it>C. jejuni</it>. It is also suggestive that these genes have a low-GC ratio (about 38%), compared with a genomic GC ratio of 30.2% for <it>C. jejuni</it>. <it>C. jejuni </it>cannot have been the direct donor, however, because it possesses a <it>trpAb</it>&#8226;<it>trpB </it>fusion.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusions</p>
         </st>
         <p>The overall impact of LGT in <it>Bacteria </it>and <it>Archaea </it>is currently a highly contentious issue. Critical reviews written from completely different viewpoints are recommended for a sense of the status of the ongoing debate, as well as for a substantial listing of key references <abbrgrp><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr></abbrgrp>. Gogarten <it>et al. </it><abbrgrp><abbr bid="B21">21</abbr></abbrgrp> have summarized a contemporary rationale to describe the evolutionary process as a phylogenetic "synthesis" that could integrate a traditional tree-like behavior (vertical descent of genes) and web-like, reticulate behavior (LGT). The latter paper follows up on a balanced and highly insightful review produced by Doolittle in 1999 <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. In an essay by Martin <abbrgrp><abbr bid="B41">41</abbr></abbrgrp> about the extent to which bacterial chromosomes might be mosaic, it was emphasized that "careful gene-by-gene phylogenetic comparisons in addition to genome-by-genome comparisons.....are needed." We consider the Trp system to be a particularly apt choice for gaining detailed insight into the relative contributions of lateral and vertical events in the evolutionary history of a major segment of primary metabolism. On the one hand, genes of Trp biosynthesis are expected to represent a level and type of fundamental metabolism that is least prone to LGT if it is correct that a "core" of genes exists that is relatively recalcitrant to LGT <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B42">42</abbr></abbrgrp>. On the other hand, the wide distribution of intact whole-pathway <it>trp </it>operons should facilitate the probability of LGT (at least the initial acquisition event). For the Trp pathway, we conclude that events of LGT and paralogy do not obliterate the vertical trace of evolutionary history. This view is reinforced by the contention that whole-genome trees (mean pairwise similarities between shared genomic proteins) have largely converged on the rRNA-sequence tree <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>. The latter work was preceded by the gene-content "trees" introduced by Snel <it>et al</it>. <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. Also, see Eisen <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>.</p>
         <p>Gogarten <it>et al</it>. <abbrgrp><abbr bid="B21">21</abbr></abbrgrp> have discussed the radical possibility that rRNA genes are highly mosaic and are so useful for prokaryotic taxonomy "precisely because they are mosaics and reflect the mosaic character of the genome as a whole". According to this view, a vertical trace of genealogical history will not be found, because it only exists in short jumps. (If so, the two whole-pathway LGT events described here reflect LGT events that stand out against the overall mosaic trends.)</p>
         <p>The above caveat aside, 16S rRNA trees do appear to provide a reasonable guide to the vertical trace of evolutionary descent <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>. If so, the mapping of the Trp-pathway system upon the 16S rRNA tree exemplifies a case where the evolutionary history can indeed be tracked as a vertical genealogy that features some intriguing reticulate relationships. Proteins of the Trp pathway generally exhibit a genealogy that is parallel with the 16S rRNA tree. In the case of <it>Helicobacter </it>and <it>Corynebacterium </it>species, the genomes are a mosaic with respect to Trp genes. This means that the history of the <it>trp </it>genes in the latter organisms, instead of being that shared with their closest relatives, is the same as that of enteric bacteria (the donor lineage) up to the time of LGT. Following this time a new vertical genealogical progression has begun. Each organism (lineage) can be envisioned to possess an individualistic repertoire of chimeric features that have been assimilated into a recognizable skeleton of vertical events. It would seem that the evolutionary history of the primary pathway of Trp biosynthesis parallels the organismal phylogeny in most organisms. This presently includes all of the organisms in Figure <figr fid="F2">2</figr> that belong to TCG groupings, except for <it>H. pylori </it>and the coryneform bacteria. Mosaicism can occasionally exist at a level of individual <it>trp </it>genes, as illustrated by <it>P. aeruginosa </it>and its closest relatives. Here the history of <it>trpAa/trpAb/trpB/trpC/trpD </it>(primary biosynthesis) parallels the organismal phylogeny, but <it>trpEb/trpEa </it>exhibit a reticulate genealogy.</p>
         <p>It will be important to obtain other analyses of similar detail for other pathways to assess to what extent the evolutionary process that underpins Trp biosynthesis reflects the general process in other pathways. We have already seen in preliminary work that construction of similar congruency groups pertinent to common aromatic-pathway biosynthesis, folate biosynthesis, and histidine biosynthesis has good potential to expand the analysis. It seems probable that there are no prokaryote species whose evolutionary history exclusively involved a vertical line of descent for all of its genes. Lawrence <abbrgrp><abbr bid="B47">47</abbr></abbrgrp> has written a cogent essay in support of the view that the Linnean paradigm of hierarchical descent fails to describe the evolution of prokaryotes because LGT occurs "at all levels of taxonomic inclusiveness". However, Woese <it>et al. </it><abbrgrp><abbr bid="B48">48</abbr></abbrgrp> argue that the dynamic of LGT has become progressively diminished through time as simple, primitive early cells became complex and refined. This is in accord with our thesis <abbrgrp><abbr bid="B6">6</abbr></abbrgrp> that early and simple <it>trp</it>-gene assemblages may have been unstable until their progression to complexities of regulation and to establishment of individualistic metabolic ties that conferred increasing operon stability.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Genomes</p>
            </st>
            <p>Refer to the Gold Genomes Online Database <abbrgrp><abbr bid="B49">49</abbr></abbrgrp> for a list of published complete genomes and links to the corresponding references. Synonymous names include <it>Rhizobium meliloti = Sinorhizobium meliloti</it>; <it>Corynebacterium glutamicum = Brevibacterium lactofermentum = Brevibacterium flavum = Brevibacterium divaricatum; Rhizobium meliloti </it>= <it>Sinorhizobium meliloti</it>; <it>Rhizobium loti = Mesorhizobium loti</it>; <it>Thermomonospora fusca </it>= <it>Thermobifida fusca</it>; <it>Chlamydophila psittaci </it>= <it>Chlamydophila caviae</it>; <it>Thiobacillus ferrooxidans </it>= <it>Acidithiobacillus ferrooxidans</it>; <it>Shewanella putrefaciens </it>= <it>Shewanella oneidensis</it>; <it>Sphingomonas aromaticivorans = Novosphingobium aromativorans</it>.</p>
         </sec>
         <sec>
            <st>
               <p>16S rRNA phylogenetic trees</p>
            </st>
            <p>16S rRNA subtrees were derived from the Ribosomal Database site <abbrgrp><abbr bid="B50">50</abbr><abbr bid="B51">51</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Protein phylogenetic trees</p>
            </st>
            <p>Unrooted phylogenetic trees were derived from ClustalW alignment <abbrgrp><abbr bid="B52">52</abbr><abbr bid="B53">53</abbr></abbrgrp>. The neighbor-joining and Fitch <abbrgrp><abbr bid="B54">54</abbr></abbrgrp> programs were employed to obtain distance-based trees. The distance matrix was obtained using Protdist with a Dayhoff Pam matrix. The Seqboot and Consense programs were then used to assess the statistical strength of the tree using bootstrap resampling. Neighbor-joining and Fitch trees yielded similar clusters and arrangement of taxa within them. Bootstrap values indicate the number of times a node was supported in 100 resampling replications.</p>
         </sec>
         <sec>
            <st>
               <p>Concatenated sequences of Trp-pathway proteins</p>
            </st>
            <p>Multiple sequence alignments were derived by input of the indicated homolog amino acid sequences into the ClustalW program (Version 1.4) <abbrgrp><abbr bid="B52">52</abbr><abbr bid="B53">53</abbr></abbrgrp>. Manual alignment adjustments were made as needed with the assistance of the BioEdit multiple alignment tool of Hall <abbrgrp><abbr bid="B55">55</abbr></abbrgrp>.</p>
            <p>After each of the individual Trp-protein domain alignments were generated, both N-terminal and C-terminal unaligned regions were trimmed manually through visual inspection. Then the seven protein domains responsible for the Trp pathway of biosynthesis were concatenated in the order: TrpAa/TrpAb/TrpB/TrpC/TrpD/TrpEa/TrpEb. The resulting concatenated multiple alignment was used as input for generation of a phylogenetic tree using the program package PHYLIP <abbrgrp><abbr bid="B56">56</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Analysis of raw DNA sequence data</p>
            </st>
            <p>Raw DNA contig sequences available from NCBI <abbrgrp><abbr bid="B57">57</abbr></abbrgrp> and the TIGR Unfinished Microbial Genomes database <abbrgrp><abbr bid="B58">58</abbr></abbrgrp> were screened using the built-in BLAST service. The protein sequences from GenBank were used as query entries. The BLAST 2.0 and the ORF Finder (Open Reading Frame Finder) offered by NCBI <abbrgrp><abbr bid="B59">59</abbr></abbrgrp> were used to locate open reading frames and to confirm the similarity search result of the raw sequence.</p>
         </sec>
         <sec>
            <st>
               <p>Analysis of fusion proteins</p>
            </st>
            <p>Fusion protein sequences from GenBank and NCBI Microbial Genomes Blast Databases <abbrgrp><abbr bid="B57">57</abbr></abbrgrp> were screened using the BLAST <abbrgrp><abbr bid="B60">60</abbr></abbrgrp> program. Multiple alignments were obtained by input of single-domain and fusion-protein sequences into the ClustalW <abbrgrp><abbr bid="B52">52</abbr><abbr bid="B53">53</abbr></abbrgrp> program (version 1.4).</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>The major roles of the authors were as follows. GX obtained the sequences, did the alignments, and obtained the phylogenetic trees. CB drew the figures. JS undertook the systematic management and organization of sequence data with the objective of achieving a dynamic and progressively updateable website. NK managed and coordinated the general laboratory operations. The initial guiding concepts and the initial draft came from RJ. All of the authors participated in the data analyses and in the formulation of conclusions. All of the authors read and approved the final version of the manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>This is Florida Agricultural Experiment Station Journal Series No. R-09802. R Jensen thanks the National Library of Medicine (Grant G13 LM008297) for partial support. Some of the preliminary sequence data were obtained from the Institute for Genomic Research <abbrgrp><abbr bid="B58">58</abbr></abbrgrp>, National Center for Biotechnology Information <abbrgrp><abbr bid="B57">57</abbr></abbrgrp>, and Integrated Genomics, Inc. <abbrgrp><abbr bid="B61">61</abbr></abbrgrp>. We thank Charles Yanofsky and Ross Overbeek for their critiques of an early version of this manuscript.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Molecular archaeology of the <it>Escherichia coli </it>genome</p>
            </title>
            <aug>
               <au>
                  <snm>Lawrence</snm>
                  <fnm>JG</fnm>
               </au>
               <au>
                  <snm>Ochman</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1998</pubdate>
            <volume>95</volume>
            <fpage>9413</fpage>
            <lpage>9417</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.95.16.9413</pubid>
                  <pubid idtype="pmpid" link="fulltext">9689094</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Amelioration of bacterial genomes: rate of change and exchange</p>
            </title>
            <aug>
               <au>
                  <snm>Lawrence</snm>
                  <fnm>JG</fnm>
               </au>
               <au>
                  <snm>Ochman</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>1997</pubdate>
            <volume>44</volume>
            <fpage>383</fpage>
            <lpage>397</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9089078</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Wavelet to predict bacterial <it>ori </it>and <it>ter</it>: a tendency towards a physical balance</p>
            </title>
            <aug>
               <au>
                  <snm>Song</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ware</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>S-L</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <fpage>17</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/1471-2164-4-17</pubid>
                  <pubid idtype="pmpid" link="fulltext">12732098</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Selfish operons: the evolutionary impact of gene clustering in prokaryotes and eukaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Lawrence</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Curr Opin Genet Dev</source>
            <pubdate>1999</pubdate>
            <volume>9</volume>
            <fpage>642</fpage>
            <lpage>648</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0959-437X(99)00025-8</pubid>
                  <pubid idtype="pmpid" link="fulltext">10607610</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Selfish operons and speciation</p>
            </title>
            <aug>
               <au>
                  <snm>Lawrence</snm>
                  <fnm>JG</fnm>
               </au>
            </aug>
            <source>Trends Microbiol</source>
            <pubdate>1997</pubdate>
            <volume>5</volume>
            <fpage>355</fpage>
            <lpage>359</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0966-842X(97)01110-4</pubid>
                  <pubid idtype="pmpid" link="fulltext">9294891</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>The ancient origin of the tryptophan operon and tracking the subsequent dynamics of evolutionary change</p>
            </title>
            <aug>
               <au>
                  <snm>Xie</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Keyhani</snm>
                  <fnm>NO</fnm>
               </au>
               <au>
                  <snm>Bonner</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Jensen</snm>
                  <fnm>RA</fnm>
               </au>
            </aug>
            <source>Microbiol Mol Biol Rev</source>
            <pubdate>2003</pubdate>
            <volume>67</volume>
            <fpage>303</fpage>
            <lpage>342</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1128/MMBR.67.3.303-342.2003</pubid>
                  <pubid idtype="pmpid" link="fulltext">12966138</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Lateral gene transfer and ancient paralogy of operons containing redundant copies of tryptophan-pathway genes in <it>Xylella </it>species and in heterocystous cyanobacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Xie</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Bonner</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Brettin</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Gottardo</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Keyhani</snm>
                  <fnm>NO</fnm>
               </au>
               <au>
                  <snm>Jensen</snm>
                  <fnm>RA</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <fpage>R</fpage>
            <lpage>14</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1186/gb-2003-4-2-r14</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Gene transfer, speciation, and the evolution of bacterial genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Lawrence</snm>
                  <fnm>JG</fnm>
               </au>
            </aug>
            <source>Curr Opin Microbiol</source>
            <pubdate>1999</pubdate>
            <volume>2</volume>
            <fpage>519</fpage>
            <lpage>523</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S1369-5274(99)00010-7</pubid>
                  <pubid idtype="pmpid" link="fulltext">10508729</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Bacterial origin for the isoprenoid biosynthesis enzyme HMG-CoA reductase of the archaeal orders <it>Thermoplasmatales and Archaeoglobales</it></p>
            </title>
            <aug>
               <au>
                  <snm>Boucher</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Huber</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>L'Haridon</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Stetter</snm>
                  <fnm>KO</fnm>
               </au>
               <au>
                  <snm>Doolittle</snm>
                  <fnm>WF</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2001</pubdate>
            <volume>18</volume>
            <fpage>1378</fpage>
            <lpage>1388</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11420376</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Dynamic diversity of the tryptophan pathway in chlamydiae: reductive evolution and a novel operon for tryptophan recapture</p>
            </title>
            <aug>
               <au>
                  <snm>Xie</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Bonner</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Jensen</snm>
                  <fnm>RA</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2002</pubdate>
            <volume>3</volume>
            <fpage>research0051.1</fpage>
            <lpage>0051.17</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1186/gb-2002-3-9-research0051</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Molecular basis defining human <it>Chlamydia trachomatis </it>tissue tropism: a possible role for tryptophan synthase</p>
            </title>
            <aug>
               <au>
                  <snm>Fehlner-Gardniner</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Roshick</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Carlson</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Hughes</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Belland</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Caldwell</snm>
                  <fnm>HD</fnm>
               </au>
               <au>
                  <snm>McClarty</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2002</pubdate>
            <volume>277</volume>
            <fpage>26893</fpage>
            <lpage>26903</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.M203937200</pubid>
                  <pubid idtype="pmpid" link="fulltext">12011099</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Tryptophan recycling is responsible for the interferon-gamma resistance of <it>Chlamydia psittaci </it>GPIC in indoleamine dioxygenase-expressing host cells</p>
            </title>
            <aug>
               <au>
                  <snm>Wood</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Roshick</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>McClarty</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>2004</pubdate>
            <volume>52</volume>
            <fpage>903</fpage>
            <lpage>916</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1365-2958.2004.04029.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">15101993</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Occurrence of a putative ancient-like isomerase involved in histidine and tryptophan biosynthesis</p>
            </title>
            <aug>
               <au>
                  <snm>Barona-G&#243;mez</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Hodgson</snm>
                  <fnm>DA</fnm>
               </au>
            </aug>
            <source>EMBO Reports</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <fpage>296</fpage>
            <lpage>300</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/sj.embor.embor771</pubid>
                  <pubid idtype="pmpid" link="fulltext">12634849</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Missing genes in metabolic pathways: a comparative genomics approach</p>
            </title>
            <aug>
               <au>
                  <snm>Osterman</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Overbeek</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Curr Opin Chem Biol</source>
            <pubdate>2003</pubdate>
            <volume>7</volume>
            <fpage>238</fpage>
            <lpage>251</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S1367-5931(03)00027-9</pubid>
                  <pubid idtype="pmpid" link="fulltext">12714058</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Advancing our knowledge in biochemistry, genetics, and microbiology through studies on tryptophan metabolism</p>
            </title>
            <aug>
               <au>
                  <snm>Yanofsky</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Ann Rev Biochem</source>
            <pubdate>2001</pubdate>
            <volume>70</volume>
            <fpage>1</fpage>
            <lpage>37</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.biochem.70.1.1</pubid>
                  <pubid idtype="pmpid" link="fulltext">11395401</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Reflections: using studies on tryptophan metabolism to answer basic biological questions</p>
            </title>
            <aug>
               <au>
                  <snm>Yanofsky</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2003</pubdate>
            <volume>278</volume>
            <fpage>10859</fpage>
            <lpage>10878</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.X200012200</pubid>
                  <pubid idtype="pmpid" link="fulltext">12556463</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Nucleotide sequences of the <it>trpI</it>, <it>trpB</it>, and <it>trpA </it>genes of <it>Pseudomonas syringae</it>: positive control unique to fluorescent pseudomonads</p>
            </title>
            <aug>
               <au>
                  <snm>Auerbach</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gao</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Gussin</snm>
                  <fnm>GN</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>1993</pubdate>
            <volume>123</volume>
            <fpage>25</fpage>
            <lpage>32</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0378-1119(93)90534-A</pubid>
                  <pubid idtype="pmpid">8423001</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>The expression of the <it>trpD</it>, <it>trpC </it>and <it>trpBA </it>genes of <it>Streptomyces coelicolor </it>A3(2) is regulated by growth rate and growth phase but not by feedback repression</p>
            </title>
            <aug>
               <au>
                  <snm>Hu</snm>
                  <fnm>DS-J</fnm>
               </au>
               <au>
                  <snm>Hood</snm>
                  <fnm>DW</fnm>
               </au>
               <au>
                  <snm>Heidstra</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Hodgson</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>1999</pubdate>
            <volume>32</volume>
            <fpage>869</fpage>
            <lpage>880</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-2958.1999.01407.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">10361288</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Regulation of the <it>Streptomyces coelicolor </it>calcium-dependent antibiotic by <it>absA</it>, encoding a cluster-linked two-component system</p>
            </title>
            <aug>
               <au>
                  <snm>Ryding</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Anderson</snm>
                  <fnm>TB</fnm>
               </au>
               <au>
                  <snm>Champness</snm>
                  <fnm>WC</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2002</pubdate>
            <volume>184</volume>
            <fpage>794</fpage>
            <lpage>805</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11790750</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p><it>Archaea </it>and the prokaryote-to-eukaryote transition</p>
            </title>
            <aug>
               <au>
                  <snm>Brown</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Doolittle</snm>
                  <fnm>WF</fnm>
               </au>
            </aug>
            <source>Microbiol Mol Biol Rev</source>
            <pubdate>1997</pubdate>
            <volume>61</volume>
            <fpage>456</fpage>
            <lpage>502</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9409149</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Prokaryotic evolution in light of gene transfer</p>
            </title>
            <aug>
               <au>
                  <snm>Gogarten</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Doolittle</snm>
                  <fnm>WF</fnm>
               </au>
               <au>
                  <snm>Lawrence</snm>
                  <fnm>JG</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2002</pubdate>
            <volume>19</volume>
            <fpage>2226</fpage>
            <lpage>2238</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12446813</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Phylogenetic classification and the universal tree</p>
            </title>
            <aug>
               <au>
                  <snm>Doolittle</snm>
                  <fnm>WF</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1999</pubdate>
            <volume>284</volume>
            <fpage>2124</fpage>
            <lpage>2128</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.284.5423.2124</pubid>
                  <pubid idtype="pmpid" link="fulltext">10381871</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Horizontal gene transfer among microbial genomes: new insights from complete genome analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Eisen</snm>
                  <fnm>JA</fnm>
               </au>
            </aug>
            <source>Curr Opin Genet Dev</source>
            <pubdate>2000</pubdate>
            <volume>10</volume>
            <fpage>606</fpage>
            <lpage>611</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0959-437X(00)00143-X</pubid>
                  <pubid idtype="pmpid" link="fulltext">11088009</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Phylogenomics: improving functional predictions for uncharacterized genes by evolutionary analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Eisen</snm>
                  <fnm>JA</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>1998</pubdate>
            <volume>8</volume>
            <fpage>163</fpage>
            <lpage>167</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9521918</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Orthologous and paralogous divergence, reticulate evolution, and lateral gene transfer in bacterial <it>trp </it>genes</p>
            </title>
            <aug>
               <au>
                  <snm>Crawford</snm>
                  <fnm>IP</fnm>
               </au>
               <au>
                  <snm>Milkman</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>In: Evolution at the Molecular Level</source>
            <publisher>Sunderland, Mass.: Sinauer &amp; Assoc</publisher>
            <editor>Selander RK, Clark AG, Whittam TS</editor>
            <pubdate>1990</pubdate>
            <fpage>77</fpage>
            <lpage>95</lpage>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Phenazine biosynthesis in <it>Pseudomonas fluorescens </it>: branchpoint from the primary shikimate biosynthetic pathway and role of phenazine-1,6-dicarboxylic acid</p>
            </title>
            <aug>
               <au>
                  <snm>McDonald</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Mavrodi</snm>
                  <fnm>DV</fnm>
               </au>
               <au>
                  <snm>Thomashow</snm>
                  <fnm>LS</fnm>
               </au>
               <au>
                  <snm>Floss</snm>
                  <fnm>HG</fnm>
               </au>
            </aug>
            <source>J Am Chem Soc</source>
            <pubdate>2001</pubdate>
            <volume>123</volume>
            <fpage>9459</fpage>
            <lpage>9460</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1021/ja011243+</pubid>
                  <pubid idtype="pmpid">11562236</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Trp Operon</p>
            </title>
            <aug>
               <au>
                  <snm>Yanofsky</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Miles</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Bauerle</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Kirschner</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <publisher>New York: John Wiley &amp; Sons, Inc</publisher>
            <pubdate>1999</pubdate>
            <volume>4</volume>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Significance of two distinct types of tryptophan synthase beta chain in <it>Bacteria</it>, <it>Archaea </it>and higher plants</p>
            </title>
            <aug>
               <au>
                  <snm>Xie</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Forst</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Bonner</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Jensen</snm>
                  <fnm>RA</fnm>
               </au>
            </aug>
            <source>Genome Biology</source>
            <pubdate>2001</pubdate>
            <volume>3</volume>
            <fpage>research005.1</fpage>
            <lpage>005.13</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1186/gb-2001-3-1-research0004</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Duplication is more common among laterally transferred genes than among indigenous genes</p>
            </title>
            <aug>
               <au>
                  <snm>Hooper</snm>
                  <fnm>SD</fnm>
               </au>
               <au>
                  <snm>Berg</snm>
                  <fnm>OG</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <fpage>R48</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/gb-2003-4-8-r48</pubid>
                  <pubid idtype="pmpid" link="fulltext">12914657</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>The emerging periplasm-localized subclass of AroQ chorismate mutases, exemplified by those from <it>Salmonella typhimurium </it>and <it>Pseudomonas aeruginosa</it></p>
            </title>
            <aug>
               <au>
                  <snm>Calhoun</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Bonner</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Gu</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Xie</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Jensen</snm>
                  <fnm>RA</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2001</pubdate>
            <volume>2</volume>
            <fpage>research0030.1</fpage>
            <lpage>0030.16</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1186/gb-2001-2-8-research0030</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Tryptophan synthase from <it>Escherichia coli </it>and <it>Salmonella typhimurium</it></p>
            </title>
            <aug>
               <au>
                  <snm>Miles</snm>
                  <fnm>EW</fnm>
               </au>
               <au>
                  <snm>Bauerle</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Ahmed</snm>
                  <fnm>SA</fnm>
               </au>
            </aug>
            <source>Methods Enzymol</source>
            <pubdate>1987</pubdate>
            <volume>142</volume>
            <fpage>398</fpage>
            <lpage>414</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0076-6879(87)42051-X</pubid>
                  <pubid idtype="pmpid">3298982</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p><it>D</it>-Pantothenate synthesis in <it>Corynebacterium glutamicum </it>and use of <it>panBC </it>and genes encoding <it>L</it>-valine synthesis for <it>D</it>-pantothenate overproduction</p>
            </title>
            <aug>
               <au>
                  <snm>Sahm</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Eggeling</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Appl Environ Microbiol</source>
            <pubdate>1999</pubdate>
            <volume>65</volume>
            <fpage>1973</fpage>
            <lpage>1979</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10223988</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Nonlinear dynamics of regulation of bacterial <it>trp </it>operon: model analysis of integrated effects of repression, feedback inhibition, and attenuation</p>
            </title>
            <aug>
               <au>
                  <snm>Xiu</snm>
                  <fnm>ZI</fnm>
               </au>
               <au>
                  <snm>Chang</snm>
                  <fnm>ZY</fnm>
               </au>
               <au>
                  <snm>Zeng</snm>
                  <fnm>AP</fnm>
               </au>
            </aug>
            <source>Biotech</source>
            <pubdate>2002</pubdate>
            <volume>18</volume>
            <fpage>686</fpage>
            <lpage>693</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1021/bp020052n</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Sequence analysis of the <it>Brevibacterium lactofermentum trp </it>operon</p>
            </title>
            <aug>
               <au>
                  <snm>Matsui</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Sano</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ohtsubo</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Mol Gen Genet</source>
            <pubdate>1987</pubdate>
            <volume>209</volume>
            <fpage>299</fpage>
            <lpage>305</lpage>
            <xrefbib>
               <pubid idtype="pmpid">2823076</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Cloning of the <it>trp </it>gene cluster from a tryptophan-hyperproducing strain of <it>Corynebacterium glutamicum</it>: identification of a mutation in the <it>trp </it>leader sequence</p>
            </title>
            <aug>
               <au>
                  <snm>Heery</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Dunican</snm>
                  <fnm>LK</fnm>
               </au>
            </aug>
            <source>Appl Envir Microbiol</source>
            <pubdate>1993</pubdate>
            <volume>59</volume>
            <fpage>791</fpage>
            <lpage>799</lpage>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Cloning of the tryptophan operon of <it>Brevibacterium divaricatum </it>and its expression in <it>E. coli</it></p>
            </title>
            <aug>
               <au>
                  <snm>Su</snm>
                  <fnm>YC</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>SL</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1996</pubdate>
            <volume>20</volume>
            <fpage>87</fpage>
            <lpage>91</lpage>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Identification and characterization of genes for a second anthranilate synthase in <it>Pseudomonas aeruginosa </it>: interchangeability of the two anthranilate synthases and evolutionary implications</p>
            </title>
            <aug>
               <au>
                  <snm>Essar</snm>
                  <fnm>DW</fnm>
               </au>
               <au>
                  <snm>Eberly</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Hadero</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Crawford</snm>
                  <fnm>IP</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>1990</pubdate>
            <volume>172</volume>
            <fpage>884</fpage>
            <lpage>900</lpage>
            <xrefbib>
               <pubid idtype="pmpid">2153661</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Functional analysis of genes for biosynthesis of pyocyamin and phenazine-1-carboxamide from Pseudomonas aeruginosa PAO1</p>
            </title>
            <aug>
               <au>
                  <snm>Mavrodi</snm>
                  <fnm>DV</fnm>
               </au>
               <au>
                  <snm>Bonsall</snm>
                  <fnm>RF</fnm>
               </au>
               <au>
                  <snm>Delaney</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Soule</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Phillips</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Thomashow</snm>
                  <fnm>LS</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2001</pubdate>
            <volume>183</volume>
            <fpage>6454</fpage>
            <lpage>6465</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1128/JB.183.21.6454-6465.2001</pubid>
                  <pubid idtype="pmpid" link="fulltext">11591691</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Horizontal gene transfer: A critical view</p>
            </title>
            <aug>
               <au>
                  <snm>Kurland</snm>
                  <fnm>CG</fnm>
               </au>
               <au>
                  <snm>Canback</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Berg</snm>
                  <fnm>OG</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2003</pubdate>
            <volume>100</volume>
            <fpage>9658</fpage>
            <lpage>9662</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.1632870100</pubid>
                  <pubid idtype="pmpid" link="fulltext">12902542</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Lateral gene transfer: when will adolescence end?</p>
            </title>
            <aug>
               <au>
                  <snm>Lawrence</snm>
                  <fnm>JG</fnm>
               </au>
               <au>
                  <snm>Henderickson</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>2003</pubdate>
            <volume>50</volume>
            <fpage>739</fpage>
            <lpage>749</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-2958.2003.03778.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">14617137</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Mosaic bacterial chromosomes: a challenge en route to a tree of genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Martin</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Bioessays</source>
            <pubdate>1999</pubdate>
            <volume>21</volume>
            <fpage>99</fpage>
            <lpage>104</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/(SICI)1521-1878(199902)21:2&lt;99::AID-BIES3>3.3.CO;2-2</pubid>
                  <pubid idtype="pmpid" link="fulltext">10193183</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>From gene trees to organismal phylogeny in prokaryotes: the case of the alpha-proteobacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Lerat</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Daubin</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Moran</snm>
                  <fnm>NA</fnm>
               </au>
            </aug>
            <source>PLoS Biology</source>
            <pubdate>2003</pubdate>
            <volume>1</volume>
            <fpage>E1</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1371/journal.pbio.0000019</pubid>
                  <pubid idtype="pmpid" link="fulltext">14551903</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Inferring genome trees by using a filter to eliminate phylogenetically discordant sequences and a distance matrix based on mean normalized BLAST scores</p>
            </title>
            <aug>
               <au>
                  <snm>Clarke</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Beiko</snm>
                  <fnm>RG</fnm>
               </au>
               <au>
                  <snm>Ragan</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Charlebois</snm>
                  <fnm>RL</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2002</pubdate>
            <volume>184</volume>
            <fpage>2072</fpage>
            <lpage>2080</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1128/JB.184.8.2072-2080.2002</pubid>
                  <pubid idtype="pmpid" link="fulltext">11914337</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Genome phylogeny based on gene content</p>
            </title>
            <aug>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Huynen</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>1999</pubdate>
            <volume>21</volume>
            <fpage>108</fpage>
            <lpage>110</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/5052</pubid>
                  <pubid idtype="pmpid" link="fulltext">9916801</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Assessing evolutionary relationships among microbes from whole-genome analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Eisen</snm>
                  <fnm>JA</fnm>
               </au>
            </aug>
            <source>Curr Opin Microbiol</source>
            <pubdate>2000</pubdate>
            <volume>3</volume>
            <fpage>475</fpage>
            <lpage>480</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S1369-5274(00)00125-9</pubid>
                  <pubid idtype="pmpid" link="fulltext">11050445</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>Interpreting the universal phylogenetic tree</p>
            </title>
            <aug>
               <au>
                  <snm>Woese</snm>
                  <fnm>CR</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2000</pubdate>
            <volume>97</volume>
            <fpage>8392</fpage>
            <lpage>8396</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.97.15.8392</pubid>
                  <pubid idtype="pmpid" link="fulltext">10900003</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>Gene transfer in <it>Bacteria </it>: speciation without species?</p>
            </title>
            <aug>
               <au>
                  <snm>Lawrence</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Theoret Populat Biol</source>
            <pubdate>2002</pubdate>
            <volume>61</volume>
            <fpage>449</fpage>
            <lpage>460</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1006/tpbi.2002.1587</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>Aminoacyl-tRNA synthetases, the genetic code, and the evolutionary process</p>
            </title>
            <aug>
               <au>
                  <snm>Woese</snm>
                  <fnm>CR</fnm>
               </au>
               <au>
                  <snm>Olsen</snm>
                  <fnm>GJ</fnm>
               </au>
               <au>
                  <snm>Ibba</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Soll</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Microbiol Mol Biol Rev</source>
            <pubdate>2000</pubdate>
            <volume>64</volume>
            <fpage>202</fpage>
            <lpage>236</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1128/MMBR.64.1.202-236.2000</pubid>
                  <pubid idtype="pmpid" link="fulltext">10704480</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>Gold Genomes OnLine Database</p>
            </title>
            <url>http://www.genomesonline.org/</url>
         </bibl>
         <bibl id="B50">
            <title>
               <p>The RDP-II (Ribosomal Database Project)</p>
            </title>
            <aug>
               <au>
                  <snm>Maidak</snm>
                  <fnm>BL</fnm>
               </au>
               <au>
                  <snm>Cole</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Lilburn</snm>
                  <fnm>TG</fnm>
               </au>
               <au>
                  <snm>Parker</snm>
                  <fnm>CT</fnm>
                  <suf>Jr</suf>
               </au>
               <au>
                  <snm>Saxman</snm>
                  <fnm>PR</fnm>
               </au>
               <au>
                  <snm>Farris</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Garrity</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Olsen</snm>
                  <fnm>GJ</fnm>
               </au>
               <au>
                  <snm>Schmidt</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Tiedje</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Nucl Acids Res</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <fpage>173</fpage>
            <lpage>17</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/29.1.173</pubid>
                  <pubid idtype="pmpid" link="fulltext">11125082</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>Ribosomal Database Project II</p>
            </title>
            <url>http://rdp.cme.msu.edu/html</url>
         </bibl>
         <bibl id="B52">
            <title>
               <p>Clustal W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice</p>
            </title>
            <aug>
               <au>
                  <snm>Thompson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Higgins</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Gibson</snm>
                  <fnm>TJ</fnm>
               </au>
            </aug>
            <source>Nucl Acids Res</source>
            <pubdate>1994</pubdate>
            <volume>22</volume>
            <fpage>4673</fpage>
            <lpage>4680</lpage>
            <xrefbib>
               <pubid idtype="pmpid">7984417</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>ClustalW, version 1.4</p>
            </title>
            <url>http://www.ebi.ac.uk/clustalw</url>
         </bibl>
         <bibl id="B54">
            <title>
               <p>Toward defining the course of evolution: minimum change for a specific tree topology</p>
            </title>
            <aug>
               <au>
                  <snm>Fitch</snm>
                  <fnm>WM</fnm>
               </au>
            </aug>
            <source>Syst Zool</source>
            <pubdate>1971</pubdate>
            <volume>20</volume>
            <fpage>406</fpage>
            <lpage>416</lpage>
         </bibl>
         <bibl id="B55">
            <title>
               <p>Biological sequence alignment editor for Windows 95/98/NT, 4.8.6</p>
            </title>
            <aug>
               <au>
                  <snm>Hall</snm>
                  <fnm>TA</fnm>
               </au>
            </aug>
            <pubdate>2000</pubdate>
            <url>http://http//:www.mbio.ncsu.edu/BioEdit/bioedit.html</url>
         </bibl>
         <bibl id="B56">
            <title>
               <p>PHYLIP-Phylogeny Inference Package (version 3.2)</p>
            </title>
            <aug>
               <au>
                  <snm>Felsenstein</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Cladistics</source>
            <pubdate>1989</pubdate>
            <volume>5</volume>
            <fpage>164</fpage>
            <lpage>166</lpage>
         </bibl>
         <bibl id="B57">
            <title>
               <p>National Center for Biotechnology Information</p>
            </title>
            <url>http://www.ncbi.nlm.nih.gov/Microb_blast/unfinishedgemome.html.old/</url>
         </bibl>
         <bibl id="B58">
            <title>
               <p>The Institute for Genomic Research</p>
            </title>
            <url>http://www.tigr.org/tdb/ufmg/</url>
         </bibl>
         <bibl id="B59">
            <title>
               <p>National Center for Biotechnology Information</p>
            </title>
            <url>http://www.ncbi.nlm.nih.gov/</url>
         </bibl>
         <bibl id="B60">
            <title>
               <p>BLAST</p>
            </title>
            <url>http://www.ncbi.nlm.nih.gov/blast/</url>
         </bibl>
         <bibl id="B61">
            <title>
               <p>Integrated Genomics, Inc</p>
            </title>
            <url>http://wit.integratedgenomics.com/ERGO</url>
         </bibl>
      </refgrp>
   </bm>
</art>
