<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2001-2-12-research0054</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>The process of genome shrinkage in the obligate symbiont <it>Buchnera aphidicola</it></p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Moran</snm>
               <mi>A</mi>
               <fnm>Nancy</fnm>
               <insr iid="I1"/>
               <email>nmoran@email.arizona.edu</email>
            </au>
            <au id="A2">
               <snm>Mira</snm>
               <fnm>Alex</fnm>
               <insr iid="I1"/>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721, USA</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2001</pubdate>
         <volume>2</volume>
         <issue>12</issue>
         <fpage>research0054.1</fpage>
         <lpage>research0054.12</lpage>
         <url>http://genomebiology.com/2001/2/12/research/0054</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="doi">10.1186/gb-2001-2-12-research0054</pubid>
               <pubid idtype="pmpid">11790257</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>7</day>
               <month>9</month>
               <year>2001</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>12</day>
               <month>10</month>
               <year>2001</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>15</day>
               <month>10</month>
               <year>2001</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>14</day>
               <month>11</month>
               <year>2001</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2001</year>
         <collab>Moran and Mira, licensee BioMed Central Ltd</collab>
      </cpyrt>
      <shortabs>
         <p>To examine the process of genome reduction in symbionts, the tiny genome of the endosymbiont <it>Buchnera aphidicola</it> was compared to a reconstructed larger ancestral genome. On the basis of this comparison, 503 genes were eliminated from the <it>Buchnera</it> genome within syntenic fragments, and 1,403 genes were lost from the gaps between syntenic fragments, probably in connection with genome rearrangements.</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Very small genomes have evolved repeatedly in eubacterial lineages that have adopted obligate associations with eukaryotic hosts. Complete genome sequences have revealed that small genomes retain very different gene sets, raising the question of how final genome content is determined. To examine the process of genome reduction, the tiny genome of the endosymbiont <it>Buchnera aphidicola</it> was compared to the larger ancestral genome, reconstructed on the basis of the phylogenetic distribution of gene orthologs among fully sequenced relatives of <it>Escherichia coli</it> and <it>Buchnera</it>.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>The reconstructed ancestral genome contained 2,425 open reading frames (ORFs). The <it>Buchnera</it> genome, containing 564 ORFs, consists of 153 fragments of 1-34 genes that are syntenic with reconstructed ancestral regions. On the basis of this reconstruction, 503 genes were eliminated within syntenic fragments, and 1,403 genes were lost from the gaps between syntenic fragments, probably in connection with genome rearrangements. Lost regions are sometimes large, and often span functionally unrelated genes. In addition, individual genes and regulatory regions have been lost or eroded. For the categories of DNA repair genes and rRNA genes, most lost loci fall in regions between syntenic fragments. This history of gene loss is reflected in the sequences of intergenic spacers at positions where genes were once present.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusions</p>
               </st>
               <p>The most plausible interpretation of this reconstruction is that <it>Buchnera</it> lost many genes through the fixation of large deletions soon after the acquisition of an obligate endosymbiotic lifestyle. An implication is that final genome composition may be partly the chance outcome of initial deletions and that neighboring genes influence the likelihood of loss of particular genes and pathways.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010014">Microbiology and parasitology</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010008">Evolution</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010010">Genome studies</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Genome sizes in the eubacteria and archaebacteria range from 0.58 megabases (Mb) to around 10 Mb [<abbr bid="B1">1</abbr>]. The smallest of these genomes do not represent ancestral states, as was once believed, but are derived from larger genomes through massive loss of genes. The conclusion that small genome size is evolutionarily derived is based on a combination of molecular phylogenetic results, genome size determinations and full genome sequences [<abbr bid="B2">2</abbr>,<abbr bid="B3">3</abbr>,<abbr bid="B4">4</abbr>,<abbr bid="B5">5</abbr>,<abbr bid="B6">6</abbr>,<abbr bid="B7">7</abbr>,<abbr bid="B8">8</abbr>,<abbr bid="B9">9</abbr>]. Extreme genome reduction, producing bacterial genome sizes in the range of 1 Mb or less, is closely correlated with symbiotic or pathogenic lifestyles involving obligate associations with eukaryotic hosts. Thus, when lineages make the transition from potentially free-living lifestyles to obligately host-associated ones, genome reduction ensues.</p>
         <p>An immediate question is whether this reduction occurs in large steps, involving relatively few large losses, or whether it is entirely gradual, consisting of loss of individual genes one-by-one. Reconstruction of steps leading to genome reduction requires comparative analysis of related small and large genomes. Public databases now contain complete sequences for many bacterial genomes of varying sizes, but most of the fully sequenced genomes under 1.5 Mb are very distantly related to genomes of larger size. For example, the smallest genome, from <it>Mycoplasma genitalium</it> (0.58 Mb) belongs to the Mollicutes, a large and ancient clade that consists entirely of bacteria with reduced genomes [<abbr bid="B2">2</abbr>,<abbr bid="B10">10</abbr>,<abbr bid="B11">11</abbr>]. Likewise, <it>Rickettsia prowazekii</it> (1.1 Mb) is embedded in a large clade within the alpha-Proteobacteria that contains only small-genome, intracellular inhabitants such as <it>Ehrlichia, Wolbachia pipientis</it> and mitochondria [<abbr bid="B12">12</abbr>]. Similarly, the Chlamydiae, including several fully sequenced organisms, are an ancient clade, all characterized by small genomes (1.0-1.2 Mb) [<abbr bid="B13">13</abbr>]. This phylogenetic distribution hinders reconstruction of the large-genome ancestors of pathogenic lineages and of the events leading from a large genome to a very small one.</p>
         <p>Among fully sequenced published genomes, the only instance of a highly reduced genome that shows a close relationship to large genomes is <it>Buchnera aphidicola,</it> the obligate endosymbiont of aphids (Insecta). On the basis of gene content and similarity of orthologous genes, <it>Buchnera</it> is closely related to enteric bacteria, including <it>Escherichia coli</it> (gamma-3 Proteobacteria). The <it>Buchnera</it> endosymbiont of the aphid <it>Acyrthosiphon pisum</it> has a genome of 643 kb [<abbr bid="B8">8</abbr>], only one-seventh the size of the genome of <it>E. coli</it> MG1655 (4.6 Mb) [<abbr bid="B14">14</abbr>]. The <it>E. coli</it> size is similar to that of other enterics such as <it>Salmonella, Klebsiella</it> and <it>Yersinia</it> [<abbr bid="B1">1</abbr>]. The <it>Buchnera</it> gene inventory, consisting of only 564 ORFs, 32 tRNAs and a single copy of each rRNA gene, is essentially a subset of that of <it>E. coli</it> [<abbr bid="B8">8</abbr>]. (The four annotated genes lacking obvious homology with <it>E. coli</it> genes seem to be either recently lost in <it>E. coli</it> or fast-evolving genes or pseudogenes for which orthology would be difficult to detect (I. Tamas <it>et al</it>.; unpublished results)). Orthologous pairs show an average amino-acid identity of 62% and 16S rDNA identity of 89% between <it>E. coli</it> and <it>Buchnera.</it></p>
         <p><it>Buchnera</it> is a mutualistic endosymbiont of its host, but the pattern of reductions of numbers of genes among functional categories is similar between <it>Buchnera</it> and other fully sequenced small-genome bacteria, all obligate pathogens [<abbr bid="B8">8</abbr>,<abbr bid="B12">12</abbr>,<abbr bid="B15">15</abbr>,<abbr bid="B16">16</abbr>]. The major exception is that the <it>Buchnera</it> genome contains 55 loci (10% of the genome) that specify the biosynthesis of amino acids needed by its host [<abbr bid="B8">8</abbr>]. In contrast, pathogenic species with small genomes have lost these loci and acquire amino acids from host cells. Both <it>Buchnera</it> and small-genome pathogens show reduction in numbers of genes in many functional categories, including biosynthesis of metabolic intermediates, basic cellular processes (transcription, translation, replication, cell division), biosynthesis of phospholipids, and repair and recombination. Small-genome bacteria show convergent features, including accelerated sequence evolution and genome-wide base compositional bias favoring A and T [<abbr bid="B5">5</abbr>,<abbr bid="B17">17</abbr>].</p>
         <p>In attempting to characterize the evolutionary forces underlying genome reduction, a critical question concerns the size and content of the deletions. If genes or operons have been eliminated individually in separate events, their loss may be governed by the functional roles and independent fitness effects of individual loci. But, if a substantial portion of the reduction has occurred as large deletion events spanning many kilobases and functionally unrelated genes, the set of genes lost is more reasonably interpreted as the result of selection acting on the composite fitness effects of the set of loci deleted. The content of early deletions is expected to affect the strength of selection for retention of other genes. Thus, the final gene inventory might depend in part on chance combinations of gene order and deletions occurring early in the process of genome reduction. If large deletions have a role in the early stages of genome reduction, then large contiguous regions of the ancestral genome will be found to be missing from the reduced genome.</p>
         <p>Here we examine the process of genome reduction in the lineage leading to <it>Buchnera</it> through comparison of its genome with that of a hypothetical ancestor reconstructed on the basis of the genomes of <it>E. coli</it> MG1655 and several other sequenced genomes of enteric bacteria.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Gene inventory of the reconstructed ancestor relative to that of <it>Buchnera</it></p>
            </st>
            <p>A total of 2,425 protein-coding genes were retained in the ancestor, out of a total of 4,351 genes in <it>E. coli.</it> Under the procedures for reconstructing the ancestral genome (A in Figure <figr fid="F1">1</figr>), a requirement for inclusion in the ancestor was presence in <it>E. coli</it> MG1655. As a result, the reconstructed genome is expected to be smaller than that of the real ancestor, because it does not include genes that were present in the ancestor but lost independently from both <it>E. coli</it> and <it>Buchnera</it> lineages. Despite the removal of 44% of the <it>E. coli</it> genes, over 99% of genes present in <it>Buchnera</it> were also retained in the reconstructed ancestor. This outcome strongly supports this parsimony approach to reconstructing the gene inventory of the shared ancestor of <it>E. coli</it> and <it>Buchnera,</it> and also of the hypothesis that <it>Buchnera</it> evolved from such an ancestor through gene loss. Only two of the 560 protein-coding genes that show clear orthology between <it>E. coli</it> and <it>Buchnera</it> would have been removed (due to absence from both <it>Vibrio cholerae</it> and <it>Yersinia pestis</it>). The two genes (<it>dnaC</it> and <it>dnaT</it>) are linked in both <it>Buchnera</it> and <it>E. coli</it> and were retained in the ancestor.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Phylogenetic relationships of <it>Buchnera</it>, <it>E. coli</it> and related taxa used in reconstructing the genome of the free-living ancestor (A) of <it>Buchnera</it></p>
               </caption>
               <text>
                  <p>Phylogenetic relationships of <it>Buchnera</it>, <it>E. coli</it> and related taxa used in reconstructing the genome of the free-living ancestor (A) of <it>Buchnera</it>.</p>
               </text>
               <graphic file="gb-2001-2-12-research0054-1"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Rearrangements in the <it>Buchnera</it> lineage</p>
            </st>
            <p>Fragments for which the order and orientation of genes are the same in <it>Buchnera</it> and <it>E. coli</it> were assumed to be present in the common ancestor. A syntenic fragment was recognized if <it>E. coli</it> and <it>Buchnera</it> showed the same order and orientation apart from missing genes in <it>Buchnera,</it> and if these missing genes could not be located elsewhere in the <it>Buchnera</it> genome (example in Figure <figr fid="F2">2</figr>). Syntenic regions always terminated with loci that were present in both species.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Part of a syntenic fragment from <it>Buchnera</it> and the ancestor (same as <it>E. coli</it> for this region)</p>
               </caption>
               <text>
                  <p>Part of a syntenic fragment from <it>Buchnera</it> and the ancestor (same as <it>E. coli</it> for this region). Deleted loci are white in the ancestor; orthologous genes are color-coded. Genes shifted up in the figure are oriented forward in the genome; genes shifted down are oriented backwards.</p>
               </text>
               <graphic file="gb-2001-2-12-research0054-2"/>
            </fig>
            <p>This criterion resulted in 91 fragments in the ancestor that contained at least two genes and that corresponded to regions of synteny between <it>Buchnera</it> and <it>E. coli</it> (Figures <figr fid="F3">3</figr>,<figr fid="F4">4</figr>). In addition, there were 62 single genes. The longest syntenic fragment, containing the megaoperon of ribosomal proteins that is widely conserved among bacteria, consisted of 89 kb spanning 77 genes in the ancestor and 45 genes in <it>Buchnera.</it> In addition, there were 143 ancestral regions between these retained fragments, containing genes inferred to be lost in <it>Buchnera.</it> (This number is slightly less than the number of retained fragments (153) because, whereas most of the retained fragments were flanked by lost regions, a few directly bordered other retained fragments.) The ancestral genome was therefore divided into a total of 306 fragments, just over half of which were retained in the <it>Buchnera</it> genome, and there are 306 junctions between these fragments on the circular chromosome (Figure <figr fid="F3">3</figr>).</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Graphic depiction of syntenic fragments and lost regions in the genome of the reconstructed ancestor and in <it>Buchnera</it></p>
               </caption>
               <text>
                  <p>Graphic depiction of syntenic fragments and lost regions in the genome of the reconstructed ancestor and in <it>Buchnera</it>. Syntenic fragments are color-coded based on position in the ancestor. Lost regions occurring between syntenic fragments are gray.</p>
               </text>
               <graphic file="gb-2001-2-12-research0054-3"/>
            </fig>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Sizes of <it>Buchnera</it> regions that are syntenic with regions in the reconstructed ancestor</p>
               </caption>
               <text>
                  <p>Sizes of <it>Buchnera</it> regions that are syntenic with regions in the reconstructed ancestor. Solid bars, number of fragments; open bars, number of genes.</p>
               </text>
               <graphic file="gb-2001-2-12-research0054-4"/>
            </fig>
            <p>The evolution of <it>Buchnera</it> was clearly accompanied by many chromosome rearrangements and massive changes in genome size (Figures <figr fid="F3">3</figr>,<figr fid="F5">5</figr>). Remarkably, a comparison of ortholog positions between <it>Buchnera</it> and <it>E. coli</it> indicates that a pattern of approximate symmetry of gene distance from the replication origin and terminus was maintained (Figure <figr fid="F6">6</figr>). This is indicated by an X-pattern when the positions in the two genomes are plotted against one another with the origins of replication as endpoints (the replication origin in <it>Buchnera</it> was designated by Shigenobu <it>et al.</it> [<abbr bid="B8">8</abbr>] on the basis of the position of the only DnaA box present on the chromosome and on a shift in the GC-skew value at third codon positions around that region). This X-pattern has recently been found to be typical for comparisons of orthologous regions between pairs of closely related bacteria [<abbr bid="B18">18</abbr>,<abbr bid="B19">19</abbr>,<abbr bid="B20">20</abbr>]. It is most readily explained as the result of successive inversions around the replication terminus or origin.</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Region rearranged in the <it>Buchnera</it> lineage, based on the order in <it>Vibrio cholerae</it> and <it>E. coli</it></p>
               </caption>
               <text>
                  <p>Region rearranged in the <it>Buchnera</it> lineage, based on the order in <it>Vibrio cholerae</it> and <it>E. coli</it>. Orthologous genes are in matching colors. In <it>Buchnera</it>, <it>yfgK</it> and <it>yfgM</it> have been translocated and inverted. Numbers under the genes denote genomic position and size. Genes marked with 'D' were eliminated in the evolution of <it>Buchnera</it>.</p>
               </text>
               <graphic file="gb-2001-2-12-research0054-5"/>
            </fig>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>Relative positions of orthologs in the <it>Buchnera</it> and the <it>E. coli</it> genomes, with the origin of replication for each genome at the origin</p>
               </caption>
               <text>
                  <p>Relative positions of orthologs in the <it>Buchnera</it> and the <it>E. coli</it> genomes, with the origin of replication for each genome at the origin. Each gene is positioned by its starting base within each genome. Despite the difference in genome size, an X-pattern is evident, indicating preservation of the relative absolute distance from the origin of replication.</p>
               </text>
               <graphic file="gb-2001-2-12-research0054-6"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Genes lost as deletions within and between regions of synteny</p>
            </st>
            <p>On the basis of the reconstructed ancestor, a total of 503 genes in 156 locations were lost in deletions within regions of synteny (Figure <figr fid="F7">7</figr>, top). These deleted regions are sometimes large, with 11 regions spanning 10 or more genes (Figure <figr fid="F7">7</figr>, top). A much larger proportion of the reduction can be attributed to regions lost between syntenic fragments; 1,449 genes were lost in 143 such gaps (Figure <figr fid="F7">7</figr>, bottom).</p>
            <fig id="F7">
               <title>
                  <p>Figure 7</p>
               </title>
               <caption>
                  <p>Numbers of genes in deleted regions</p>
               </caption>
               <text>
                  <p>Numbers of genes in deleted regions. <b>(a)</b> Numbers of genes in deleted regions occurring within fragments syntenic between <it>Buchnera</it> and its ancestor. <b>(b)</b> Numbers of genes in deleted regions occurring between fragments syntenic between <it>Buchnera</it> and its ancestor.</p>
               </text>
               <graphic file="gb-2001-2-12-research0054-7"/>
            </fig>
            <p>For losses occurring both within and between syntenic fragments, a single lost region frequently contains multiple genes, often involved in seemingly unrelated functions (see, for example, Figures <figr fid="F2">2</figr> and <figr fid="F5">5</figr>).</p>
         </sec>
         <sec>
            <st>
               <p>Gene erosion and spacer lengths</p>
            </st>
            <p>Some genes deleted within regions of synteny persist as partially degraded sequences that range from pseudogenes having clear homology with the <it>E. coli</it> ortholog [<abbr bid="B8">8</abbr>], to much shortened sequences with no recognizable sequence homology. These sequences indicate that function has sometimes been eliminated, initially through a small deletion or substitution, with the remaining DNA sequence subsequently eroded by multiple mutations that are biased in favor of deletion over insertion, as documented for <it>Rickettsia</it> [<abbr bid="B7">7</abbr>,<abbr bid="B9">9</abbr>] and other bacteria including <it>Buchnera</it> [<abbr bid="B21">21</abbr>]. An analysis of <it>Buchnera</it> pseudogenes showed that deletions outnumber insertions (31 versus 2) and that the number of nucleotides deleted is over 100-fold greater than those inserted [<abbr bid="B21">21</abbr>].</p>
            <p>An expected result of this gradual gene degradation is that intergenic spacers that contain gene remnants will be longer than spacers where no gene loss has occurred. <it>Buchnera</it> spacers can be categorized into three groups: those flanked by the same genes in <it>Buchnera</it> and the ancestor, those that occur within regions of synteny at positions where gene(s) are missing in <it>Buchnera,</it> and those that occur between syntenic fragments. Spacers in the first category, which can be considered to be descended from ancestral spacers, have an average length of 55 bp (<it>n</it> = 272). These ancient spacers are much shorter than spacers occurring where ancestral genes have been deleted within syntenic fragments (188 bp, <it>n</it> = 165) or than spacers occurring between syntenic fragments (also 188 bp, <it>n</it> = 162). Considering only spacers within syntenic fragments, spacer length does not increase further when the number of genes lost is greater than one (Figure <figr fid="F8">8</figr>).</p>
            <fig id="F8">
               <title>
                  <p>Figure 8</p>
               </title>
               <caption>
                  <p>Lengths of intergenic spacers in <it>Buchnera</it> relative to number of genes deleted from the corresponding position within syntenic fragments</p>
               </caption>
               <text>
                  <p>Lengths of intergenic spacers in <it>Buchnera</it> relative to number of genes deleted from the corresponding position within syntenic fragments. Five spacers that included remnants of recognizable pseudogenes (&#968;) are indicated. The dotted line indicates the average length of spacers at sites where no genes have been eliminated (mean = 55 nucleotides, <it>N</it> = 272).</p>
               </text>
               <graphic file="gb-2001-2-12-research0054-8"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Promoter loss and fusion of transcription units</p>
            </st>
            <p>Small bacterial genomes typically contain a smaller proportion of regulatory elements than do larger genomes [<abbr bid="B22">22</abbr>,<abbr bid="B23">23</abbr>]. Consistent with this trend, the genome of <it>Buchnera</it> has been noted to lack virtually all regulatory proteins [<abbr bid="B8">8</abbr>]. We examined promoters within intergenic spacers flanked by the same genes in both <it>Buchnera</it> and <it>E. coli</it> under the assumption that these evolved from a common ancestral sequence and thus represent orthologous spacers. In 44 of the orthologous spacers, a &#963;-70 promoter is annotated in <it>E. coli</it> on the basis of experimental evidence and/or presence of a promoter consensus sequence [<abbr bid="B14">14</abbr>]. The consensus sequence for this promoter is TTGACA-(l7 nucleotides)-TATAAT, from which the -35 and -10 regions are each defined primarily by three base pairs:TTG--- for the -35 region and TA---T for the -10 region [<abbr bid="B24">24</abbr>,<abbr bid="B25">25</abbr>]. We applied a conservative criterion for presence of a promoter in <it>Buchnera,</it> by requiring retention of as few as four of these six nucleotides with a separation of 16-18 nucleotides. In 16 of the 33 spacers in which flanking genes are encoded on the same strand and in which <it>E. coli</it> has an annotated promoter, <it>Buchnera</it> lacks any recognizable promoter. All such apparent losses occurred when the flanking genes were encoded on the same strand, suggesting that promoter loss is frequent when a group of newly contiguous genes can be translated as a single transcriptional unit. In some cases, fusion of genes into the same polycistron has occurred following the loss of intervening genes oriented in the opposite direction (Figure <figr fid="F9">9</figr>). In contrast, in none of the 11 cases in which orthologous spacers were located between genes oriented in opposite directions did <it>Buchnera</it> lose promoters.</p>
            <fig id="F9">
               <title>
                  <p>Figure 9</p>
               </title>
               <caption>
                  <p>Example of a syntenic fragment in which <it>Buchnera</it> has lost genes of opposite orientation to flanking genes and has fused the new neighbors into a polycistron, through loss of intervening promoters</p>
               </caption>
               <text>
                  <p>Example of a syntenic fragment in which <it>Buchnera</it> has lost genes of opposite orientation to flanking genes and has fused the new neighbors into a polycistron, through loss of intervening promoters. R = repetitive sequence; P = promoter sequence; T= termination sequence.</p>
               </text>
               <graphic file="gb-2001-2-12-research0054-9"/>
            </fig>
            <p>On the basis of sequence criteria, <it>Buchnera</it> also shows degeneration of the Shine-Dalgarno (SD) sequences that promote initiation of translation by binding with the anti-SD complement in rRNA (which is widely conserved and identical for <it>Buchnera</it> and <it>E. coli</it>). Changes in the SD sequence can impede initiation of translation [<abbr bid="B26">26</abbr>,<abbr bid="B27">27</abbr>]. In comparisons of spacers orthologous between <it>E. coli</it> and <it>Buchnera, Buchnera</it> showed a lower number of matches to the core eight nucleotides of the SD sequence in 37 of 51 cases (73%). Also, a total of 20 protein-binding sites were annotated within <it>E. coli</it> spacers orthologous to <it>Buchnera</it> spacers (that is, with the same flanking genes in the same orientation). In <it>Buchnera,</it> only one was preserved, four had half of the sequence maintained, and the remaining 15 were not detectable.</p>
            <p>These hypothetical losses of regulatory regions from the reduced genome of <it>Buchnera</it> are based on levels of sequence similarity with the known consensus sequence in modern <it>E. coli.</it> There is no experimental evidence to eliminate the possibility that the degraded promoters in <it>Buchnera</it> are still functional, or that the symbiont uses different promoters from those identified in <it>E. coli.</it> In addition, inaccuracies in the annotated positions of <it>Buchnera</it> genes, in which positions of start codons are not verified experimentally, could bias analyses of SD sequences.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <sec>
            <st>
               <p>Genetic drift as a basis for genome reduction</p>
            </st>
            <p>Endosymbiosis or chronic pathogenesis involves a metabolite-rich environment within host tissues, low growth rates and isolation of strains within host individuals. This combination of factors results in relaxed selection at many loci and higher levels of genetic drift affecting the entire genome. The reduced effectiveness of selection is reflected in patterns of sequence evolution in <it>Buchnera</it> and other endosymbionts [<abbr bid="B17">17</abbr>,<abbr bid="B28">28</abbr>,<abbr bid="B29">29</abbr>]. It is also expected to affect genome evolution, accelerating the loss of genes that are entirely superfluous as well as those that are beneficial but not essential.</p>
            <p>Small genome size itself is likely to reflect lack of selection for gene retention rather than direct selection for a compact genome [<abbr bid="B21">21</abbr>]. Especially in the case of <it>Buchnera,</it> which possesses 50-200 chromosomes per cell [<abbr bid="B30">30</abbr>] and in which nonfunctional pseudogenes can persist for long periods [<abbr bid="B31">31</abbr>], selection for reduced DNA content seems an untenable explanation for its small genome. The polyploidy itself may be a consequence of the loss, through genetic drift, of one or more loci involved in regulating and resolving chromosome replication during the cell cycle.</p>
            <p>One pattern that has emerged from comparative analyses of fully sequenced genomes is that, although different pathogen and symbiont lineages approach the same minimal genome size, the inventories of genes retained by these organisms are extremely different [<abbr bid="B3">3</abbr>,<abbr bid="B32">32</abbr>,<abbr bid="B33">33</abbr>,<abbr bid="B34">34</abbr>,<abbr bid="B35">35</abbr>,<abbr bid="B36">36</abbr>]. Relatively few genes are universally distributed, and universal cellular processes depend in part on nonorthologous genes. The divergence in gene inventories among small-genome bacteria implies redundancy in ancestral genomes underlying central cell processes, including DNA processing, transcription and translation. For the most part, this redundancy does not arise from the presence of paralogous genes, which are relatively few in bacteria. An examination of the process of genome reduction, as exemplified by <it>Buchnera,</it> could yield insight into why gene inventories differ among small genomes.</p>
         </sec>
         <sec>
            <st>
               <p>Deletion sizes in the evolution of <it>Buchnera</it></p>
            </st>
            <p>The comparison of the <it>Buchnera</it> genome to the reconstructed ancestral genome suggests that a considerable part of genome reduction occurred through large deletions accompanying chromosome rearrangements. One indicator that supports gene loss partly through large deletions spanning multiple genes is the distribution of sizes of these regions in the ancestor (Figure <figr fid="F7">7</figr>, bottom). If genes were lost one by one, the expectation is that retained genes would be randomly mixed with lost genes within the ancestral genome. In the observed distribution, lost genes are aggregated. Although part of this clustering might be attributed to the linkage of functionally related genes, this would not account for the larger segments. Another indicator of gene loss through large deletions is the lack of positive relationship between spacer length and number of genes lost for gene deletions that occurred within syntenic fragments (Figure <figr fid="F8">8</figr>). Thus, the most plausible explanation for much of the gene loss occurring in the evolution of <it>Buchnera</it> is the fixation of single large deletions spanning many genes, although it is also clear that some genes were eliminated through smaller deletions and gradual erosion. Large deletions spanning multiple genes are possible only early in the reductive process, when many genes are nonessential. The content of initial large deletions will determine the degree of selection on remaining loci and thus will govern the ultimate composition of the reduced genome.</p>
         </sec>
         <sec>
            <st>
               <p>Loss of RNA genes</p>
            </st>
            <p>One distinctive aspect of the <it>Buchnera</it> genome is the presence of only one copy each of the genes encoding 16S, 23S and 5S rRNA, and the separation of these genes into two transcriptional units with the 16S rRNA gene (<it>rrs</it>) apart from the others (<it>rrf</it> and <it>rrl</it>) [<abbr bid="B37">37</abbr>]. Typically, bacteria possess multiple rRNA operons, each containing all three genes; the <it>Buchnera</it> ancestor is inferred to have possessed at least five operons and modern <it>E. coli</it> has seven. The remaining rRNA genes descend from two different ancestral operons, with other rRNA operons lost entirely. From flanking genes, it is evident that the two retained transcription units correspond to <it>rrsH</it> and <it>rrfD-rrlD</it> of <it>E. coli.</it> The missing parts of these operons (<it>rrfH, rrlH,</it> and <it>rrsD</it>) were lost in deletions within syntenic fragments. All other rRNA operons were lost in entirety as part of regions occurring between syntenic fragments. Thus, the modern number and arrangement of rRNA genes is the result of the loss of large regions, possibly in the course of rearrangements, as well as the elimination of individual genes within operons.</p>
            <p><it>E. coli</it> has 86 tRNAs, and most of these appear to be present in the ancestor (the parsimony criterion for presence of a tRNA in the ancestor, based on distribution in sequenced genomes, is somewhat unreliable because of sequence homology among different tRNA genes). <it>Buchnera</it> has only 32 tRNAs, with most amino acids having only one. Assuming that the ancestor had the same complement of tRNAs as <it>E. coli,</it> 15 were lost from within syntenic fragments and 39 were lost in regions between syntenic fragments. Selection enforces the retention of at least one tRNA for each amino acid, as observed in modern <it>Buchnera</it>, so the early fixation of large deletions containing some tRNAs would have created a selective requirement for the retention of others.</p>
         </sec>
         <sec>
            <st>
               <p>Loss of DNA repair pathways</p>
            </st>
            <p>One example of a functional category that is routinely reduced in small-genome bacteria is that consisting of genes for DNA repair and recombination. As is typical for small genome bacteria, <it>Buchnera</it> has lost a large number of repair genes (Table <tblr tid="T1">1</tblr>). In this functional category, as in others, the set of retained genes differs among reduced genomes [<abbr bid="B15">15</abbr>]. For example, <it>Buchnera</it> is unique among sequenced genomes in having lost <it>recA,</it> which functions in homologous recombination and repair. In contrast, <it>Buchnera</it> retains <it>recBCD,</it> which has been lost by several other small genomes [<abbr bid="B8">8</abbr>,<abbr bid="B12">12</abbr>,<abbr bid="B15">15</abbr>]. The reconstructed events underlying the loss of individual repair genes include many large deletions, encompassing many genes (Table <tblr tid="T1">1</tblr>). For example, under the reconstruction, <it>recA</it> is part of a contiguous deleted region of about 10 kb containing ten genes. Also unusual in <it>Buchnera</it> is the lack of <it>uvrA, uvrB</it> and <it>uvrC</it> (encoding an excision nuclease involved in repair of UV damage to DNA). The <it>uvrB</it> and <it>uvrC</it> genes fall in separate large deleted regions between syntenic fragments, whereas <it>uvrA</it> is one of six genes deleted within a syntenic region, with the corresponding position in <it>Buchnera</it> occupied by a 618 bp spacer (Table <tblr tid="T1">1</tblr>).</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Characteristics of deleted regions containing DNA repair loci that have been lost during the evolution of <it>Buchnera</it></p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="left">
                        <p>Deleted gene</p>
                     </c>
                     <c ca="left">
                        <p>Functional role</p>
                     </c>
                     <c ca="center">
                        <p>Within/between syntenic fragments</p>
                     </c>
                     <c ca="center">
                        <p>Size of deleted region (nucleotides)</p>
                     </c>
                     <c ca="center">
                        <p>Number of genes in deleted region</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>ada</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Direct damage reversal</p>
                     </c>
                     <c ca="center">
                        <p>Between</p>
                     </c>
                     <c ca="center">
                        <p>25,426</p>
                     </c>
                     <c ca="center">
                        <p>27</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>ogt</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Direct damage reversal</p>
                     </c>
                     <c ca="center">
                        <p>Between</p>
                     </c>
                     <c ca="center">
                        <p>88,327</p>
                     </c>
                     <c ca="center">
                        <p>83</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>tag</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Base excision repair</p>
                     </c>
                     <c ca="center">
                        <p>Between</p>
                     </c>
                     <c ca="center">
                        <p>33,065</p>
                     </c>
                     <c ca="center">
                        <p>30</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>mutM</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Base excision repair</p>
                     </c>
                     <c ca="center">
                        <p>Between</p>
                     </c>
                     <c ca="center">
                        <p>809</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>mutH</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Mismatch repair</p>
                     </c>
                     <c ca="center">
                        <p>Within</p>
                     </c>
                     <c ca="center">
                        <p>9,601</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>recJ</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Mismatch repair</p>
                     </c>
                     <c ca="center">
                        <p>Within</p>
                     </c>
                     <c ca="center">
                        <p>4,533</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>uvrD</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Mismatch repair</p>
                     </c>
                     <c ca="center">
                        <p>Between</p>
                     </c>
                     <c ca="center">
                        <p>11,325</p>
                     </c>
                     <c ca="center">
                        <p>12</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>recA</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Recombinase pathway</p>
                     </c>
                     <c ca="center">
                        <p>Between</p>
                     </c>
                     <c ca="center">
                        <p>9,884</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>recF</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Recombinase pathway</p>
                     </c>
                     <c ca="center">
                        <p>Within</p>
                     </c>
                     <c ca="center">
                        <p>1,073</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>recN</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Recombinase pathway</p>
                     </c>
                     <c ca="center">
                        <p>Within</p>
                     </c>
                     <c ca="center">
                        <p>1,661</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>uvrA</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>UV excision repair</p>
                     </c>
                     <c ca="center">
                        <p>Within</p>
                     </c>
                     <c ca="center">
                        <p>6,579</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>uvrB</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>UV excision repair</p>
                     </c>
                     <c ca="center">
                        <p>Between</p>
                     </c>
                     <c ca="center">
                        <p>60,515</p>
                     </c>
                     <c ca="center">
                        <p>64</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>uvrC</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>UV excision repair</p>
                     </c>
                     <c ca="center">
                        <p>Between</p>
                     </c>
                     <c ca="center">
                        <p>27,558</p>
                     </c>
                     <c ca="center">
                        <p>34</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>The most plausible interpretation of this pattern is that some of these repair functions were initially lost as the result of large deletions, sometimes occurring in combination with chromosomal rearrangements. These deletions were followed by gradual loss of other genes in the same pathways. The process of fixation of these large deletions reflects not only selection on the repair functions but also selection on other genes on the deleted fragments. A gene flanked by required loci is less likely to be lost in an early large deletion than a gene flanked by nonessential loci. For example, if the 10 kb segment containing <it>recA</it> was lost as a single deletion, then the loss of <it>recA</it> was dependent on the fact that the neighboring genes included in the deletion were not essential for survival. The retention of <it>recBCD</it> might have been promoted by <it>Buchnera</it>'s requirement for the flanking gene, <it>argA,</it> which encodes an enzyme for biosynthesis of arginine. Other small-genome bacteria lack <it>argA</it> and other genes underlying amino-acid biosynthesis, whereas <it>Buchnera</it> retains all genes required for biosynthesis of essential amino acids, which are needed by the host.</p>
            <p>Many genes were lost singly, implying insufficient selection for conservation of individual genes. A consequence is the correlation in presence/absence among genes in the same pathway, even if they occupy separate locations on the chromosome: if pathway function is lost, all genes in the pathway are invariably lost or degraded. An example in the case of the recombinase genes is the loss of <it>recF,</it> which is required for some <it>recA</it> functions [<abbr bid="B38">38</abbr>] and which was deleted individually with both flanking genes retained (Table <tblr tid="T1">1</tblr>). A plausible scenario for the loss of this pathway is that, once <it>recA</it> was eliminated in the context of a large deletion, there was no selection for <it>recF</it> retention and it was subject to successive small deletions and substitutions. In both <it>Buchnera</it> of <it>A. pisum</it> and <it>Buchnera</it> of <it>Schizaphis graminum, recF</it> is replaced by 127-138 nucleotides, with base composition 87-89% A+T [<abbr bid="B8">8</abbr>,<abbr bid="B39">39</abbr>]. The process of shrinkage and A+T accumulation could result strictly from mutational patterns, which are biased towards deletions [<abbr bid="B21">21</abbr>] and nucleotide substitutions ending in A or T [<abbr bid="B31">31</abbr>].</p>
         </sec>
         <sec>
            <st>
               <p>Large deletions in other bacterial genomes</p>
            </st>
            <p>Genome comparisons across other groups of related bacteria suggest that deletion events encompassing multiple loci are frequent in bacterial evolution. For example, deletions of over 20 kb and 20 loci have occurred in natural isolates of <it>E. coli</it> that are very closely related on the basis of sequence homology [<abbr bid="B40">40</abbr>]. On the basis of comparison with its close relative <it>Mycobacterium tuberculosis, M. leprae</it> shows a pattern of genome reduction that includes both loss of large fragments and also loss of gene function through slight changes in sequence [<abbr bid="B41">41</abbr>]. Different strains of <it>M. tuberculosis</it> have been found to contain at least 25 long deletions, with one event removing as many as 16 ORFs [<abbr bid="B42">42</abbr>].</p>
         </sec>
         <sec>
            <st>
               <p>Consequences of genome reduction for gene regulation</p>
            </st>
            <p>The changes in sequences underlying initiation of both transcription and translation, as well as the loss of regulatory proteins [<abbr bid="B8">8</abbr>], give an impression of genome-wide degeneration of regulatory functions in <it>Buchnera.</it> The hypothesized elimination of promoters and genes seems to have produced newly formed polycistronic regions (Figure <figr fid="F9">9</figr>). The fusion of genes into single transcriptional units has been interpreted as the result of selection favoring small genome size [<abbr bid="B43">43</abbr>] or favoring efficiency in transcription or translation [<abbr bid="B44">44</abbr>]. In <it>Buchnera,</it> however, the hypothesized loss of promoters suggests a general genomic decay influencing transcriptional regulation; this is supported by the observation that SD sequences are also degenerate, and by the finding that some of the most frequently used codons in the genome do not have corresponding tRNAs because they have been deleted throughout <it>Buchnera</it>'s evolution. Experiments on gene expression patterns are needed to test the potential deterioration of regulatory capabilities.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusions</p>
         </st>
         <p><it>Buchnera</it> provides the first case of a fully sequenced, highly reduced genome for which closely related large genomes exist and have been sequenced, allowing reconstruction of the steps in genome reduction. The most plausible interpretation of the deletion of large contiguous regions during the evolution of <it>Buchnera</it> is that the transition to the endosymbiotic lifestyle was accompanied, or soon followed, by large deletions. The location and content of early deletions might well have shaped the ultimate gene inventory of the fully reduced genome found in modern <it>Buchnera.</it></p>
         <p>The similarity in genome size across <it>Buchnera</it> species [<abbr bid="B45">45</abbr>] suggests that most genome reduction occurred in the shared ancestral lineage and that the ancestor of modern <it>Buchnera</it> already had reached a near-minimum size. The extremely stable genome content of modern <it>Buchnera</it> is also indicated by comparison of the full genome sequences of <it>Buchnera</it> of <it>A. pisum</it> and <it>Buchnera</it> of <it>Schizaphis graminum</it> (I. Tamas <it>et al.</it> unpublished results).</p>
         <p>The ability to examine further why some genes are retained and some are lost will be improved as additional closely related genomes of different sizes are sequenced and annotated. The gamma-3 Proteobacteria contain free-living bacteria with large genomes and also numerous symbiotic lineages showing varying degrees of reduction in genome size [<abbr bid="B46">46</abbr>,<abbr bid="B47">47</abbr>]. With improved ability to reconstruct ancestral genomes, this group will provide an excellent opportunity to determine the role of chance and selection in the determination of genome content and in the evolution of gene regulatory systems.</p>
      </sec>
      <sec>
         <st>
            <p>Materials and methods</p>
         </st>
         <sec>
            <st>
               <p>Reconstruction of the ancestor</p>
            </st>
            <p>Phylogenetic studies based on 16S rDNA sequences indicate that <it>Buchnera</it> belongs to the gamma-3 Proteobacteria and is closely related to <it>E. coli</it> (Figure <figr fid="F1">1</figr>). The precise phylogenetic position of <it>Buchnera</it> has varied depending on the method of analysis and the taxa included in the analysis. This instability is partially the result of the accelerated evolution and AT bias that affect all <it>Buchnera</it> genes including the 16S rDNA [<abbr bid="B17">17</abbr>]. Most studies place <it>Buchnera</it> either within or just outside the Enterobacteriaceae, suggesting that it diverged near the time of the common ancestor of this clade [<abbr bid="B48">48</abbr>,<abbr bid="B49">49</abbr>,<abbr bid="B50">50</abbr>,<abbr bid="B51">51</abbr>]. Thus, the reconstructed ancestor of the clade corresponding to Enterobacteriaceae (A in Figure <figr fid="F1">1</figr>) was considered to constitute a close approximation of the free-living ancestor that gave rise to <it>Buchnera.</it></p>
            <p>The genes absent from <it>Buchnera</it> and present in <it>E. coli</it> include many that are present in other related bacteria such as <it>Haemophilus influenzae</it> and <it>V. cholerae.</it> This supports the view that <it>Buchnera</it> was derived through loss of genes from an ancestor similar to <it>E. coli,</it> a conclusion also reached by Shigenobu <it>et al.</it> [<abbr bid="B8">8</abbr>]. However, some genes were acquired recently by <it>E. coli,</it> after divergence from the <it>Buchnera</it> lineage [<abbr bid="B52">52</abbr>]. The ancestral genome reconstruction was based on the <it>E. coli</it> MCl655 genome [<abbr bid="B14">14</abbr>], following the removal of genes that seemed to have been acquired after the <it>E. coli-Buchnera</it> divergence. The rules for removing a gene were based on phylogenetic distribution of orthologs among the unannotated genomes of three serovars of <it>Salmonella enterica</it> (sv. Typhi, sv. Paratyphi and sv. Typhimurium), <it>Klebsiella pneumoniae</it> and <it>Yersinia pestis</it> and the annotated genome of <it>V. cholerae</it> (GenBank accession number NC 002505) [<abbr bid="B53">53</abbr>]. The unpublished sequence data were produced by the <it>Salmonella, Klebsiella</it> and <it>Yersinia</it> Sequencing Groups at the Sanger Centre (for <it>S. enterica</it> sv. Typhi, <it>K. pneumoniae</it> and <it>Y. pestis</it>) [<abbr bid="B54">54</abbr>] and the Washington University Genome Sequencing Center (for <it>S. enterica</it> sv. Typhimurium and <it>S. enterica</it> sv. Paratyphi) [<abbr bid="B55">55</abbr>]. Determination of phylogenetic distribution was based on the 'Pip-maker' analyses [<abbr bid="B56">56</abbr>] for viewing orthology among enteric and related bacteria as produced by McClelland <it>et al.</it> [<abbr bid="B57">57</abbr>,<abbr bid="B58">58</abbr>]. To qualify as an ortholog to an <it>E. coli</it> gene, a sequence had to show at least 40% amino-acid identity for at least half of the <it>E. coli</it> gene.</p>
            <p>Inclusion of a gene in the ancestor was not dependent on its presence in all the descendant taxa, as a gene could be ancestral but lost from some lineages or acquired after the ancestor but before divergence of some descendant taxa. Therefore, to be retained in the ancestor, an ortholog had to be present in at least one of the close relatives of <it>E. coli</it> (<it>S. enterica</it> sv. Paratyphi, Typhi or Typhimurium, or <it>K. pneumoniae</it>) and also in at least one of the more distant relatives, either <it>Y. pestis</it> (representing a more basal branch of the Enterobacteriaceae) or <it>V. cholerae</it> (branching just outside the Enterobacteriaceae). These requirements effectively impose a parsimony criterion, minimizing the number of gains and losses of genes. This criterion is likely to be valid for most genes across this relatively shallow phylogeny. Sequences most likely to be misrepresented in the ancestor by this reconstruction are phage genes or insertion elements, which may move in and out of genomes too frequently for ancestral states to be reconstructed using parsimony. <it>Buchnera</it> lacks phage or insertion sequences [<abbr bid="B8">8</abbr>].</p>
            <p>The ancestor was assigned the gene order of modern <it>E. coli.</it> Although this order is almost certainly not entirely correct, comparison with other bacteria indicates that it is true for a majority of the borders between the 306 fragments (including those lost and those retained). For example, the same linkages occur in <it>E. coli</it> and <it>V. cholerae</it> and/or <it>E. coli</it> and <it>Y. pestis</it> for 219 of the junctions between fragments in <it>Buchnera</it> (example in Figure <figr fid="F5">5</figr>). This implies that most of the <it>E. coli</it> arrangements were present in the ancestor of <it>Buchnera</it> and <it>E. coli</it> and that the ends of syntenic fragments mostly represent sites at which rearrangements occurred in the evolution of <it>Buchnera.</it> A concentration of rearrangements in the <it>Buchnera</it> lineage is also suggested by the much longer average length of spacers that occur between syntenic fragments as compared to spacers that have the same flanking genes in the ancestor and in <it>E. coli</it> (see above). This longer spacer length can be interpreted as arising from remnants of genes that were rendered functionless when rearrangements occurred. When the relevant genomes currently in progress are complete and annotated, a more accurate reconstruction of ancestral gene order maybe possible.</p>
         </sec>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank Howard Ochman for comments on the manuscript. Financial support was from a US National Science Foundation grant (DEB-9978518) to N.M.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>The diverse and dynamic structure of bacterial genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Casjens</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Annu Rev Genet</source>
            <pubdate>1998</pubdate>
            <volume>32</volume>
            <fpage>339</fpage>
            <lpage>377</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.genet.32.1.339</pubid>
                  <pubid idtype="pmpid" link="fulltext">9928484</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Bacterial evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Woese</snm>
                  <fnm>CR</fnm>
               </au>
            </aug>
            <source>Microbiol Rev</source>
            <pubdate>1987</pubdate>
            <volume>51</volume>
            <fpage>221</fpage>
            <lpage>271</lpage>
            <xrefbib>
               <pubid idtype="pmpid">2439888</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>The minimal cell genome: "On being the right size".</p>
            </title>
            <aug>
               <au>
                  <snm>Maniloff</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1996</pubdate>
            <volume>93</volume>
            <fpage>10004</fpage>
            <lpage>10006</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">38325</pubid>
                  <pubid idtype="pmpid" link="fulltext">8816738</pubid>
                  <pubid idtype="doi">10.1073/pnas.93.19.10004</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Comparative analysis of the genomes of the bacteria <it>Mycoplasma pneumoniae</it> and <it>Mycoplasma genitalium.</it></p>
            </title>
            <aug>
               <au>
                  <snm>Himmelreich</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Plagens</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Hilbert</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Reiner</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Herrmann</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1997</pubdate>
            <volume>25</volume>
            <fpage>701</fpage>
            <lpage>712</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">146517</pubid>
                  <pubid idtype="pmpid" link="fulltext">9016618</pubid>
                  <pubid idtype="doi">10.1093/nar/25.4.701</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Reductive evolution of resident genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Andersson</snm>
                  <fnm>SGE</fnm>
               </au>
               <au>
                  <snm>Kurland</snm>
                  <fnm>CG</fnm>
               </au>
            </aug>
            <source>Trends Microbiol</source>
            <pubdate>1998</pubdate>
            <volume>6</volume>
            <fpage>263</fpage>
            <lpage>268</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0966-842X(98)01312-2</pubid>
                  <pubid idtype="pmpid" link="fulltext">9717214</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Insights into the evolutionary process of genome degradation.</p>
            </title>
            <aug>
               <au>
                  <snm>Andersson</snm>
                  <fnm>JO</fnm>
               </au>
               <au>
                  <snm>Andersson</snm>
                  <fnm>SGE</fnm>
               </au>
            </aug>
            <source>Curr Opin Genet Dev</source>
            <pubdate>1999</pubdate>
            <volume>9</volume>
            <fpage>664</fpage>
            <lpage>671</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0959-437X(99)00024-6</pubid>
                  <pubid idtype="pmpid" link="fulltext">10607609</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Genome degradation is an ongoing process in <it>Rickettsia</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Andersson</snm>
                  <fnm>JO</fnm>
               </au>
               <au>
                  <snm>Andersson</snm>
                  <fnm>SGE</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1999</pubdate>
            <volume>16</volume>
            <fpage>1178</fpage>
            <lpage>1191</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10486973</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Genome sequence of the endocellular bacterial symbiont of aphids <it>Buchnera</it> sp. APS.</p>
            </title>
            <aug>
               <au>
                  <snm>Shigenobu</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Watanabe</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Hattori</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sakaki</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Ishikawa</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>407</volume>
            <fpage>81</fpage>
            <lpage>86</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-9002(97)01373-9</pubid>
                  <pubid idtype="pmpid" link="fulltext">10993077</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Pseudogenes, junk DNA, and the dynamics of <it>Rickettsia</it> genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Andersson</snm>
                  <fnm>JO</fnm>
               </au>
               <au>
                  <snm>Andersson</snm>
                  <fnm>SGE</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2001</pubdate>
            <volume>18</volume>
            <fpage>829</fpage>
            <lpage>839</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11319266</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>The minimal gene complement of <it>Mycoplasma genitalium</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Fraser</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Gocayne</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Adams</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Clayton</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Fleischmann</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Bult</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Kerlavage</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Sutton</snm>
                  <fnm>GG</fnm>
               </au>
               <au>
                  <snm>Kelley</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1995</pubdate>
            <volume>270</volume>
            <fpage>397</fpage>
            <lpage>403</lpage>
            <xrefbib>
               <pubid idtype="pmpid">7569993</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>The complete sequence of the mucosal pathogen <it>Ureaplasma urealyticum</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Glass</snm>
                  <fnm>JI</fnm>
               </au>
               <au>
                  <snm>Lefkowitz</snm>
                  <fnm>EJ</fnm>
               </au>
               <au>
                  <snm>Glass</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Heiner</snm>
                  <fnm>CR</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>EY</fnm>
               </au>
               <au>
                  <snm>Cassell</snm>
                  <fnm>GH</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>407</volume>
            <fpage>757</fpage>
            <lpage>762</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35037619</pubid>
                  <pubid idtype="pmpid" link="fulltext">11048724</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>The genome sequence of <it>Rickettsia prowazekii</it> and the origin of mitochondria.</p>
            </title>
            <aug>
               <au>
                  <snm>Andersson</snm>
                  <fnm>SGE</fnm>
               </au>
               <au>
                  <snm>Zomorodipour</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Andersson</snm>
                  <fnm>JO</fnm>
               </au>
               <au>
                  <snm>Sicheritz-Ponten</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Alsmark</snm>
                  <fnm>UC</fnm>
               </au>
               <au>
                  <snm>Podowski</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Naslund</snm>
                  <fnm>AK</fnm>
               </au>
               <au>
                  <snm>Eriksson</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Winkler</snm>
                  <fnm>HH</fnm>
               </au>
               <au>
                  <snm>Kurland</snm>
                  <fnm>CG</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1998</pubdate>
            <volume>396</volume>
            <fpage>133</fpage>
            <lpage>140</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/24094</pubid>
                  <pubid idtype="pmpid" link="fulltext">9823893</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Genome sequences of <it>Chlamydia trachomatis</it> MoPn and <it>Chlamydia pneumoniae</it> AR39.</p>
            </title>
            <aug>
               <au>
                  <snm>Read</snm>
                  <fnm>TD</fnm>
               </au>
               <au>
                  <snm>Brunham</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Shen</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Gill</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Heidelberg</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Hickey</snm>
                  <fnm>EK</fnm>
               </au>
               <au>
                  <snm>Peterson</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Umayam</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Utterback</snm>
                  <fnm>T</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2000</pubdate>
            <volume>28</volume>
            <fpage>1397</fpage>
            <lpage>1406</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">111046</pubid>
                  <pubid idtype="pmpid" link="fulltext">10684935</pubid>
                  <pubid idtype="doi">10.1093/nar/28.6.1397</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>The complete genome sequence of <it>Escherichia coli</it> K-12.</p>
            </title>
            <aug>
               <au>
                  <snm>Blattner</snm>
                  <fnm>FR</fnm>
               </au>
               <au>
                  <snm>Plunkett</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Bloch</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Perna</snm>
                  <fnm>NT</fnm>
               </au>
               <au>
                  <snm>Burland</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Riley</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>ColladoVides</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Glasner</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Rode</snm>
                  <fnm>CK</fnm>
               </au>
               <au>
                  <snm>Mayhew</snm>
                  <fnm>GF</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>1997</pubdate>
            <volume>277</volume>
            <fpage>1453</fpage>
            <lpage>1474</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.277.5331.1453</pubid>
                  <pubid idtype="pmpid" link="fulltext">9278503</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Are mutualism and parasitism irreversible evolutionary alternatives for endosymbiotic bacteria? Insights from molecular phylogenetics and genomics.</p>
            </title>
            <aug>
               <au>
                  <snm>Moran</snm>
                  <fnm>NA</fnm>
               </au>
               <au>
                  <snm>Wernegreen</snm>
                  <fnm>JJ</fnm>
               </au>
            </aug>
            <source>Trends Ecol Evol</source>
            <pubdate>2000</pubdate>
            <volume>15</volume>
            <fpage>321</fpage>
            <lpage>326</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0169-5347(00)01902-9</pubid>
                  <pubid idtype="pmpid" link="fulltext">10884696</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Mutualists and parasites: how to paint yourself into a (metabolic) corner.</p>
            </title>
            <aug>
               <au>
                  <snm>Tamas</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Klasson</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Sandstr&#246;m</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Andersson</snm>
                  <fnm>SGE</fnm>
               </au>
            </aug>
            <source>FEBS Lett</source>
            <pubdate>2001</pubdate>
            <volume>498</volume>
            <fpage>135</fpage>
            <lpage>139</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0014-5793(01)02459-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">11412844</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Accelerated evolution and Muller's ratchet in endosymbiotic bacteria.</p>
            </title>
            <aug>
               <au>
                  <snm>Moran</snm>
                  <fnm>NA</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1996</pubdate>
            <volume>93</volume>
            <fpage>2873</fpage>
            <lpage>2878</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">39726</pubid>
                  <pubid idtype="pmpid" link="fulltext">8610134</pubid>
                  <pubid idtype="doi">10.1073/pnas.93.7.2873</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Evolution of prokaryotic gene order: genome rearrangements in closely related species.</p>
            </title>
            <aug>
               <au>
                  <snm>Suyama</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2001</pubdate>
            <volume>17</volume>
            <fpage>10</fpage>
            <lpage>13</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-9525(00)02159-4</pubid>
                  <pubid idtype="pmpid" link="fulltext">11163906</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Genome rearrangement by replication-directed translocation.</p>
            </title>
            <aug>
               <au>
                  <snm>Tillier</snm>
                  <fnm>ERM</fnm>
               </au>
               <au>
                  <snm>Collins</snm>
                  <fnm>RA</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2000</pubdate>
            <volume>26</volume>
            <fpage>195</fpage>
            <lpage>197</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/79918</pubid>
                  <pubid idtype="pmpid" link="fulltext">11017076</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Evidence for symmetric chromosomal inversions around the replication origin in bacteria.</p>
            </title>
            <aug>
               <au>
                  <snm>Eisen</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Heidelberg</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Salzberg</snm>
                  <fnm>SL</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2000</pubdate>
            <volume>1</volume>
            <fpage>research0011.1</fpage>
            <lpage>0011.9</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">16139</pubid>
                  <pubid idtype="pmpid" link="fulltext">11178265 </pubid>
                  <pubid idtype="doi">10.1186/gb-2000-1-6-research0011</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Deletional bias and the evolution of bacterial genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Mira</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ochman</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Moran</snm>
                  <fnm>NA</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2001</pubdate>
            <volume>17</volume>
            <fpage>589</fpage>
            <lpage>596</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-9525(01)02447-7</pubid>
                  <pubid idtype="pmpid" link="fulltext">11585665 </pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Sequencing and analysis of bacterial genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Mushegian</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Rudd</snm>
                  <fnm>KE</fnm>
               </au>
            </aug>
            <source>Curr Biol</source>
            <pubdate>1996</pubdate>
            <volume>6</volume>
            <fpage>404</fpage>
            <lpage>416</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8723345</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Complete genome sequence of <it>Pseudomonas aeruginosa</it> PAO1, an opportunistic pathogen.</p>
            </title>
            <aug>
               <au>
                  <snm>Stover</snm>
                  <fnm>CK</fnm>
               </au>
               <au>
                  <snm>Pham</snm>
                  <fnm>XQ</fnm>
               </au>
               <au>
                  <snm>Erwin</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Mizoguchi</snm>
                  <fnm>SD</fnm>
               </au>
               <au>
                  <snm>Warrener</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Hickey</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Brinkman</snm>
                  <fnm>FSL</fnm>
               </au>
               <au>
                  <snm>Hufnagle</snm>
                  <fnm>WO</fnm>
               </au>
               <au>
                  <snm>Kowalik</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Lagrou</snm>
                  <fnm>M</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>406</volume>
            <fpage>959</fpage>
            <lpage>964</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35023079</pubid>
                  <pubid idtype="pmpid" link="fulltext">10984043</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Compilation and analysis of <it>Escherichia coli</it> promoter DNA sequences.</p>
            </title>
            <aug>
               <au>
                  <snm>Hawley</snm>
                  <fnm>DK</fnm>
               </au>
               <au>
                  <snm>McClure</snm>
                  <fnm>WR</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1983</pubdate>
            <volume>11</volume>
            <fpage>2237</fpage>
            <lpage>2255</lpage>
            <xrefbib>
               <pubid idtype="pmpid">6344016 </pubid>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Eubacterial sigma-factors.</p>
            </title>
            <aug>
               <au>
                  <snm>Wosten</snm>
                  <fnm>MMSM</fnm>
               </au>
            </aug>
            <source>FEMS Microbiol Rev</source>
            <pubdate>1998</pubdate>
            <volume>22</volume>
            <fpage>127</fpage>
            <lpage>150</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-6445(98)00011-4</pubid>
                  <pubid idtype="pmpid" link="fulltext">9818380</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Mutations of bacteriophage T7 that affect the initiation of synthesis of gene 0.3 protein.</p>
            </title>
            <aug>
               <au>
                  <snm>Dunn</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Buzash-Pollert</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Studier</snm>
                  <fnm>FW</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1978</pubdate>
            <volume>75</volume>
            <fpage>2741</fpage>
            <lpage>2745</lpage>
            <xrefbib>
               <pubid idtype="pmpid">275843</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Initiation of translation in prokaryotes and eukaryotes.</p>
            </title>
            <aug>
               <au>
                  <snm>Kozak</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>1999</pubdate>
            <volume>234</volume>
            <fpage>187</fpage>
            <lpage>208</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0378-1119(99)00210-3</pubid>
                  <pubid idtype="pmpid" link="fulltext">10395892</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Evidence for genetic drift in endosymbionts: analyses of protein-coding genes.</p>
            </title>
            <aug>
               <au>
                  <snm>Wernegreen</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Moran</snm>
                  <fnm>NA</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1999</pubdate>
            <volume>16</volume>
            <fpage>83</fpage>
            <lpage>97</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10331254</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Psyllid endosymbionts exhibit patterns of co-speciation with hosts and destabilizing substitutions in ribosomal RNA.</p>
            </title>
            <aug>
               <au>
                  <snm>Spaulding</snm>
                  <fnm>AW</fnm>
               </au>
               <au>
                  <snm>von Dohlen</snm>
                  <fnm>CD</fnm>
               </au>
            </aug>
            <source>Insect Mol Biol</source>
            <pubdate>2001</pubdate>
            <volume>10</volume>
            <fpage>57</fpage>
            <lpage>67</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-2583.2001.00231.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">11240637</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Intracellular symbionts of aphids possess many genome copies per bacterium.</p>
            </title>
            <aug>
               <au>
                  <snm>Komaki</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ishikawa</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>1999</pubdate>
            <volume>48</volume>
            <fpage>717</fpage>
            <lpage>722</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10229576</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Decay of mutualistic potential in aphid endosymbionts through silencing of biosynthetic loci: <it>Buchnera</it> of <it>Diuraphis</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Wernegreen</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Moran</snm>
                  <fnm>NA</fnm>
               </au>
            </aug>
            <source>Proc R Soc Lond B</source>
            <pubdate>2000</pubdate>
            <volume>267</volume>
            <fpage>1423</fpage>
            <lpage>1431</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1098/rspb.2000.1159</pubid>
                  <pubid idtype="pmpid" link="fulltext">10983826</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>A minimal gene set for cellular life derived by comparison of complete bacterial genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Mushegian</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1996</pubdate>
            <volume>93</volume>
            <fpage>10268</fpage>
            <lpage>10273</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">38373</pubid>
                  <pubid idtype="pmpid" link="fulltext">8816789 </pubid>
                  <pubid idtype="doi">10.1073/pnas.93.19.10268</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>The minimal genome concept.</p>
            </title>
            <aug>
               <au>
                  <snm>Mushegian</snm>
                  <fnm>AR</fnm>
               </au>
            </aug>
            <source>Curr Opin Genet Dev</source>
            <pubdate>1999</pubdate>
            <volume>9</volume>
            <fpage>709</fpage>
            <lpage>714</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0959-437X(99)00023-4</pubid>
                  <pubid idtype="pmpid" link="fulltext">10607608</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Global transposon mutagenesis and a minimal <it>Mycoplasma</it> genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Hutchison</snm>
                  <fnm>CN</fnm>
               </au>
               <au>
                  <snm>Peterson</snm>
                  <fnm>SN</fnm>
               </au>
               <au>
                  <snm>Gill</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Cline</snm>
                  <fnm>RT</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Fraser</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>HO</fnm>
               </au>
               <au>
                  <snm>Venter</snm>
                  <fnm>JC</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1999</pubdate>
            <volume>286</volume>
            <fpage>2165</fpage>
            <lpage>2169</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.286.5447.2165</pubid>
                  <pubid idtype="pmpid" link="fulltext">10591650</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>How many genes can make a cell: the minimal-gene-set concept.</p>
            </title>
            <aug>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Annu Rev Genom Human Genet</source>
            <pubdate>2000</pubdate>
            <volume>1</volume>
            <fpage>99</fpage>
            <lpage>116</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.genom.1.1.99</pubid>
                  <pubid idtype="pmpid" link="fulltext">11701626 </pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Measuring genome evolution.</p>
            </title>
            <aug>
               <au>
                  <snm>Huynen</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1998</pubdate>
            <volume>95</volume>
            <fpage>5849</fpage>
            <lpage>5856</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">34486</pubid>
                  <pubid idtype="pmpid" link="fulltext">9600883 </pubid>
                  <pubid idtype="doi">10.1073/pnas.95.11.5849</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Pea aphid symbiont relationships established by analysis of 16S rRNAs.</p>
            </title>
            <aug>
               <au>
                  <snm>Unterman</snm>
                  <fnm>BM</fnm>
               </au>
               <au>
                  <snm>Baumann</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>McLean</snm>
                  <fnm>DL</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>1989</pubdate>
            <volume>171</volume>
            <fpage>2970</fpage>
            <lpage>2974</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">210002</pubid>
                  <pubid idtype="pmpid">2470724</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Assembly of RecA-like recombinases: Distinct roles for mediator proteins in mitosis and meiosis.</p>
            </title>
            <aug>
               <au>
                  <snm>Gasior</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Olivares</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Ear</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Hari</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Weichselbaum</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Bishop</snm>
                  <fnm>DK</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2001</pubdate>
            <volume>98</volume>
            <fpage>8411</fpage>
            <lpage>8418</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">37451</pubid>
                  <pubid idtype="pmpid" link="fulltext">11459983</pubid>
                  <pubid idtype="doi">10.1073/pnas.121046198</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Sequence analysis of a 34.7-kb DNA segment from the genome of <it>Buchnera aphidicola</it> (endosymbiont of aphids) containing <it>groEL</it>, <it>dnaA</it>, the atp operon, <it>gidA</it>, and <it>rho</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Clark</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Baumann</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Baumann</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Curr Microbiol</source>
            <pubdate>1998</pubdate>
            <volume>36</volume>
            <fpage>158</fpage>
            <lpage>163</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9516544</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Evolutionary dynamics of full genome content in <it>Escherichia coli</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Ochman</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>IB</fnm>
               </au>
            </aug>
            <source>EMBO J</source>
            <pubdate>2000</pubdate>
            <volume>19</volume>
            <fpage>6637</fpage>
            <lpage>6643</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/emboj/19.24.6637</pubid>
                  <pubid idtype="pmpid" link="fulltext">11118198</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Massive gene decay in the leprosy bacillus.</p>
            </title>
            <aug>
               <au>
                  <snm>Cole</snm>
                  <fnm>ST</fnm>
               </au>
               <au>
                  <snm>Eiglmeier</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Parkhill</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>James</snm>
                  <fnm>KD</fnm>
               </au>
               <au>
                  <snm>Thomson</snm>
                  <fnm>NR</fnm>
               </au>
               <au>
                  <snm>Wheeler</snm>
                  <fnm>PR</fnm>
               </au>
               <au>
                  <snm>Honore</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Garnier</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Churcher</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2001</pubdate>
            <volume>409</volume>
            <fpage>1007</fpage>
            <lpage>1011</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35059006</pubid>
                  <pubid idtype="pmpid" link="fulltext">11234002</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Comparing genomes within the species <it>Mycobacterium tuberculosis.</it></p>
            </title>
            <aug>
               <au>
                  <snm>Kato-Maeda</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rhee</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Gingeras</snm>
                  <fnm>TR</fnm>
               </au>
               <au>
                  <snm>Salamon</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Drenkow</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Smittipat</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Small</snm>
                  <fnm>PM</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2001</pubdate>
            <volume>11</volume>
            <fpage>547</fpage>
            <lpage>554</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gr.166401</pubid>
                  <pubid idtype="pmpid" link="fulltext">11282970</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Reducing the genome size of organelles favors gene transfer to the nucleus.</p>
            </title>
            <aug>
               <au>
                  <snm>Selosse</snm>
                  <fnm>M-A</fnm>
               </au>
               <au>
                  <snm>Albert</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Godelle</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Trends Ecol Evol</source>
            <pubdate>2001</pubdate>
            <volume>16</volume>
            <fpage>135</fpage>
            <lpage>141</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0169-5347(00)02084-X</pubid>
                  <pubid idtype="pmpid" link="fulltext">11179577</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Overlapping genes in bacterial and phage genomes.</p>
            </title>
            <aug>
               <au>
                  <snm>Scherbakov</snm>
                  <fnm>DV</fnm>
               </au>
               <au>
                  <snm>Garber</snm>
                  <fnm>MB</fnm>
               </au>
            </aug>
            <source>Mol Biol</source>
            <pubdate>2000</pubdate>
            <volume>34</volume>
            <fpage>485</fpage>
            <lpage>495</lpage>
            <xrefbib>
               <pubid idtype="pmpid">11042850 </pubid>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>The decoupling of genome size and sequence divergence in a symbiotic bacterium.</p>
            </title>
            <aug>
               <au>
                  <snm>Wernegreen</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Ochman</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Moran</snm>
                  <fnm>NA</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2000</pubdate>
            <volume>182</volume>
            <fpage>3867</fpage>
            <lpage>3869</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">94565</pubid>
                  <pubid idtype="pmpid" link="fulltext">10851009</pubid>
                  <pubid idtype="doi">10.1128/JB.182.13.3867-3869.2000</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>A novel application of gene arrays: <it>Escherichia coli</it> array provides insight into the biology of the obligate endosymbiont of tsetse flies.</p>
            </title>
            <aug>
               <au>
                  <snm>Akman</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Aksoy</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2001</pubdate>
            <volume>98</volume>
            <fpage>7546</fpage>
            <lpage>7551</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">34705</pubid>
                  <pubid idtype="pmpid" link="fulltext">11404467</pubid>
                  <pubid idtype="doi">10.1073/pnas.131057498</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>Genome size determination and coding capacity of <it>Sodalis glossinidius</it>, an enteric symbiont of tsetse flies, as revealed by hybridization to <it>Escherichia coli</it> gene arrays.</p>
            </title>
            <aug>
               <au>
                  <snm>Akman</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Rio</snm>
                  <fnm>RVM</fnm>
               </au>
               <au>
                  <snm>Beard</snm>
                  <fnm>CB</fnm>
               </au>
               <au>
                  <snm>Aksoy</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2001</pubdate>
            <volume>183</volume>
            <fpage>4517</fpage>
            <lpage>4525</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">95346</pubid>
                  <pubid idtype="pmpid" link="fulltext">11443086</pubid>
                  <pubid idtype="doi">10.1128/JB.183.15.4517-4525.2001</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>A molecular clock in endosymbiotic bacteria is calibrated using the insect hosts.</p>
            </title>
            <aug>
               <au>
                  <snm>Moran</snm>
                  <fnm>NA</fnm>
               </au>
               <au>
                  <snm>Munson</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Baumann</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Ishikawa</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Proc R Soc Lond B</source>
            <pubdate>1993</pubdate>
            <volume>253</volume>
            <fpage>167</fpage>
            <lpage>171</lpage>
         </bibl>
         <bibl id="B49">
            <title>
               <p>Putative evolutionary origin of plasmids carrying the genes involved in leucine biosynthesis in <it>Buchnera aphidicola</it> (endosymbiont of aphids).</p>
            </title>
            <aug>
               <au>
                  <snm>van Ham</snm>
                  <fnm>RCHJ</fnm>
               </au>
               <au>
                  <snm>Moya</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Latorre</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>1997</pubdate>
            <volume>179</volume>
            <fpage>4768</fpage>
            <lpage>4777</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">179323</pubid>
                  <pubid idtype="pmpid" link="fulltext">9244264</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>Phylogenetic characterization and molecular evolution of bacterial endosymbionts in psyllids (Hemiptera: Sternorrhyncha).</p>
            </title>
            <aug>
               <au>
                  <snm>Spaulding</snm>
                  <fnm>AW</fnm>
               </au>
               <au>
                  <snm>von Dohlen</snm>
                  <fnm>CD</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1998</pubdate>
            <volume>15</volume>
            <fpage>1506</fpage>
            <lpage>1513</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12572614</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>Systematic relationships and cospeciation of bacterial endosymbionts and their carpenter ant host species: proposal of the new taxon Candidatus <it>Blochmannia</it> gen. nov.</p>
            </title>
            <aug>
               <au>
                  <snm>Sauer</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Stackebrandt</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Gadau</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Holldobler</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Gross</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Int J Syst Evol Microbiol</source>
            <pubdate>2000</pubdate>
            <volume>50</volume>
            <fpage>1877</fpage>
            <lpage>1886</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11034499</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>Molecular archaeology of the <it>Escherichia coli</it> genome.</p>
            </title>
            <aug>
               <au>
                  <snm>Lawrence</snm>
                  <fnm>JG</fnm>
               </au>
               <au>
                  <snm>Ochman</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1998</pubdate>
            <volume>95</volume>
            <fpage>9413</fpage>
            <lpage>9417</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">21352</pubid>
                  <pubid idtype="pmpid" link="fulltext">9689094</pubid>
                  <pubid idtype="doi">10.1073/pnas.95.16.9413</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>DNA sequence of both chromosomes of the cholera pathogen <it>Vibrio cholerae</it>.</p>
            </title>
            <aug>
               <au>
                  <snm>Heidelberg</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Eisen</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>WC</fnm>
               </au>
               <au>
                  <snm>Clayton</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Gwinn</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Dodson</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Haft</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Hickey</snm>
                  <fnm>EK</fnm>
               </au>
               <au>
                  <snm>Peterson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Umayam</snm>
                  <fnm>L</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>406</volume>
            <fpage>477</fpage>
            <lpage>483</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1086/172459</pubid>
                  <pubid idtype="pmpid" link="fulltext">10952301</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>Sanger Centre: microbial genomes</p>
            </title>
            <url>http://www.sanger.ac.uk/Projects/Microbes/</url>
         </bibl>
         <bibl id="B55">
            <title>
               <p>Washington University Genome Sequencing Center: bacterial genomes</p>
            </title>
            <url>http://genome.wustl.edu/gsc/Projects/bacteria.shtml</url>
         </bibl>
         <bibl id="B56">
            <title>
               <p>PipMaker: A web server for aligning two genomic DNA sequences.</p>
            </title>
            <aug>
               <au>
                  <snm>Schwartz</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Frazer</snm>
                  <fnm>KA</fnm>
               </au>
               <au>
                  <snm>Smit</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Riemer</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Bouck</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Gibbs</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Hardison</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2000</pubdate>
            <volume>10</volume>
            <fpage>577</fpage>
            <lpage>586</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gr.10.4.577</pubid>
                  <pubid idtype="pmpid" link="fulltext">10779500</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <title>
               <p>Comparison of the <it>Escherichia coli</it> K-12 genome with sampled genomes of a <it>Klebsiella pneumoniae</it> and three <it>Salmonella enterica</it> serovars, Typhimurium, Typhi and Paratyphi.</p>
            </title>
            <aug>
               <au>
                  <snm>McClelland</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Florea</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Sanderson</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Clifton</snm>
                  <fnm>SW</fnm>
               </au>
               <au>
                  <snm>Parkhill</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Churcher</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Dougan</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Wilson</snm>
                  <fnm>RK</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2000</pubdate>
            <volume>28</volume>
            <fpage>4974</fpage>
            <lpage>4986</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">115240</pubid>
                  <pubid idtype="pmpid" link="fulltext">11121489</pubid>
                  <pubid idtype="doi">10.1093/nar/28.24.4974</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>The Enteric server</p>
            </title>
            <url>http://galapagos.cse.psu.edu/enterix/enteric/enteric.html</url>
         </bibl>
      </refgrp>
   </bm>
</art>
