<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2000-1-6-research0011</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl rating="0">
         <title>
            <p>Evidence for symmetric chromosomal inversions around the replication origin in bacteria</p>
         </title>
         <aug>
            <au id="A1" ca="yes" ce="no" pa="no" da="no">
               <snm>Eisen</snm>
               <mi>A</mi>
               <fnm>Jonathan</fnm>
               <insr iid="I1"/>
               <email>jeisen@tigr.org</email>
            </au>
            <au id="A2" ca="no" ce="no" pa="no" da="no">
               <snm>Heidelberg</snm>
               <mi>F</mi>
               <fnm>John</fnm>
               <insr iid="I1"/>
            </au>
            <au id="A3" ca="no" ce="no" pa="no" da="no">
               <snm>White</snm>
               <fnm>Owen</fnm>
               <insr iid="I1"/>
            </au>
            <au id="A4" ca="no" ce="no" pa="no" da="no">
               <snm>Salzberg</snm>
               <mi>L</mi>
               <fnm>Steven</fnm>
               <insr iid="I1"/>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>The Institute for Genomic Research, 9712 Medical Center Drive, Rockville, MD 20850, USA</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2000</pubdate>
         <volume>1</volume>
         <issue>6</issue>
         <fpage>research0011.1</fpage>
         <lpage>research0011.9</lpage>
         <url>http://genomebiology.com/2000/1/6/research/0011</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="doi" link="fulltext">10.1186/gb-2000-1-6-research0011</pubid>
               <pubid idtype="pmpid" link="fulltext">11178265</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>7</day>
               <month>8</month>
               <year>2000</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>25</day>
               <month>9</month>
               <year>2000</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>19</day>
               <month>10</month>
               <year>2000</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>4</day>
               <month>12</month>
               <year>2000</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2000</year>
         <collab>GenomeBiology.com</collab>
      </cpyrt>
      <shortabs>
         <p>By comparing recently completed bacterial genome sequences to those of closely related species an unusual and previously unobserved feature of bacterial genome structure has been identified. The distance of a particular feature from the replication origin is conserved between closely related pairs of species. This suggests that chromosomal inversions around the replication origin are a common feature of bacterial genome evolution.</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Whole-genome comparisons can provide great insight into many aspects of biology. Until recently, however, comparisons were mainly possible only between distantly related species. Complete genome sequences are now becoming available from multiple sets of closely related strains or species.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>By comparing the recently completed genome sequences of <it>Vibrio cholerae</it>,						<it>Streptococcus pneumoniae</it> and <it>Mycobacterium tuberculosis</it> to those of closely related species - <it>Escherichia coli, Streptococcus pyogenes</it> and <it>Mycobacterium leprae,</it> respectively - we have identified an unusual and previously unobserved feature of bacterial genome structure. Scatterplots of the conserved sequences (both DNA and protein) between each pair of species produce a distinct X-shaped pattern, which we call an X-alignment. The key feature of these alignments is that they have symmetry around the replication origin and terminus; that is, the distance of a particular conserved feature (DNA or protein) from the replication origin (or terminus) is conserved between closely related pairs of species. Statistically significant X-alignments are also found within some genomes, indicating that there is symmetry about the replication origin for paralogous features as well.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusions</p>
               </st>
               <p>The most likely mechanism of generation of X-alignments involves large chromosomal inversions that reverse the genomic sequence symmetrically around the origin of replication. The finding of these X-alignments between many pairs of species suggests that chromosomal inversions around the origin are a common feature of bacterial genome evolution.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010010">Genome studies</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010014">Microbiology and parasitology</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010008">Evolution</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Large-scale genomic rearrangements and duplications are important in the evolution of species. Previously, these large-scale genome-changing events were studied through genetic or cytological studies. With the availability of many complete genome sequences it is now possible to study such events through comparative genomics. The publication of the yeast genome has led to much better insight into the duplication events that have occurred in fungal and eukaryotic evolution (for example, see [<abbr bid="B1">1</abbr>]). Large chromosomal duplications have also been found from analysis of completed chromosomes of <it>Arabidopsis thaliana</it> [<abbr bid="B2">2</abbr>,<abbr bid="B3">3</abbr>]. The ability to detect large-scale genomic changes is dependent in large part on which genomes are available. Such studies in bacteria, for example, have been limited by the availability of genomes only from distantly related sets of species. Recently, however, the genomes of sets of closely related bacterial species have become available. We have compared these closely related bacterial genomes and have discovered an unusual phenomenon - alignments of whole genomes that show an X-shaped pattern (which we refer to as X-alignments). Here we present the evidence for these X-alignments and discuss mechanisms that might have produced them.</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <sec>
            <st>
               <p>Whole-genome X-alignments between species at the DNA level</p>
            </st>
            <p>We compared the DNA sequences of the two chromosomes of <it>Vibrio cholerae</it> [<abbr bid="B4">4</abbr>] with the sequence of the <it>Escherichia coli</it> chromosome [<abbr bid="B5">5</abbr>] using a suffix tree alignment algorithm [<abbr bid="B6">6</abbr>]. The analysis revealed a significant alignment at the DNA level between the <it>V. cholerae</it> large chromosome (chrI) [<abbr bid="B4">4</abbr>] and the <it>E. coli</it> chromosome [<abbr bid="B5">5</abbr>] spanning the entire length of these chromosomes (Figure <figr fid="F1">1a</figr>). Analysis of the reverse complement of <it>V. cholerae</it> chrI with <it>E. coli</it> also produced a significant alignment (Figure <figr fid="F1">1b</figr>). When superimposed, the two alignments produce a clear 'X' shape (Figure <figr fid="F1">1c</figr>) that is symmetric about the origin of replication of both genomes. This symmetry indicates that matching sequences tend to occur at the same distance from the origin but not necessarily on the same side of the origin. The X-alignment between <it>V. cholerae</it> and <it>E. coli</it> was found to be statistically significant using a test based on the number of matches found in diagonal strips in the alignment (see the Materials and methods section). Specifically, when <it>V. cholerae</it> chrI is aligned in the forward direction against <it>E. coli</it>, there are 459 maximal unique matching subsequences (MUMs; see the Materials and methods section), of which 177 occurred in a diagonal strip covering 10% of the total area (compared to the expected value of 46). The probability of observing this high a number of MUMs by chance is 4.7 &#215; 10<sup>-59</sup>. The alignment of <it>V. cholerae</it> chrI in the reverse direction against <it>E. coli</it> (which corresponds to the MUMs on the anti-diagonal) has a probability of 1.8 &#215; 10<sup>-90</sup>. As a control, we compared the genomes of distantly related species, such as <it>E. coli</it> and <it>Mycobacterium tuberculosis.</it> These do not show a significant X-alignment (Table <tblr tid="T1">1</tblr>).</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Between-species whole-genome DNA alignments</p>
               </caption>
               <text>
                  <p>Between-species whole-genome DNA alignments. Plots of maximally unique matching subsequences (MUMs) between genomes as identified by the MUMmer program. <b>(a)</b><it>V. cholerae</it> chrI forward strand versus <it>E. coli</it> forward strand. <b>(b)</b><it>E. coli</it> forward versus <it>V. cholerae</it> chrI reverse. <b>(c)</b><it>V. cholerae</it> chrI versus <it>E. coli</it>, forward and reverse overlaid. <b>(d)</b><it>S. pneumoniae</it> forward versus <it>S. pyogenes</it> forward and reverse overlaid. <b>(e)</b><it>M. tuberculosis</it> forward versus <it>M. leprae</it> forward and reverse overlaid. A point (<it>x, y</it>) indicates a DNA sequence that occurs once within each genome, at location <it>x</it> in one genome and at location <it>y</it> in the other genome. The matching sequences may occur on either the forward or the reverse strand; in either case, the locations indicate the 5' end of the sequences. The point (0,0) corresponds to the origin of replication for each genome.</p>
               </text>
               <graphic file="gb-2000-1-6-research0011-1"/>
            </fig>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Whole-genome DNA alignments using MUMmer</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="left">
                        <p>Organism 1</p>
                     </c>
                     <c ca="left">
                        <p>Organism 2</p>
                     </c>
                     <c ca="center">
                        <p>Total</p>
                     </c>
                     <c ca="center">
                        <p>Matches</p>
                     </c>
                     <c ca="center">
                        <p>Probability<sup>*</sup></p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>MUMs</p>
                     </c>
                     <c ca="center">
                        <p>within</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>diagonal</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>(10%)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>V. cholerae</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>E. coli</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>459</p>
                     </c>
                     <c ca="center">
                        <p>177</p>
                     </c>
                     <c ca="center">
                        <p>4.7 &#177; 10<sup>-59</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>V. cholerae</it> (rev)</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>E. coli</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>467</p>
                     </c>
                     <c ca="center">
                        <p>217</p>
                     </c>
                     <c ca="center">
                        <p>1.8 &#177; 10<sup>-90</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>V. cholerae</it> (rev)</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>V. cholerae</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>342</p>
                     </c>
                     <c ca="center">
                        <p>86</p>
                     </c>
                     <c ca="center">
                        <p>8.2 &#177; 10<sup>-16</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>E. coli</it> (rev)</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>E. coli</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1,128</p>
                     </c>
                     <c ca="center">
                        <p>225</p>
                     </c>
                     <c ca="center">
                        <p>1.5 &#177; 10<sup>-23</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>S. pyogenes</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>S. pneumoniae</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>706</p>
                     </c>
                     <c ca="center">
                        <p>259</p>
                     </c>
                     <c ca="center">
                        <p>4.5 &#177; 10<sup>-80</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>S. pyogenes</it> (rev)</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>S. pneumoniae</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>626</p>
                     </c>
                     <c ca="center">
                        <p>255</p>
                     </c>
                     <c ca="center">
                        <p>2.3 &#177; 10<sup>-90</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>S. pyogenes</it> (rev)</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>S. pyogenes</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>367</p>
                     </c>
                     <c ca="center">
                        <p>96</p>
                     </c>
                     <c ca="center">
                        <p>1.1 &#177; 10<sup>-18</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>S. pneumoniae</it> (rev)</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>S. pneumoniae</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1,054</p>
                     </c>
                     <c ca="center">
                        <p>154</p>
                     </c>
                     <c ca="center">
                        <p>1.5 &#177; 10<sup>-6</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>M. leprae</it> (rev)</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>M. leprae</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>449</p>
                     </c>
                     <c ca="center">
                        <p>89</p>
                     </c>
                     <c ca="center">
                        <p>3.5 &#177; 10<sup>-10</sup></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>M. tuberculosis</it> (rev)</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>M. tuberculosis</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>2,476</p>
                     </c>
                     <c ca="center">
                        <p>268</p>
                     </c>
                     <c ca="center">
                        <p>0.092</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>E. coli</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>M. tuberculosis</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>81</p>
                     </c>
                     <c ca="center">
                        <p>13</p>
                     </c>
                     <c ca="center">
                        <p>0.06</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>E. coli</it> (rev)</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>M. tuberculosis</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>70</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>0.84</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><sup>*</sup>Statistical significance was estimated as described in the text; rev, reverse complement sequence.</p>
               </tblfn>
            </tbl>
            <p>We have found that X-alignments of whole genomes are not limited to the <it>V. cholerae</it> versus <it>E. coli</it> comparison. For example, a whole-genome comparison of two bacteria in the genus <it>Streptococcus - S. pyogenes</it> [<abbr bid="B7">7</abbr>] and <it>S. pneumoniae</it> (H. Tettelin, personal communication) - reveals a global X-alignment similar to that of <it>V. cholerae</it> versus <it>E. coli</it> (Figure <figr fid="F1">1d</figr>) which is also statistically significant (Table <tblr tid="T1">1</tblr>). In addition, an X-alignment is found between two species in the genus <it>Mycobacterium - M. tuberculosis</it> [<abbr bid="B8">8</abbr>] and <it>M. leprae</it> [<abbr bid="B9">9</abbr>] (Figure <figr fid="F1">1e</figr>) - as well as between two strains of <it>Helicobacter pylori</it> (data not shown). The X-alignments observed between any two pairs of genomes are not identical in every aspect. For example, in the alignment between the two <it>Mycobacterium</it> species, each conserved region is much longer than in the other genome pairs. We believe this is due to different numbers of evolutionary events between the species (see below). Whole-genome X-alignments were not found between any other pairs of species, although a related pattern was seen between some of the chlamydial species (see below).</p>
         </sec>
         <sec>
            <st>
               <p>Whole-genome X-alignments between species are also found at the proteome level</p>
            </st>
            <p>To test whether the X-alignments found in the DNA analysis could also be found at the level of whole proteomes, we conducted comparisons of homologous proteins between species (see the Materials and methods section). Figure <figr fid="F2">2a</figr> shows a scatterplot of chromosome positions of all proteins homologous between <it>V. cholerae</it> chrI and <it>E. coli.</it> The presence of many large gene families causes a great deal of noise in this comparison. This noise can be reduced by considering only the best matching homolog for each open reading frame (ORF), rather than all protein homologs (Figure <figr fid="F2">2b</figr>). This filtered protein comparison results in an X-alignment that is statistically significant (Table <tblr tid="T2">2</tblr>).</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Whole-genome proteome alignments</p>
               </caption>
               <text>
                  <p>Whole-genome proteome alignments. Plots show the chromosome locations of pairs of predicted proteins that have significant similarity (on the basis of fasta3 comparisons). <b>(a)</b><it>V. cholerae</it> chrI versus <it>E. coli</it>. All significant matches for each <it>V. cholerae</it> ORF are shown. <b>(b)</b><it>V. cholerae</it> chrI versus <it>E. coli</it>, top matches. Only the best match for each <it>V. cholerae</it> ORF is shown. The filtering for top matches for each ORF removes noise due to the presence of many large multigene families. This X-alignment is highly statistically significant (Table <tblr tid="T2">2</tblr>). <b>(c)</b><it>Chlamydia trachomatis</it> versus <it>C. pneumoniae</it>, top matches. Only the best match for each <it>C. trachomatis</it> ORF is shown. The position of the origins (R) and termini (T) of replication are slightly shifted to see the inversions better. This pattern is consistent with the occurrence a small number of inversions around the origin and terminus in the two lineages since their divergence from a common ancestor (see Figure <figr fid="F4">4</figr>). <b>(d)</b><it>V. cholerae</it> chrI versus <it>V. cholerae</it> chrI, self-alignment. Only recently duplicated pairs of genes are shown. Recent duplications were operationally defined as those genes that were more similar to another gene in <it>V. cholerae</it> than to any gene in any other complete genome sequence. The faint X-alignment is statistically significant (Table <tblr tid="T2">2</tblr>). No significant X-alignment was detected when all pairs of paralogs were included.</p>
               </text>
               <graphic file="gb-2000-1-6-research0011-2"/>
            </fig>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Whole-genome protein-level comparisons</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="left">
                        <p>Organism 1</p>
                     </c>
                     <c ca="left">
                        <p>Organism 2</p>
                     </c>
                     <c ca="center">
                        <p>Total</p>
                     </c>
                     <c ca="center">
                        <p>Matches</p>
                     </c>
                     <c ca="center">
                        <p>Probability<sup>*</sup></p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>matches</p>
                     </c>
                     <c ca="center">
                        <p>within</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>10%</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>diagonal</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Top matches</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>V. cholerae</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>E. coli</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1,797</p>
                     </c>
                     <c ca="center">
                        <p>369</p>
                     </c>
                     <c ca="center">
                        <p>3.2 &#177; 10<sup>-40</sup></p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>V. cholerae</it> (rev)</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>E. coli</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1,797</p>
                     </c>
                     <c ca="center">
                        <p>441</p>
                     </c>
                     <c ca="center">
                        <p>2.3 &#177; 10<sup>-70</sup></p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>V. cholerae</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>V. cholerae</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>701</p>
                     </c>
                     <c ca="center">
                        <p>145</p>
                     </c>
                     <c ca="center">
                        <p>3.6 &#177; 10<sup>-17</sup></p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>V. cholerae</it> (rev)</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>V. cholerae</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>701</p>
                     </c>
                     <c ca="center">
                        <p>70</p>
                     </c>
                     <c ca="center">
                        <p>0.52</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>E. coli</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>E. coli</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1,985</p>
                     </c>
                     <c ca="center">
                        <p>286</p>
                     </c>
                     <c ca="center">
                        <p>3.6 &#177; 10<sup>-10</sup></p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>E. coli</it> (rev)</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>E. coli</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1,985</p>
                     </c>
                     <c ca="center">
                        <p>210</p>
                     </c>
                     <c ca="center">
                        <p>0.20</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Recent duplications<sup>&#8224;</sup></p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>
                           <it>V. cholerae</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>V. cholerae</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>195</p>
                     </c>
                     <c ca="center">
                        <p>60</p>
                     </c>
                     <c ca="center">
                        <p>1.0 &#177; 10<sup>-15</sup></p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p><it>V. cholerae</it> (rev)</p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>V. cholerae</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>195</p>
                     </c>
                     <c ca="center">
                        <p>26</p>
                     </c>
                     <c ca="center">
                        <p>8.4 &#177; 10<sup>-4</sup></p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><sup>*</sup>Statistical significance was estimated as described in the text. <sup>&#8224;</sup>Best match to another <it>V. cholerae</it> ORF versus any other complete genome.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Whole-genome X-alignments within species</p>
            </st>
            <p>The finding of the X-alignment pattern between species led us to search for similar patterns within species; that is, global alignments of a genome with its own reverse complement. Of the genomes for which we found between-species X-alignments (<it>M. tuberculosis, M. leprae, S. pyogenes, S. pneumoniae, E. coli</it> and <it>V. cholerae),</it> statistically significant self-alignments are detected for all except <it>M. tuberculosis</it> (Figure <figr fid="F3">3</figr>; probabilities shown in Table <tblr tid="T1">1</tblr>). Interestingly, these self-alignments are not as strong as those between species. Proteome analysis also shows an X-alignment within species (shown for <it>V. cholerae</it> chrI in Figure <figr fid="F2">2d</figr>; probabilities shown in Table <tblr tid="T2">2</tblr>). The X-alignment of proteins within <it>V. cholerae</it> chrI is statistically significant only for recently duplicated-genes, but disappears when all paralogs are included. The importance of filtering for recent duplications is discussed below.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Within-genome DNA alignments</p>
               </caption>
               <text>
                  <p>Within-genome DNA alignments. Plots of exactly matching sequences within four genomes as identified by the MUMmer program: <b>(a)</b><it>V. cholerae</it>; <b>(b)</b><it>E. coli</it>; <b>(c)</b><it>S. pyogenes</it>; <b>(d)</b><it>S. pneumoniae</it>. A point (<it>x,y</it>) indicates a DNA sequence that is repeated within the genome, occurring once at location <it>x</it> and again at location <it>y</it>. Points near the diagonal (<it>y</it> = <it>x</it>) correspond to tandem repeats. Points near the anti-diagonal (<it>y</it> = <it>L</it> - <it>x</it>, where <it>L</it> is genome length) correspond to repeats that occur at symmetric locations about the origin of location. The point (0,0) corresponds to the origin of replication in each plot. Statistically significant X-alignments occur in all four species (Table <tblr tid="T1">1</tblr>).</p>
               </text>
               <graphic file="gb-2000-1-6-research0011-3"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Model I: whole-genome inverted duplications</p>
            </st>
            <p>One possible explanation for an X-alignment within and between species is an ancestral inverted duplication of the whole genome, as has been suggested for <it>E. coli</it> [<abbr bid="B10">10</abbr>]. The weak or missing X-alignment within species could be explained by gene loss of one of the two duplicates of many of the pairs of genes in the different lineages. Gene loss has been found to follow large chromosomal or genome duplications [<abbr bid="B11">11</abbr>,<abbr bid="B12">12</abbr>,<abbr bid="B13">13</abbr>]. This gene loss is thought to stabilize large duplications by preventing recombination events between duplicate genes. If gene loss is responsible for the weak X-alignment within species, then to maintain the X-alignments between species, the member of the gene pair lost in a particular lineage should be essentially random. If an ancient inverted duplication followed by differential gene loss is the correct explanation for the observed X-alignments, one would expect the genes along one diagonal to be orthologous between species (related to each other by the speciation event), while the genes along the other diagonal should be paralogous (related to each other by the genome duplication event before the speciation of the two lineages). However, the evidence appears to contradict this model: likely orthologous gene pairs are equally distributed on each diagonal (data not shown).</p>
         </sec>
         <sec>
            <st>
               <p>Model II: chromosomal inversions about the origin and/or terminus</p>
            </st>
            <p>A second possible explanation for the X-alignments is that an underlying mechanism allows sections of DNA to move within the genome but maintains the distance of these sections from the origin and/or terminus. There are a variety of possible mechanisms for such movement, but we believe the most likely explanation is the occurrence of large chromosomal inversions that pivot around the replication origin and/or terminus. Large chromosomal inversions, including those that occur around the replication origin and terminus, have been shown to occur in <it>E. coli</it> and <it>Salmonella typhimurium</it> in the laboratory (see, for example, [<abbr bid="B14">14</abbr>,<abbr bid="B15">15</abbr>,<abbr bid="B16">16</abbr>,<abbr bid="B17">17</abbr>,<abbr bid="B18">18</abbr>]). The occurrence of such inversions over evolutionary time scales was first suggested by comparative analysis of the complete genomes of four strains in the genus <it>Chlamydia</it> [<abbr bid="B19">19</abbr>]. In that study, we found that the major chromosomal differences between <it>C. pneumoniae</it> and <it>C. trachomatis</it> (shown in Figure <figr fid="F2">2c</figr>) were consistent with the occurrence of large inversions that pivoted around the origin and terminus (including multiple inversions of different sizes). In Figure <figr fid="F4">4</figr> we present a hypothetical model showing how a small number of inversions centered around the origin or terminus could produce patterns very similar to those seen in the <it>Chlamydia, Mycobacterium</it> and <it>Helicobacter</it> comparisons. The continued occurrence of such inversion over longer time scales would result in an X-alignment similar to that seen in the <it>V. cholerae</it> versus <it>E. coli</it> and <it>S. pneumoniae</it> versus <it>S. pyogenes</it> comparisons. Thus the different between-species X-alignments could be the result of different numbers of inversions between particular pairs of species.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Schematic model of genome inversions</p>
               </caption>
               <text>
                  <p>Schematic model of genome inversions. The model shows an initial speciation event, followed by a series of inversions in the different lineages (A and B). Inversions occur between the asterisks (<sup>*</sup>). Numbers on the chromosome refer to hypothetical genes 1-32. At time point 1, the genomes of the two species are still co-linear (as indicated in the scatterplot of A1 versus B1). Between time point 1 and time point 2, each species (A and B) undergoes a large inversion about the terminus (as indicated in the scatterplots of A1 versus A2 and B1 versus B2). This results in the between-species scatterplot looking as if there have been two nested inversions (A2 versus B2), similar to that seen for <it>C. trachomatis</it> versus <it>C. pneumoniae</it> (see Figure <figr fid="F2">2</figr>). Between time point 2 and time point 3 each species undergoes an additional inversion (as indicated in the scatterplots of B2 versus B3 and A2 versus A3). This results in the between-species scatterplots beginning to resemble an X-alignment, similar to that seen in <it>M. tuberculosis</it> versus <it>M. leprae</it> (see Figure <figr fid="F2">2</figr>).</p>
               </text>
               <graphic file="gb-2000-1-6-research0011-4"/>
            </fig>
            <p>Inversions about the origin and terminus could also produce an X-alignment within species, through the splitting of tandemly duplicated sequence. Many sets of tandemly duplicated genes are found in most bacterial genomes [<abbr bid="B19">19</abbr>,<abbr bid="B20">20</abbr>] (also see Figure <figr fid="F3">3a</figr>,<figr fid="F3">c</figr>). As tandem duplications are inherently unstable (one of the duplicates can be rapidly eliminated by slippage and/or recombination events [<abbr bid="B21">21</abbr>]), the fact that many tandem pairs are present within each genome suggests that tandem duplications occur frequently. Thus, it is reasonable to assume that occasionally a large inversion will split a pair of tandemly duplicated genes. An inversion that pivots about the origin and also splits a tandem duplication will result in a pair of paralogous genes spaced symmetrically on opposite sides of the origin.</p>
            <p>If our inversion model is correct, then the genes along both diagonals in the <it>between</it>-species alignments should be orthologous, which is the case (see above). In contrast, genes along the anti-diagonal in the <it>within</it>-species X-alignments should be recent tandem duplicates that have been separated by inversions. This also appears to be the case - in the within-species analysis of <it>V. cholerae</it> chrI ORFs, the X-alignment shows up best when only recent duplicates are analyzed (Figure <figr fid="F2">2d</figr>). The splitting of tandem duplicates by inversions may be a general mechanism to stabilize the coexistence of duplicated genes, as it will prevent their elimination by unequal crossing-over or replication slippage events.</p>
            <p>What could cause inversions that pivot around the origin and terminus of the genome to occur more frequently than other inversions? One possibility is that many inversions occur, but there is selection against those that change the distance of a gene from the origin or terminus. Such a possibility has been suggested by experimental work in <it>E. coli</it> [<abbr bid="B14">14</abbr>,<abbr bid="B15">15</abbr>]. Additional studies have, however, suggested that there is little selective difference between inversions and that instead there may be certain regions that are more prone to inversion than others [<abbr bid="B16">16</abbr>,<abbr bid="B17">17</abbr>,<abbr bid="B18">18</abbr>,<abbr bid="B22">22</abbr>,<abbr bid="B23">23</abbr>]. Alternatively, the inversion events could be linked to replication, as has been suggested for small local inversion events [<abbr bid="B24">24</abbr>]. Whatever the mechanisms, the fact that we find evidence for such inversions between many pairs of species suggests that they are a common feature of bacterial evolution. Many aspects of the X-alignments require further exploration. For example, to split a tandem duplication, an inversion must fall precisely on the boundary between two duplicated genes. This would appear to be unlikely, requiring a large number of inversions in order to generate a sufficient number of split gene pairs. If the mechanisms of gene duplication are somehow related to the mechanisms of inversion, however, then this model is more plausible. The process of duplicating a gene, if it occurs during replication, might promote a recombination event within the bacterial chromosome that inverts the sequence from the origin up to that point. As with inversion events, recombination and replication have been found to be tightly coupled [<abbr bid="B25">25</abbr>].</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusions</p>
         </st>
         <p>We present here a novel observation regarding the conservation between bacterial species of the distance of particular genes from the replication origin or terminus. The initial observation was only possible due to the availability of complete genome sequences from pairs of moderately closely related species (for example, <it>V. cholerae</it> and <it>E. coli</it>). This shows the importance of having genome pairs from many levels of evolutionary relatedness. Comparisons of distantly related species enable the determination of universal features of life as well as of events that occur very rarely. Comparison of very closely related species allows the identification of frequent events such as transitional changes at third codon positions or tandem duplications. To elucidate all other events in the history of life, genome pairs covering all the intermediate levels of evolutionary relatedness will be needed.</p>
      </sec>
      <sec>
         <st>
            <p>Materials and methods</p>
         </st>
         <sec>
            <st>
               <p>Genomes analyzed</p>
            </st>
            <p>Complete published genome sequences were obtained from the National Center for Biotechnology Information website [<abbr bid="B26">26</abbr>] or from the TIGR Comprehensive Microbial Resource [<abbr bid="B27">27</abbr>]. These included <it>Aeropyrum pernix</it> [<abbr bid="B28">28</abbr>], <it>Aquifex aeolicus</it> [<abbr bid="B29">29</abbr>], <it>Archaeoglobus fulgidus</it> [<abbr bid="B30">30</abbr>], <it>Bacillus subtilis</it> [<abbr bid="B31">31</abbr>], <it>Borrelia burgdorferi</it> [<abbr bid="B32">32</abbr>], <it>Campylobacter jejuni</it> [<abbr bid="B33">33</abbr>], <it>Chlamydia pneumoniae</it> AR39 [<abbr bid="B19">19</abbr>], <it>Chlamydia pneumoniae</it> CWL029 [<abbr bid="B34">34</abbr>], <it>Chlamydia trachomatis</it> (D/UW-3/Cx) [<abbr bid="B35">35</abbr>], <it>Chlamydia trachomatis</it> MoPn [<abbr bid="B19">19</abbr>], <it>Deinococcus radiodurans</it> [<abbr bid="B36">36</abbr>], <it>Escherichia coli</it> [<abbr bid="B5">5</abbr>], <it>Haemophilus influenzae</it> [<abbr bid="B37">37</abbr>], <it>Helicobacter pylori</it> [<abbr bid="B38">38</abbr>], <it>Helicobacter pylori</it> J99 [<abbr bid="B39">39</abbr>], <it>Methanobacterium thermoautotrophicum</it> [<abbr bid="B40">40</abbr>], <it>Methanococcus jannaschii</it> [<abbr bid="B41">41</abbr>], <it>Mycobacterium tuberculosis</it> [<abbr bid="B8">8</abbr>], <it>Mycoplasma genitalium</it> [<abbr bid="B42">42</abbr>], <it>Mycoplasma pneumoniae</it> [<abbr bid="B43">43</abbr>], <it>Neisseria meningitidis</it> MC58 [<abbr bid="B20">20</abbr>], <it>Neisseria meningitidis</it> serogroup A strain Z2491 [<abbr bid="B44">44</abbr>], <it>Pyrococcus horikoshii</it> [<abbr bid="B45">45</abbr>], <it>Rickettsia prowazekii</it> [<abbr bid="B46">46</abbr>], <it>Synechocystis</it> sp. [<abbr bid="B47">47</abbr>], <it>Thermotoga maritima</it> [<abbr bid="B48">48</abbr>], <it>Treponema pallidum</it> [<abbr bid="B49">49</abbr>], and <it>Vibrio cholerae</it> [<abbr bid="B4">4</abbr>]. In addition, a few unpublished genomes were analyzed: <it>Streptococcus pyogenes</it> (obtained from the Oklahoma University Genome Center website [<abbr bid="B7">7</abbr>]), <it>Streptococcus pneumoniae</it> (H. Tettelin, personal communication), and <it>Mycobacterium leprae</it> (obtained from the Sanger Centre Pathogen Sequencing Group website [<abbr bid="B9">9</abbr>]).</p>
         </sec>
         <sec>
            <st>
               <p>Whole-genome DNA alignments</p>
            </st>
            <p>DNA alignments of the complete genomic sequences of all bacteria used in this study were accomplished with the MUMmer program [<abbr bid="B6">6</abbr>]. This program uses an efficient suffix tree construction algorithm to rapidly compute alignments of entire genomes. The algorithm identifies all exact matches of nucleotide subsequences that are contained in both input sequences; these exact matches must be longer than a specified minimum length, which was set to 20 base pairs for this comparison. To search for genome-scale alignments within species, complete bacterial and archaeal genomes (25 in total including all published genomes) were aligned with their own reverse complements. To search for between-species alignments, all genomes were aligned against all others in both orientations.</p>
         </sec>
         <sec>
            <st>
               <p>Whole-genome protein comparisons</p>
            </st>
            <p>The predicted proteome of each complete genome sequence (all predicted proteins in the genome) was compared to the proteomes of all complete genome sequences (including itself) using the fasta3 program [<abbr bid="B50">50</abbr>]. Matches with an expected score (e-value) of 10<sup>-5</sup> or less were considered significant.</p>
         </sec>
         <sec>
            <st>
               <p>Statistical significance of X-alignments</p>
            </st>
            <p>To calculate the statistical significance of the X-alignments, the maximal unique matching subsequences (MUMs) for unrelated genomes were examined and found to be uniformly distributed [<abbr bid="B6">6</abbr>]. With a uniform background, the expected density of MUMs in any region of an alignment plot is a simple proportion of the area of that region to the entire plot. In particular, in an alignment with <it>N</it> total MUMs, the probability (Pr) of observing at least <it>m</it> matches in a region with area <it>p</it> can be computed using the binomial distribution in Equation 1:</p>
            <graphic file="gb-2000-1-6-research0011-I1.gif"/>
            <p>The alignment of <it>V. cholerae</it> chrI (both forward and reverse strands) versus <it>E. coli</it> contains 926 MUMs. The MUMs forming X-alignments appear along the diagonal (<it>y</it> = <it>x</it>) and the anti-diagonal (<it>y</it> = <it>L -x,</it> where <it>L</it> is the genome length). To estimate the significance of the alignments in both directions, diagonal strips were sampled along each of the diagonals. The width of each strip was set at 10% of the plot area and significance values were calculated (Table <tblr tid="T1">1</tblr>).</p>
         </sec>
         <sec>
            <st>
               <p>Identification of origins of replication</p>
            </st>
            <p>The origins of replication for the bacterial genomes have been characterized by a variety of methods. For <it>E. coli, M. tuberculosis</it> and <it>M. leprae,</it> the origins have been well-characterized by laboratory studies [<abbr bid="B51">51</abbr>,<abbr bid="B52">52</abbr>]. The origins and termini of <it>C. trachomatis, C. pneumoniae</it> and <it>V. cholerae</it> were identified by GC-skew [<abbr bid="B53">53</abbr>] and by characteristic genes in the region of the origin [<abbr bid="B4">4</abbr>,<abbr bid="B19">19</abbr>]. GC-skew uses the function (G-C)/(G+C) computed on 2,000 bp windows across the genome, which exhibits a clear tendency in many bacterial genomes to be positive for the leading strand and negative for the lagging strand. The origin of <it>H. pylori</it> was determined by oligomer skew [<abbr bid="B54">54</abbr>] and confirmed by GC-skew. The origins and termini of <it>S. pneumoniae</it> and <it>S. pyogenes</it> were determined by the authors of the present study using GC-skew analysis and the locations of characteristic genes, particularly the chromosome replication initiator gene <it>dnaA.</it></p>
         </sec>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank S. Eddy, M.A. Riley, T. Read, A. Stoltzfus, M-I Benito and I. Paulsen for helpful comments, suggestions and discussions. S.L.S. was supported in part by NSF grant IIS-9902923 and NIH grant R01 LM06845. S.L.S. and J.A.E were supported in part by NSF grant KDI-9980088. Data for all published complete genome sequences were obtained from the NCBI genomes database [<abbr bid="B26">26</abbr>] or from The Institute for Genomic Research (TIGR) Microbial Genome Database [<abbr bid="B27">27</abbr>]. The sequences of <it>V. cholerae</it>, <it>S. pneumoniae,</it> and <it>M. tuberculosis</it> (CDC 1551) were determined at TIGR with support from NIH and the NIAID. The <it>M. leprae</it> sequence data were produced by the Pathogen Sequencing Group at the Sanger Centre. Sequencing of <it>M. leprae</it> is funded by the Heiser Program for Research in Leprosy and Tuberculosis of The New York Community Trust and by L'Association Raoul Follereau. The <it>M. tuberculosis</it> CDC 1551 genome sequence was obtained from TIGR. The source of the <it>S. pyogenes</it> genome sequence was the Streptococcal Genome Sequencing Project funded by USPHS/NIH grant AI38406, and was kindly made available by B. A. Roe, S.P. Linn, L. Song, X. Yuan, S. Clifton, R.E. McLaughlin, M. McShan and J. Ferretti, and can be obtained from the website of the Oklahoma University Genome Center [<abbr bid="B7">7</abbr>].</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1" rating="0">
            <title>
               <p>Updated map of duplicated regions in the yeast genome.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Seoighe</snm>
                  <fnm>C</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Wolfe</snm>
                  <fnm>KH</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>1999</pubdate>
            <volume>238</volume>
            <fpage>253</fpage>
            <lpage>261</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi" link="fulltext">10.1016/S0378-1119(99)00319-4</pubid>
                  <pubid idtype="pmpid" link="fulltext">10571001</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2" rating="0">
            <title>
               <p>Sequence and analysis of chromosome 2 of the plant <it>Arabidopsis thaliana</it>.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Lin</snm>
                  <fnm>X</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Kaul</snm>
                  <fnm>S</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Rounsley</snm>
                  <fnm>S</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Shea</snm>
                  <fnm>TP</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Benito</snm>
                  <fnm>MI</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Town</snm>
                  <fnm>CD</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Fujii</snm>
                  <fnm>CY</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Mason</snm>
                  <fnm>T</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Bowman</snm>
                  <fnm>CL</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Barnstead</snm>
                  <fnm>M</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>1999</pubdate>
            <volume>402</volume>
            <fpage>761</fpage>
            <lpage>768</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi" link="fulltext">10.1038/45471</pubid>
                  <pubid idtype="pmpid" link="fulltext">10617197</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3" rating="0">
            <title>
               <p>Sequence and analysis of chromosome 4 of the plant <it>Arabidopsis thaliana</it>.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Mayer</snm>
                  <fnm>K</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Schuller</snm>
                  <fnm>C</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Wambutt</snm>
                  <fnm>R</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Murphy</snm>
                  <fnm>G</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Volckaert</snm>
                  <fnm>G</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Pohl</snm>
                  <fnm>T</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Dusterhoft</snm>
                  <fnm>A</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Stiekema</snm>
                  <fnm>W</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Entian</snm>
                  <fnm>KD</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Terryn</snm>
                  <fnm>N</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>1999</pubdate>
            <volume>402</volume>
            <fpage>769</fpage>
            <lpage>777</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi" link="fulltext">10.1038/47134</pubid>
                  <pubid idtype="pmpid" link="fulltext">10617198</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4" rating="0">
            <title>
               <p>The genome sequence of <it>Vibrio cholerae</it>, the aetiologic agent of cholera.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Heidelberg</snm>
                  <fnm>JF</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Eisen</snm>
                  <fnm>JA</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Nelson</snm>
                  <fnm>WC</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Clayton</snm>
                  <fnm>RA</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Gwinn</snm>
                  <fnm>ML</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Dodson</snm>
                  <fnm>RJ</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Haft</snm>
                  <fnm>DH</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Hickey</snm>
                  <fnm>EK</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Peterson</snm>
                  <fnm>JD</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Umayam</snm>
                  <fnm>L</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>406</volume>
            <fpage>477</fpage>
            <lpage>483</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi" link="fulltext">10.1086/172459</pubid>
                  <pubid idtype="pmpid" link="fulltext">10952301</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5" rating="0">
            <title>
               <p>The complete genome sequence of <it>Escherichia coli</it> K-12.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Blattner</snm>
                  <fnm>FR</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Plunkett</snm>
                  <fnm>GI</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Bloch</snm>
                  <fnm>CA</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Perna</snm>
                  <fnm>NT</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Burland</snm>
                  <fnm>V</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Riley</snm>
                  <fnm>M</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Collado-Vides</snm>
                  <fnm>J</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Glasner</snm>
                  <fnm>JD</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Rode</snm>
                  <fnm>CK</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Mayhew</snm>
                  <fnm>GF</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>1997</pubdate>
            <volume>277</volume>
            <fpage>1453</fpage>
            <lpage>1462</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi" link="fulltext">10.1126/science.277.5331.1453</pubid>
                  <pubid idtype="pmpid" link="fulltext">9278503</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6" rating="0">
            <title>
               <p>Alignment of whole genomes.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Delcher</snm>
                  <fnm>AL</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Kasif</snm>
                  <fnm>S</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Fleischmann</snm>
                  <fnm>RD</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Peterson</snm>
                  <fnm>J</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>White</snm>
                  <fnm>O</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Salzberg</snm>
                  <fnm>SL</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1999</pubdate>
            <volume>27</volume>
            <fpage>2369</fpage>
            <lpage>2376</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid" link="fulltext">148804</pubid>
                  <pubid idtype="pmpid" link="fulltext">10325427</pubid>
                  <pubid idtype="doi" link="fulltext">10.1093/nar/27.11.2369</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7" rating="0">
            <title>
               <p>Oklahoma University Genome Center</p>
            </title>
            <url>http://www.genome.ou.edu/strep.html</url>
         </bibl>
         <bibl id="B8" rating="0">
            <title>
               <p>Deciphering the biology of <it>Mycobacterium tuberculosis</it> from the complete genome sequence.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Cole</snm>
                  <fnm>ST</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Brosch</snm>
                  <fnm>R</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Parkhill</snm>
                  <fnm>J</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Garnier</snm>
                  <fnm>T</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Churcher</snm>
                  <fnm>C</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Harris</snm>
                  <fnm>D</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Gordon</snm>
                  <fnm>SV</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Eiglmeier</snm>
                  <fnm>K</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Gas</snm>
                  <fnm>S</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Barry</snm>
                  <fnm>CE</fnm>
                  <suf>III</suf>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>1998</pubdate>
            <volume>393</volume>
            <fpage>537</fpage>
            <lpage>544</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi" link="fulltext">10.1038/31159</pubid>
                  <pubid idtype="pmpid" link="fulltext">9634230</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9" rating="0">
            <title>
               <p>Sanger Centre Pathogen Sequencing Group</p>
            </title>
            <url>ftp://ftp.sanger.ac.uk/pub/pathogens/leprae</url>
         </bibl>
         <bibl id="B10" rating="0">
            <title>
               <p>Proposal concerning mechanism of evolution of the genome of <it>Escherichia coli</it>.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Zipkas</snm>
                  <fnm>D</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Riley</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1975</pubdate>
            <volume>72</volume>
            <fpage>1354</fpage>
            <lpage>1358</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">1093179</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B11" rating="0">
            <title>
               <p>The fate of duplicated genes: loss or new function?</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Wagner</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>BioEssays</source>
            <pubdate>1998</pubdate>
            <volume>20</volume>
            <fpage>785</fpage>
            <lpage>788</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi" link="fulltext">10.1002/(SICI)1521-1878(199810)20:10&lt;785::AID-BIES2>3.0.CO;2-M</pubid>
                  <pubid idtype="pmpid" link="fulltext">10200118</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12" rating="0">
            <title>
               <p>The probability of duplicate gene preservation by subfunctionalization.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Lynch</snm>
                  <fnm>M</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Force</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2000</pubdate>
            <volume>154</volume>
            <fpage>459</fpage>
            <lpage>473</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10629003</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B13" rating="0">
            <title>
               <p>Comparable rates of gene loss and functional divergence after genome duplications early in vertebrate evolution.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Nadeau</snm>
                  <fnm>JH</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Sankoff</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1997</pubdate>
            <volume>147</volume>
            <fpage>1259</fpage>
            <lpage>1266</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9383068</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B14" rating="0">
            <title>
               <p>Constraints in chromosomal inversions in <it>Escherichia coli</it> are not explained by replication pausing at inverted terminator-like sequences.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Francois</snm>
                  <fnm>V</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Louarn</snm>
                  <fnm>J</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Patte</snm>
                  <fnm>J</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Rebollo</snm>
                  <fnm>JE</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Louarn</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>1990</pubdate>
            <volume>4</volume>
            <fpage>537</fpage>
            <lpage>542</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">2191180</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B15" rating="0">
            <title>
               <p>Detection and possible role of two large nondivisible zones on the <it>Escherichia coli</it> chromosome.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Rebollo</snm>
                  <fnm>JE</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Francois</snm>
                  <fnm>V</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Louarn</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1988</pubdate>
            <volume>85</volume>
            <fpage>9391</fpage>
            <lpage>9395</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">3059345</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B16" rating="0">
            <title>
               <p>Rearrangement of the bacterial chromosome: forbidden inversions.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Segall</snm>
                  <fnm>A</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Mahan</snm>
                  <fnm>MJ</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Roth</snm>
                  <fnm>JR</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1988</pubdate>
            <volume>241</volume>
            <fpage>1314</fpage>
            <lpage>1318</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">3045970</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B17" rating="0">
            <title>
               <p>Ability of a bacterial chromosome segment to invert is dictated by included material rather than flanking sequence.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Mahan</snm>
                  <fnm>MJ</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Roth</snm>
                  <fnm>JR</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1991</pubdate>
            <volume>129</volume>
            <fpage>1021</fpage>
            <lpage>1032</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">1783289</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B18" rating="0">
            <title>
               <p>Recombination between homologies in direct and inverse orientation in the chromosome of <it>Salmonella</it> : intervals which are nonpermissive for inversion formation.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Segall</snm>
                  <fnm>AM</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Roth</snm>
                  <fnm>JR</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1989</pubdate>
            <volume>122</volume>
            <fpage>737</fpage>
            <lpage>747</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">2547692</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B19" rating="0">
            <title>
               <p>Genome sequences of <it>Chlamydia trachomatis</it> MoPn and <it>Chlamydia pneumoniae</it> AR39.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Read</snm>
                  <fnm>TD</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Brunham</snm>
                  <fnm>RC</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Shen</snm>
                  <fnm>C</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Gill</snm>
                  <fnm>SR</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Heidelberg</snm>
                  <fnm>JF</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>White</snm>
                  <fnm>O</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Hickey</snm>
                  <fnm>EK</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Peterson</snm>
                  <fnm>J</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Utterback</snm>
                  <fnm>T</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Berry</snm>
                  <fnm>K</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2000</pubdate>
            <volume>28</volume>
            <fpage>1397</fpage>
            <lpage>1406</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid" link="fulltext">111046</pubid>
                  <pubid idtype="pmpid" link="fulltext">10684935</pubid>
                  <pubid idtype="doi" link="fulltext">10.1093/nar/28.6.1397</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20" rating="0">
            <title>
               <p>Complete genome sequence of <it>Neisseria meningitidis</it> serogroup B strain MC58.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Tettelin</snm>
                  <fnm>H</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Saunders</snm>
                  <fnm>NJ</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Heidelberg</snm>
                  <fnm>J</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Jeffries</snm>
                  <fnm>AC</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Nelson</snm>
                  <fnm>KE</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Eisen</snm>
                  <fnm>JA</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Ketchum</snm>
                  <fnm>KA</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Hood</snm>
                  <fnm>DW</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Peden</snm>
                  <fnm>JF</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Dodson</snm>
                  <fnm>RJ</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2000</pubdate>
            <volume>287</volume>
            <fpage>1809</fpage>
            <lpage>1815</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi" link="fulltext">10.1126/science.287.5459.1809</pubid>
                  <pubid idtype="pmpid" link="fulltext">10710307</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21" rating="0">
            <title>
               <p>Preservation of duplicate genes by complementary, degenerative mutations.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Force</snm>
                  <fnm>A</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Lynch</snm>
                  <fnm>M</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Pickett</snm>
                  <fnm>FB</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Amores</snm>
                  <fnm>A</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Yan</snm>
                  <fnm>YL</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Postlethwait</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1999</pubdate>
            <volume>151</volume>
            <fpage>1531</fpage>
            <lpage>1545</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10101175</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B22" rating="0">
            <title>
               <p>Selection and endpoint distribution of bacterial inversion mutations.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Schmid</snm>
                  <fnm>MB</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Roth</snm>
                  <fnm>JR</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1983</pubdate>
            <volume>105</volume>
            <fpage>539</fpage>
            <lpage>557</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">6357944</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B23" rating="0">
            <title>
               <p>Reciprocality of recombination events that rearrange the chromosome.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Mahan</snm>
                  <fnm>MJ</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Roth</snm>
                  <fnm>JR</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1988</pubdate>
            <volume>120</volume>
            <fpage>23</fpage>
            <lpage>35</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">3065138</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B24" rating="0">
            <title>
               <p>Inversions with deletions and duplications.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Gordon</snm>
                  <fnm>AJ</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Halliday</snm>
                  <fnm>JA</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1995</pubdate>
            <volume>140</volume>
            <fpage>411</fpage>
            <lpage>414</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">7635304</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B25" rating="0">
            <title>
               <p>Recombination enhancement by replication (RER) in <it>Rhizobium etli</it>.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Valencia-Morales</snm>
                  <fnm>E</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Romero</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2000</pubdate>
            <volume>154</volume>
            <fpage>971</fpage>
            <lpage>983</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10757747</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B26" rating="0">
            <title>
               <p>National Center for Biotechnology Information, Entrez Genomes</p>
            </title>
            <url>http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Genome</url>
         </bibl>
         <bibl id="B27" rating="0">
            <title>
               <p>The Institute for Genomic Research Microbial Genome Database</p>
            </title>
            <url>http://www.tigr.org/tdb/mdb/mdb.html</url>
         </bibl>
         <bibl id="B28" rating="0">
            <title>
               <p>Complete genome sequence of an aerobic hyper-thermophilic crenarchaeon, <it>Aeropyrum pernix</it> K1.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Kawarabayasi</snm>
                  <fnm>Y</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Hino</snm>
                  <fnm>Y</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Horikawa</snm>
                  <fnm>H</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Yamazaki</snm>
                  <fnm>S</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Haikawa</snm>
                  <fnm>Y</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Jin-no</snm>
                  <fnm>K</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Takahashi</snm>
                  <fnm>M</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Sekine</snm>
                  <fnm>M</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Baba</snm>
                  <fnm>S</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Ankai</snm>
                  <fnm>A</fnm>
               </au>
               <etal/>
            </aug>
            <source>DNA Res</source>
            <pubdate>1999</pubdate>
            <volume>6</volume>
            <fpage>83</fpage>
            <lpage>101</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10382966</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B29" rating="0">
            <title>
               <p>The complete genome of the hyperthemophilic bacterium <it>Aquifex aeolicus</it>.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Deckert</snm>
                  <fnm>G</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Warren</snm>
                  <fnm>PV</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Gaasterland</snm>
                  <fnm>T</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Young</snm>
                  <fnm>WG</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Lenox</snm>
                  <fnm>AL</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Grahams</snm>
                  <fnm>DE</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Overbeek</snm>
                  <fnm>R</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Snead</snm>
                  <fnm>MA</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Keller</snm>
                  <fnm>M</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Aujay</snm>
                  <fnm>M</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>1998</pubdate>
            <volume>392</volume>
            <fpage>353</fpage>
            <lpage>358</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi" link="fulltext">10.1038/32831</pubid>
                  <pubid idtype="pmpid" link="fulltext">9537320</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30" rating="0">
            <title>
               <p>The complete genomic sequence of the hyperthermophilic, sulfate-reducing archaeon <it>Archaeoglobus fulgidus</it>.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Klenk</snm>
                  <fnm>H-P</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Clayton</snm>
                  <fnm>RA</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Tomb</snm>
                  <fnm>J-F</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>White</snm>
                  <fnm>O</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Nelsen</snm>
                  <fnm>KE</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Ketchum</snm>
                  <fnm>KA</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Dodson</snm>
                  <fnm>RJ</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Gwinn</snm>
                  <fnm>M</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Hickey</snm>
                  <fnm>EK</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Peterson</snm>
                  <fnm>JD</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>1997</pubdate>
            <volume>390</volume>
            <fpage>364</fpage>
            <lpage>370</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi" link="fulltext">10.1038/37052</pubid>
                  <pubid idtype="pmpid" link="fulltext">9389475</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31" rating="0">
            <title>
               <p>The complete genome sequence of the Gram-positive bacterium <it>Bacillus subtilis</it>.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Kunst</snm>
                  <fnm>A</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Ogasawara</snm>
                  <fnm>N</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Moszer</snm>
                  <fnm>I</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Albertini</snm>
                  <fnm>A</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Alloni</snm>
                  <fnm>G</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Azevedo</snm>
                  <fnm>V</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Bertero</snm>
                  <fnm>M</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Bessieres</snm>
                  <fnm>P</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Bolotin</snm>
                  <fnm>A</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Borchert</snm>
                  <fnm>S</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>1997</pubdate>
            <volume>390</volume>
            <fpage>249</fpage>
            <lpage>256</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi" link="fulltext">10.1038/36786</pubid>
                  <pubid idtype="pmpid" link="fulltext">9384377</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32" rating="0">
            <title>
               <p>Genomic sequence of a Lyme disease spirochaete, <it>Borrelia burgdorferi</it>.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Fraser</snm>
                  <fnm>CM</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Norris</snm>
                  <fnm>SJ</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Weinstock</snm>
                  <fnm>GM</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>White</snm>
                  <fnm>O</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Sutton</snm>
                  <fnm>GG</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Dodson</snm>
                  <fnm>R</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Gwinn</snm>
                  <fnm>M</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Hickey</snm>
                  <fnm>EK</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Clayton</snm>
                  <fnm>R</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Ketchum</snm>
                  <fnm>KA</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>1997</pubdate>
            <volume>390</volume>
            <fpage>580</fpage>
            <lpage>586</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi" link="fulltext">10.1038/37551</pubid>
                  <pubid idtype="pmpid" link="fulltext">9403685</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33" rating="0">
            <title>
               <p>The genome sequence of the food-borne pathogen <it>Campylobacter jejuni</it> reveals hypervariable sequences.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Parkhill</snm>
                  <fnm>J</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Wren</snm>
                  <fnm>BW</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Mungall</snm>
                  <fnm>K</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Ketley</snm>
                  <fnm>JM</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Churcher</snm>
                  <fnm>C</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Basham</snm>
                  <fnm>D</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Chillingworth</snm>
                  <fnm>T</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Davies</snm>
                  <fnm>RM</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Feltwell</snm>
                  <fnm>T</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Holroyd</snm>
                  <fnm>S</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>403</volume>
            <fpage>665</fpage>
            <lpage>668</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi" link="fulltext">10.1038/35001088</pubid>
                  <pubid idtype="pmpid" link="fulltext">10688204</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34" rating="0">
            <title>
               <p>Comparative genomes of <it>Chlamydia pneumoniae</it> and <it>C. trachomatis</it>.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Kalman</snm>
                  <fnm>S</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Mitchell</snm>
                  <fnm>W</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Marathe</snm>
                  <fnm>R</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Lammel</snm>
                  <fnm>C</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Fan</snm>
                  <fnm>J</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Hyman</snm>
                  <fnm>RW</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Olinger</snm>
                  <fnm>L</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Grimwood</snm>
                  <fnm>J</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Davis</snm>
                  <fnm>RW</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Stephens</snm>
                  <fnm>RS</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>1999</pubdate>
            <volume>21</volume>
            <fpage>385</fpage>
            <lpage>389</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi" link="fulltext">10.1038/7716</pubid>
                  <pubid idtype="pmpid" link="fulltext">10192388</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35" rating="0">
            <title>
               <p>Genome sequence of an obligate intracellular pathogen of humans: <it>Chlamydia trachomatis</it>.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Stephens</snm>
                  <fnm>RS</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Kalman</snm>
                  <fnm>S</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Lammel</snm>
                  <fnm>C</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Fan</snm>
                  <fnm>J</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Marathe</snm>
                  <fnm>R</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Aravind</snm>
                  <fnm>L</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Mitchell</snm>
                  <fnm>W</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Olinger</snm>
                  <fnm>L</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Tatusov</snm>
                  <fnm>RL</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Zhao</snm>
                  <fnm>Q</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>1998</pubdate>
            <volume>282</volume>
            <fpage>754</fpage>
            <lpage>759</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi" link="fulltext">10.1126/science.282.5389.754</pubid>
                  <pubid idtype="pmpid" link="fulltext">9784136</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36" rating="0">
            <title>
               <p>Genome sequence of the radioresistant bacterium <it>Deinococcus radiodurans</it> R1.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>White</snm>
                  <fnm>O</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Eisen</snm>
                  <fnm>JA</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Heidelberg</snm>
                  <fnm>JF</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Hickey</snm>
                  <fnm>EK</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Peterson</snm>
                  <fnm>JD</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Dodson</snm>
                  <fnm>RJ</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Haft</snm>
                  <fnm>DH</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Gwinn</snm>
                  <fnm>ML</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Nelson</snm>
                  <fnm>WC</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Richardson</snm>
                  <fnm>DL</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>1999</pubdate>
            <volume>286</volume>
            <fpage>1571</fpage>
            <lpage>1577</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi" link="fulltext">10.1126/science.286.5444.1571</pubid>
                  <pubid idtype="pmpid" link="fulltext">10567266</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37" rating="0">
            <title>
               <p>Whole-genome random sequencing and assembly of <it>Haemophilus influenzae</it> Rd.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Fleischmann</snm>
                  <fnm>RD</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Adams</snm>
                  <fnm>MD</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>White</snm>
                  <fnm>O</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Clayton</snm>
                  <fnm>RA</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Kirkness</snm>
                  <fnm>EF</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Kerlavage</snm>
                  <fnm>AR</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Bult</snm>
                  <fnm>CJ</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Tomb</snm>
                  <fnm>JF</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Dougherty</snm>
                  <fnm>BA</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Merrick</snm>
                  <fnm>JM</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>1995</pubdate>
            <volume>269</volume>
            <fpage>496</fpage>
            <lpage>512</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">7542800</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B38" rating="0">
            <title>
               <p>The complete genome sequence of the gastric pathogen <it>Helicobacter pylori</it>.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Tomb</snm>
                  <fnm>JF</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>White</snm>
                  <fnm>O</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Kerlavage</snm>
                  <fnm>AR</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Clayton</snm>
                  <fnm>RA</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Sutton</snm>
                  <fnm>GG</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Fleischmann</snm>
                  <fnm>RD</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Ketchum</snm>
                  <fnm>KA</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Klenk</snm>
                  <fnm>HP</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Gill</snm>
                  <fnm>S</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Dougherty</snm>
                  <fnm>BA</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>1997</pubdate>
            <volume>388</volume>
            <fpage>539</fpage>
            <lpage>547</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi" link="fulltext">10.1038/41483</pubid>
                  <pubid idtype="pmpid" link="fulltext">9252185</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39" rating="0">
            <title>
               <p>Genomic-sequence comparison of two unrelated isolates of the human gastric pathogen <it>Helicobacter pylori</it>.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Alm</snm>
                  <fnm>RA</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Ling</snm>
                  <fnm>LS</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Moir</snm>
                  <fnm>DT</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>King</snm>
                  <fnm>BL</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Brown</snm>
                  <fnm>ED</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Doig</snm>
                  <fnm>PC</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Smith</snm>
                  <fnm>DR</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Noonan</snm>
                  <fnm>B</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Guild</snm>
                  <fnm>BC</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>deJonge</snm>
                  <fnm>BL</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>1999</pubdate>
            <volume>397</volume>
            <fpage>176</fpage>
            <lpage>180</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi" link="fulltext">10.1038/16495</pubid>
                  <pubid idtype="pmpid" link="fulltext">9923682</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40" rating="0">
            <title>
               <p>Complete genome sequence of <it>Methanobacterium thermoautotrophicum</it> DH: functional analysis and comparative genomics.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Smith</snm>
                  <fnm>DR</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Doucette-Stamm</snm>
                  <fnm>LA</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Deloughery</snm>
                  <fnm>C</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Lee</snm>
                  <fnm>H</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Dubois</snm>
                  <fnm>J</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Aldredge</snm>
                  <fnm>T</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Bashirzadeh</snm>
                  <fnm>R</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Blakely</snm>
                  <fnm>D</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Cook</snm>
                  <fnm>R</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Gilbert</snm>
                  <fnm>K</fnm>
               </au>
               <etal/>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>1996</pubdate>
            <volume>179</volume>
            <fpage>7135</fpage>
            <lpage>7155</lpage>
         </bibl>
         <bibl id="B41" rating="0">
            <title>
               <p>Complete genome sequence of the methanogenic archaeon, <it>Methanococcus jannaschii</it>.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Bult</snm>
                  <fnm>CJ</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>White</snm>
                  <fnm>O</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Olsen</snm>
                  <fnm>GJ</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Zhou</snm>
                  <fnm>L</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Fleischmann</snm>
                  <fnm>RD</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Sutton</snm>
                  <fnm>GG</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Blake</snm>
                  <fnm>JA</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Fitzgerald</snm>
                  <fnm>LM</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Clayton</snm>
                  <fnm>RA</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Gocayne</snm>
                  <fnm>JD</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>1996</pubdate>
            <volume>273</volume>
            <fpage>1058</fpage>
            <lpage>1073</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8688087</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B42" rating="0">
            <title>
               <p>The minimal gene complement of <it>Mycoplasma genitalium</it>.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Fraser</snm>
                  <fnm>CM</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Gocayne</snm>
                  <fnm>JD</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>White</snm>
                  <fnm>O</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Adams</snm>
                  <fnm>MD</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Clayton</snm>
                  <fnm>RA</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Fleischmann</snm>
                  <fnm>RD</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Bult</snm>
                  <fnm>CJ</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Kerlavage</snm>
                  <fnm>AR</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Sutton</snm>
                  <fnm>G</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Kelley</snm>
                  <fnm>JM</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>1995</pubdate>
            <volume>270</volume>
            <fpage>397</fpage>
            <lpage>403</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">7569993</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B43" rating="0">
            <title>
               <p>Complete sequence analysis of the genome of the bacterium <it>Mycoplasma pneumoniae</it>.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Himmelreich</snm>
                  <fnm>R</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Hilbert</snm>
                  <fnm>H</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Plagens</snm>
                  <fnm>H</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Pirkl</snm>
                  <fnm>E</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Li</snm>
                  <fnm>BC</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Herrmann</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1996</pubdate>
            <volume>24</volume>
            <fpage>4420</fpage>
            <lpage>4449</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid" link="fulltext">146264</pubid>
                  <pubid idtype="pmpid" link="fulltext">8948633</pubid>
                  <pubid idtype="doi" link="fulltext">10.1093/nar/24.22.4420</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44" rating="0">
            <title>
               <p>Complete DNA sequence of a serogroup A strain of <it>Neisseria meningitidis</it> Z2491.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Parkhill</snm>
                  <fnm>J</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Achtman</snm>
                  <fnm>M</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>James</snm>
                  <fnm>KD</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Bentley</snm>
                  <fnm>SD</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Churcher</snm>
                  <fnm>C</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Klee</snm>
                  <fnm>SR</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Morelli</snm>
                  <fnm>G</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Basham</snm>
                  <fnm>D</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Brown</snm>
                  <fnm>D</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Chillingworth</snm>
                  <fnm>T</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>404</volume>
            <fpage>502</fpage>
            <lpage>506</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi" link="fulltext">10.1038/35006655</pubid>
                  <pubid idtype="pmpid" link="fulltext">10761919</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45" rating="0">
            <title>
               <p>Complete sequence and gene organization of the genome of a hyperthermophilic archaebacterium, <it>Pyrococcus horikoshii</it> OT3.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Kawarabayasi</snm>
                  <fnm>Y</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Sawada</snm>
                  <fnm>M</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Horikawa</snm>
                  <fnm>H</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Haikawa</snm>
                  <fnm>Y</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Hino</snm>
                  <fnm>Y</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Yamamoto</snm>
                  <fnm>S</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Sekine</snm>
                  <fnm>M</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Baba</snm>
                  <fnm>S</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Kosugi</snm>
                  <fnm>H</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Hosoyama</snm>
                  <fnm>A</fnm>
               </au>
               <etal/>
            </aug>
            <source>DNA Res</source>
            <pubdate>1998</pubdate>
            <volume>5</volume>
            <fpage>55</fpage>
            <lpage>76</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9679194</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B46" rating="0">
            <title>
               <p>The genome sequence of <it>Rickettsia prowazekii</it> and the origin of mitochondria.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Andersson</snm>
                  <fnm>SG</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Zomorodipour</snm>
                  <fnm>A</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Andersson</snm>
                  <fnm>JO</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Sicheritz-Ponten</snm>
                  <fnm>T</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Alsmark</snm>
                  <fnm>UC</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Podowski</snm>
                  <fnm>RM</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Naslund</snm>
                  <fnm>AK</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Eriksson</snm>
                  <fnm>AS</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Winkler</snm>
                  <fnm>HH</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Kurland</snm>
                  <fnm>CG</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1998</pubdate>
            <volume>396</volume>
            <fpage>133</fpage>
            <lpage>140</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi" link="fulltext">10.1038/24094</pubid>
                  <pubid idtype="pmpid" link="fulltext">9823893</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47" rating="0">
            <title>
               <p>Sequence analysis of the genome of the unicellular cyanobacterium <it>Synechocystis</it> sp. strain PCC6803. II. Sequence determination of the entire genome and assignment of potential protein-coding regions.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Kaneko</snm>
                  <fnm>T</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Sato</snm>
                  <fnm>S</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Kotani</snm>
                  <fnm>H</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Tanaka</snm>
                  <fnm>A</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Asamizu</snm>
                  <fnm>E</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Nakamura</snm>
                  <fnm>Y</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Miyajima</snm>
                  <fnm>N</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Hirosawa</snm>
                  <fnm>M</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Sugiura</snm>
                  <fnm>M</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Sasamoto</snm>
                  <fnm>S</fnm>
               </au>
               <etal/>
            </aug>
            <source>DNA Res</source>
            <pubdate>1996</pubdate>
            <volume>3</volume>
            <fpage>109</fpage>
            <lpage>136</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8905231</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B48" rating="0">
            <title>
               <p>Evidence for lateral gene transfer between Archaea and bacteria from genome sequence of <it>Thermotoga maritima</it>.</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Nelson</snm>
                  <fnm>KE</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Clayton</snm>
                  <fnm>RA</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Gill</snm>
                  <fnm>SR</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Gwinn</snm>
                  <fnm>ML</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Dodson</snm>
                  <fnm>RJ</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Haft</snm>
                  <fnm>DH</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Hickey</snm>
                  <fnm>EK</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Peterson</snm>
                  <fnm>JD</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <s