<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>gb-2008-9-2-r42</ui>
   <ji>GBJ</ji>
   <fm>
      <dochead>Research</dochead>
      <bibl>
         <title>
            <p>Visualization of pseudogenes in intracellular bacteria reveals the different tracks to gene destruction</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Fuxelius</snm>
               <fnm>Hans-Henrik</fnm>
               <insr iid="I1"/>
               <email>Hans-Henrik.Fuxelius@ebc.uu.se</email>
            </au>
            <au id="A2">
               <snm>Darby</snm>
               <mi>C</mi>
               <fnm>Alistair</fnm>
               <insr iid="I1"/>
               <insr iid="I3"/>
               <email>Alistar.darby@liverpool.ac.uk</email>
            </au>
            <au id="A3">
               <snm>Cho</snm>
               <fnm>Nam-Huyk</fnm>
               <insr iid="I2"/>
               <email>chonh@snu.ac.kr</email>
            </au>
            <au id="A4" ca="yes">
               <snm>Andersson</snm>
               <mi>GE</mi>
               <fnm>Siv</fnm>
               <insr iid="I1"/>
               <email>Siv.Andersson@ebc.uu.se</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Molecular Evolution, Evolutionary Biology Center, Uppsala University, Norbyv&#228;gen 18C, S-752 36 Uppsala, Sweden</p>
            </ins>
            <ins id="I2">
               <p>Department of Microbiology and Immunology, College of Medicine and Institute of Endemic Diseases, Seoul National University Medical Research Center and Bundang hospital, 28 Yongon-Dong, Chongno-Gu, Seoul 110-799, Republic of Korea</p>
            </ins>
            <ins id="I3">
               <p>Vector Group, Liverpool School of Tropical Medicine, Pembroke Place, Liverpool L3 5QA, UK</p>
            </ins>
         </insg>
         <source>Genome Biology</source>
         <issn>1465-6906</issn>
         <pubdate>2008</pubdate>
         <volume>9</volume>
         <issue>2</issue>
         <fpage>R42</fpage>
         <url>http://genomebiology.com/2008/9/2/R42</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18302730</pubid>
               <pubid idtype="doi">10.1186/gb-2008-9-2-r42</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>10</day>
               <month>12</month>
               <year>2007</year>
            </date>
         </rec>
         <revrec>
            <date>
               <day>13</day>
               <month>2</month>
               <year>2008</year>
            </date>
         </revrec>
         <acc>
            <date>
               <day>26</day>
               <month>2</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>26</day>
               <month>02</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Fuxelius et al.; licensee BioMed Central Ltd.</collab>
         <note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <shorttitle>
         <p>Gene destruction in Rickettsia </p>
      </shorttitle>
      <shortabs>
         <p>Variably present genes and pseudogenes in Rickettsia species tend to have been acquired more recently and to be more divergent from the genes conserved across all species</p>
      </shortabs>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Pseudogenes reveal ancestral gene functions. Some obligate intracellular bacteria, such as <it>Mycobacterium leprae </it>and <it>Rickettsia </it>spp., carry substantial fractions of pseudogenes. Until recently, horizontal gene transfers were considered to be rare events in obligate host-associated bacteria.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We present a visualization tool that displays the relationships and positions of degraded and partially overlapping gene sequences in multiple genomes. With this tool we explore the origin and deterioration patterns of the <it>Rickettsia </it>pseudogenes and find that variably present genes and pseudogenes tend to have been acquired more recently, are more divergent in sequence, and exhibit a different functional profile compared with genes conserved across all species. Overall, the origin of only one-quarter of the variable genes and pseudogenes can be traced back to the common ancestor of <it>Rickettsia </it>and the outgroup genera <it>Orientia </it>and <it>Wolbachia</it>. These sequences contain only a few disruptive mutations and show a broad functional distribution profile, much like the core genes. The remaining genes and pseudogenes are extensively degraded or solely present in a single species. Their functional profile was heavily biased toward the mobile gene pool and genes for components of the cell wall and the lipopolysaccharide.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>Reductive evolution of the vertically inherited genomic core accounts for 25% of the predicted genes in the variable segments of the <it>Rickettsia </it>genomes, whereas 75% stems from the flux of the mobile gene pool along with genes for cell surface structures. Thus, most of the variably present genes and pseudogenes in <it>Rickettsia </it>have arisen from recent acquisitions.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="BMC" subtype="man_spc_id" id="30010008">Evolution</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010010">Genome studies</classification>
         <classification type="BMC" subtype="man_spc_id" id="30010014">Microbiology and parasitology</classification>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Pseudogenes represent a heterogeneous collection of sequences, ranging from genes with an internal stop codon or frameshift mutation to extensively degraded genes. Pseudogenes and noncoding DNA were originally considered to be rare in bacteria. However, a recent genomic survey identified 7,000 pseudogenes in 64 bacterial genomes, a large fraction of which had arisen from 'failed' horizontal gene transfers <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Recently evolved pathogens in particular have many pseudogenes <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>, and the genomes of intracellular bacteria such as <it>Rickettsia </it>and <it>Mycobacteria </it>have exceptionally high fractions of noncoding DNA and pseudogenes (>25%) <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>. This has been accounted for by reductive genome evolution and small effective population sizes <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp>. Also, increased exploitation of host metabolites and reduced selective pressure for rapid growth in the nutritionally rich eukaryotic cytoplasm may allow mutations to accumulate in essential bacterial genes. Furthermore, it was suggested that the reduced threat of genetic parasites in the protected intracellular environment has lowered the genomic deletion rate, making pseudogene elimination a slower process <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. However, this model was based on the assumption that horizontal gene transfers are rare in intracellular bacterial populations.</p>
         <p>As more and more genomes are sequenced, it is becoming increasingly clear that obligate host-associated bacteria are not immune to the spread of genetic parasites. All kinds of mobile elements, plasmids, integrated conjugative elements, prophages and transposons have been identified in one or another species of intracellular bacteria <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B8">8</abbr></abbrgrp>. In fact, the most highly repeated bacterial genome identified to date is that of an obligate intracellular pathogen, namely <it>Orientia tsutsugamushi </it><abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. This genome contains about 37% repetitive sequences (>200 bp), most of which represent clusters of deteriorating genes for conjugative transfer systems and eukaryotic-like proteins putatively involved in host cell adaptation processes <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. The intracellular arena hypothesis posits that the transfer of mobile genetic elements occurs in these populations but is restricted to intracellular bacterial communities that infect the same hosts <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. One expectation from this hypothesis is that the circulating pool of mobile elements may be different for free-living and intracellular bacterial populations. Another prediction is that the recent evolutionary history of mobile elements in intracellular bacteria follows host specialization patterns rather than the phylogeny of the bacterial core genes.</p>
         <p>With the growing realization that mobile genetic elements are circulating among obligate host-associated bacteria, it is time to revisit the source of the many pseudogenes in these bacterial populations. The genus <it>Rickettsia </it>represents an excellent model system for such studies; genome sizes are small while pseudogene contents are high. Furthermore, the availability of genomic data from multiple <it>Rickettsia </it>spp. <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>, now also including the closely related outgroup species <it>O. tsutsugamushi </it><abbrgrp><abbr bid="B9">9</abbr></abbrgrp>, and the more distantly related outgroup <it>Wolbachia pipientis </it>from <it>Drosophila melanogaster </it><abbrgrp><abbr bid="B14">14</abbr></abbrgrp> and <it>Brugia malayi </it><abbrgrp><abbr bid="B15">15</abbr></abbrgrp>, provides all of the raw material needed for such a study. Because we wished to study the deterioration process over time, we placed the analyses within a phylogenetic context, with the underlying species tree essentially as outlined previously <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp> with three main groups: the spotted fever group (SFG: <it>Rickettsia conorii</it>, <it>Rickettsia sibirica</it>, and <it>Rickettsia rickettsii</it>), the transitional group (<it>Rickettsia akari </it>and <it>Rickettsia felis</it>), and the typhus group (TG: <it>Rickettsia prowazekii </it>and <it>Rickettsia typhi</it>). <it>Rickettsia bellii </it>is the earliest diverging lineage in the genus and is a member of the ancestral group <it>Rickettsia</it>.</p>
         <p>Previous studies of pseudogenes have either traced the degradation pathway of a few individual genes <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr></abbrgrp> or identified pseudogenes <it>en masse</it>, ignoring the various stages of the degradation process <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. With the aid of our recently developed software for visualization and comparison of closely related genomes, we have performed a large-scale analysis of the deterioration process, in order to investigate the source and the ancestral function of the variable segments in <it>Rickettsia</it>. This study was implemented to provide a general model for the evolution of host-adapted bacterial genomes, including lineage-specific expansion and deterioration of host-interaction genes. We also sought to understand better the connection between the load of pseudogenes and the spread of selfish genetic elements.</p>
         <p>The results suggest that the variability among <it>Rickettsia </it>genomes is due to a slow and steady accumulation of mutations in essential genes, as well as to a more rapid degradation of genes acquired by horizontal gene transfer at the base of the <it>Rickettsia </it>lineage. In addition, the circulation of genetic parasites across the modern species continues to generate variability.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Identification of positional orthologs</p>
            </st>
            <p>We developed GenComp for the visualization of gene order structures and pseudogene relationships across multiple closely related genomes. The program was applied to a comparative analysis of eight <it>Rickettsia </it>genomes and a closely related outgroup, <it>O. tsutsugamushi</it>, plus the more distantly related outgroup <it>W. pipientis</it>. We first predicted open reading frames (ORFs) with the aid of Glimmer <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> using similar settings for all genomes. Homolog identification across the seven most closely related <it>Rickettsia </it>genomes (excluding <it>R. bellii</it>) was accomplished by basic local alignment search tool (BLAST) searches <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> followed by clustering with the aid of Tribe-MCL <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. A total of 9,450 Glimmer-predicted ORFs were clustered into 2,940 homologous gene families when applying the length ratio criteria 0.80 for homologous groups, with up to 59 genes in each family, including 359 single-gene families.</p>
            <p>Information about gene location was considered with the aid of the visualization component of GenComp to identify the final set of positional homologs. This resolved many of the clusters with large numbers of homologs into groups of true orthologs that are conserved in two to six species or, in the full set, seven species. The conserved orthologs were fused into 86 metaclusters (segments with conserved gene order structures). These represent from 84% (<it>R. felis</it>) to 93% (<it>R. prowazekii</it>) of the <it>Rickettsia </it>genomes. Present in the metaclusters were 665 gene families with a cluster size of seven, representing single-copy genes that are conserved in sequence across all seven taxa, using 80% as the length cut-off value. We refer to the positional orthologs present in all seven species as the 'R7 core genes'. In total, this set comprises 688 genes, which accounts for 62% of the TG genomes and 47% to 56% of the SFG genomes. The mean size of the R7 core genes in <it>Rickettsia </it>was 1,006 to 1,010 bp (median size 850 to 860 bp) per species (typical of bacterial genes; Table <tblr tid="T1">1</tblr>).</p>
            <tbl id="T1" hint_layout="double">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Gene sizes of the R7 core genes and R2 to R6 strain-variable ORFs located in the variable segments</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="left">
                        <p>RNumber</p>
                     </c>
                     <c ca="left">
                        <p>Number of ORFs</p>
                     </c>
                     <c ca="left">
                        <p>Mean size (bp)</p>
                     </c>
                     <c ca="left">
                        <p>Median size (bp)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>R7</p>
                     </c>
                     <c ca="left">
                        <p>4,816</p>
                     </c>
                     <c ca="left">
                        <p>1,009</p>
                     </c>
                     <c ca="left">
                        <p>856</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>R6</p>
                     </c>
                     <c ca="left">
                        <p>366</p>
                     </c>
                     <c ca="left">
                        <p>976</p>
                     </c>
                     <c ca="left">
                        <p>744</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>R5</p>
                     </c>
                     <c ca="left">
                        <p>225</p>
                     </c>
                     <c ca="left">
                        <p>1006</p>
                     </c>
                     <c ca="left">
                        <p>579</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>R4</p>
                     </c>
                     <c ca="left">
                        <p>304</p>
                     </c>
                     <c ca="left">
                        <p>567</p>
                     </c>
                     <c ca="left">
                        <p>330</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>R3</p>
                     </c>
                     <c ca="left">
                        <p>558</p>
                     </c>
                     <c ca="left">
                        <p>404</p>
                     </c>
                     <c ca="left">
                        <p>237</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>R2</p>
                     </c>
                     <c ca="left">
                        <p>696</p>
                     </c>
                     <c ca="left">
                        <p>332</p>
                     </c>
                     <c ca="left">
                        <p>219</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>'Rnumber' refers to the number of species carrying open reading frame (ORF) homologs. bp, base pairs.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Identification of <it>Rickettsia </it>variable segments</p>
            </st>
            <p>We identified a combined total of 658 <it>Rickettsia </it>intergenic segments, defined as sequences flanked by orthologs in the R7 core gene set. These segments represent from 30% of the TG genomes to 38% of the <it>R. felis </it>genome. Glimmer-predicted ORFs located inside <it>Rickettsia </it>variable segments (RVSs) and present in one to six species, or present in all seven species but differing by more than 20% in size, are here referred to as the 'strain-variable ORFs', irrespectively of their origin, size, and functional status. An ORF-cluster was defined as a set of positional homologs conserved across two or more species. Paralogs located in different genomic regions were manually sorted into separate ORF-clusters. In total, 1,160 ORF-clusters were predicted in 304 of the 658 RVSs; these were used as the starting point for all subsequent analyses, as schematically outlined in Figure <figr fid="F1">1</figr>.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Identification of strain-variable ORFs</p>
               </caption>
               <text>
                  <p>Identification of strain-variable ORFs. Presented is a schematic illustration of the process whereby strain-variable open reading frames ORFs located in variable segments of the <it>Rickettsia </it>genomes were identified and analyzed. RVS, <it>Rickettsia </it>variable segment.</p>
               </text>
               <graphic file="gb-2008-9-2-r42-1"/>
            </fig>
            <p>The ORF-containing segments ranged in size from a mean of 510 bp to 854 bp per species (median sizes: 143 bp to 186 bp) and contained on average 3.82 ORF-clusters per RVS over the seven species. The longest RVS (reg_id 546) was 19,665 kilobases and contained 29 ORF-clusters in <it>R. felis</it>. The sizes of the strain-variable ORFs were found to be roughly proportional to the prevalence of the ORF across the various species. Thus, strain-variable ORFs identified in six members exhibited a mean gene size of 976 bp (median 744 bp), whereas the mean size of strain-variable ORFs solely present in two species was only 332 bp (median 219 bp), which is only 30% of the R7 core gene size (Table <tblr tid="T1">1</tblr>).</p>
         </sec>
         <sec>
            <st>
               <p>Rapid sequence evolution of strain-variable ORFs</p>
            </st>
            <p>The nonsynonymous substitution frequency (dN) ranged from 0.5 to 6.3 &#215; 10<sup>-2 </sup>substitutions per site for the R7 core genes (Figure <figr fid="F2">2a</figr>). The synonymous substitution frequency (dS) values were more than tenfold higher than the dN values in all pair-wise comparisons, which is indicative of purifying selection. In comparison, the dN values for the strain-variable ORFs were much more variable, ranging from 0.5 to 18 &#215; 10<sup>-2 </sup>nonsynymous substitutions per site (Figure <figr fid="F2">2a</figr>). Median dN values were inversely related to the prevalence of the ORF across the seven species and approached the dS values in some pair-wise comparisons that included only two or three species. A difference between the R7 core genes and the strain-variable ORFs in the seven-ortholog clusters was observed even if only ORFs that are more than 1 kilobase in size were included (Figure <figr fid="F2">2b</figr>). The smaller size and higher substitution frequency suggests that many strain-variable ORFs, particularly those present in a limited set of species, have evolved as pseudogenes.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Substitution frequency at nonsynonymous and synonymous sites plotted by genes present in different numbers of species</p>
               </caption>
               <text>
                  <p>Substitution frequency at nonsynonymous and synonymous sites plotted by genes present in different numbers of species. The nonsynonymous substitution frequency (dN) values were plotted against the synonymous substitution frequency (dS) values for <b>(a) </b>R7 core genes and strain-variable open reading frames (ORFs) present in two to seven species and <b>(b) </b>core genes and strain-variable ORFs present in seven species that are longer than one kilobase in size.</p>
               </text>
               <graphic file="gb-2008-9-2-r42-2"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Graphical display of the gene fragmentation process</p>
            </st>
            <p>With the aid of the visualization component of GenComp, we produced graphical images of the positions and relative sizes of all strain-variable ORFs in the 304 ORF-containing RVSs (Figure <figr fid="F3">3</figr>; see Additional date file 1 for graphical images of all RVSs). Note that many of the individual strain-variable ORFs represent fragments of the same pseudogene, broken up by indels and stop codons into multiple short ORFs. To follow the gene deterioration process in detail, we implemented a digital code to track sequence similarity across the strain-variable ORFs, with the first digits being identical for homologs within and across genomes and the latter two numbers representing a size index such that ORFs that are more than 80% similar in length are given the same number. A total of 482 of the 1,160 ORF-clusters produced significant hits (E &lt; e<sup>-10</sup>) to genes in the National Center for Biotechnology Information (NCBI) database other than of the seven <it>Rickettsia </it>genomes used to identify the variable segments. We selected all <it>Rickettsia </it>ORFs for which multiple gene alignments with homologs in other species could be created for an in-depth analysis; this amounted to 469 of the 482 strain-variable ORF-clusters and 681 of the 688 R7 core genes.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Visualization of variable segments using the GenComp visualization tool</p>
               </caption>
               <text>
                  <p>Visualization of variable segments using the GenComp visualization tool. Segments are shown that contain variable open reading frames (ORFs) that are present in <b>(a) </b>all seven species, <b>(b) </b>in most but not all species, <b>(c) </b>in members of the spotted fever group (SFG) <it>Rickettsia</it>, and <b>(d) </b>in a single species. The visualization tool displays (in blue) the location of positional orthologs that are conserved across all seven <it>Rickettsia </it>spp. and differ by less than 20% in size. Interspersed among these are segments with strain-variable ORFs shown (in green) that differ in sizes and are normally present in only a subset of the <it>Rickettsia </it>spp. Vertical lines show positional orthologs and horizontal lines indicate the six frames (+1, +2, +3, -1, -2, and -3, in that order). Each set of six lines represents a species, with <it>R. prowazekii </it>(Rp), <it>R. typhi </it>(Rt), <it>R. felis </it>(Rf), <it>R. akari </it>(Ra), <it>R. conorii </it>(Rc), <it>R. sibirica </it>(Rs), and <it>R. rickettsii </it>(Rr) shown from the top to the bottom, in accordance with their phylogenetic relationships. Numbers inside boxes show ORF numbers, and designations above boxes show gene annotations. The first digits in the numbers below the boxes indicate homologous strain-variable ORFs that are members of the same ORF-cluster, and the last two digits indicate ORFs in the ORF-clusters that differ by less than 80% in size. Arrows illustrate the fragmentation process for sequences that are similar across species.</p>
               </text>
               <graphic file="gb-2008-9-2-r42-3"/>
            </fig>
            <p>To illustrate the fragmentation patterns, we sorted the strain-variable ORFs into four sets depending on the different species distribution profiles (Table <tblr tid="T2">2</tblr>). One set of 75 ORF-clusters contained homologs across all seven species, as exemplified in Figure <figr fid="F3">3a</figr>. Many strain-variable ORFs in this set were only weakly mutated (slightly shorter with only a few indels or internal termination codons compared with their full-length homologs in other species), suggesting that they may encode functional or semifunctional gene products in some species. Another set encompassed 31 strain-variable ORFs in 18 segments (Figure <figr fid="F3">3b</figr>), including 27 ORF-clusters present in the TG plus some but not all members of the SFG, as well as four ORF-clusters uniquely present in the TG. A third set of clusters included strain-variable ORFs that are present in members of the SFG but not in the TG (Figure <figr fid="F3">3c</figr>). This was the largest set, including 215 ORF-clusters, 76 of which have homologs in all members of the SFG. Another 28 strain-variable ORFs were solely present in <it>R. felis </it>and <it>R. akari</it>, and 26 ORFs only in <it>R. conori, R. sibirica</it>, and <it>R. rickettsii</it>. Visual inspection of the erosion patterns in this set provides many examples of how a long ORF in one species, typically <it>R. felis</it>, has been disrupted into numerous short sequence fragments in the other species. The final set included 148 strain-variable ORFs identified in only a single <it>Rickettsia </it>sp., 115 of which were solely present in <it>R. felis</it>; many of these encoded transposons and other mobile elements (Figure <figr fid="F3">3d</figr>).</p>
            <tbl id="T2" hint_layout="single">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Classification of strain-variableORFs into species sets and phylogroups</p>
               </caption>
               <tblbdy cols="6">
                  <r>
                     <c ca="left">
                        <p>Species profile</p>
                     </c>
                     <c cspan="5" ca="center">
                        <p>Phylogroup</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>ROW*</p>
                     </c>
                     <c ca="left">
                        <p>RO</p>
                     </c>
                     <c ca="left">
                        <p>R8</p>
                     </c>
                     <c ca="left">
                        <p>R7</p>
                     </c>
                     <c ca="left">
                        <p>Total</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Set 1: all species</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>1, 2, 3, 4, 5, 6, 7</p>
                     </c>
                     <c ca="left">
                        <p>29</p>
                     </c>
                     <c ca="left">
                        <p>15</p>
                     </c>
                     <c ca="left">
                        <p>30</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>75</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Total</p>
                     </c>
                     <c ca="left">
                        <p>29</p>
                     </c>
                     <c ca="left">
                        <p>15</p>
                     </c>
                     <c ca="left">
                        <p>30</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>75</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Set 2: TG + SFG</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>1, 2</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>1, 2, 3</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>1, 2, 3, 4, 5</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>1, 2, 3, 4, 6</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>1, 2, 3, 4, 5, 6</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>1, 2, 3, 4, 5, 7</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>1, 2, 3, 5, 6, 7</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>1, 2, 3, 6, 7</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>1, 2, 4</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>1, 2, 4, 5, 6</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>1, 2, 4, 5, 6, 7</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>1, 3, 4, 5, 6, 7</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>6</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>1, 3, 5, 6, 7</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>1, 4, 5, 6, 7</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>2, 3, 4, 5, 6, 7</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>2, 3, 5, 6, 7</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Total</p>
                     </c>
                     <c ca="left">
                        <p>8</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>16</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>31</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Set 3: SFG only</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>3, 4</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>19</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>28</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>3, 4, 5</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>3, 4, 5, 6, 7</p>
                     </c>
                     <c ca="left">
                        <p>16</p>
                     </c>
                     <c ca="left">
                        <p>6</p>
                     </c>
                     <c ca="left">
                        <p>53</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>76</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>3, 4, 5, 6</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>6</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>6</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>3, 4, 6</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>3, 4, 6, 7</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>3, 4, 7</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>3, 5</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>5</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>6</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>3, 5, 6</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>5</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>3, 5, 6, 7</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>17</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>25</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>3, 5, 7</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>3, 6</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>3, 6, 7</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>3, 7</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>4, 5</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>4, 5, 6</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>4, 5, 6, 7</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>5</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>5, 6</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>9</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>10</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>5, 6, 7</p>
                     </c>
                     <c ca="left">
                        <p>5</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>16</p>
                     </c>
                     <c ca="left">
                        <p>5</p>
                     </c>
                     <c ca="left">
                        <p>26</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>5, 7</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>5</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>6, 7</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Total</p>
                     </c>
                     <c ca="left">
                        <p>29</p>
                     </c>
                     <c ca="left">
                        <p>15</p>
                     </c>
                     <c ca="left">
                        <p>152</p>
                     </c>
                     <c ca="left">
                        <p>19</p>
                     </c>
                     <c ca="left">
                        <p>215</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Set 4: single species</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>9</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>21</p>
                     </c>
                     <c ca="left">
                        <p>5</p>
                     </c>
                     <c ca="left">
                        <p>69</p>
                     </c>
                     <c ca="left">
                        <p>20</p>
                     </c>
                     <c ca="left">
                        <p>115</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>11</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>13</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>5</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>6</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Total</p>
                     </c>
                     <c ca="left">
                        <p>23</p>
                     </c>
                     <c ca="left">
                        <p>10</p>
                     </c>
                     <c ca="left">
                        <p>90</p>
                     </c>
                     <c ca="left">
                        <p>25</p>
                     </c>
                     <c ca="left">
                        <p>148</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Grand total</p>
                     </c>
                     <c ca="left">
                        <p>89</p>
                     </c>
                     <c ca="left">
                        <p>44</p>
                     </c>
                     <c ca="left">
                        <p>288</p>
                     </c>
                     <c ca="left">
                        <p>48</p>
                     </c>
                     <c ca="left">
                        <p>469</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Number abbreviations for species are as follows: 1, <it>R. prowazekii</it>; 2, <it>R. typhi</it>; 3, <it>R. felis</it>; 4, <it>R. akari</it>; 5, <it>R. conorii</it>; 6, <it>R. sibirica</it>; and 7, <it>R. rickettsii</it>. The phylogroup ROW* is a summary of ROW, RW, W, and OW. ORF, open reading frame; SFG, spotted fever group; TG, typhus group.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Strain-variable ORFs and core genes are associated with different phylogroups</p>
            </st>
            <p>The strain-variable ORFs were then mapped onto the species phylogenetic tree to estimate the node at which it had been vertically inherited (Figure <figr fid="F4">4</figr>). To this end, core genes and strain-variable ORFs were separately grouped into different phylogroups. The R7 class contained ORFs without homologs in <it>R. bellii</it>, <it>O. tsutsugamushi</it>, or <it>W. pipientis</it>; the R8 class had homologs in <it>R. bellii </it>only; the RO class also in <it>O. tsutsugamushi</it>; and the ROW* class in <it>O. tsutsugamushi </it>and/or <it>W. pipientis </it>(E &lt; e<sup>-10</sup>; see Additional data file 2). The results of this categorization revealed a dramatic difference between strain-variable ORFs and core genes, as summarized in Figure <figr fid="F4">4</figr>. Thus, 72% of the strain-variable ORFs were placed in the R7 and R8 classes, 9% in the RO class, and only 19% belonged to the ROW* class. The converse pattern was observed for the core genes; only 21% belonged to the R7 and R8 classes, and 62% traced back to the ROW* ancestor.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Number of R7 core genes and strain-variable ORFs placed at different nodes of the species tree</p>
               </caption>
               <text>
                  <p>Number of R7 core genes and strain-variable ORFs placed at different nodes of the species tree. The relative proportions of R7 core genes (open boxes) and strain-variable open reading frames (ORFs; black boxes) are indicated at each node of the tree. Arrows show the nodes that contain the majority of core genes (ROW*) and strain-variable ORFs (R8). Species abbreviations are as in Figure 3, plus Ot (<it>O. tsutsugamushi</it>), wMel (<it>W. pipientis </it>[<it>Drosophila melanogaster</it>]), and wBm (<it>W. pipientis </it>[<it>Brugia malayia</it>]). The underlying phylogenetic tree was constructed using the maximum likelihood method from a concatenated alignment of adenylate kinase, SecY, and ribosomal proteins S3, S8, S10, S11, S13, S14, S19 and L2, L3, L4, L5, L6, L14, L16, L18, L22, L23, L24 and L29.</p>
               </text>
               <graphic file="gb-2008-9-2-r42-4"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Strain-variable ORFs in the R7 and R8 phylogroups are present in fewer <it>Rickettsia </it>spp. and are more degraded than strain-variable ORFs in the ROW* phylogroup</p>
            </st>
            <p>The extent of degradation was different for strain-variable ORFs placed at different nodes in the tree (Table <tblr tid="T2">2</tblr>). In brief, strain-variable ORFs in the ROW* class were often weakly mutated and normally present in all or most <it>Rickettsia </it>spp., whereas strain-variable ORFs in the R7 and R8 classes tended to be heavily degraded or only present in a single <it>Rickettsia </it>sp. For example, 32% of the 89 strain-variable ORFs in the ROW* class were present in all seven <it>Rickettsia </it>spp., whereas only one strain-variable ORF in the R7 phylogroup had homologs in all species. The RO class also contained a high proportion, 34%, of strain-variable ORFs with homologs in all <it>Rickettsia </it>spp., whereas strain-variable ORFs placed in the R8 phylogroup exhibited a more scattered species distribution pattern. Taken together, 71% of the strain-variable ORFs solely present in the SFG belonged to the R8 class, although they represent no more than 46% of the ORFs overall. The R7 class was even more biased in that 52% of the 48 strain-variable ORFs in the R7 phylogroup were members of a single species and another 40% were present solely in the SFG.</p>
         </sec>
         <sec>
            <st>
               <p>Strain-variable ORFs in the R7 and R8 phylogroups are associated with different functional categories than strain-variable ORFs in the ROW* phylogroup</p>
            </st>
            <p>With the aid of a classification scheme based on clusters of orthologous groups of proteins (COGs)-based classification scheme, we analyzed the distribution of functional categories for genes and pseudogenes belonging to different phylogroups (Tables <tblr tid="T3">3</tblr> and <tblr tid="T4">4</tblr>). The 89 strain-variable ORFs in the ROW* class exhibited a broad functional distribution profile according to COG classification (Table <tblr tid="T3">3</tblr>), much like the core genes inherited from the ROW* ancestor (Table <tblr tid="T4">4</tblr>). Both strain-variable ORFs and core genes in the ROW* phylogroup exhibited a relatively high abundance of genes in categories such as translation, replication, and energy production, as observed previously <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>.</p>
            <tbl id="T3" hint_layout="double">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Classification of strain-variableORFs into functional categories and phylogroups</p>
               </caption>
               <tblbdy cols="6">
                  <r>
                     <c ca="left">
                        <p>Category</p>
                     </c>
                     <c cspan="5" ca="center">
                        <p>Phylogroup</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>ROW*</p>
                     </c>
                     <c ca="left">
                        <p>RO</p>
                     </c>
                     <c ca="left">
                        <p>R8</p>
                     </c>
                     <c ca="left">
                        <p>R7</p>
                     </c>
                     <c ca="left">
                        <p>Total</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Information</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>J Translation</p>
                     </c>
                     <c ca="left">
                        <p>6</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>12</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>K Transcription</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>5</p>
                     </c>
                     <c ca="left">
                        <p>5</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>14</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>L Replication</p>
                     </c>
                     <c ca="left">
                        <p>12</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>13</p>
                     </c>
                     <c ca="left">
                        <p>13</p>
                     </c>
                     <c ca="left">
                        <p>41</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Total</p>
                     </c>
                     <c ca="left">
                        <p>20</p>
                     </c>
                     <c ca="left">
                        <p>11</p>
                     </c>
                     <c ca="left">
                        <p>20</p>
                     </c>
                     <c ca="left">
                        <p>16</p>
                     </c>
                     <c ca="left">
                        <p>67</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cellular processess</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>D Cell cycle</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>V Defense mechanisms</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>16</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>23</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>T Signal transduction</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>M Cell wall/membrane</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>10</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>20</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>U Intracellular trafficking</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>10</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>O Post-translational modification</p>
                     </c>
                     <c ca="left">
                        <p>6</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>8</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Total</p>
                     </c>
                     <c ca="left">
                        <p>17</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>41</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>68</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Metabolism</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>C Energy production</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>6</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>10</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>G Carbohydrate transport</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>5</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>E Amino acid transport</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>6</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>F Nucleotide transport</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>H Coenzyme transport</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>6</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>I Lipid transport</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>11</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>12</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>P Inorganic ion transport</p>
                     </c>
                     <c ca="left">
                        <p>5</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>9</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Q Secondary metabolites</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Total</p>
                     </c>
                     <c ca="left">
                        <p>16</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>29</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>52</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Poorly characterized</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>R General function</p>
                     </c>
                     <c ca="left">
                        <p>13</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>29</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>52</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>S Function unknown</p>
                     </c>
                     <c ca="left">
                        <p>11</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>30</p>
                     </c>
                     <c ca="left">
                        <p>6</p>
                     </c>
                     <c ca="left">
                        <p>51</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Unclassified</p>
                     </c>
                     <c ca="left">
                        <p>12</p>
                     </c>
                     <c ca="left">
                        <p>20</p>
                     </c>
                     <c ca="left">
                        <p>139</p>
                     </c>
                     <c ca="left">
                        <p>8</p>
                     </c>
                     <c ca="left">
                        <p>179</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Total</p>
                     </c>
                     <c ca="left">
                        <p>36</p>
                     </c>
                     <c ca="left">
                        <p>27</p>
                     </c>
                     <c ca="left">
                        <p>198</p>
                     </c>
                     <c ca="left">
                        <p>21</p>
                     </c>
                     <c ca="left">
                        <p>282</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Grand total</p>
                     </c>
                     <c ca="left">
                        <p>89</p>
                     </c>
                     <c ca="left">
                        <p>44</p>
                     </c>
                     <c ca="left">
                        <p>288</p>
                     </c>
                     <c ca="left">
                        <p>48</p>
                     </c>
                     <c ca="left">
                        <p>469</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>The phylogroup ROW* is a summary of ROW, RW, W and OW. ORF, open reading frame</p>
               </tblfn>
            </tbl>
            <tbl id="T4" hint_layout="double">
               <title>
                  <p>Table 4</p>
               </title>
               <caption>
                  <p>Classification of core genes into functional categories and phylogroups</p>
               </caption>
               <tblbdy cols="6">
                  <r>
                     <c ca="left">
                        <p>Category</p>
                     </c>
                     <c cspan="5" ca="center">
                        <p>Phylogroup</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>ROW*</p>
                     </c>
                     <c ca="left">
                        <p>RO</p>
                     </c>
                     <c ca="left">
                        <p>R8</p>
                     </c>
                     <c ca="left">
                        <p>R7</p>
                     </c>
                     <c ca="left">
                        <p>Total</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Information</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>J Translation</p>
                     </c>
                     <c ca="left">
                        <p>68</p>
                     </c>
                     <c ca="left">
                        <p>9</p>
                     </c>
                     <c ca="left">
                        <p>8</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>85</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>K Transcription</p>
                     </c>
                     <c ca="left">
                        <p>13</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>19</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>L Replication</p>
                     </c>
                     <c ca="left">
                        <p>38</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>42</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Total</p>
                     </c>
                     <c ca="left">
                        <p>119</p>
                     </c>
                     <c ca="left">
                        <p>13</p>
                     </c>
                     <c ca="left">
                        <p>14</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>146</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cellular processess</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>D Cell cycle</p>
                     </c>
                     <c ca="left">
                        <p>9</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>12</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>V Defense mechanisms</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>6</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>T Signal transduction</p>
                     </c>
                     <c ca="left">
                        <p>5</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>9</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>M Cell wall/membrane</p>
                     </c>
                     <c ca="left">
                        <p>21</p>
                     </c>
                     <c ca="left">
                        <p>6</p>
                     </c>
                     <c ca="left">
                        <p>32</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>59</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>N Cell motility</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>U Intracellular trafficking</p>
                     </c>
                     <c ca="left">
                        <p>26</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>30</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>O Post-translational modification</p>
                     </c>
                     <c ca="left">
                        <p>32</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>42</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Total</p>
                     </c>
                     <c ca="left">
                        <p>97</p>
                     </c>
                     <c ca="left">
                        <p>22</p>
                     </c>
                     <c ca="left">
                        <p>40</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>159</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Metabolism</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>C Energy production</p>
                     </c>
                     <c ca="left">
                        <p>45</p>
                     </c>
                     <c ca="left">
                        <p>9</p>
                     </c>
                     <c ca="left">
                        <p>5</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>59</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>G Carbohydrate transport</p>
                     </c>
                     <c ca="left">
                        <p>6</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>E Amino acid transport</p>
                     </c>
                     <c ca="left">
                        <p>13</p>
                     </c>
                     <c ca="left">
                        <p>6</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>20</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>F Nucleotide transport</p>
                     </c>
                     <c ca="left">
                        <p>9</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>10</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>H Coenzyme transport</p>
                     </c>
                     <c ca="left">
                        <p>14</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>18</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>I Lipid transport</p>
                     </c>
                     <c ca="left">
                        <p>18</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>22</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>P Inorganic ion transport</p>
                     </c>
                     <c ca="left">
                        <p>8</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>14</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Q Secondary metabolites</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>6</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Total</p>
                     </c>
                     <c ca="left">
                        <p>116</p>
                     </c>
                     <c ca="left">
                        <p>24</p>
                     </c>
                     <c ca="left">
                        <p>16</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>156</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Poorly characterized</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>R General function</p>
                     </c>
                     <c ca="left">
                        <p>24</p>
                     </c>
                     <c ca="left">
                        <p>8</p>
                     </c>
                     <c ca="left">
                        <p>10</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>42</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>S Function unknown</p>
                     </c>
                     <c ca="left">
                        <p>12</p>
                     </c>
                     <c ca="left">
                        <p>6</p>
                     </c>
                     <c ca="left">
                        <p>13</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>32</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Unclassified</p>
                     </c>
                     <c ca="left">
                        <p>60</p>
                     </c>
                     <c ca="left">
                        <p>31</p>
                     </c>
                     <c ca="left">
                        <p>53</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>146</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Total</p>
                     </c>
                     <c ca="left">
                        <p>96</p>
                     </c>
                     <c ca="left">
                        <p>45</p>
                     </c>
                     <c ca="left">
                        <p>76</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>220</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Grand total</p>
                     </c>
                     <c ca="left">
                        <p>428</p>
                     </c>
                     <c ca="left">
                        <p>104</p>
                     </c>
                     <c ca="left">
                        <p>146</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>681</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>The phylogroup ROW* is a summary of ROW, RW, W and OW.</p>
               </tblfn>
            </tbl>
            <p>The R7 and R8 phylogroups showed strong functional bias. Interestingly, the large majority of the strain-variable ORFs (69%) and the core genes (54%) in the R8 phylogroup represented unknown or poorly characterized genes. Although these hypothetical ORFs may include gene prediction errors, a similar fraction of unknowns was observed in the core genes, suggesting that they represent current or ancestral genes of unknown function. Three functional categories - 'cell wall biosynthesis', 'replication', and 'defense mechanisms' - dominated among the rest of ORFs in these phylogroups. For example, 32 of the 146 core genes in the R8 class represent components of the cell wall that were possibly shed from <it>Orientia </it>and <it>Wolbachia </it><abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. A total of 20 strain-variable ORFs belonged to this category, ten of which were placed in the R8 and three in the R7 phylogroup. These include genes for lipopolysaccharide biosynthesis, which may have been acquired at the base of the <it>Rickettsia </it>lineage.</p>
         </sec>
         <sec>
            <st>
               <p>Strain-variable ORFs in the R7 and R8 phylogroups contain an over-representation of mobile genetic elements</p>
            </st>
            <p>The large majority of strain-variable ORFs in the R7 and R8 classes represents mobile genetic elements and their associated genes, many of which were classified into the COG categories 'replication and repair' or 'defence mechanisms'. Altogether, these two categories contained 44 strain-variable ORFs, which account for 30% to 50% of strain-variable ORFs with a COG-based functional assignment in the R7 and R8 classes, respectively. These genes include transposases, phage genes, plasmid genes and genes for DNA helicases, RNA helicases, and different types of DNA restriction-modification enzymes. Many more mobile genes were identified in the <it>Rickettsia </it>genomes, but were either unclassified or represented poorly characterized genes, or were categorized in the RO or ROW* classes because of the presence of distant homologs in <it>O. tsutsugamushi </it>and <it>W. pipientis</it>. The latter include numerous strain-variable ORFs for ankyrin repeat and TPR repeat proteins that are normally associated with conjugative transfer elements in <it>O. tsutsugamushi </it><abbrgrp><abbr bid="B8">8</abbr></abbrgrp> and phage genes in <it>W. pipientis </it><abbrgrp><abbr bid="B14">14</abbr></abbrgrp>.</p>
            <p>To search systematically for remnants of mobile genetic elements, we extracted already identified phage and conjugative transfer genes in the individual genomes and BLASTed these against all of the other genomes. Genes for conjugative transfer encoded by the <it>tra </it>operon have been identified on the <it>R. felis </it>plasmid as well as on the chromosomes of <it>R. felis</it>, <it>R. bellii</it>, and <it>O. tsutsugamushi</it>. Using the <it>tra </it>operon as the query, we analyzed each genome for remnants of such genes. No evidence of conjugative transfer genes or remnants of such genes was observed in any of the other <it>Rickettsia </it>genomes nor in <it>W. pipientis</it>. The explanation may be that the <it>tra </it>gene cluster has been horizontally transmitted into or across <it>R. felis</it>, <it>R. belli</it>, and <it>O. tsutsugamushi</it>. Indeed, a phylogenetic analysis of the <it>tra </it>genes present in <it>R. bellii</it>, <it>R. felis</it>, and <it>O. tsutsugamushi </it>showed that the order of divergence was different from the species divergence pattern (data not shown), as was also observed in a recent survey of the <it>Rickettsia massiliae </it>genome <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>.</p>
            <p>We also observed phage-related genes of the HK97 family in a few RVS with multiple strain-variable ORFs. For example, flanking a gene encoding a HK97 phage portal protein in RVS-308 were five duplicated genes putatively encoding cell surface antigens in <it>R. felis</it>. In RVS-552 we identified a long stretch of 13 ORFs in <it>R. felis</it>, all of which have homologs in <it>R. bellii </it>and six of which (including a copy of the gene for the HK97 phage major capsid protein) have homologs also in <it>O. tsutsugamushi </it>and <it>W. pipientis</it>. Interestingly, not only the sequence but also the order of genes was preserved. None of these six genes, or remnants thereof, was present in any of the other <it>Rickettsia </it>spp. Although no intact prophage was found, the identification of phage genes in the variable <it>Rickettsia </it>segments suggests that bacteriophages are circulating in the <it>Rickettsia </it>population.</p>
            <p>Finally, we examined in greater detail the 48 strain-variable ORFs identified in the R7 class, which represents putative horizontal gene transfers into individual <it>Rickettsia </it>spp. and clades, although it cannot be excluded that they were acquired at the base of the <it>Rickettsia </it>lineage followed by species-specific losses. These include a long stretch of genes in RVS-422 that were solely identified in <it>R. prowazekii </it>and encode resolvase-like proteins, transposases, and ankyrin-repeat proteins. Also present in the TG were multiple genes encoding proteins with glycosyltransferase domains in RVS-626, one of which represents the 5' half of a longer gene with two such domains in the other <it>Rickettsia </it>spp. None of these have a close homolog in the Rickettsiales, but were related to glycosyltransferases in distantly related species such as <it>Geobacter </it>spp. and <it>Vibrio cholerae</it>. The genes RP336/RT0325 and RP337/RT0326 have adjacent homologs in <it>V. cholerae</it>.</p>
            <p>The SFG and transitional group clades shared gene remnants for aminoglucoside phosphotransferase and acyltransferases. In addition, these clades had unique strain-variable ORFs associated with mobile elements, which encoded products such as lyase, DNA-damage inducible proteins, type I restriction-modification enzyme, transposase, and plasmid stabilization proteins. Twenty strain-variable ORFs in the R7 group were only present in <it>R. felis</it>; these included an exochitinase, a biotin synthase gene <it>bioB</it>, and mobile elements such as a mutator-type transposase (similarity to <it>Psychrobacter</it>), which was located in six different RVS fragments. The only strain-variable ORF with remnants in all species was the <it>metK </it>gene, which encodes S-adenosylmethionine synthetase.</p>
         </sec>
         <sec>
            <st>
               <p>Strain-variable ORFs and core genes in the R7 and R8 phylogroups have different proportions of closest proteobacterial relatives</p>
            </st>
            <p>To investigate how the differences in functionality relate to the putative source of the sequences in the R7 and R8 phylogroups, we inferred the closest relatives from the taxonomic classification of the most similar sequences, excluding <it>Rickettsia</it>, <it>Orientia</it>, and <it>Wolbachia </it>(Table <tblr tid="T5">5</tblr>). The R7 class was dominated by proteobacteria-like sequences, with 17 out of 35 ORFs showing similarity to &#947;-proteobacteria. Overall, the proteobacteria-like sequences accounted for 73% of strain-variable ORFs in the R7 class, and 11% to 12% of strain-variable ORFs and core genes in the R8 phylogroups. The lower proportion of proteobacteria-like sequences in the R8 classes is due to the high numbers of strain-variable ORFs (60%) and core genes (38%) that are specific to <it>Rickettsia </it>with no identifiable homolog outside the genus. A total of 80 strain-variable ORFs and 80 core genes in the R8 class exhibited sequence similarities to proteobacteria. However, the ratio of ORFs with the highest sequence similarity of &#945;-proteobacteria versus &#947;-proteobacteria was strikingly different: only 0.6 for the strain-variable ORFs versus 3.2 for the core genes.</p>
            <tbl id="T5" hint_layout="single">
               <title>
                  <p>Table 5</p>
               </title>
               <caption>
                  <p>Sequence similarity to bacterial subdivisions</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>R7</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>R8</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Strain variable</p>
                     </c>
                     <c ca="center">
                        <p>Strain variable</p>
                     </c>
                     <c ca="center">
                        <p>Core</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>R8-specific</p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="left">
                        <p>173</p>
                     </c>
                     <c ca="left">
                        <p>56</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Proteobacteria</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>&#945;</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>21</p>
                     </c>
                     <c ca="left">
                        <p>52</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>&#946;</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>14</p>
                     </c>
                     <c ca="left">
                        <p>6</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>&#948;</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>10</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>&#949;</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>&#947;</p>
                     </c>
                     <c ca="left">
                        <p>17</p>
                     </c>
                     <c ca="left">
                        <p>34</p>
                     </c>
                     <c ca="left">
                        <p>16</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Actinobacteria</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Aquificae</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Bacilli</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>5</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Bacteroidetes</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>8</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Chlamydia</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Clostridia</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cyanobacteria</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>6</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Deinococcus</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Fibrobacter</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Fusobacteria</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Mollicutes</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Spirochaetes</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Euryarchaeota</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Total</p>
                     </c>
                     <c ca="left">
                        <p>48</p>
                     </c>
                     <c ca="left">
                        <p>288</p>
                     </c>
                     <c ca="left">
                        <p>146</p>
                     </c>
                  </r>
               </tblbdy>
   