<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2164-8-182</ui>
   <ji>1471-2164</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>New criteria for selecting the origin of DNA replication in <it>Wolbachia </it>and closely related bacteria</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Ioannidis</snm>
               <fnm>Panagiotis</fnm>
               <insr iid="I1"/>
               <email>panioan@cc.uoi.gr</email>
            </au>
            <au id="A2">
               <snm>Hotopp</snm>
               <mnm>C Dunning</mnm>
               <fnm>Julie</fnm>
               <insr iid="I2"/>
               <email>jdunning@jcvi.org</email>
            </au>
            <au id="A3">
               <snm>Sapountzis</snm>
               <fnm>Panagiotis</fnm>
               <insr iid="I1"/>
               <email>psapount@cc.uoi.gr</email>
            </au>
            <au id="A4">
               <snm>Siozios</snm>
               <fnm>Stefanos</fnm>
               <insr iid="I1"/>
               <email>ssiozios@cc.uoi.gr</email>
            </au>
            <au id="A5">
               <snm>Tsiamis</snm>
               <fnm>Georgios</fnm>
               <insr iid="I1"/>
               <email>gtsiamis@cc.uoi.gr</email>
            </au>
            <au id="A6">
               <snm>Bordenstein</snm>
               <mi>R</mi>
               <fnm>Seth</fnm>
               <insr iid="I3"/>
               <email>sbordenstein@mbl.edu</email>
            </au>
            <au id="A7">
               <snm>Baldo</snm>
               <fnm>Laura</fnm>
               <insr iid="I4"/>
               <email>laurab@ucr.edu</email>
            </au>
            <au id="A8">
               <snm>Werren</snm>
               <mi>H</mi>
               <fnm>John</fnm>
               <insr iid="I5"/>
               <email>werr@mail.rochester.edu</email>
            </au>
            <au id="A9" ca="yes">
               <snm>Bourtzis</snm>
               <fnm>Kostas</fnm>
               <insr iid="I1"/>
               <email>kbourtz@uoi.gr</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Environmental and Natural Resources Management, University of Ioannina, 2 Seferi Street, Agrinio 30100, Greece</p>
            </ins>
            <ins id="I2">
               <p>The Institute for Genomic Research, J. Craig Venter Institute, 9712 Medical Center Drive, Rockville, MD 20850, USA</p>
            </ins>
            <ins id="I3">
               <p>Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, The Marine Biological Laboratory, Woods Hole, MA, 02543, USA</p>
            </ins>
            <ins id="I4">
               <p>Department of Biology, University of California, Riverside, USA</p>
            </ins>
            <ins id="I5">
               <p>Department of Biology, University of Rochester, Rochester, USA</p>
            </ins>
         </insg>
         <source>BMC Genomics</source>
         <issn>1471-2164</issn>
         <pubdate>2007</pubdate>
         <volume>8</volume>
         <issue>1</issue>
         <fpage>182</fpage>
         <url>http://www.biomedcentral.com/1471-2164/8/182</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">17584494</pubid>
               <pubid idtype="doi">10.1186/1471-2164-8-182</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>03</day>
               <month>11</month>
               <year>2006</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>20</day>
               <month>6</month>
               <year>2007</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>20</day>
               <month>6</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Ioannidis et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>The annotated genomes of two closely related strains of the intracellular bacterium <it>Wolbachia pipientis </it>have been reported without the identifications of the putative origin of replication (<it>ori</it>). Identifying the <it>ori </it>of these bacteria and related alpha-Proteobacteria as well as their patterns of sequence evolution will aid studies of cell replication and cell density, as well as the potential genetic manipulation of these widespread intracellular bacteria.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Using features that have been previously experimentally verified in the alpha-Proteobacterium <it>Caulobacter crescentus</it>, the origin of DNA replication (<it>ori</it>) regions were identified <it>in silico </it>for <it>Wolbachia </it>strains and eleven other related bacteria belonging to <it>Ehrlichia</it>, <it>Anaplasma</it>, and <it>Rickettsia </it>genera. These features include DnaA-, CtrA- and IHF-binding sites as well as the flanking genes in <it>C. crescentus</it>. The <it>Wolbachia ori </it>boundary genes were found to be <it>hemE </it>and COG1253 protein (CBS domain protein). Comparisons of the putative <it>ori </it>region among related <it>Wolbachia </it>strains showed higher conservation of bases within binding sites.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>The sequences of the <it>ori </it>regions described here are only similar among closely related bacteria while fundamental characteristics like presence of DnaA and IHF binding sites as well as the boundary genes are more widely conserved. The relative paucity of CtrA binding sites in the <it>ori </it>regions, as well as the absence of key enzymes associated with DNA replication in the respective genomes, suggest that several of these obligate intracellular bacteria may have altered replication mechanisms. Based on these analyses, criteria are set forth for identifying the <it>ori </it>region in genome sequencing projects.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="bmc" subtype="user_supplied_xml" id="endnote"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p><it>Wolbachia </it>are Gram-negative, intracellular &#945;-Proteobacteria that infect many invertebrates including terrestrial crustaceans, mites, spiders and filarial nematodes <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>. Much of the success of <it>Wolbachia </it>can be attributed to the diverse phenotypes they induce in hosts. These range from classical mutualism to reproductive parasitism as characterized by the ability to override chromosomal sex determination, induce parthenogenesis, selectively kill males and induce cytoplasmic incompatibility in early embryos <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>. The unique biology of <it>Wolbachia </it>has attracted a growing number of researchers interested in questions ranging from the evolutionary implications of infection to the use of this agent for pest and disease control <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp>.</p>
         <p>These endosymbiotic bacteria are typically transmitted through the eggs of their hosts and their replication rate is regulated to avoid overgrowth prior to host reproduction <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>. The replication control mechanisms are not known for this growth limitation. The intensity of <it>Wolbachia</it>'s effects has often been correlated with bacterial copy number, as reported in different host species <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>. A factor that may influence bacterial proliferation is the organization of the <it>ori </it>region. Identifying the <it>ori </it>of these bacteria is a key step in understanding the mechanisms of bacterial replication and for developing methods for genetic manipulation of these bacteria. Recently, the closed and annotated genomes of two <it>Wolbachia </it>strains have been reported <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr></abbrgrp>. However, neither of these studies identified the putative origin of replication (<it>ori</it>) for <it>Wolbachia</it>.</p>
         <p>DNA replication in bacteria takes place by uncoiling the double stranded helix and breaking the hydrogen bonds between the complementary strands at a specific chromosomal locus, the <it>ori </it>region. Early events in DNA replication are subdivided into the following three steps: (i) binding of the initiator proteins to sites located within the <it>ori </it>region; (ii) local unwinding of the <it>ori </it>region; and (iii) loading of the DNA helicase and other proteins required to form the Y-shaped replication forks <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. Typically, bacteria have a single <it>ori </it>region <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>, although in some prokaryotes two chromosomal <it>ori </it>regions were experimentally identified, probably reflecting a temporal mode of DNA replication <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp>.</p>
         <p>Chromosomal replication initiates at <it>ori</it>, proceeds bidirectionally and terminates when the replication forks reach the termination site, <it>terC </it>(in the case of circular chromosomes) or the chromosome ends (in the case of linear chromosomes) <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. The initiation of bacterial chromosome replication is mediated by the DnaA protein, which binds to specific 9-mer cis-regulatory elements called DnaA boxes located in the <it>ori </it>region. Usually, about ten to twenty DnaA molecules bind to five DnaA boxes and promote unwinding of the AT-rich <it>ori </it>region. The sequences of <it>ori </it>are conserved only among closely related microorganisms and vary greatly in size (from 200&#8211;1000 bp) <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. A common feature is the presence of several DnaA boxes and an AT-rich region; a cluster of four or more DnaA boxes is indicative of a functional origin of replication <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. In &#947;-Proteobacteria, the <it>ori </it>region is frequently located within the <it>rnpA-rmpH-dnaA-dnaN-recF-gyrB </it>gene cluster and usually next to the <it>dnaA </it>gene <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>, with the <it>Escherichia coli ori </it>being the most thoroughly studied <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp>.</p>
         <p>The location of the <it>ori </it>region can be diverse across the bacterial lineages <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. Among the &#945;-Proteobacteria, which includes <it>Wolbachia</it>, only the <it>ori </it>region of <it>C. crescentus </it>(<it>Cori</it>) has been experimentally identified <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. Independent methods have provided a consistent location for <it>Cori </it>between <it>hemE </it>(CC3763) and a gene encoding a conserved hypothetical protein (CC0001) (in the present study, this gene is referred to by its NCBI COG number, COG1806) <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. The <it>hemE </it>gene encodes for uroporphyrinogen decarboxylase &#8211; a component in heme biosynthesis, and the COG1806 gene has no known function but contains the conserved domain of unknown function, DUF299. The <it>hemE</it>/COG1806 boundary genes of <it>Cori </it>are present in the sequenced genomes of several other &#945;-Proteobacteria <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. While <it>Cori </it>shows some apparent similarities with the <it>E. coli ori </it>such as a 40 bp AT-rich region, presence of DnaA boxes and an IHF (integration host factor) binding site, it has an additional regulatory protein, CtrA. CtrA is a global cell cycle regulator that controls 26% of the transcripts that vary during the cell cycle <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. <it>In vitro </it>footprint experiments revealed five CtrA binding sites within <it>Cori</it>, centered over the consensus TTAA-N<sub>7</sub>-TTAA <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. CtrA binding sites appear to be strategically organized spanning the entire length of <it>Cori </it><abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. Phosphorylated CtrA binds two perfect or imperfect halves of the recognition sequence, probably as a dimer and represses chromosomal replication.</p>
         <p>Comparison of over 30 bacterial genomes with <it>Cori </it>reveals a noticeable conservation of binding sites and flanking genes, although in some bacteria, chromosomal rearrangements appear to have taken place (unpublished observations). GC-skew analysis has also been used to predict the origin of DNA replication <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr></abbrgrp>, but it is not a good universal predictor of the <it>ori </it><abbrgrp><abbr bid="B33">33</abbr></abbrgrp>, and was not predictive for the two annotated <it>Wolbachia </it>genomes <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr></abbrgrp>. In contrast, for the closely related <it>Ehrlichia</it>, <it>Neorickettsia</it>, and <it>Anaplasma</it>, GC-skew has been used to approximate the origin position in the chromosome more reliably <abbrgrp><abbr bid="B34">34</abbr><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr></abbrgrp>. However, the actual shift can occur over a fairly large region and thus this feature alone cannot reveal the precise location for the ori region.</p>
         <p>Given that GC-skew analysis was insufficient at predicting the <it>ori </it>region in several &#945;-Proteobacteria species, a different approach was developed. The origin of replication in <it>C. crescentus </it>is located between orthologs of <it>hemE </it>(CC_3763) and COG1806 (CC_0001), and it contains five DnaA boxes, a single binding site for IHF and five CtrA binding sites <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. In <it>Rickettsia prowazekii</it>, the <it>ori </it>region is also located between the <it>hemE </it>(RP_885) and COG1806 orthologs <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>.</p>
         <p>The guidelines used by Brassinga et al. <abbrgrp><abbr bid="B37">37</abbr></abbrgrp> for detecting <it>Cori </it>were applied in the present study to identify the origin of DNA replication in <it>Wolbachia </it>and in ten closely related &#945;-Proteobacteria. Computational analyses indicate that the origin of DNA replication in the <it>w</it>Mel and <it>w</it>Bm <it>Wolbachia </it>strains <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr></abbrgrp> lies between a gene encoding a cystathione-&#946;-synthase (CBS) domain protein and the <it>Wolbachia hemE </it>gene. The evidence relies mainly on boundary gene recognition as well as DnaA-, CtrA- and IHF-binding site identification, as described by <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. Analysis of the corresponding sequences from an additional fifty-one <it>Wolbachia </it>strains supports the predicted <it>ori </it>as well as identifying frequent recombination at the edges of the sequences. Using the same guidelines, the <it>ori </it>region was also identified in the sequenced representatives of the closely related <it>Anaplasma</it>, <it>Ehrlichia</it>, <it>Neorickettsia </it>and <it>Rickettsia</it>.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p><it>In silico </it>prediction of the <it>Wolbachia </it>origin of replication</p>
            </st>
            <p>The origin of replication of <it>C. crescentus </it>is located between orthologs of <it>hemE </it>(CC_3763) and a conserved hypothetical protein, COG1806, and contains DnaA, CtrA and IHF binding sites <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>: In <it>R. prowazekii</it>, the <it>ori </it>region is also located between the <it>hemE </it>(RP_885) and COG1806 orthologs.</p>
            <p>Based on a slightly modified approach and findings of Brassinga et al. <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>, the <it>Wolbachia ori </it>was predicted <it>in silico</it>. Our approach was based on four criteria: (a) position near either <it>hemE </it>or COG1806 orthologs; (b) an intergenic region (that is, the putative <it>ori </it>region) containing an appropriate number of binding sites for the DnaA, CtrA and IHF factors; (c) genome-wide searches confirming that no other appropriately sized intergenic region (>300 bp) contains a significant number of these three characteristic binding sites and (d) the AT-content of the predicted origins that is higher than the average for the respective genome.</p>
            <p>First, the <it>Wolbachia </it>orthologs of <it>hemE </it>and COG1806 were searched in the recently published <it>Wolbachia w</it>Mel genome <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> and identified as the loci WD_1028 and WD_0341, respectively. The great distance between the two genes in <it>Wolbachia </it>(>600 kb) indicated no conservation of the <it>hemE</it>-COG1806 region in <it>Wolbachia</it>. The lack of conservation is most likely due to a single chromosomal rearrangement resulting in two possible locations for <it>ori</it>: WD_0340-WD_0341 or WD_1027-WD_1028. The first region is between the heme exporter WD_0340 (<it>ccmC</it>) and the COG1806 homolog (WD_0341). The second region is between the CBS domain protein WD_1027 (referred to in the present study after its NCBI COG number, COG1253) and uroporphyrinogen decarboxylase WD_1028 (<it>hemE</it>).</p>
            <p>Both regions were examined for the presence of DnaA-, CtrA-, and IHF-binding sites, essential components of <it>ori </it><abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. Only the intergenic region (IGR) between the COG1253 and <it>hemE </it>gene (position 988364&#8211;988765, 402 bp) was found to have all three characteristic binding sites. In addition, out of all 110 IGRs of greater than 300 bp, this region has the highest number of CtrA, DnaA and IHF binding sites (eight total, p = 1/110 = 0.009), the highest density of total binding domains per IGR bp (0.020 per bp). The next closest is the IGR between WD_0248-WD_0249, which has a much lower total number of binding sites (six) and binding site density (0.013 per bp). However, it lacks putative DnaA boxes, a prerequisite for an <it>ori </it>region. The putative <it>ori </it>region is one of only two IGRs > 300 bp that have all three binding site types (p = 2/110 = 0.018). However, the IGR between WD_0100-WD_0102 has only 4 total binding domains (compared to 8) and, due to it's larger size (545 bp) a binding site density 1/3 that of the putative ori (0.007 <it>vs </it>0.020). These data, combined with the association of the WD_1027-WD_1028 IGR with the same genes shown to be flanking the <it>ori </it>region in <it>C. crescentus </it>and <it>R. prowazekii</it>, provides compelling evidence that this is most likely, the origin of replication. More specifically, there are 3 DnaA, 4 CtrA and 1 IHF binding sites in the 404 bp region between COG1253 and <it>hemE</it>. In contrast, the region between <it>ccmC </it>and COG1806 has only a single IHF binding site and is only 130 bp long, which is rather short compared to the <it>ori </it>of related bacteria (e.g. <it>R. prowazekii </it>and <it>C. crescentus</it>, have replication origins >400 bp <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>).</p>
            <p>Another feature of origins is that they typically contain an AT-rich region to facilitate dissociation of the DNA during replication initiation <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. As a result, the overall AT-content of <it>ori </it>is higher when compared to the respective genome. Consistent with this feature, the putative <it>w</it>Mel origin is 76% AT-rich compared to the 65% average AT-content for the genome. In addition, the <it>ori </it>region is significantly more AT-rich than the average found for other intergenic regions of >300 bp. Of 110 intergenic regions >300 bp, only 6 have AT-content greater or equal to the <it>ori </it>(p = 6/110 = 0.055). Indeed, the AT-contents of all origins detected in this study were higher than their respective genome averages (Table <tblr tid="T1">1</tblr>). It has been proposed in <it>E. coli </it>that this exceptionally AT-rich DNA is the place where the initiator factor of DNA replication, the DnaA protein, first unwinds the origin <abbrgrp><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr></abbrgrp>. These AT-rich sequences are also conspicuous because they are tandem repeats. For example, the <it>E. coli </it>oriC region is composed of three imperfect 13-mer sequences with the consensus sequence GATCTNTTNTTTT <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. Similar tandem repeats were detected in all bacterial species and strains studied except in <it>A. marginale</it>, <it>E. canis </it>and <it>N. sennetsu</it>. In most cases the repeats were 10&#8211;12 bp long and appeared in two copies (data not shown).</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Bacterial species/strains whose partial and/or complete genome sequence was downloaded from various sources</p>
               </caption>
               <tblbdy cols="9">
                  <r>
                     <c ca="left">
                        <p>Organism</p>
                     </c>
                     <c ca="center">
                        <p>References</p>
                     </c>
                     <c ca="center">
                        <p>Closed-Curated annotation</p>
                     </c>
                     <c ca="center">
                        <p>Genbank</p>
                     </c>
                     <c ca="center">
                        <p>Coordinates (length)</p>
                     </c>
                     <c ca="center">
                        <p>Flank One</p>
                     </c>
                     <c ca="center">
                        <p>Flank Two</p>
                     </c>
                     <c ca="center">
                        <p>% genome AT content</p>
                     </c>
                     <c ca="center">
                        <p>% <it>ori </it>AT content</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="9">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Wolbachia pipientis w</it>Mel_A</p>
                     </c>
                     <c ca="center">
                        <p>[15]</p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="NC_002978">NC_002978</ext-link>
                        </p>
                     </c>
                     <c ca="center">
                        <p>988364..988765 (402 bp)</p>
                     </c>
                     <c ca="center">
                        <p>WD_1027/<it>COG1253 </it>(CBS)</p>
                     </c>
                     <c ca="center">
                        <p>WD_1028/<it>hemE</it></p>
                     </c>
                     <c ca="center">
                        <p>64.77</p>
                     </c>
                     <c ca="center">
                        <p>75.87</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Wolbachia pipientis w</it>Bm_D</p>
                     </c>
                     <c ca="center">
                        <p>[14]</p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="NC_006833">NC_006833</ext-link>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1079740..110 (455 bp)</p>
                     </c>
                     <c ca="center">
                        <p>Wbm_0809/<it>COG1253 </it>(CBS)</p>
                     </c>
                     <c ca="center">
                        <p>Wbm_0001/<it>hemE</it></p>
                     </c>
                     <c ca="center">
                        <p>65.82</p>
                     </c>
                     <c ca="center">
                        <p>69.89</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Wolbachia pipientis w</it>Ana_A</p>
                     </c>
                     <c ca="center">
                        <p>[43]</p>
                     </c>
                     <c ca="center">
                        <p>N</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="NZ_AAGB00000000">NZ_AAGB00000000</ext-link>
                        </p>
                     </c>
                     <c ca="center">
                        <p>N/A (402 bp)</p>
                     </c>
                     <c ca="center">
                        <p>WwAna0355/<it>COG1253 </it>(CBS)</p>
                     </c>
                     <c ca="center">
                        <p>WwAna0353/<it>hemE</it></p>
                     </c>
                     <c ca="center">
                        <p>N/A</p>
                     </c>
                     <c ca="center">
                        <p>75.87</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Wolbachia pipientis w</it>Sim_A</p>
                     </c>
                     <c ca="center">
                        <p>[43]</p>
                     </c>
                     <c ca="center">
                        <p>N</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="NZ_AAGC00000000">NZ_AAGC00000000</ext-link>
                        </p>
                     </c>
                     <c ca="center">
                        <p>N/A (396 bp)</p>
                     </c>
                     <c ca="center">
                        <p>WwSim0491/<it>COG1253 </it>(CBS)</p>
                     </c>
                     <c ca="center">
                        <p>WwSim0492/<it>hemE</it></p>
                     </c>
                     <c ca="center">
                        <p>N/A</p>
                     </c>
                     <c ca="center">
                        <p>75.51</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Wolbachia pipientis w</it>Wil_A</p>
                     </c>
                     <c ca="center">
                        <p>[44]</p>
                     </c>
                     <c ca="center">
                        <p>N</p>
                     </c>
                     <c ca="center">
                        <p>N/A</p>
                     </c>
                     <c ca="center">
                        <p>N/A (402 bp)</p>
                     </c>
                     <c ca="center">
                        <p><it>COG1253 </it>(CBS)</p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>hemE</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>N/A</p>
                     </c>
                     <c ca="center">
                        <p>75.49</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Wolbachia pipientis w</it>Pip_B</p>
                     </c>
                     <c ca="center">
                        <p>[45]</p>
                     </c>
                     <c ca="center">
                        <p>N</p>
                     </c>
                     <c ca="center">
                        <p>N/A</p>
                     </c>
                     <c ca="center">
                        <p>N/A (402 bp)</p>
                     </c>
                     <c ca="center">
                        <p><it>COG1253 </it>(CBS)</p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>hemE</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>N/A</p>
                     </c>
                     <c ca="center">
                        <p>79.17</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Ehrlichia chaffeensis </it>Arkansas</p>
                     </c>
                     <c ca="center">
                        <p>[36]</p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="CP000236">CP000236</ext-link>
                        </p>
                     </c>
                     <c ca="center">
                        <p>26939..27361 (423 bp)</p>
                     </c>
                     <c ca="center">
                        <p>ECH_0030/<it>hemE</it></p>
                     </c>
                     <c ca="center">
                        <p>ECH_0031/<it>COG1253</it></p>
                     </c>
                     <c ca="center">
                        <p>69.90</p>
                     </c>
                     <c ca="center">
                        <p>74.70</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Ehrlichia ruminantium </it>Welgevonden</p>
                     </c>
                     <c ca="center">
                        <p>[35]</p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="NC_005295">NC_005295</ext-link>
                        </p>
                     </c>
                     <c ca="center">
                        <p>23003..23424 (422 bp)</p>
                     </c>
                     <c ca="center">
                        <p>Erum_0180/<it>hemE</it></p>
                     </c>
                     <c ca="center">
                        <p>Erum_0190/<it>COG1253</it></p>
                     </c>
                     <c ca="center">
                        <p>72.52</p>
                     </c>
                     <c ca="center">
                        <p>75.19</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Ehrlichia ruminantium </it>Gardel</p>
                     </c>
                     <c ca="center">
                        <p>[88]</p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="NC_006831">NC_006831</ext-link>
                        </p>
                     </c>
                     <c ca="center">
                        <p>4149..4564 (416 bp)</p>
                     </c>
                     <c ca="center">
                        <p>ERGA_CDS_00050/<it>hemE</it></p>
                     </c>
                     <c ca="center">
                        <p>ERGA_CDS_00060/<it>COG1253</it></p>
                     </c>
                     <c ca="center">
                        <p>72.49</p>
                     </c>
                     <c ca="center">
                        <p>75.48</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Ehrlichia canis </it>Jake</p>
                     </c>
                     <c ca="center">
                        <p>NC_007354</p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="NC_007354">NC_007354</ext-link>
                        </p>
                     </c>
                     <c ca="center">
                        <p>11395..11819 (425 bp)</p>
                     </c>
                     <c ca="center">
                        <p>Ecaj_0010/<it>hemE</it></p>
                     </c>
                     <c ca="center">
                        <p>Ecaj_0011/<it>COG1253</it></p>
                     </c>
                     <c ca="center">
                        <p>71.04</p>
                     </c>
                     <c ca="center">
                        <p>73.88</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Anaplasma marginale </it>St. Maries</p>
                     </c>
                     <c ca="center">
                        <p>[34]</p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="NC_004842">NC_004842</ext-link>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1147944..1148563 (620 bp)</p>
                     </c>
                     <c ca="center">
                        <p>AM_1301/<it>hemE</it></p>
                     </c>
                     <c ca="center">
                        <p>AM_1303/<it>COG1253</it></p>
                     </c>
                     <c ca="center">
                        <p>50.24</p>
                     </c>
                     <c ca="center">
                        <p>63.71</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Anaplasma phagocytophilum </it>HZ</p>
                     </c>
                     <c ca="center">
                        <p>[36]</p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="CP000235">CP000235</ext-link>
                        </p>
                     </c>
                     <c ca="center">
                        <p>20557..20941 (385 bp)</p>
                     </c>
                     <c ca="center">
                        <p>APH_0020/<it>hemE</it></p>
                     </c>
                     <c ca="center">
                        <p>APH_0021/<it>COG1253</it></p>
                     </c>
                     <c ca="center">
                        <p>58.36</p>
                     </c>
                     <c ca="center">
                        <p>71.43</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Neorickettsia sennetsu </it>Miyayama</p>
                     </c>
                     <c ca="center">
                        <p>[36]</p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="CP000237">CP000237</ext-link>
                        </p>
                     </c>
                     <c ca="center">
                        <p>855136..855541 (406 bp)</p>
                     </c>
                     <c ca="center">
                        <p>NSE_0967/phage uncharacterized protein</p>
                     </c>
                     <c ca="center">
                        <p>NSE_0968/<it>hemE</it></p>
                     </c>
                     <c ca="center">
                        <p>58.92</p>
                     </c>
                     <c ca="center">
                        <p>71.92</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>[36]</p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="CP000237">CP000237</ext-link>
                        </p>
                     </c>
                     <c ca="center">
                        <p>267777..268329 (553 bp)</p>
                     </c>
                     <c ca="center">
                        <p>NSE_0324/hypothetical protein</p>
                     </c>
                     <c ca="center">
                        <p>NSE_0327/<it>COG1253</it></p>
                     </c>
                     <c ca="center">
                        <p>58.92</p>
                     </c>
                     <c ca="center">
                        <p>58.22</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Rickettsia typhi </it>Wilmington</p>
                     </c>
                     <c ca="center">
                        <p>[75]</p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="NC_006142">NC_006142</ext-link>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1111114..1111496 (383 bp)</p>
                     </c>
                     <c ca="center">
                        <p>RT_0877/<it>hemE</it></p>
                     </c>
                     <c ca="center">
                        <p>RT_0001/<it>COG1806</it></p>
                     </c>
                     <c ca="center">
                        <p>71.08</p>
                     </c>
                     <c ca="center">
                        <p>82.25</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Rickettsia prowazekii </it>Madrid E</p>
                     </c>
                     <c ca="center">
                        <p>[73]</p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="NC_000963">NC_000963</ext-link>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1111140..1111523 (384 bp)</p>
                     </c>
                     <c ca="center">
                        <p>RP_885/<it>hemE</it></p>
                     </c>
                     <c ca="center">
                        <p>RP_001/<it>COG1806</it></p>
                     </c>
                     <c ca="center">
                        <p>71.00</p>
                     </c>
                     <c ca="center">
                        <p>82.55</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Rickettsia conorii </it>Malish 7</p>
                     </c>
                     <c ca="center">
                        <p>[74]</p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="NC_003103">NC_003103</ext-link>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1268361..1268755 (395 bp)</p>
                     </c>
                     <c ca="center">
                        <p>RC_1374/<it>hemE</it></p>
                     </c>
                     <c ca="center">
                        <p>RC_0001/<it>COG1806</it></p>
                     </c>
                     <c ca="center">
                        <p>67.56</p>
                     </c>
                     <c ca="center">
                        <p>81.52</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Rickettsia felis </it>URRWXCal2</p>
                     </c>
                     <c ca="center">
                        <p>[76]</p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="NC_007109">NC_007109</ext-link>
                        </p>
                     </c>
                     <c ca="center">
                        <p>1484753..1485148 (396 bp)</p>
                     </c>
                     <c ca="center">
                        <p>RF_1400/<it>hemE</it></p>
                     </c>
                     <c ca="center">
                        <p>RF_0001/<it>COG1806</it></p>
                     </c>
                     <c ca="center">
                        <p>67.55</p>
                     </c>
                     <c ca="center">
                        <p>82.83</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Caulobacter crescentus </it>CB15</p>
                     </c>
                     <c ca="center">
                        <p>[89]</p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="NC_002696">NC_002696</ext-link>
                        </p>
                     </c>
                     <c ca="center">
                        <p>4016703..159 (404 bp)</p>
                     </c>
                     <c ca="center">
                        <p>CC_3763/<it>hemE</it></p>
                     </c>
                     <c ca="center">
                        <p>CC_0001/<it>COG1806</it></p>
                     </c>
                     <c ca="center">
                        <p>32.80</p>
                     </c>
                     <c ca="center">
                        <p>43.56</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Y, yes; N, no; N/A, not applicable.</p>
               </tblfn>
            </tbl>
            <p>These lines of <it>in silico </it>evidence suggest that <it>Wolbachia </it>has an origin of replication on its chromosome that lies between the COG1253 and <it>hemE </it>genes. One of the boundary genes, <it>hemE </it>(WD_1028), encodes for uroporphyrinogen decarboxylase &#8211; a component in heme biosynthesis. The other, COG1253 (WD_1027), encodes for a CBS domain protein. CBS domains have been shown to bind ligands with an adenosyl group such as AMP, ATP and S-AdoMet <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. Using the publicly available HMMer program <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>, it was found that this protein has a CBS domain pair and a CorC_HlyC transporter associated domain.</p>
            <p>The putative <it>ori </it>regions were also identified in other complete and/or partial publicly available <it>Wolbachia </it>genomes by using the above mentioned criteria. The orthologs of the COG1253 (WD_1027) and <it>hemE </it>(WD_1028) genes were identified by BLAST searches <abbrgrp><abbr bid="B42">42</abbr></abbrgrp> of the genomes of the <it>w</it>Ana <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>, <it>w</it>Sim <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>, <it>w</it>Wil <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>, <it>w</it>Pip <abbrgrp><abbr bid="B45">45</abbr></abbrgrp> and <it>w</it>Bm <it>Wolbachia </it>strains <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> (see Table <tblr tid="T1">1</tblr>). Both flanking genes are highly conserved in position and in sequence. Based on criteria set forth by Zyskind et al. <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>, the regions of the <it>w</it>Mel and <it>w</it>Bm strains for which their genome sequence is available <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr></abbrgrp>, were compared indicating that: (a) 266 of 347, or 77%, of the amino acids are conserved, while 790 of 1044 (76%) of the nucleotides are conserved in the flanking gene <it>hemE </it>and (b) 216 of 279, or 77%, of the amino acids are conserved, while 609 of 819 (74%) of the nucleotides are conserved in the flanking gene COG1253. The <it>ori </it>regions of the two genomes present 73% identity at the nucleotide level. The putative <it>ori-</it>region-related binding sites for the <it>w</it>Ana, <it>w</it>Sim and <it>w</it>Wil strains (all members of the A-supergroup), <it>w</it>Pip strain (B-supergroup), and the <it>w</it>Bm strain (D-supergroup) have been depicted [see Additional Table <supplr sid="S3">3</supplr>]. A schematic representation is depicted in Figure <figr fid="F1">1</figr> while the length and genome coordinates are shown in Table <tblr tid="T1">1</tblr> (where applicable).</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Schematic representation of representative origins of replication</p>
               </caption>
               <text>
                  <p><b>Schematic representation of representative origins of replication</b>. Schematic drawing of <it>ori </it>regions from <it>Wolbachia</it>, <it>Anaplasma</it>, <it>Ehrlichia </it>and <it>Rickettsia</it>. Inverted triangles denote DnaA boxes, circles denote CtrA binding sites and triangles indicate IHF binding sites. The flanking genes are fragmented. Some protein binding sites are located outside <it>ori </it>but within either of the boundary genes; however, the flanking genes were not fully sequenced from the <it>Wolbachia </it>strains of <it>F. candida</it>, <it>C. lectularius</it>, <it>C. scorpioides</it>, <it>P. hoffmeyeri</it>, and <it>A. encedon</it>. Note also that in <it>A. marginale </it>the <it>ori </it>region appears to be significantly longer due to the differently annotated <it>hemE </it>gene (see text for details). <it>w</it>Mel_A, <it>Wolbachia </it>of <it>Drosophila melanogaster w</it>Mel; <it>w</it>Mau_B, <it>Wolbachia </it>of <it>Drosophila mauritiana w</it>Mau; <it>w</it>Bm_D, <it>Wolbachia </it>of <it>Brugia malayi w</it>Bm; <it>w</it>Fcan_E, <it>Wolbachia </it>of <it>Folsomia candida w</it>FcanE; <it>w</it>Clec_F, <it>Wolbachia </it>of <it>Cimex lectularius w</it>ClecF; <it>w</it>Csco, <it>Wolbachia </it>of <it>Cordylochernes scorpioides w</it>Csco; <it>w</it>Phof_A, <it>Wolbachia </it>of <it>Pegoscapus hoffmeyeri w</it>PhofA; <it>w</it>Aenc_B, <it>Wolbachia </it>of <it>Acraea encedon w</it>AencB; Echaf, <it>Ehrlichia chaffeensis </it>Arkansas; ErumW, <it>E. ruminantium </it>Welgevonden; Ecan, <it>E. canis </it>Jake; Amar, <it>Anaplasma marginale </it>St Maries; Aphag, <it>A. phagocytophilum </it>HZ; Nsenn, <it>Neorickettsia sennetsu </it>Miyayama; Rtyp, <it>Rickettsia typhi </it>Wilmington; Rpro, <it>R. prowazekii </it>Madrid E; Rcon, <it>R. conorii </it>Malish 7; Rfel, <it>R. felis </it>URRWXCal2; Ccre, <it>Caulobacter crescentus </it>CB15.</p>
               </text>
               <graphic file="1471-2164-8-182-1"/>
            </fig>
            <p>Based on the consensus sequence of the flanking genes of the putative <it>ori </it>region in <it>Wolbachia </it>[see Additional Text File 1], a PCR strategy [see Additional Figure 1 and Additional Table <supplr sid="S2">2</supplr>] was developed to amplify and sequence the putative <it>ori </it>regions of other <it>Wolbachia </it>strains infecting different <it>Drosophila </it>species: <it>w</it>MelPop, <it>w</it>Ri, <it>w</it>Au, <it>w</it>Ha, <it>w</it>Yak, <it>w</it>Tei, <it>w</it>San, <it>w</it>No, <it>w</it>Ma and <it>w</it>Mau [see Additional Table <supplr sid="S1">1</supplr>] <abbrgrp><abbr bid="B47">47</abbr><abbr bid="B48">48</abbr><abbr bid="B49">49</abbr><abbr bid="B50">50</abbr><abbr bid="B51">51</abbr><abbr bid="B52">52</abbr><abbr bid="B53">53</abbr><abbr bid="B54">54</abbr><abbr bid="B55">55</abbr><abbr bid="B56">56</abbr><abbr bid="B57">57</abbr></abbrgrp>. Further sequencing of the putative <it>ori </it>region of an additional 37 <it>Wolbachia </it>strains from diverse hosts confirmed the overall characteristics of this region both in size and sequence [see Figure <figr fid="F1">1</figr> and Additional Table <supplr sid="S1">1</supplr>]. <it>Wolbachia </it>strains belong to eight supergroups A-H <abbrgrp><abbr bid="B58">58</abbr><abbr bid="B59">59</abbr><abbr bid="B60">60</abbr><abbr bid="B61">61</abbr></abbrgrp>. The present study includes representative strains from supergroups A, B, D, E and F. All strains present three DnaA and a single IHF boxes while they differ in the number of CtrA binding sites (two to seven). The sequenced putative <it>ori </it>regions of some <it>Wolbachia </it>strains present a peculiar pattern of binding sites [see Figure <figr fid="F1">1</figr> and Additional Table <supplr sid="S1">1</supplr>]. It is worth noting that the pseudoscorpion <it>Wolbachia ori </it>does not have any CtrA binding motifs while it presents three DnaA boxes and a single IHF binding site suggesting either the existence of a very diverged CtrA binding motif (which could not be identified when compared to the consensus sequence used in the present study) or the CtrA factor may not play an important regulatory role in the DNA replication of this <it>Wolbachia </it>strain.</p>
            <p>We tested for selective constraints on the binding sites for DnaA, CtrA and IHF by comparing the frequencies of polymorphic base positions in binding domain sites versus non-binding domain sites within the same IGR in different clades containing closely related <it>Wolbachia </it>strains (B1, B2 and A1, figure <figr fid="F2">2</figr>). The proportion of positions showing polymorphism was lower in the binding domain sites than non-binding sites for all three clades (B1 0.08 <it>vs </it>0.19, p = 0.035; B2 0 <it>vs </it>0.12, p = 0.001; A1 0.02 <it>vs </it>0.08, p = 0.081; Fisher Exact Test), with two showing significant reductions in polymorphisms within domain positions. Combining the data give 25.3 percent lower proportion of positions with polymorphisms in the binding domains relative to flanking non-binding positions (p &lt; 0.0001 Fisher Exact Test), indicating some selective constraint on these positions.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p><it>ori </it>phylogeny based on 47 <it>Wolbachia </it>taxa</p>
               </caption>
               <text>
                  <p><b><it>ori </it>phylogeny based on 47 <it>Wolbachia </it>taxa</b>. Maximum Likelihood (ML) inferred phylogeny and ML bootstrap values based on the nucleotide sequences of the <it>ori </it>region from the A and B Wolbachia. Subgroups are denoted A1 and A2, and B1 and B2.</p>
               </text>
               <graphic file="1471-2164-8-182-2"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p><it>In silico </it>prediction of the origin of DNA replication in closely related bacteria</p>
            </st>
            <p>Applying the same criteria, the origin of DNA replication was predicted <it>in silico </it>for the closely related <it>A. marginale </it>St. Maries, <it>A. phagocytophilum </it>HZ, <it>E. ruminantium </it>Welgevonden, <it>E. ruminantium </it>Gardel, <it>E. canis </it>Jake and <it>E. chaffeensis </it>Arkansas (Table <tblr tid="T1">1</tblr>). Annotation, BLAST analysis and/or domain searches (using HMMer) were used to identify the two flanking genes of the putative <it>ori </it>regions (Table <tblr tid="T1">1</tblr>).</p>
            <p>The length of the predicted <it>ori </it>region differs markedly by 300 bp between the two <it>Anaplasma </it>spp <abbrgrp><abbr bid="B34">34</abbr><abbr bid="B36">36</abbr></abbrgrp>. In <it>A. marginale</it>, the <it>ori </it>is 620 bp; in <it>A. phagocytophilum </it>it is just 385 bp (Table <tblr tid="T1">1</tblr>). Both species have a single CtrA binding site and a single IHF binding site that differ in both sequence and position. <it>A. marginale </it>has two DnaA boxes while <it>A. phagocytophilum </it>has four [see Figure <figr fid="F1">1</figr> and Additional Table <supplr sid="S1">1</supplr>].</p>
            <p>Origin of replication features of the different <it>Ehrlichia </it>spp. examined in this study are summarized in Table <tblr tid="T1">1</tblr>. Multiple possible IHF binding sites are found for each member (depicted in Figure <figr fid="F1">1</figr>). Of special interest is the absence of CtrA binding sites in <it>E. ruminantium</it>. However, if two mismatches are allowed two putative CtrA sites are found.</p>
            <p>The putative <it>ori </it>region of the four different <it>Rickettsia </it>spp. examined here was predicted by searching for the previously described <it>hemE</it>/COG1806 <abbrgrp><abbr bid="B37">37</abbr></abbrgrp> [see Figure <figr fid="F1">1</figr>, Table <tblr tid="T1">1</tblr> and Additional Table <supplr sid="S1">1</supplr>]. Following the same type of analysis, two putative <it>ori </it>regions can be identified in <it>N. sennetsu </it>that are unlike other bacteria examined in this study [see Table <tblr tid="T1">1</tblr> and Additional Table <supplr sid="S1">1</supplr>]. The first region is located between COG1806 and COG1253, is 553 bp long, and has no DnaA, no CtrA and no IHF binding site. This intergenic region contains a hypothetical gene (92 amino acids long) and a predicted tRNA-Arg. The second region is located between <it>hemE </it>and an uncharacterized phage protein, is 408 bp long, and has no DnaA, no CtrA and two IHF binding sites. Although less clearly defined by the binding sites, this latter region is close to the shift in GC-skew <abbrgrp><abbr bid="B36">36</abbr></abbrgrp> and its AT-content is higher when compared to the respective genome. Additionally, when it is compared with the first one, it appears to have more IHF binding sites in its immediate neighborhood. The above data indicate that the second region may be the putative <it>ori </it>region of <it>N. sennetsu </it>(Figure <figr fid="F1">1</figr>); however, it is possible that the proposed search criteria may not be appropriate for the identification of the origin of replication of <it>N. sennetsu</it>.</p>
            <p>Taken together, the <it>in silico </it>approach can predict the putative origin of replication of most bacterial species closely related to <it>Wolbachia </it>including <it>Ehrlichia</it>, <it>Anaplasma</it>, and <it>Rickettsia</it>. In addition, the Maximum Likelihood derived trees demonstrate a clear concordance of the <it>ori </it>region and 16S rRNA phylogenies for the <it>Rickettsia </it>genus, <it>Anaplasma/Ehrlichia </it>genera, and <it>Wolbachia </it>supergroups [see Additional Figure 2].</p>
         </sec>
         <sec>
            <st>
               <p>Recombination of the <it>Wolbachia ori </it>region</p>
            </st>
            <p>Significant recombination events at the <it>ori </it>region of <it>Wolbachia </it>were detected within both supergroup A and B and between the two supergroups (MaxChi, P &lt; 0.001). Most of the recombination breakpoints fell at one of the two edges of the intergenic region (see graph in Figure <figr fid="F3">3A</figr>). The majority of binding sites (excluding DnaA_2 and CtrA_2) occur within a region that experienced a similar low number of recombination events, suggesting that usually these sites, when recombining, are exchanged as a unique sequence tract. A likely recombination breakpoint occurred in the nucleotide region between IHF_1 and DnaA_2 (see arrow, Fig. <figr fid="F3">3A</figr>) indicating a potential shuffling among binding sites at this region.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Recombination within <it>Wolbachia ori </it>sequences</p>
               </caption>
               <text>
                  <p><b>Recombination within <it>Wolbachia ori </it>sequences</b>. Recombination results of the <it>Wolbachia ori </it>based on MaxChi (P &lt; 0.001). (A) pattern of distribution of recombination breakpoints along the nucleotide alignment of 38 strains belonging to supergroup A and B. The cumulative number of recombination events per site is given. The alignment includes the beginning of locus <it>COG1253 </it>(WD1027, 1&#8211;168 bp), the intergenic non-coding region (INT, 169&#8211;586 bp), and the beginning of locus <it>hemE </it>(WD1028, 587&#8211;673 bp). Most of the breakpoints fall at the two edges of the intergenic region. Location of binding sites in the intergenic region is also shown. CtrA_1: 225-239 bp; CtrA_2: 489-503 bp (complementary strand); CtrA_3: 251-265 bp; CtrA_4: 228-242 (complementary strand); DnaA_1: 406-414 bp; DnaA_2: 461-469 bp; DnaA_3: 246-254 bp (complementary strand); IHF_1: 430-442 bp. Arrow indicates a major breakpoint. (B) Two examples of recombination among strains within supergroup B and between supergroup A and B. Only polymorphic sites are shown. For each example, putative recombination breakpoints are predicted based on the difference of shared polymophisms among the three strains and are indicated with an arrow (corresponding position in the nucleotide alignment is given). The <it>w</it>Aenc_B <it>Wolbachia ori </it>sequence (belonging to supergroup B) is indicated as recombinant between the corresponding <it>ori </it>sequences of <it>w</it>Mel_A (A-supergroup) and <it>w</it>Ma_B (B-supergroup) strains. A similar recombination event, but among supergroup B strains, is observed in <it>w</it>Calt_B, which appears to be recombinant between <it>w</it>Osca_B and <it>w</it>Vul_B.</p>
               </text>
               <graphic file="1471-2164-8-182-3"/>
            </fig>
            <p>Most of the recombination breakpoints per sequence were single and thus involved recombination of sequences encompassing one of the two halves of the alignment. Clear examples of recombinant tracts can be visualized by simple examination of the shared nucleotide polymorphisms among triplets of sequences (see example in Figure <figr fid="F3">3B</figr>). For instance, the <it>ori </it>region of <it>w</it>Aenc_B, a strain from the host <it>A. encedon </it>belonging to supergroup B, shows a clear recombinant pattern between A- and B-type sequences. Specifically, the first 210 bp of the sequence from the <it>w</it>Aenc_B strain shares most of the polymorphisms (70) with the B strain <it>w</it>Ma_B, from <it>D. simulans</it>, while the remaining sequence portion shares most of polymorphisms (42) with the A strain <it>w</it>Mel-A, from <it>D. melanogaster</it>. As a result, <it>w</it>Aenc_B places as a deep branch of supergroup B, separated by a great distance from other B-strains (bootstrap value, P = 82; data not shown); this apparent phylogenetic divergence is actually due to recombination within the sequence. A similar case of recombination is found for the <it>w</it>Calt_B strain and involves B-type sequences. The first 500 bp of the <it>w</it>Calt_B sequence shares most of the polymorphisms with the <it>w</it>Vul sequence, while the following 173 bp share most polymorphisms with the <it>w</it>OscaB sequence. Other instances of recombination involve the <it>Wolbachia </it>A-strains from <it>A. albopictus, A. sparsa, C. pennsylvanicus </it>and S. <it>invicta</it>, denoted A2 in figure <figr fid="F2">2</figr>. These four <it>ori </it>sequences are recombinant between A and B-types sequences (MaxChi, P &lt; 0.001).</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>In the present study, we provide <it>in silico </it>evidence for the location of the origin of replication in <it>Wolbachia </it>and its close relatives: <it>Ehrlichia</it>, <it>Anaplasma</it>, <it>Rickettsia </it>and <it>Neorickettsia</it>. The analysis included fifty-three <it>Wolbachia </it>strains and ten additional strains from the other closely related bacterial species. All the origins predicted here are 383&#8211;620 bp long. The <it>Ehrlichia</it>, <it>Anaplasma</it>, <it>Neorickettsia</it>, and <it>Rickettsia </it>predicted origins herein do align appropriately with the shift of GC-skew found in these genomes <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. However, the actual shift occurs over a fairly large region (~20 kB) and thus is not sufficient for identifying the precise <it>ori</it>. The <it>Wolbachia </it>genomes have no clear shift in GC-skew which could indicate a putative origin of replication <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B17">17</abbr></abbrgrp>. The fact that <it>Wolbachia </it>genomes do not present a strong shift of GC-skew may be due to extensive intragenomic recombination events that in addition may have also eliminated the synteny between the genomes of the genes flanking the <it>ori</it>, another feature of the <it>Wolbachia </it>lineage. Recombination has been recently shown to be widespread across <it>Wolbachia </it>genomes <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>. Based on our analyses, it has clearly played a role in shaping the <it>ori </it>region of <it>Wolbachia </it>and potentially shuffled the binding sites, thus giving rise to chimeric sequences. Whether and how these DNA rearrangements have affected the replication performance in the recombinant strains remains a subject of future investigation.</p>
         <sec>
            <st>
               <p>Prediction of origins in genomes</p>
            </st>
            <p>As a non-coding region, the <it>ori </it>region is often overlooked and not annotated in genomes. However, this is a significant issue since it has an essential role in DNA replication. Therefore, we have established several criteria that should be used to identify the origin from genome sequencing data including: (a) boundary genes that are homologous to those of closely related bacteria, (b) an intergenic region 200&#8211;1000 bp in size, (c) presence of appropriate binding sites (primarily for DnaA and IHF), (d) an increased distribution of the appropriate binding sites when compared to other intergenic regions, (e) increased AT content relative to the genome, (f) increased homology to closely related sequences relative to the genome, and (g) a shift in GC-skew. Although all seven elements may not be present in all genomes, a significant combination of these elements should allow for a better prediction of this important region.</p>
         </sec>
         <sec>
            <st>
               <p>The origin of replication and the CtrA, DnaA and IHF binding sites</p>
            </st>
            <p>The presumed <it>ori </it>regions of the bacterial species and strains of the present study are characterized by the presence of DnaA, CtrA and IHF boxes (the exception being <it>N. sennetsu</it>). The DnaA and IHF boxes are present in all bacterial <it>ori </it>regions characterized. However, the CtrA boxes seem to be restricted, as yet, to the <it>ori </it>regions of the &#945;-Proteobacteria. In <it>C. crescentus</it>, CtrA is a global cell cycle regulator <abbrgrp><abbr bid="B26">26</abbr></abbrgrp> and more specifically, the response regulator protein of a two-component signal transduction signal. Upon phosphorylation by a sensor histidine kinase, CtrA binds to its corresponding binding sites and represses chromosomal replication <abbrgrp><abbr bid="B63">63</abbr></abbrgrp>. CtrA's binding sites are overlapping with a DnaA box and the IHF binding site. Thus, binding of CtrA prevents binding of DnaA and IHF <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. CtrA is degraded before the onset of the S phase by the protease ClpXP allowing DnaA and IHF binding <abbrgrp><abbr bid="B64">64</abbr><abbr bid="B65">65</abbr></abbrgrp>. Despite <it>A. marginale </it>CtrA having 59% amino acid identity with its <it>w</it>Mel ortholog, <it>A. marginale </it>and <it>A. phagocytophilum ori </it>regions have only a single CtrA binding site, as opposed to four in <it>w</it>Mel. <it>E. ruminantium </it>has no CtrA binding site in its putative <it>ori </it>region while its CtrA protein is 80% identical to the <it>A. marginale </it>ortholog. Similarly, the putative <it>ori </it>regions of the pseudoscorpion <it>C. scorpioides Wolbachia </it>strain, <it>E. chaffeensis</it>, <it>E. canis</it>, <it>R. conorii </it>and the filarial nematode <it>B. malayi Wolbachia </it>strain (<it>w</it>Bm) have zero, one, three, two and two CtrA binding sites respectively. These observations suggest that CtrA may be dispensable in these bacteria. Interestingly, the bacteria with zero to three CtrA binding sites in their <it>ori </it>region are not associated with insects. In contrast, insect-associated <it>Wolbachia </it>and <it>Rickettsia </it>bacteria usually present high number of CtrA binding sites present in their <it>ori </it>region ranging from four to seven (the exception being <it>R. felis </it>which has three CtrA binding sites). Whether bacterial growth control is host-dependent and is regulated through the number of the CtrA binding sites in the origin of replication region awaits experimental confirmation.</p>
            <p>It is worth noting that there are orthologs to <it>dnaA</it>, <it>ctrA </it>and <it>ihfAB </it>in the bacterial genomes studied except <it>N. sennetsu </it>and <it>A. marginale</it>. They are both missing IHF-&#946; but have retained the IHF-&#945; subunit [see Additional Table <supplr sid="S3">3</supplr>]. Another interesting observation is that the filarial nematode <it>B. malayi Wolbachia </it>strain (<it>w</it>Bm) and <it>A. phagocytophilum </it>are missing both <it>parA </it>and <it>parB</it>, which are involved in partitioning the chromosomes <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B36">36</abbr></abbrgrp>. This suggests that <it>Wolbachia </it>and <it>Anaplasma </it>replication may be quite interesting; experimental work is needed in order to clarify the roles of replication-associated proteins in binding the <it>ori </it>and initiating DNA replication initiation process.</p>
            <p>IHF binding sites were found in all bacteria using the consensus WATCAN<sub>5</sub>WTR <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. In some <it>Wolbachia </it>strains, two to three such candidate sites were found within their putative <it>ori </it>regions. In these cases, only the common one was retained <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. In contrast, <it>A. marginale </it>and <it>A. phagocytophilum </it>had different IHF binding sites in both in sequence and in position. Two conserved putative IHF binding sites were detected in the putative <it>ori </it>region of <it>E. canis </it>and the two <it>E. ruminantium </it>strains, with all of them being similar both in sequence and position while a single putative IHF binding site was present in <it>E. chaffeensis </it>which was the only common IHF binding site found in all four <it>Ehrlichia </it>strains. IHF positioning varied between the four <it>Rickettsia </it>species. The only exception appears to be the conservation in position as well as in sequence between <it>R. prowazekii </it>and <it>R. typhi</it>. The IHF-&#946; subunit is missing from the genome of <it>A. marginale</it>. The only IHF identified, AM_006, is 43.2% identical (amino acid level) to the <it>w</it>Mel IHF-&#945; subunit (WD_0057). <it>A. phagocytophilum</it>, all four <it>Ehrlichia</it>, and all four <it>Rickettsia </it>species have both IHF subunits. <it>N. sennetsu </it>also lacks IHF-&#946; [see Additional Table <supplr sid="S3">3</supplr>]. Whether the absence of the IHF-&#946; subunit is somehow correlated with the divergence/absence of IHF binding sites in <it>Anaplasma </it>and <it>Neorickettsia </it>is not known.</p>
         </sec>
         <sec>
            <st>
               <p>The evolution of the origin of replication of <it>Wolbachia</it>, <it>Ehrlichia</it>, <it>Anaplasma</it>, <it>Rickettsia </it>and <it>Neorickettsia</it></p>
            </st>
            <p>The difference in the boundary genes observed between the closely related <it>Anaplasma</it>, <it>Ehrlichia</it>, <it>Wolbachia </it>and the other &#945;-Proteobacteria is likely due to chromosomal rearrangements that have taken place in the <it>ori </it>region. All of the origins examined have as boundaries either the COG1253 &#8211; <it>hemE </it>pair (<it>Wolbachia</it>, <it>Ehrlichia </it>and <it>Anaplasma</it>), or the <it>hemE </it>&#8211; COG1806 pair (<it>Caulobacter </it>and <it>Rickettsia</it>), or the <it>hemE </it>&#8211; uncharacterized phage protein pair (<it>N. sennetsu</it>). Overlaying these observations on the phylogeny of the &#945;-Proteobacteria <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>, the most likely ancestral configuration is that found in <it>Caulobacter </it>and <it>Rickettsia</it>. Sometime after the branching of the <it>Anaplasmataceae</it>, a rearrangement took place that repositioned the COG1253 gene to the position of COG1806 orthologs in <it>Wolbachia</it>, <it>Ehrlichia</it>, and <it>Anaplasma </it>with a second rearrangement occurring in <it>N. sennetsu</it>. The exact nature of these rearrangements cannot be determined since the synteny of the chromosomes has been lost. As discussed earlier, the distribution of <it>ori</it>-specific binding sites (DnaA, CtrA and IHF) is consistent with the currently accepted phylogeny of <it>Wolbachia</it>.</p>
         </sec>
         <sec>
            <st>
               <p>Growth control, infection levels and <it>Wolbachia</it>-induced phenotypes</p>
            </st>
            <p>It has been widely accepted that bacterial infection levels are positively correlated with the virulent and/or pathogenic properties of bacterial pathogens <abbrgrp><abbr bid="B66">66</abbr></abbrgrp>. Similarly, several studies have shown that <it>Wolbachia</it>'s ability to induce reproductive and/or virulent phenotypes is positively correlated with their intra-host infection levels <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B54">54</abbr><abbr bid="B67">67</abbr></abbrgrp>. Furthermore, intracellular bacteria that are transmitted through the eggs of their hosts must be "prudent replicators" in order to ensure their transmission to future generations by preventing overgrowth that could lead to host death prior to reproduction. This raises the question: is the DNA replication and growth of an obligate intracellular bacterium under bacterial control, host control, phage control or some combination? Overreplication by the "popcorn" strain of <it>Wolbachia </it>in <it>D. melanogaster </it>suggests that the bacterial strain has a strong influence on replication rates within the host <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>, although effects of the host also occur <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> and effects of phage have not been tested <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>.</p>
            <p>The results of the present study clearly indicate that no major differences could be detected between the <it>ori </it>regions of <it>Wolbachia </it>strains differing in ability to induce CI, virulence, parthenogenesis or feminization. All <it>Wolbachia </it>strains presented the same organization in their putative origin of replication: COG1253 and <it>hemE </it>as boundary genes, three DnaA boxes, two to seven CtrA boxes and a single IHF binding site. A notable exception is the pseudoscorpion <it>Wolbachia </it>strain which presents zero CtrA binding sites. The possibility that DNA binding specificity of CtrA has changed in these bacteria is an intriguing hypothesis. CtrA contains two domains, a receiver domain at the N-terminus and a transcriptional regulatory (DNA binding) effector domain at the C-terminus. These domains are nearly identical between <it>w</it>Mel and <it>w</it>Bm with only one positive amino acid substitution in each domain. All other changes are located at the far C-terminus where no domains are found. Since a single amino acid change can greatly affect DNA binding, it is conceivable that these two positive substitutions can alter CtrA's binding specificity. A second hypothesis is that CtrA's DNA-binding specificity did not change in <it>w</it>Bm, and only the number of CtrA binding sites reduced. This CtrA binding site reduction may be associated with the mutualistic nature of the <it>Wolbachia </it>&#8211; <it>B. malayi </it>association and may have resulted in a novel control of bacterial replication. These hypotheses need to be tested experimentally.</p>
         </sec>
         <sec>
            <st>
               <p>Origin of replication and genetic transformation system</p>
            </st>
            <p>Evidence presented here for the prediction of <it>ori </it>location, is based on the boundary genes and on the <it>in silico </it>finding of characteristic DnaA, CtrA, and IHF binding sites that have been experimentally confirmed only in <it>C. cresentus </it>and partially in <it>R. prowazekii </it><abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. Lack of a robust genetic transformation system for <it>Wolbachia</it>, <it>Ehrlichia</it>, <it>Neorickettsia </it>and most <it>Anaplasma </it>and <it>Rickettsia </it>precludes experimental verification in the bacterial species of the present study. However, significant progress has been made toward the development of a robust genetic transformation for <it>A. phagocytophilum </it>and <it>R. prowazekii </it>using homologous recombination-based and transposon-based approaches <abbrgrp><abbr bid="B68">68</abbr><abbr bid="B69">69</abbr><abbr bid="B70">70</abbr></abbrgrp>. The fact that these bacteria (both <it>Wolbachia </it>and relatives) can be maintained in different cell lines <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B71">71</abbr><abbr bid="B72">72</abbr></abbrgrp>, the availability of complete genomic information <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B34">34</abbr><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr><abbr bid="B73">73</abbr><abbr bid="B74">74</abbr><abbr bid="B75">75</abbr><abbr bid="B76">76</abbr></abbrgrp> and, the presence in some of them, such as <it>Wolbachia</it>, of endogenous phages and insertion sequences <abbrgrp><abbr bid="B77">77</abbr><abbr bid="B78">78</abbr></abbrgrp> will certainly facilitate current efforts for the genetic transformation of these intracellular bacteria.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>We provide <it>in silico </it>evidence for the location of the origin of replication in <it>Wolbachia </it>and its close relatives: <it>Ehrlichia</it>, <it>Anaplasma</it>, <it>Rickettsia </it>and <it>Neorickettsia</it>. Putative origins of replication, which are usually 200&#8211;1000 bp long, have features in common that were used to establish a set of guidelines for properly predicting the origin in genomes. Several of the bacteria had variable sequences/numbers of key binding sites suggesting altered modes of replication. This is supported by the lack of specific replication associated enzymes.</p>
         <p>Intracellular bacteria that are transmitted through the eggs of their hosts are thought to be "prudent replicators" in order to ensure their transmission to future generations by not causing lethality of their host prior to reproduction. This raises the question: is the DNA replication and growth of an obligate intracellular bacterium under bacterial control, host control, phage control or some combination? The results of the present study clearly indicate that no major differences could be detected between the <it>ori </it>regions of <it>Wolbachia </it>strains inducing or non-inducing CI, being virulent or non-virulent, inducing parthenogenesis or feminization. In addition, recombination across the <it>ori </it>region was demonstrated for the main arthropod supergroups, A and B. Lastly and surprisingly, the origin boundary genes have changed twice in the evolution of this order of bacteria.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>DNA extraction</p>
            </st>
            <p>Various insect hosts and <it>Wolbachia </it>strains were used for DNA extraction in the present study [see Additional Table <supplr sid="S1">1</supplr>]. Fly stocks were reared on standard corn flour &#8211; sugar &#8211; yeast medium at 25&#176;C. Bacterial DNA was extracted using the DNeasy Tissue Kit (Qiagen) according to the manufacturer's instructions.</p>
         </sec>
         <sec>
            <st>
               <p>PCR amplification and sequencing of the <it>Wolbachia ori </it>region</p>
            </st>
            <p>A PCR strategy [see Results, Additional Figure 1 and Additional Table <supplr sid="S2">2</supplr>] was developed to amplify the <it>ori </it>region from the following <it>Wolbachia </it>strains belonging to supergroup A and B: <it>w</it>Mel, <it>w</it>MelPop, <it>w</it>Au, <it>w</it>Ri, <it>w</it>Ha, <it>w</it>Yak, <it>w</it>Tei, <it>w</it>San, <it>w</it>No, <it>w</it>Ma and <it>w</it>Mau. Various PCR primers were used for the amplification reactions [see Additional Table <supplr sid="S1">1</supplr>]. PCR reaction mixtures contained 1&#215; Taq buffer (750 mM Tris-Cl pH 8.8, 200 mM (NH<sub>4</sub>)<sub>2</sub>SO<sub>4</sub>, 0.1% Tween 20), 2 mM MgCl<sub>2</sub>, 125 &#956;M dNTPs, 12.5 pmol of each primer, 1 unit Taq polymerase (Promega) and 25 ng/&#956;l template DNA and were cycled with an initial denaturing step at 94&#176;C for 10 minutes, 35 cycles of 94&#176;C for 30 seconds, 54&#176;C for 30 seconds and 72&#176;C for 3 minutes followed by a final extension at 72&#176;C for 10 minutes. PCR reactions were purified with Qiagen nucleotide removal kit or Qiagen gel extraction kit depending on the existence or not, of byproducts. Sequencing was performed by Macrogen (Korea). Sequence trace files from sequencing reactions were processed using the DNAStar 5.0 suite of programs.</p>
            <p>A smaller region of <it>ori </it>was amplified and sequenced from a greater variety of <it>Wolbachia </it>strains according to the methods mentioned in <abbrgrp><abbr bid="B79">79</abbr></abbrgrp>. Reactions to generate these smaller amplicons were attempted for 52 <it>Wolbachia </it>strains from almost all described supergroups using standard PCR conditions with HotStarTaq (Qiagen), according to the manufacturer's suggestions, with 0.5 &#956;M of each WD_1027_R and WD_1028_R. Reactions were initiated with a 15 minute incubation at 95&#176;C followed by 50 cycles of 95&#176;C for 30 seconds, 55&#176;C for 30 seconds, 72&#176;C for 1 min. The primers WD_1027_R and WD_1028_R had 5'-tags of M13 forward and reverse primer sequences, respectively, to serve as anchors for the degenerate primers in later stages of amplification and were later used for sequencing. Amplification reactions (8 &#956;L) were treated with 0.5 U shrimp alkaline phosphatase and 1.0 U exonuclease I (Amersham) in the supplied buffer. Sequencing reactions were performed at the J. Craig Venter Joint Technology Center (Rockville, MD) with M13 forward and reverse primers. Assembly was done with the TIGR assembler and manually curated with Cloe (ClosureEditor, a TIGR program for editing assemblies). The alignment was initially generated using CLUSTALW and curated in Bioedit v. 7.0.4.1.</p>
            <p>The nucleotide sequences reported in this study have been deposited in GenBank under accession numbers <ext-link ext-link-type="gen" ext-link-id="DQ498834">DQ498834</ext-link> &#8211; <ext-link ext-link-type="gen" ext-link-id="DQ498882">DQ498882</ext-link>.</p>
         </sec>
         <sec>
            <st>
               <p>Sequence acquisition and analysis from publicly available genomes</p>
            </st>
            <p>Sequence information was used from publicly available genome data (GenBank) of <it>Wolbachia </it>and other closely related bacterial species and strains (Table <tblr tid="T1">1</tblr>).</p>
         </sec>
         <sec>
            <st>
               <p><it>In silico </it>prediction of the <it>ori </it>region</p>
            </st>
            <p>For <it>ori</it>-boundary identification, genes homologous to COG1253 (see results for the new proposed gene name of the CBS domain protein) and <it>hemE </it>were identified using BLAST. Moreover, the CBS domain protein gene was searched for conserved domains using the HMMer program <abbrgrp><abbr bid="B41">41</abbr></abbrgrp> and the Pfam_fs hidden markov model (HMM) database.</p>
            <p>Binding sites were identified using local perl scripts based on the previously determined consensus sequences <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. For the CtrA binding site (TTAA-N<sub>7</sub>-TTAA), a single mismatch was allowed only in the A's and only if these mismatches followed the looser consensus TTWW-N<sub>7</sub>-TTWW described previously <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. For the DnaA binding site, the consensus TTATNCACA was used <abbrgrp><abbr bid="B80">80</abbr></abbrgrp>. For the IHF-binding site, the consensus WATCAN<sub>5</sub>WTR was used <abbrgrp><abbr bid="B81">81</abbr><abbr bid="B82">82</abbr></abbrgrp>.</p>
            <p>In order to examine the distribution of the CtrA-, IHF-, and DnaA-binding sites in the intergenic regions of each genome were identified using fuzznuc from the EMBOSS package <abbrgrp><abbr bid="B83">83</abbr></abbrgrp> and locally developed scripts.</p>
         </sec>
         <sec>
            <st>
               <p>Phylogenetic analysis</p>
            </st>
            <p>The alignment for the figure <figr fid="F2">2</figr> tree was generated using ClustalX 1.83 for Windows <abbrgrp><abbr bid="B84">84</abbr></abbrgrp> with the default parameters and trimmed using BioEdit for Windows <abbrgrp><abbr bid="B85">85</abbr></abbrgrp>. Maximum likelihood (ML) methods were used to infer phylogenetic relationships. Prior to ML analysis, a DNA substitution model was selected using Modeltest v3.06 and the Akaike information criterion (AIC). The selected model of evolution was based on a 382 bp <it>ori </it>region with indels excluded (TVM+G). ML heuristic searches were performed using 100 random taxon addition replicates with tree bisection and reconnection (TBR) branch swapping. ML bootstrap support was determined using 100 bootstrap replicates, each using 10 random taxon addition replicates with TBR branch swapping. Searches were performed in parallel on a Beowulf cluster using a clusterpaup program and PAUP version 4.0b10 <abbrgrp><abbr bid="B86">86</abbr></abbrgrp>.</p>
            <p>Conservation of binding domains in the <it>Wolbachia ori </it>region was investigated within three clades of closely related <it>Wolbachia </it>(B1, B2 and A1). It should be noted here that the A2 clade (see Figure <figr fid="F2">2</figr>) was not used because it is recombinant between the A and B supergroups in the <it>ori </it>region (see text for details). Binding site positions were identified based on a reference strain within each clade; this was straightforward due to relatively low variation within each clade. Positions were then classified as binding or non-binding. Positions within binding domains with free nucleotide designations (N) were classified as non-binding sites. In cases where binding domains were overlapping, if both positions were N sites, then the position was classified as non-binding, otherwise it was classified as a binding position. All positions within the IGR were then scored as polymorphic (containing at least one polymorphism) or not. Fisher Exat Tests were used to compare the proportions of polymorphic and non-polymorphic positions in binding sites versus non-binding sites.</p>
         </sec>
         <sec>
            <st>
               <p>Recombination analyses</p>
            </st>
            <p>The <it>ori </it>region of <it>Wolbachia </it>was searched for recombination signature by using the Maximum Chi Square (MaxChi2) program, implemented in the RDP2 program <abbrgrp><abbr bid="B87">87</abbr></abbrgrp>. MaxChi is a local method that uses a sliding window approach to search for putative recombination breakpoints in a set of aligned DNA sequences. Significant discrepancies between the two partitions of the window are calculated based on the difference in the number of variable sites (VI) on either sides of the central partition. A Chi-square statistics is applied. The step size was set to 20 nucleotides and the window size set to 20 VI, gaps were included and a Bonferroni correction was applied. The highest acceptable P-value cut-off was set to 0.001 and 1000 permutation were generated. We analyzed only strains belonging to supergroup A and B and having sequences of equal length encompassing a portion of both flanking genes. The final alignment was 673 base pairs long and included 38 strains (19 A- and 19 B-strains). The alignment was partitioned in dataset A and B (from the two supergroups) and recombination analyses were run on both single and combined datasets.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p><it>ori</it>: origin of DNA replication</p>
         <p>CBS: Cystathione-beta-synthase domain</p>
         <p>IHF: Integration Host Factor</p>
         <p>IGR: Intergenic Region</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>PI designed primers, amplified and analyzed sequences, carried bioinformatics analysis and drafted the manuscript.</p>
         <p>JCDH amplified and analyzed sequences across diverse <it>Wolbachia </it>strains, carried genomics and bioinformatics analysis, and drafted portions of the manuscript</p>
         <p>PS designed primers, amplified and analyzed sequences across diverse <it>Wolbachia </it>strains.</p>
         <p>SS amplified and analyzed sequences across diverse <it>Wolbachia </it>strains.</p>
         <p>GT amplified and analyzed sequences across diverse <it>Wolbachia </it>strains.</p>
         <p>SRB designed primers, carried out the phylogenetic analysis and drafted the corresponding portion of the manuscript.</p>
         <p>LB carried out the recombination analysis and drafted the corresponding portion of the manuscript.</p>
         <p>JHW participated in the design of the study and coordination of research.</p>
         <p>KB conceived of the study, participated in its design and coordination and drafted the manuscript.</p>
         <p>All authors read and approved the final manuscript.</p>
         <suppl id="S1">
            <title>
               <p>Additional file 1</p>
            </title>
            <text>
               <p><b>Additional Figure 1 </b>&#8211; PCR strategy followed for amplifying and sequencing the <it>ori </it>region along with the two flanking genes of different <it>Wolbachia </it>strains. FRAF1 is either A-group (FRAF1MEL) or B-group (FRAF1PIP) specific and binds in slightly different positions.</p>
            </text>
            <file name="1471-2164-8-182-S1.jpeg">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S2">
            <title>
               <p>Additional file 2</p>
            </title>
            <text>
               <p><b>Additional Figure 2 </b>&#8211; The Maximum Likelihood derived trees demonstrate a clear concordance of the <it>ori </it>region and 16S <it>rRNA </it>phylogenies for the <it>Rickettsia </it>genus (Figure <figr fid="F2">2C</figr>), <it>Anaplasma/Ehrlichia </it>genera (Figure <figr fid="F2">2B</figr>), and <it>Wolbachia </it>supergroups with one minor exception (Figure <figr fid="F2">2A</figr>). There is a weakly supported trifurcation in the 16S <it>rRNA </it>phylogeny (spanning supergroups A, B, and E, 64% bootstrap support) that is inconsistent with the <it>ori </it>phylogeny (98% bootstrap support). Supergroups A, B, D, E, and F are represented by sequences from <it>D. melanogaster</it>, <it>C. pipiens</it>, <it>B. malayi</it>, <it>F. candida</it>, and <it>C. lectularius</it>, respectively. The likelihood-based Shimodaira-Hasegawa (SH) test for alternative tree topologies <abbrgrp><abbr bid="B90">90</abbr></abbrgrp> however revealed no significant difference between any of the Figure <figr fid="F2">2</figr> topologies, specifying the <it>ori </it>region as a sufficient genetic marker for the broad evolutionary relationships in this intracellular clade. Selected DNA substitution models of evolution were selected using Modeltest v3.06 and the Akaike information criterion (AIC). They were based on a 655 bp <it>ori </it>region (TVM+I) and 1377 bp 16S <it>rRNA </it>sequence (HKY) for <it>Wolbachia</it>, a 395 bp <it>ori </it>region (GTR+I) and 1454 bp 16S <it>rRNA </it>sequence (GTR+I) for the <it>Anaplasma/Ehrlichia </it>genera, and a 382 bp <it>ori </it>region (HKY+I) and 1498 bp 16S <it>rRNA </it>sequence (TrN) for <it>Rickettsia</it>. ML heuristic searches were performed using 500 random taxon addition replicates with tree bisection and reconnection (TBR) branch swapping. ML bootstrap support was determined using 500 bootstrap replicates, each using 10 random taxon addition replicates with TBR branch swapping. Searches were performed in parallel on a Beowulf cluster using a clusterpaup program and PAUP version 4.0b10 <abbrgrp><abbr bid="B86">86</abbr></abbrgrp>.</p>
            </text>
            <file name="1471-2164-8-182-S2.jpeg">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S3">
            <title>
               <p>Additional file 3</p>
            </title>
            <text>
               <p><b>Additional Table 1 </b>&#8211; <it>Wolbachia </it>strains and closely related bacterial species used in this work.</p>
            </text>
            <file name="1471-2164-8-182-S3.doc">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S4">
            <title>
               <p>Additional file 4</p>
            </title>
            <text>
               <p><b>Additional Table 2 </b>&#8211; Primers used in this study.</p>
            </text>
            <file name="1471-2164-8-182-S4.doc">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S5">
            <title>
               <p>Additional file 5</p>
            </title>
            <text>
               <p><b>Additional Table 3 </b>&#8211; Replication enzymes in complete genomes.</p>
            </text>
            <file name="1471-2164-8-182-S5.doc">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S6">
            <title>
               <p>Additional file 6</p>
            </title>
            <text>
               <p><b>Additional Text File 1 </b>&#8211; Consensus sequence of the <it>ori </it>region based on which the PCR strategy presented in Additional Figure 1 and Additional Table <supplr sid="S3">3</supplr> was developed.</p>
            </text>
            <file name="1471-2164-8-182-S6.fas">
               <p>Click here for file</p>
            </file>
         </suppl>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank the following individuals for providing arthopods for this study: D. Bouchon, K. Christianson, K. Dyer, M. Hoy, S. Dobson, R. Harrison, G. Hurst, J. Jaenike, F. Jiggins, N. Lo, J. Marshall, J. Rasgon, T. Sasaki, D. Shoemaker, R. Stouthamer, M. Wade, D. Windsor, D. Zeh, and J. Zeh. We would like to thank Rahul Nene for assistance in generating sequences at TIGR and Herv&#233; Tettelin, Stefan Oehler and Patrick Mavingui for helpful discussions and in commenting on an earlier version of the manuscript. We also thank three anonymous reviewers for helpful comments on the manuscript. PI, PS, SS, GT and KB acknowledge support of their work from intramural funding from the University of Ioannina. SB, JDH, LB and JW acknowledge support of their work from the U.S. National Science Foundation grant EF-0328363. SB also acknowledges the support from the NASA Astrobiology Institute (NNA04CC04A).</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Wolbachia pipientis: microbial manipulator of arthropod reproduction</p>
            </title>
            <aug>
               <au>
                  <snm>Stouthamer</snm>
                  <fnm>R</fnm>
                  <suf>Breeuwer JAJ, Hurst GDD</suf>
               </au>
            </aug>
            <source>Annu Rev Microbiol</source>
            <pubdate>1999</pubdate>
            <volume>53</volume>
            <fpage>71&#8211;102</fpage>
            <xrefbib>
               <pubid idtype="doi">10.1146/annurev.micro.53.1.71</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Insect Symbiosis</p>
            </title>
            <aug>
               <au>
                  <snm>Bourtzis</snm>
                  <fnm>K</fnm>
                  <suf>Miller TA</suf>
               </au>
            </aug>
            <publisher>Boca Raton, FL , CRC Press</publisher>
            <pubdate>2003</pubdate>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Influential passengers: inherited microorganism and arthropod reproduction</p>
            </title>
            <aug>
               <au>
                  <snm>O&#8217;Neill</snm>
                  <fnm>SL</fnm>
                  <suf>Hoffmann AA, Werren JH</suf>
               </au>
            </aug>
            <publisher>New York, NY , Oxford University Press</publisher>
            <pubdate>1997</pubdate>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Biology of Wolbachia</p>
            </title>
            <aug>
               <au>
                  <snm>Werren</snm>
                  <fnm>JH</fnm>
               </au>
            </aug>
            <source>Annu Rev Entomol</source>
            <pubdate>1997</pubdate>
            <volume>42</volume>
            <fpage>587&#8211;609</fpage>
            <xrefbib>
               <pubid idtype="doi">10.1146/annurev.ento.42.1.587</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Wolbachia bacteria of filarial nematodes in the pathogenesis of disease and as a target for control</p>
            </title>
            <aug>
               <au>
                  <snm>Taylor</snm>
                  <fnm>MJ</fnm>
               </au>
            </aug>
            <source>Trans R Soc Trop Med Hyg</source>
            <pubdate>2000</pubdate>
            <volume>94</volume>
            <fpage>596</fpage>
            <lpage>598</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0035-9203(00)90201-3</pubid>
                  <pubid idtype="pmpid">11198638</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Wolbachia-induced incompatibility precedes other hybrid incompatibilities in Nasonia</p>
            </title>
            <aug>
               <au>
                  <snm>Bordenstein</snm>
                  <fnm>SR</fnm>
                  <suf>O&#8217;Hara FP, and Werren JH</suf>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2001</pubdate>
            <volume>409</volume>
            <fpage>707&#8211;710</fpage>
            <xrefbib>
               <pubid idtype="doi">10.1038/35055543</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Wolbachia-induced cytoplasmic incompatibility as a means for insect pest population control</p>
            </title>
            <aug>
               <au>
                  <snm>Zabalou</snm>
                  <fnm>S</fnm>
                  <suf>Riegler M, Theodorakopoulou M, Stauffer C, Savakis C, Bourtzis K</suf>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2004</pubdate>
            <volume>101</volume>
            <fpage>15042</fpage>
            <lpage>15045</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">524042</pubid>
                  <pubid idtype="pmpid" link="fulltext">15469918</pubid>
                  <pubid idtype="doi">10.1073/pnas.0403853101</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Wolbachia establishment and invasion in an Aedes aegypti laboratory population</p>
            </title>
            <aug>
               <au>
                  <snm>Xi</snm>
                  <fnm>Z</fnm>
                  <suf>Khoo CC, Dobson SL</suf>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2005</pubdate>
            <volume>310</volume>
            <fpage>326</fpage>
            <lpage>328</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1117607</pubid>
                  <pubid idtype="pmpid" link="fulltext">16224027</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Influence of antibiotic treatment and Wolbachia curing on sexual isolation among Drosophila melanogaster cage populations</p>
            </title>
            <aug>
               <au>
                  <snm>Koukou</snm>
                  <fnm>K</fnm>
                  <suf>Pavlikaki H, Kilias J, Werren JH, Bourtzis K, Alahiotis SN</suf>
               </au>
            </aug>
            <source>Evolution</source>
            <pubdate>2006</pubdate>
            <volume>60</volume>
            <fpage>87</fpage>
            <lpage>96</lpage>
            <xrefbib>
               <pubid idtype="pmpid">16568634</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>The tripartite associations between bacteriophage, Wolbachia, and arthropods</p>
            </title>
            <aug>
               <au>
                  <snm>Bordenstein</snm>
                  <fnm>SR</fnm>
                  <suf>Marshall ML, Fry AJ, Kim U, Wernegreen JJ</suf>
               </au>
            </aug>
            <source>PLoS Pathog</source>
            <pubdate>2006</pubdate>
            <volume>2</volume>
            <fpage>e43</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1463016</pubid>
                  <pubid idtype="pmpid" link="fulltext">16710453</pubid>
                  <pubid idtype="doi">10.1371/journal.ppat.0020043</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Cytoplasmic incompatibility and bacterial density in Nasonia vitripennis</p>
            </title>
            <aug>
               <au>
                  <snm>Breeuwer</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Werren</snm>
                  <fnm>JH</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>1993</pubdate>
            <volume>135</volume>
            <fpage>565</fpage>
            <lpage>574</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1205656</pubid>
                  <pubid idtype="pmpid" link="fulltext">8244014</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Wolbachia density and virulence attenuation after transfer into a novel host</p>
            </title>
            <aug>
               <au>
                  <snm>McGraw</snm>
                  <fnm>EA</fnm>
                  <suf>Merritt DJ, Droller JN, O&#8217;Neill SL</suf>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2002</pubdate>
            <volume>99</volume>
            <fpage>2918&#8211;2923</fpage>
            <xrefbib>
               <pubid idtype="doi">10.1073/pnas.052466499</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Cytoplasmic incompatibility and sperm cyst infection in different Drosophila &#8211; Wolbachia associations</p>
            </title>
            <aug>
               <au>
                  <snm>Veneti</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Clark</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Zabalou</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Karr</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Savakis</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Bourtzis</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2003</pubdate>
            <volume>164</volume>
            <fpage>545</fpage>
            <lpage>552</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1462605</pubid>
                  <pubid idtype="pmpid" link="fulltext">12807775</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>The Wolbachia genome of Brugia malayi: endosymbiont evolution within a human pathogenic nematode</p>
            </title>
            <aug>
               <au>
                  <snm>Foster</snm>
                  <fnm>J</fnm>
                  <suf>Ganatra M, Kamal I, Ware J, Makarova K, Ivanova N, Bhattacharyya A, Kapatral V, Kumar S, Posfai J, Vincze T, Ingram J, Moran L, Lapidus A, Omelchenko M, Kyrpides N, Ghedin E, Wang S, Goltsman E, Joukov V, Ostrovskaya O, Tsukerman K, Mazur M, Comb D, Koonin E, Slatko B.</suf>
               </au>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2005</pubdate>
            <volume>3</volume>
            <fpage>e121</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1069646</pubid>
                  <pubid idtype="pmpid" link="fulltext">15780005</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0030121</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Phylogenomics of the reproductive parasite Wolbachia pipientis wMel: a streamlined genome overrun by mobile genetic elements</p>
            </title>
            <aug>
               <au>
                  <snm>Wu</snm>
                  <fnm>M</fnm>
                  <suf>Sun LV, Vamathevan J, Riegler M, Deboy R, Brownlie JC, McGraw EA, Martin W, Esser C, Ahmadinejad N, Wiegand C, Madupu R, Beanan MJ, Brinkac LM, Daugherty SC, Durkin AS, Kolonay JF, Nelson WC, Mohamoud Y, Lee P, Berry K, Young MB, Utterback T, Weidman J, Nierman WC, Paulsen IT, Nelson KE, Tettelin H, O'Neill SL, Eisen JA</suf>
               </au>
            </aug>
            <source>PLOS Biol</source>
            <pubdate>2004</pubdate>
            <volume>2</volume>
            <fpage>E69</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">368164</pubid>
                  <pubid idtype="pmpid" link="fulltext">15024419</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0020069</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Polymerases and the replisome: machines within machines</p>
            </title>
            <aug>
               <au>
                  <snm>Baker</snm>
                  <fnm>TA</fnm>
                  <suf>Bell SP</suf>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>1998</pubdate>
            <volume>92</volume>
            <fpage>295&#8211;305</fpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/S0092-8674(00)80923-X</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Where does bacterial replication start? Rules for predicting the oriC region</p>
            </title>
            <aug>
               <au>
                  <snm>Mackiewicz</snm>
                  <fnm>P</fnm>
                  <suf>Zakrzewska-Czerwinska J, Zawilak A, Dudek MR, Cebrat S</suf>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <fpage>3781&#8211;3791</fpage>
            <xrefbib>
               <pubid idtype="doi">10.1093/nar/gkh699</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Identification of two origins of replication in the single chromosome of the archaeon Sulfolobus solfataricus</p>
            </title>
            <aug>
               <au>
                  <snm>Robinson</snm>
                  <fnm>NP</fnm>
                  <suf>Dionne I, Lundgren M, Marsh VL, Bernander R, Bell SD</suf>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2004</pubdate>
            <volume>116</volume>
            <fpage>25&#8211;38</fpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/S0092-8674(03)01034-1</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Multiple origins of replication in archaea</p>
            </title>
            <aug>
               <au>
                  <snm>Kelman</snm>
                  <fnm>LM</fnm>
                  <suf>Kelman Z</suf>
               </au>
            </aug>
            <source>Trends Microbiol</source>
            <pubdate>2004</pubdate>
            <volume>12</volume>
            <fpage>399&#8211;401</fpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/j.tim.2004.07.001</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Initiation of chromosome replication</p>
            </title>
            <aug>
               <au>
                  <snm>Messer</snm>
                  <fnm>W</fnm>
                  <suf>Weigel C</suf>
               </au>
            </aug>
            <source>Escherichia coli and Salmonella Cellular and Molecular Biology, 2nd edition</source>
            <publisher>Washington, DC , ASM Press</publisher>
            <editor>Neidhardt, FC</editor>
            <pubdate>1996</pubdate>
            <fpage>1579</fpage>
            <lpage>1601</lpage>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Genetics and enzymology of DNA replication in Escherichia coli</p>
            </title>
            <aug>
               <au>
                  <snm>Baker</snm>
                  <fnm>TA</fnm>
                  <suf>Wickner SH</suf>
               </au>
            </aug>
            <source>Annu Rev Genet</source>
            <pubdate>1992</pubdate>
            <volume>26</volume>
            <fpage>447&#8211;477</fpage>
            <xrefbib>
               <pubid idtype="doi">10.1146/annurev.ge.26.120192.002311</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>The initiator protein DnaA: evolution, properties and function</p>
            </title>
            <aug>
               <au>
                  <snm>Skarstad</snm>
                  <fnm>K</fnm>
                  <suf>Boye E</suf>
               </au>
            </aug>
            <source>Biochim Biophys Acta</source>
            <pubdate>1994</pubdate>
            <volume>1217</volume>
            <fpage>111</fpage>
            <lpage>130</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8110826</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>The Obligate Human Pathogen, Neisseria gonorrhoeae, Is Polyploid</p>
            </title>
            <aug>
               <au>
                  <snm>Tobiason</snm>
                  <fnm>DM</fnm>
                  <suf>Seifert HS</suf>
               </au>
            </aug>
            <source>PLoS Biology</source>
            <pubdate>2006</pubdate>
            <volume>4</volume>
            <fpage>e185</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1470461</pubid>
                  <pubid idtype="pmpid" link="fulltext">16719561</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0040185</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Control of chromosome replication in Caulobacter crescentus</p>
            </title>
            <aug>
               <au>
                  <snm>Marczynski</snm>
                  <fnm>GT</fnm>
                  <suf>Shapiro L</suf>
               </au>
            </aug>
            <source>Annu Rev Microbiol</source>
            <pubdate>2002</pubdate>
            <volume>56</volume>
            <fpage>625</fpage>
            <lpage>656</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.micro.56.012302.161103</pubid>
                  <pubid idtype="pmpid" link="fulltext">12142494</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Conserved gene cluster at replication origins of the alpha-proteobacteria Caulobacter crescentus and Rickettsia prowazekii</p>
            </title>
            <aug>
               <au>
                  <snm>Brassinga</snm>
                  <fnm>AKC</fnm>
                  <suf>Siam R, Marczynski GT</suf>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2001</pubdate>
            <volume>183</volume>
            <fpage>1824&#8211;1829</fpage>
            <xrefbib>
               <pubid idtype="doi">10.1128/JB.183.5.1824-1829.2001</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Global analysis of the genetic network controlling a bacterial cell cycle</p>
            </title>
            <aug>
               <au>
                  <snm>Laub</snm>
                  <fnm>MT</fnm>
                  <suf>McAdams HH, Fraser CM, Shapiro L</suf>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2000</pubdate>
            <volume>290</volume>
            <fpage>2144&#8211;2148</fpage>
            <xrefbib>
               <pubid idtype="doi">10.1126/science.290.5499.2144</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Negative control of bacterial DNA replication by a cell cycle regulatory protein that binds at the chromosome origin</p>
            </title>
            <aug>
               <au>
                  <snm>Quon</snm>
                  <fnm>KC</fnm>
                  <suf>Yang B, Domian IJ, Shapiro L, Marczynski GT</suf>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1998</pubdate>
            <volume>95</volume>
            <fpage>120&#8211;125</fpage>
            <xrefbib>
               <pubid idtype="doi">10.1073/pnas.95.1.120</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Asymmetric substitution patterns in the two DNA strands of bacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Lobry</snm>
                  <fnm>JR</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1996</pubdate>
            <volume>13</volume>
            <fpage>660</fpage>
            <lpage>665</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">8676740</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Universal replication biases in bacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Rocha</snm>
                  <fnm>EP</fnm>
                  <suf>Danchin A, Viari A</suf>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>1999</pubdate>
            <volume>32</volume>
            <fpage>11</fpage>
            <lpage>16</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-2958.1999.01334.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">10216855</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>A simple vectorial representation of DNA sequences for the detection of replication origins in bacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Lobry</snm>
                  <fnm>JR</fnm>
               </au>
            </aug>
            <source>Biochimie</source>
            <pubdate>1996</pubdate>
            <volume>78</volume>
            <fpage>323</fpage>
            <lpage>326</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0300-9084(96)84764-X</pubid>
                  <pubid idtype="pmpid" link="fulltext">8905151</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>The contributions of replication orientation, gene direction, and signal sequences to base-composition asymmetries in bacterial genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Tillier</snm>
                  <fnm>ER</fnm>
                  <suf>Collins RA</suf>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>2000</pubdate>
            <volume>50</volume>
            <fpage>249</fpage>
            <lpage>257</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10754068</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Analyzing genomes with cumulative skew diagrams</p>
            </title>
            <aug>
               <au>
                  <snm>Grigoriev</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1998</pubdate>
            <volume>26</volume>
            <fpage>2286</fpage>
            <lpage>2290</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">147580</pubid>
                  <pubid idtype="pmpid" link="fulltext">9580676</pubid>
                  <pubid idtype="doi">10.1093/nar/26.10.2286</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>The replication-related organisation of bacterial genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Rocha</snm>
                  <fnm>EP</fnm>
               </au>
            </aug>
            <source>Microbiology</source>
            <pubdate>2004</pubdate>
            <volume>150</volume>
            <fpage>1609</fpage>
            <lpage>1627</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1099/mic.0.26974-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">15184548</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Complete genome sequencing of Anaplasma marginale reveals that the surface is skewed to two superfamilies of outer membrane proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Brayton</snm>
                  <fnm>KA</fnm>
                  <suf>Kappmeyer LS, Herndon DR, Dark MJ, Tibbals DL, Palmer GH, McGuire TC, Knowles DP</suf>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <fpage>844&#8211;849</fpage>
            <xrefbib>
               <pubid idtype="doi">10.1073/pnas.0406656102</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>The genome of the heartwater agent Ehrlichia ruminantium contains multiple tandem repeats of actively variable copy number</p>
            </title>
            <aug>
               <au>
                  <snm>Collins</snm>
                  <fnm>NE</fnm>
                  <suf>Liebenberg J, de Villiers EP, Brayton KA, Louw E, Pretorius A, Faber FE, van Heerden H, Josemans A, van Kleef M, Steyn HC, van Strijp MF, Zweygarth E, Jongejan F, Maillard JC, Berthier D, Botha M, Joubert F, Corton CH, Thomson NR, Allsopp MT, Allsopp BA</suf>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <fpage>838</fpage>
            <lpage>843</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">545511</pubid>
                  <pubid idtype="pmpid" link="fulltext">15637156</pubid>
                  <pubid idtype="doi">10.1073/pnas.0406633102</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Comparative genomics of emerging human ehrlichiosis agents</p>
            </title>
            <aug>
               <au>
                  <snm>Dunning Hotopp</snm>
                  <fnm>JC</fnm>
                  <suf>Lin M, Madupu R, Crabtree J, Angiuoli SV, Eisen J, Seshadri R, Ren Q, Wu M, Utterback TR, Smith S, Lewis M, Khouri H, Zhang C, Niu H, Lin Q, Ohashi N, Zhi N, Nelson W, Brinkac LM, Dodson RJ, Rosovitz MJ, Sundaram J, Daugherty SC, Davidsen T, Durkin AS, Gwinn M, Haft DH, Selengut JD, Sullivan SA, Zafar N, Zhou L, Benahmed F, Forberger H, Halpin R, Mulligan S, Robinson J, White O, Rikihisa Y, Tettelin H.</suf>
               </au>
            </aug>
            <source>PLoS Genet</source>
            <pubdate>2006</pubdate>
            <volume>2</volume>
            <fpage>e21</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1366493</pubid>
                  <pubid idtype="pmpid" link="fulltext">16482227</pubid>
                  <pubid idtype="doi">10.1371/journal.pgen.0020021</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Conserved response regulator CtrA and IHF binding sites in the a-proteobacteria Caulobacter crescentus and Rickettsia prowazekii chromosomal replication origins</p>
            </title>
            <aug>
               <au>
                  <snm>Brassinga</snm>
                  <fnm>AKC</fnm>
                  <suf>Siam R, McSween W, Winkler H, Wood D, Marczynski GT</suf>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2002</pubdate>
            <volume>184</volume>
            <fpage>5789&#8211;5799</fpage>
            <xrefbib>
               <pubid idtype="doi">10.1128/JB.184.20.5789-5799.2002</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Duplex opening by dnaA protein at novel sequences in initiation of replication at the origin of the E. coli chromosome</p>
            </title>
            <aug>
               <au>
                  <snm>Bramhill</snm>
                  <fnm>D</fnm>
                  <suf>Kornberg A</suf>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>1988</pubdate>
            <volume>52</volume>
            <fpage>743</fpage>
            <lpage>755</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0092-8674(88)90412-6</pubid>
                  <pubid idtype="pmpid" link="fulltext">2830993</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>The DNA unwinding element: a novel, cis-acting component that facilitates opening of the Escherichia coli replication origin</p>
            </title>
            <aug>
               <au>
                  <snm>Kowalski</snm>
                  <fnm>D</fnm>
                  <suf>Eddy MJ</suf>
               </au>
            </aug>
            <source>EMBO Journal</source>
            <pubdate>1989</pubdate>
            <volume>8</volume>
            <fpage>4335</fpage>
            <lpage>4344</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">401646</pubid>
                  <pubid idtype="pmpid">2556269</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>CBS domains in ClC chloride channels implicated in myotonia and nephrolithiasis (kidney stones)</p>
            </title>
            <aug>
               <au>
                  <snm>Ponting</snm>
                  <fnm>CP</fnm>
               </au>
            </aug>
            <source>J Mol Med</source>
            <pubdate>1997</pubdate>
            <volume>75</volume>
            <fpage>160</fpage>
            <lpage>3</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s001090050166</pubid>
                  <pubid idtype="pmpid" link="fulltext">9106071</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Profile hidden Markov models</p>
            </title>
            <aug>
               <au>
                  <snm>Eddy</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>1998</pubdate>
            <volume>14</volume>
            <fpage>755&#8211;763</fpage>
            <xrefbib>
               <pubid idtype="doi">10.1093/bioinformatics/14.9.755</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Gapped BLAST and PSI-BLAST: a new generation of protein database search programs</p>
            </title>
            <aug>
               <au>
                  <snm>Altschul</snm>
                  <fnm>SF</fnm>
                  <suf>Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ</suf>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1997</pubdate>
            <volume>25</volume>
            <fpage>3389&#8211;3402</fpage>
            <xrefbib>
               <pubid idtype="doi">10.1093/nar/25.17.3389</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Serendipitous discovery of Wolbachia genomes in multiple Drosophila species</p>
            </title>
            <aug>
               <au>
                  <snm>Salzberg</snm>
                  <fnm>SL</fnm>
                  <suf>Dunning Hotopp JC, Delcher AL, Pop M, Smith DR, Eisen MB Nelson WC</suf>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>R23</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1088942</pubid>
                  <pubid idtype="pmpid"