<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-9-79</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Software</dochead>
      <bibl>
         <title>
            <p>Ori-Finder: A web-based system for finding <it>oriC</it>s in unannotated bacterial genomes</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Gao</snm>
               <fnm>Feng</fnm>
               <insr iid="I1"/>
               <email>fgao@tju.edu.cn</email>
            </au>
            <au id="A2" ca="yes">
               <snm>Zhang</snm>
               <fnm>Chun-Ting</fnm>
               <insr iid="I1"/>
               <email>ctzhang@tju.edu.cn</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Physics, Tianjin University, Tianjin 300072, China</p>
            </ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <issn>1471-2105</issn>
         <pubdate>2008</pubdate>
         <volume>9</volume>
         <issue>1</issue>
         <fpage>79</fpage>
         <url>http://www.biomedcentral.com/1471-2105/9/79</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18237442</pubid>
               <pubid idtype="doi">10.1186/1471-2105-9-79</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>30</day>
               <month>10</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>01</day>
               <month>2</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>01</day>
               <month>2</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Gao and Zhang; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Chromosomal replication is the central event in the bacterial cell cycle. Identification of replication origins (<it>oriC</it>s) is necessary for almost all newly sequenced bacterial genomes. Given the increasing pace of genome sequencing, the current available software for predicting <it>oriC</it>s, however, still leaves much to be desired. Therefore, the increasing availability of genome sequences calls for improved software to identify <it>oriC</it>s in newly sequenced and unannotated bacterial genomes.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We have developed Ori-Finder, an online system for finding <it>oriC</it>s in bacterial genomes based on an integrated method comprising the analysis of base composition asymmetry using the <it>Z</it>-curve method, distribution of DnaA boxes, and the occurrence of genes frequently close to <it>oriC</it>s. The program can also deal with unannotated genome sequences by integrating the gene-finding program ZCURVE 1.02. Output of the predicted results is exported to an HTML report, which offers convenient views on the results in both graphical and tabular formats.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>A web-based system to predict replication origins of bacterial genomes has been presented here. Based on this system, <it>oriC </it>regions have been predicted for the bacterial genomes available in GenBank currently. It is hoped that Ori-Finder will become a useful tool for the identification and analysis of <it>oriC</it>s in both bacterial and archaeal genomes.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>With the increasing availability of complete genome sequences, it has become clear that an essential factor influencing bacterial proliferation is the organization of the <it>oriC </it>region. In bacteria, chromosome replication initiates at a single chromosome locus, called the replication origin (<it>oriC</it>), from which replication proceeds bidirectionally to the terminus. At the beginning of replication, ATP binds DnaA, resulting in a large oligomeric complex consisting of DnaA monomers bound to a series of 9-mer consensus elements termed DnaA boxes <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. As a general rule, bacterial chromosomes are characterized by one bidirectional origin of replication and one terminus that define the boundaries of replichores <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. The replication process is asymmetric with respect to the two DNA strands from the fact that the leading strand is synthesized continuously, whereas the lagging strand is synthesized as Okazaki fragments. DNA strand asymmetry can also be described in terms of gene content and nucleotide composition. Moreover, all rRNA genes, the majority of ribosomal protein coding genes and essential genes are mainly transcribed in the same direction as DNA replication (leading strand) <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. Therefore, identifying the <it>oriC</it>s of bacteria is one of the key steps not only in understanding the mechanisms of bacterial replication but also in gaining insight into gene distribution and function of these bacterial genomes.</p>
         <p>The pioneer work to identify <it>oriC</it>s <it>in silico </it>is the GC-skew analysis, which is deemed as a routine method <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>. Based on the GC-skew analysis, Frank and Lobry also developed Oriloc, a program for predicting bacterial <it>oriC</it>s <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. Meanwhile, an oligomer-skew method was also proposed to predict <it>oriC </it>regions in bacterial genomes <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, and later used to identify <it>oriC</it>s in 200 bacterial or/and archaeal genomes <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. Use of GC-skew analysis together with location of the <it>dnaA </it>gene and distribution of DnaA boxes led to more accurate prediction of <it>oriC </it>regions <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>.</p>
         <p>Identification of <it>oriC</it>s is necessary for almost all newly sequenced bacterial genomes. Given the increasing pace of genome sequencing, the currently available software for predicting <it>oriC</it>s, however, still leaves much room to be improved. For instance, 1) Most, if not all, available software only predicts one approximate position instead of the boundaries of <it>oriC</it>s. Since the <it>oriC </it>is an intergenic region rather than one site, it is necessary to present <it>oriC </it>regions with definite boundaries. 2) Most software does not provide the distribution of species-specific DnaA boxes, which are strong indicators of <it>oriC</it>s. 3) No automated web server is available so far, e.g., Oriloc, in which the program can be downloaded and run locally by users. Web-based tools are more convenient, since they are platform independent and there is no need to download and install the software. 4) Only the GC-skew and/or AT-skew are provided. In addition to G/C and A/T bases, R/Y and M/K bases can also show asymmetrical distributions around <it>oriC</it>s. It is more informative to show all the base composition asymmetries than using GC and/or AT-skews alone.</p>
         <p>To identify <it>oriC </it>regions of unannotated bacterial genomes, we have developed an online tool, Ori-Finder, based on an integrated method comprising gene identification, analysis of base composition asymmetry using the <it>Z</it>-curve method, distribution of DnaA boxes, occurrence of genes frequently close to <it>oriC</it>s and phylogenetic relationships. Output results that contain detailed information of predicted <it>oriC</it>s, DnaA box distributions, base composition disparity curves, and coordinates of identified genes by ZCURVE 1.02, are exported to an HTML report, which offers convenient views in both graphical and tabular formats.</p>
      </sec>
      <sec>
         <st>
            <p>Implementation</p>
         </st>
         <p>The web server Ori-Finder is implemented on Apache server and the web interface is designed using CGI (Common Gateway Interface) Perl scripts. The algorithms to predict <it>oriC </it>regions of bacterial genomes <it>in silico </it>are complemented with the language of C++. The output graphs are generated by gnuplot graphic routine <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>.</p>
         <p>The software kit consists of the following programs.</p>
         <p>(1) The program to calculate the coordinates of the RY, MK, GC and AT disparity curves <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>.</p>
         <p>(2) The program to calculate the distribution of DnaA boxes for the input genome sequence.</p>
         <p>(3) The program to assign and rank the priority of every intergenic sequence.</p>
         <p>(4) The gene-finding program ZCURVE 1.02 <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>.</p>
         <p>(5) The Perl script integrating the above programs into one system.</p>
         <sec>
            <st>
               <p>Input to sever</p>
            </st>
            <p>Ori-Finder has a user-friendly and intuitive input interface. Users can choose to paste the sequence into the input box or upload the sequence (FASTA format) in a file. Additionally, the server requires the specification of some optional parameters listed as follows.</p>
            <p>(1) Select 'species-specific' DnaA boxes according to the species. It defaults to the <it>Escherichia coli </it>perfect DnaA box (TTATCCACA). In addition, 'species-specific' DnaA boxes have also been provided, and some of them are proposed based on the phylogenetic relationships, which have not been reported previously, such as 'TTTTCCACA' for <it>Chlamydiae</it>, 'TTATCGAAA' for <it>Dehalococcoides </it>and 'TGTTTCACG' for <it>Bradyrhizobiaceae </it>etc. For example, the motif 'TGTTTCACG' is proposed to server as DnaA boxes for the genomes of the family <it>Bradyrhizobiaceae </it>(including <it>Rhodopseudomonas</it>, <it>Nitrobacter </it>and <it>Bradyrhizobium</it>). Currently, ten genomes of <it>Bradyrhizobiaceae </it>have been fully sequenced. Note that the predicted <it>oriC </it>regions of them are all closely next to <it>gidA </it>gene as well as the switch of GC disparity curve, and all contain the palindromic sequence 'TGTTTCACGTGAAACA', which is composed of two proposed DnaA boxes 'TGTTTCACG'. Consequently, the 'species-specific' DnaA boxes proposed by us are consistent with the phylogenetic relationships. Users can also choose to search for the DnaA boxes defined themselves. By default, only nonamers differing in no more than one position from the selected DnaA boxes are scanned, but this can be changed according to the requirements of users.</p>
            <p>(2) Whether to upload a protein table file (".ptt" file) containing the coordinates of genes in the input genome sequence. The ".ptt" file is a table of protein features, which is usually provided by GenBank in NCBI's RefSeq ftp site <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. Users can choose to download the ".ptt" file provided by GenBank or produce their own genome annotation that has the same format as the ".ptt" file according to their requirements. For Ori-Finder, if no ".ptt" file is uploaded, the gene-finding program ZCURVE 1.02 will be called to generate the gene list. In general, ZCURVE 1.02 or other gene-finding software alone couldn't annotate gene function. To obtain the ".ptt" file with annotation of genes related to <it>oriC</it>s, a database of indicator genes (such as <it>dnaA</it>, <it>dnaN</it>, <it>hemE</it>, <it>gidA </it>etc) has been collected by us, and BLAST program has also been installed locally for users' convenience (click 'Database' on the home page for more detail). Therefore, users can Blast the query sequences and find the homologues of these genes in the sequences.</p>
            <p>(3) Whether to display RY, MK, GC or AT disparity curves and the DnaA box distribution in the output graphs. It defaults to displaying all of them in the output graphs.</p>
         </sec>
         <sec>
            <st>
               <p>Output from the sever</p>
            </st>
            <p>The output web page shows the process of Ori-Finder, and provides links to the output results by Ori-Finder: (i) the genome size, GC content, location, length, DnaA box number, the motif of DnaA boxes, the conserved genes adjacent to the predicted <it>oriC</it>s, the precise coordinates of extremes of the four disparity curves and the sequence of identified <it>oriC </it>regions as an HTML table; (ii) the DnaA box distribution and the coordinates of the four disparity curves as text files; (iii) the integrated plot for the original sequence and (iv) that for the rotated sequence to display the obtained results, such as general genome information, the four disparity curves, distribution of DnaA boxes, locations of indicator genes and <it>oriC </it>regions in PNG format. A typical example of the output results obtained by Ori-Finder is shown in Figure <figr fid="F1">1</figr>. If no ".ptt" file is uploaded, the gene list file generated by ZCURVE1.02 will also be outputted. Users can do a Blast search using each output result by Ori-Finder against DoriC, a database of <it>oriC </it>regions <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>, to confirm the reliability of the prediction. Refer to Figure <figr fid="F2">2</figr> for the complete procedure of Ori-Finder to identify <it>oriC </it>regions. The output results by Ori-Finder for the newly sequenced genomes in GenBank are also available at our website.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>A typical example of the output results obtained by Ori-Finder</p>
               </caption>
               <text>
                  <p><b>A typical example of the output results obtained by Ori-Finder</b>. (A) The information of genome and <it>oriC </it>region for <it>Escherichia coli </it>K12 as an HTML table. (B) The integrated plot as a PNG figure for the original sequence and (C) the integrated plot as a PNG figure for the rotated sequence, displaying the obtained results, such as general genome information, four disparity curves, distribution of DnaA boxes, locations of putative indicator genes and the predicted <it>oriC </it>region. Note that the coordinate origin of the rotated sequence begins and ends in the maximum of the GC disparity curve.</p>
               </text>
               <graphic file="1471-2105-9-79-1"/>
            </fig>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>The procedure of Ori-Finder to identify <it>oriC </it>regions</p>
               </caption>
               <text>
                  <p><b>The procedure of Ori-Finder to identify <it>oriC </it>regions</b>. Flow chart schematically showing the procedure to identify <it>oriC </it>regions by Ori-Finder.</p>
               </text>
               <graphic file="1471-2105-9-79-2"/>
            </fig>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <p>Based on this online system, we have predicted <it>oriC </it>regions for the bacterial genomes available in GenBank currently. To estimate the accuracy of the predicted <it>oriC</it>s by Ori-Finder, a simple and straight way is to compare the predicted <it>oriC</it>s with those confirmed by experimental data. Consequently, the experimental confirmed <it>oriC</it>s so far are in accordance with those predicted by Ori-Finder <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. In addition, other lines of evidence from <it>oriC </it>types, the coherence of all the three methods (i.e., typical base composition asymmetry, DnaA box distribution and indicator genes positions), and comparative genomic analysis, have shown the high reliability of the predicted <it>oriC</it>s by Ori-Finder.</p>
         <p>In what follows we will discuss the unique merits of the present method, showing the evidence why this method is superior to other methods currently available. First of all, the <it>in silico </it>methods currently available to identify <it>oriC</it>s include the GC-skew analysis, and the oligomer-skew method etc. Jointly using the three methods (GC-skew, location of the <it>dnaA </it>gene and distribution of DnaA boxes) resulted in better prediction of <it>oriC </it>regions by Mackiewicz <it>et al </it><abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. Although DnaA and its binding sites are well conserved throughout the bacterial kingdom, replication origins from different species show considerable diversity in terms of the number, arrangement and even the consensus sequence of DnaA boxes. In addition, genes adjacent to <it>oriC</it>s include not only <it>dnaA </it>gene itself but also <it>dnaN</it>, <it>hemE</it>, <it>gidA </it>or <it>hemB </it>etc. The information mentioned above has all been taken into consideration by the present approach, such as 'species-specific' DnaA boxes, possible genes adjacent to <it>oriC</it>s and phylogenetic relationships. However, by analyzing the GC-skew, Blasting for the <it>dnaA </it>gene and searching for DnaA boxes, the <it>oriC</it>s for only ~60% of genomes can be predicted. For example, the <it>oriC</it>s were identified for only 76 chromosomes out of the 120 chromosomes (76/120 = 63.3%) <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>, which has also been confirmed again by our result for 568 chromosomes available currently in GenBank. Out of the 568 chromosomes, there are only 342 chromosomes (342/568 = 60.2%) whose <it>oriCs </it>are adjacent to <it>dnaA </it>gene and near the switch of GC-skew. <it/><it/>Second, the present work can find <it>oriC </it>regions with definite boundaries with the accuracy of single base pair. Consequently, the integrated method presented in this paper outperforms those jointly applying the three methods mentioned above, as reflected by the fact that using the integrated method, the <it>oriC </it>regions can be identified for over 98% chromosomes out of the 568 bacterial chromosomes analyzed. For example, the genome of <it>Synechococcus </it>sp. JA-3-3Ab (<ext-link ext-link-type="gen" ext-link-id="NC_007775">NC_007775</ext-link>) does not exhibit any typical GC skew pattern. Therefore, based on the location of the <it>dnaA </it>gene, the origin is tentatively assigned to the <it>rpmH</it>-<it>dnaA </it>intergenic region <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. However, in the family of <it>Synechococcus </it>genomes the <it>oriC </it>regions are closely adjacent to <it>dnaN </it>gene rather than <it>dnaA </it>gene and the DnaA box motif is 'TTTTCCACA' rather than 'TTATCCACA'. Here, by taking the above information into account, the <it>oriC </it>of <it>Synechococcus </it>sp. JA-3-3Ab can be easily predicted by Ori-Finder. For another example, applying the methods based on GC-skew, <it>dnaA </it>gene location and <it>E. coli </it>perfect DnaA box distribution to the genome of <it>Synechococcus </it>sp. WH 8102 (<ext-link ext-link-type="gen" ext-link-id="NC_005070">NC_005070</ext-link>), each of the three methods identifies a different region, respectively <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. Consequently, none <it>oriC </it>can be identified for this genome <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>, whereas a reliable <it>oriC </it>for this genome has been identified by the integrated method Ori-Finder.</p>
         <p>Finally, we should point out the limitation of the present method. Note that the replication origins from different species show considerable diversity and sometimes annotation mistakes affect the performance. Therefore, the present method couldn't solve the problem of identifying the <it>oriC </it>for every bacterial genome currently available, of course, nor could other methods. Accordingly, <it>oriC</it>s for only ~2% of 568 chromosomes remain unresolved currently using the integrated method presented here, whereas ~40% of 568 genomes cannot be solved using only the information of GC-skew, location of the <it>dnaA </it>gene and distribution of DnaA boxes. However, for the chromosomes whose <it>oriC</it>s remain unresolved by Ori-Finder, the distribution of DnaA boxes and the extremes of GC (AT, RY and MK) disparity curves are still provided by Ori-Finder, which are helpful to obtain the approximate position of <it>oriC </it>region.</p>
         <p>For the convenience of users' query, the predicted <it>oriC </it>regions have been organized into a MySQL database, called DoriC, which is freely available online <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>With the availability of an increasing number of bacterial genomes, the prediction will be more accurate and reliable since the DnaA boxes or genes frequently close to <it>oriC</it>s can be analyzed by comparative genomics. Currently, the web server Ori-Finder is designed only for the identification of <it>oriC</it>s in bacterial genomes. However, the architecture of the system is also suitable for archaeal genomes by changing DnaA box into ORB (origin recognition boxes) elements. The current version contains an input option by which users can define their own DnaA boxes or ORB elements. It is hoped that Ori-Finder will become a useful tool for the identification and analysis of <it>oriC</it>s in both bacterial and archaeal genomes.</p>
      </sec>
      <sec>
         <st>
            <p>Availability and requirements</p>
         </st>
         <p><b>Project name: </b>Ori-Finder</p>
         <p>
            <b>Project homepage: <url>http://tubic.tju.edu.cn/Ori-Finder</url></b>
         </p>
         <p><b>Operating systems: </b>platform independent</p>
         <p><b>Programming language: </b>C++</p>
         <p><b>License: </b>Freely available</p>
         <p><b>Any restrictions on use by non-academics: </b>None</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>FG designed the computer program and drafted the manuscript. CTZ supervised the study and revised the manuscript. Both authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We would like to thank Dr. Ren Zhang for invaluable assistance. The present work was supported in part by NNSF of China (Grant No. 90408028).</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Origins of DNA replication in the three domains of life</p>
            </title>
            <aug>
               <au>
                  <snm>Robinson</snm>
                  <fnm>NP</fnm>
               </au>
               <au>
                  <snm>Bell</snm>
                  <fnm>SD</fnm>
               </au>
            </aug>
            <source>The FEBS journal</source>
            <pubdate>2005</pubdate>
            <volume>272</volume>
            <issue>15</issue>
            <fpage>3757</fpage>
            <lpage>3766</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1742-4658.2005.04768.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">16045748</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>A new method for assessing the effect of replication on DNA base composition asymmetry</p>
            </title>
            <aug>
               <au>
                  <snm>Necsulea</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lobry</snm>
                  <fnm>JR</fnm>
               </au>
            </aug>
            <source>Molecular biology and evolution</source>
            <pubdate>2007</pubdate>
            <volume>24</volume>
            <issue>10</issue>
            <fpage>2169</fpage>
            <lpage>2179</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1093/molbev/msm148</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>A simple vectorial representation of DNA sequences for the detection of replication origins in bacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Lobry</snm>
                  <fnm>JR</fnm>
               </au>
            </aug>
            <source>Biochimie</source>
            <pubdate>1996</pubdate>
            <volume>78</volume>
            <issue>5</issue>
            <fpage>323</fpage>
            <lpage>326</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0300-9084(96)84764-X</pubid>
                  <pubid idtype="pmpid" link="fulltext">8905151</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Analyzing genomes with cumulative skew diagrams</p>
            </title>
            <aug>
               <au>
                  <snm>Grigoriev</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nucleic acids research</source>
            <pubdate>1998</pubdate>
            <volume>26</volume>
            <issue>10</issue>
            <fpage>2286</fpage>
            <lpage>2290</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">147580</pubid>
                  <pubid idtype="pmpid" link="fulltext">9580676</pubid>
                  <pubid idtype="doi">10.1093/nar/26.10.2286</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Oriloc: prediction of replication boundaries in unannotated bacterial chromosomes</p>
            </title>
            <aug>
               <au>
                  <snm>Frank</snm>
                  <fnm>AC</fnm>
               </au>
               <au>
                  <snm>Lobry</snm>
                  <fnm>JR</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2000</pubdate>
            <volume>16</volume>
            <issue>6</issue>
            <fpage>560</fpage>
            <lpage>561</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/16.6.560</pubid>
                  <pubid idtype="pmpid" link="fulltext">10980154</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Skewed oligomers and origins of replication</p>
            </title>
            <aug>
               <au>
                  <snm>Salzberg</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Salzberg</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Kerlavage</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Tomb</snm>
                  <fnm>JF</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>1998</pubdate>
            <volume>217</volume>
            <issue>1&#8211;2</issue>
            <fpage>57</fpage>
            <lpage>67</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0378-1119(98)00374-6</pubid>
                  <pubid idtype="pmpid">9795135</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Origin of replication in circular prokaryotic chromosomes</p>
            </title>
            <aug>
               <au>
                  <snm>Worning</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Jensen</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Hallin</snm>
                  <fnm>PF</fnm>
               </au>
               <au>
                  <snm>Staerfeldt</snm>
                  <fnm>HH</fnm>
               </au>
               <au>
                  <snm>Ussery</snm>
                  <fnm>DW</fnm>
               </au>
            </aug>
            <source>Environmental microbiology</source>
            <pubdate>2006</pubdate>
            <volume>8</volume>
            <issue>2</issue>
            <fpage>353</fpage>
            <lpage>361</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1462-2920.2005.00917.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">16423021</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Where does bacterial replication start? Rules for predicting the <it>oriC </it>region</p>
            </title>
            <aug>
               <au>
                  <snm>Mackiewicz</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Zakrzewska-Czerwinska</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zawilak</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Dudek</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Cebrat</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Nucleic acids research</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <issue>13</issue>
            <fpage>3781</fpage>
            <lpage>3791</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">506792</pubid>
                  <pubid idtype="pmpid" link="fulltext">15258248</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh699</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Gnuplot homepage</p>
            </title>
            <url>http://www.gnuplot.info</url>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Identification of replication origins in archaeal genomes based on the <it>Z</it>-curve method</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>CT</fnm>
               </au>
            </aug>
            <source>Archaea</source>
            <pubdate>2005</pubdate>
            <volume>1</volume>
            <issue>5</issue>
            <fpage>335</fpage>
            <lpage>346</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15876567</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>ZCURVE: a new system for recognizing protein-coding genes in bacterial and archaeal genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Guo</snm>
                  <fnm>FB</fnm>
               </au>
               <au>
                  <snm>Ou</snm>
                  <fnm>HY</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>CT</fnm>
               </au>
            </aug>
            <source>Nucleic acids research</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <issue>6</issue>
            <fpage>1780</fpage>
            <lpage>1789</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">152858</pubid>
                  <pubid idtype="pmpid" link="fulltext">12626720</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg254</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>NCBI's RefSeq ftp site</p>
            </title>
            <url>ftp://ftp.ncbi.nih.gov/genomes</url>
         </bibl>
         <bibl id="B13">
            <title>
               <p>DoriC: a database of <it>oriC </it>regions in bacterial genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Gao</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>CT</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2007</pubdate>
            <volume>23</volume>
            <issue>14</issue>
            <fpage>1866</fpage>
            <lpage>1867</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btm255</pubid>
                  <pubid idtype="pmpid" link="fulltext">17496319</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Effect of temperature and light on growth of and photosynthesis by Synechococcus isolates typical of those predominating in the octopus spring microbial mat community of Yellowstone National Park</p>
            </title>
            <aug>
               <au>
                  <snm>Allewalt</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Bateson</snm>
                  <fnm>MM</fnm>
               </au>
               <au>
                  <snm>Revsbech</snm>
                  <fnm>NP</fnm>
               </au>
               <au>
                  <snm>Slack</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ward</snm>
                  <fnm>DM</fnm>
               </au>
            </aug>
            <source>Applied and environmental microbiology</source>
            <pubdate>2006</pubdate>
            <volume>72</volume>
            <issue>1</issue>
            <fpage>544</fpage>
            <lpage>550</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1128/AEM.72.1.544-550.2006</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>DoriC</p>
            </title>
            <url>http://tubic.tju.edu.cn/doric</url>
         </bibl>
      </refgrp>
   </bm>
</art>
