<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-5-4</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Database</dochead>
      <bibl rating="0">
         <title>
            <p>Identification of polymorphic tandem repeats by direct comparison of genome sequence from different bacterial strains : a web-based resource</p>
         </title>
         <aug>
            <au id="A1" ca="yes" ce="no" pa="no" da="no">
               <snm>Den&#339;ud</snm>
               <fnm>France</fnm>
               <insr iid="I1"/>
               <email>France.Denoeud@igmors.u-psud.fr</email>
            </au>
            <au id="A2" ca="no" ce="no" pa="no" da="no">
               <snm>Vergnaud</snm>
               <fnm>Gilles</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>Gilles.Vergnaud@igmors.u-psud.fr</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Laboratoire GPMS, Institut de G&#233;n&#233;tique et Microbiologie, Bat 400, Universit&#233; Paris-Sud, 91405 Orsay cedex, France</p>
            </ins>
            <ins id="I2">
               <p>Centre d'Etudes du Bouchet, BP3, 91710 Vert le Petit, France</p>
            </ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <issn>1471-2105</issn>
         <pubdate>2004</pubdate>
         <volume>5</volume>
         <issue>1</issue>
         <fpage>4</fpage>
         <url>http://www.biomedcentral.com/1471-2105/5/4</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid" link="fulltext">14715089</pubid>
               <pubid idtype="doi" link="fulltext">10.1186/1471-2105-5-4</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>24</day>
               <month>9</month>
               <year>2003</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>12</day>
               <month>1</month>
               <year>2004</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>12</day>
               <month>1</month>
               <year>2004</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2004</year>
         <collab>Den&#339;ud and Vergnaud; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.</collab>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Polymorphic tandem repeat typing is a new generic technology which has been proved to be very efficient for bacterial pathogens such as <it>B. anthracis</it>, <it>M. tuberculosis</it>, <it>P. aeruginosa</it>, <it>L. pneumophila</it>, <it>Y. pestis</it>. The previously developed tandem repeats database takes advantage of the release of genome sequence data for a growing number of bacteria to facilitate the identification of tandem repeats. The development of an assay then requires the evaluation of tandem repeat polymorphism on well-selected sets of isolates. In the case of major human pathogens, such as <it>S. aureus</it>, more than one strain is being sequenced, so that tandem repeats most likely to be polymorphic can now be selected <it>in silico </it>based on genome sequence comparison.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>In addition to the previously described general Tandem Repeats Database, we have developed a tool to automatically identify tandem repeats of a different length in the genome sequence of two (or more) closely related bacterial strains. Genome comparisons are pre-computed. The results of the comparisons are parsed in a database, which can be conveniently queried over the internet according to criteria of practical value, including repeat unit length, predicted size difference, etc. Comparisons are available for 16 bacterial species, and the orthopox viruses, including the variola virus and three of its close neighbors.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusions</p>
               </st>
               <p>We are presenting an internet-based resource to help develop and perform tandem repeats based bacterial strain typing. The tools accessible at <url>http://minisatellites.u-psud.fr</url> now comprise four parts. The Tandem Repeats Database enables the identification of tandem repeats across entire genomes. The Strain Comparison Page identifies tandem repeats differing between different genome sequences from the same species. The "Blast in the Tandem Repeats Database" facilitates the search for a known tandem repeat and the prediction of amplification product sizes. The "Bacterial Genotyping Page" is a service for strain identification at the subspecies level.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Molecular epidemiology, the integration of molecular typing and conventional epidemiological studies, is likely to add significant value to analyses of infections caused by pathogenic bacteria (see <abbrgrp><abbr bid="B1">1</abbr></abbrgrp> for review). Multilocus Sequence Typing (MLST) for instance is now a major reference method for the molecular epidemiology of <it>Neisseria meningitidis </it>and other human pathogens <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. In this kind of assay, a set of typically 7 genes is partially sequenced, and the resulting data is converted into sequence types, which can be easily stored in databases, and compared to others. However a number of significant pathogens, including <it>M. tuberculosis </it><abbrgrp><abbr bid="B3">3</abbr></abbrgrp>, <it>B. anthracis </it>and <it>Y. pestis </it><abbrgrp><abbr bid="B4">4</abbr></abbrgrp> are not amenable to this approach, because of the recent emergence of these pathogens and the resulting rarity of sequence variations. In these pathogens, tandem repeats (TRs) are a source of very informative markers for strain genotyping <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>. Tandem repeats in pathogenic bacteria were initially identified within genes associated with bacterial virulence <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp>. In other instances, the contribution of tandem repeats to genome polymorphism was established after extensive searches based for instance on AFLP (amplified fragment length polymorphism) profiling. This is well illustrated by <it>B. anthracis</it>, in which polymorphic bands in AFLP patterns <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> were subsequently demonstrated by sequencing to be due to tandem repeat variations <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. Eventually, some of these tandem repeats have been shown to directly contribute to phenotypic variations of the <it>B. anthracis </it>exosporium which makes the outer layer of the spores <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. The frequent observation that tandem repeat-containing genes are often associated with outer membrane proteins suggests that such genes help bacteria adapt to their environment, and may be to some extent mutation hotspots as a result of positive selection.</p>
         <p>Polymorphic tandem repeats (VNTRs, for Variable Number of Tandem Repeats), once identified, provide convenient tools requiring ordinary molecular biology equipment and the data can be easily exchanged and compared. The resulting assay, called MLVA (for multiple locus VNTR analysis) can even be automated <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. We have developed tools which facilitate the bioinformatics step of genome analysis required to start a project. A previously described Tandem Repeats Database enables the identification of tandem repeats across entire genomes <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp>. It has been constantly updated, with now more than a hundred bacterial genomes available, compared to 35 at the onset of the database. We present here a new and major development of this resource which takes advantage of the fact that more than two different strains from the same species have now been sequenced at least for a number of major human pathogens. As a result, the tools accessible over the Internet at <url>http://minisatellites.u-psud.fr</url> now comprise four complementary parts. The newly added resource, the Strain Comparison Page, takes advantage of the availability of genome sequences from more than one strain from a growing number of species to directly identify tandem repeats differing between the sequenced strains. This is of interest because the vast majority of tandem repeats is often not polymorphic <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. The "Blast in the Tandem Repeats Database" page facilitates the search for a known tandem repeat, the prediction of PCR amplification products size, and the verification of primer specificity. Once an MLVA assay has been set up, and carefully validated by typing collections of isolates, it is relatively easy to construct databases of genotypes to be used locally or which can be queried across the Internet. The "Bacterial Genotyping Page" illustrates a freely accessible, fast and easy to use internet-based service for strain comparisons, in which a user can compare a genotype produced for one of his isolates to the existing data.</p>
      </sec>
      <sec>
         <st>
            <p>Construction and content</p>
         </st>
         <sec>
            <st>
               <p>The Tandem Repeats Database main page</p>
            </st>
            <p>Tandem repeats were identified from finished microbial genome sequences (as listed by the Genome OnLine Database <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>) using the tandem repeats finder (TRF) software <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp> with the following options: alignment parameters, "2,3,5" (these parameters are the less stringent ones), minimum alignment score to report repeat, 50 (this score allows to detect short structures), maximum period size, 500 base-pairs. When the program reported redundant (overlapping) repeats, the redundancy was eliminated as described in <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>, before import in the database. The database uses Microsoft Access 2000 and the querying process uses Active Server Pages (ASP, Microsoft) with Perlscripts or VBscripts. Perl was obtained from the ActiveState Programmer Network <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. The database is hosted on a server running under Windows 2000 server (Microsoft). The tandem repeats database main page is described in more detail in <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>The Strain Comparison page</p>
            </st>
            <p>Sequence comparisons used BLAST <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. The BLAST software was obtained from the NCBI FTP site <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. The flanking sequences of TRs from one strain were compared to the whole sequence of the other strain (and reciprocally, to avoid missing some tandem repeats that would not appear in the tandem repeats database for one strain because they were not detected by the Tandem Repeats Finder <abbrgrp><abbr bid="B21">21</abbr></abbrgrp> -for instance because there is only one copy of the repeated unit in the considered strain). The resulting list of matching tandem repeats was then imported in the database, where it can be queried. The comparison of more than two strains was made possible through a supplemental step before import in the database: the synthesis of several 2-strains comparisons, of the same "reference" strain against each of the others (matching between TRs of the different strains was deduced from the positions on the reference strain).</p>
         </sec>
         <sec>
            <st>
               <p>The Blast page</p>
            </st>
            <p>The Blast Page allows users to run BLAST <abbrgrp><abbr bid="B25">25</abbr></abbrgrp> in the tandem repeats and flanking sequences from the database via Perlscripts. The Blast outputs are linked to the database, in order to easily obtain the description of identified tandem repeats.</p>
         </sec>
         <sec>
            <st>
               <p>The Bacterial Genotyping page</p>
            </st>
            <p>The web-page site performing identifications was developed using the BNserver application (version 3.0, Applied-Maths, Belgium) and ASP (Microsoft) using Perlscript. The typing results (gel images and resulting data) were managed using the Bionumerics software package as described in <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. The output of a query is a list of strains and genotypes from the database together with similarity scores.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Utility</p>
         </st>
         <sec>
            <st>
               <p>The procedure to find polymorphic tandem repeats (TRs) for use in strain typing</p>
            </st>
            <p>Figure <figr fid="F1">1</figr> shows the steps leading from a genome sequence to the exploitation of polymorphic tandem repeats for bacterial strain genotyping. Although Tandem Repeats are easily identified using the Tandem Repeats Database, TR polymorphism must be evaluated by typing across a set of relevant strains. If the sequences of several strains of the species of interest are available, the Strain Comparison Page can be used to directly identify tandem repeats predicted to be polymorphic in size between the two (or more) sequenced strains. However, it is important to keep in mind that the tandem repeats predicted as being polymorphic will depend on the sequenced strains and well-planned surveys of isolates will still be necessary. The available tools do not replace this validation step, as the value of each marker must be carefully established on an appropriate set of isolates. The definition of an appropriate set of isolates depends upon the question which is being addressed, <it>i.e. </it>large scale or local epidemiology. The Blast Page has been implemented in the tandem repeats database in order to easily determine the size of the expected PCR amplification products. The database is also manually updated to contain PCR conditions as well as polymorphism index, and links to the original reports <abbrgrp><abbr bid="B27">27</abbr></abbrgrp> (input from users is welcome). Eventually, when an MLVA assay has been fully developed and validated, typing data can be made accessible so that individual queries can be run. The Bacterial Genotyping Page illustrates how this could work. The genotyping data for a strain can be entered and submitted via this page. The output is the description of the closest strains. The data which has been submitted is not incorporated in the database itself, since this would require stringent data validation steps. In the following sections, we are presenting the web-based resources associated with this procedure.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>The procedure to find polymorphic tandem repeats for use in strain typing</p>
               </caption>
               <text>
                  <p>The procedure to find polymorphic tandem repeats for use in strain typing. The steps leading from the release of a complete (or incomplete) genome sequence to the validation of new polymorphic markers are described. The purpose of the web-based tools developed is to facilitate the bioinformatics data-management steps.</p>
               </text>
               <graphic file="1471-2105-5-4-1"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>The "Strain Comparison" pages</p>
            </st>
            <p>The strain comparison pages are available via <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. The comparison of two strains is based on a pre-computed BLAST <abbrgrp><abbr bid="B25">25</abbr></abbrgrp> analysis of the flanking sequences of tandem repeats from one strain against the other, and vice-versa. Figure <figr fid="F2">2</figr> summarizes the results of this first step for 23 comparisons. Three indexes are scored (see figure legend): (1) the "mean %identity" between the flanking sequences is a measure of single nucleotide polymorphism (SNPs) frequency (not insertions-deletions), (2) the proportion (%) of flanking sequences that matched the flanking sequence of its homologue in the other strain on more than half of the 500 bp assayed here &#8211; <it>i.e. </it>that were not rearranged, by insertion of mobile elements for instance -, (3) the proportion (%) of tandem repeats that were found to be of a different length between the two strains being compared. In addition, the positions of matching tandem repeats in the two genomes is plotted to reveal large-scale genome rearrangements. A number of situations are observed: for instance <it>Yersinia pestis </it>orientalis strain CO-92 <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>, and medievalis strain KIM5 P12 <abbrgrp><abbr bid="B30">30</abbr></abbrgrp> show a very high "mean %identity" (99.96 %), in agreement with the recent emergence of <it>Yersinia pestis </it><abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. In spite of this, the two strains differ by a high number of large rearrangements (as seen on the plot), which reflects the high genome plasticity observed in this species <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>, together with a relatively high rate of polymorphic tandem repeats (8.47%). In contrast, <it>Listeria monocytogenes </it>strain EGD-e and <it>Listeria innocua </it>strain Clip 11262 have a lower homology (90.19%) and only 3.99% of polymorphic tandem repeats in spite of the evolutionary distance (see Figure <figr fid="F2">2</figr>).</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Comparison of strains using different indexes</p>
               </caption>
               <text>
                  <p><b>Comparison of strains using different indexes. </b>The four columns correspond to (from left to right): (1) mean %identity provided by BLAST when the match occurred on more than half the length of the 500 bp of submitted flanking sequence ; (2) proportion (%) of flanking sequences that matched on more than half their length between the two strains ; (3) proportion (%) of tandem repeats of a different size in the two strains ; and (4) plot of the positions of homologous tandem repeat loci in the two genomes which indirectly reflects large scale genome rearrangements. Species are listed according to the first index (mean %identity)</p>
               </text>
               <graphic file="1471-2105-5-4-2"/>
            </fig>
            <p>The strain comparison page allows queries in the tandem repeats database according to the tandem repeat length difference between the two strains compared, and also to other tandem repeats characteristics (unit length, copy number, etc...). Figure <figr fid="F3">3</figr> illustrates a query done for <it>Mycobacterium tuberculosis </it>strains H37Rv and CDC1551 <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>: the query "length difference &#8805; 5 bp" identifies 58 tandem repeats (8 are shown on Figure <figr fid="F3">3</figr>). This prediction has been tested for the 30 loci amenable to PCR analysis and polymorphism has been confirmed in all cases <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Example of a query in the Strain Comparison Page</p>
               </caption>
               <text>
                  <p>Example of a query in the Strain Comparison Page. On the top, the query page shows the 28 comparisons currently available (others will be added as new genome sequences are finished and released). Bottom, the result of a query performed for <it>Mycobacterium tuberculosis </it>strains H37Rv and CDC1551 is summarized.</p>
               </text>
               <graphic file="1471-2105-5-4-3"/>
            </fig>
            <p>When more than two strains have been sequenced, a synthesis of the results of several 2-strains comparisons is also available. Figure <figr fid="F4">4</figr> illustrates a query made for <it>Escherichia coli </it>strains O157:H7 Saka&#239;, O157:H7 EDL933, K12, and UPEC-CFT073 <abbrgrp><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr><abbr bid="B35">35</abbr></abbrgrp>: 87 tandem repeats were found with 2 to 4 alleles among the 4 strains (18 of which are listed in Figure <figr fid="F4">4</figr>).</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Example of a query in the Strain Comparison Page for more than two strains</p>
               </caption>
               <text>
                  <p>Example of a query in the Strain Comparison Page for more than two strains. Top, the query page shows the 6 comparisons currently available (others will be added as new genome sequences are finished and released). Bottom, the result of a query performed for <it>Escherichia coli </it>strains O157:H7 Saka&#239;, O157:H7 EDL933, K12 and UPEC-CFT073 is summarized. In several loci, the size of the repeat is listed differently for the different strains, which is due to different detections by the Tandem Repeats Finder, usually as a result of internal variations within the tandem array. Total length is calculated from positions of matching flanking sequences in the different strains, and does not necessarily correspond to the length of the tandem repeat detected by TRF in the locus. "Number of alleles" refers to the number of predicted sizes differing by at least 5 bp among the strains compared.</p>
               </text>
               <graphic file="1471-2105-5-4-4"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>The "Blast in the Tandem Repeats Database" page</p>
            </st>
            <p>To facilitate the identification of already studied tandem repeats, we implemented BLAST <abbrgrp><abbr bid="B25">25</abbr></abbrgrp> against the tandem repeats from the database, <it>i.e</it>. the tandem repeats themselves and their flanking sequences. The Blast page is available at <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. All bacteria can be queried at once, which allows the identification of tandem repeats families, conserved in several bacterial species. Another page is dedicated to the Blast of PCR primers and provides the size of the PCR products in all the species/strains where the primers match. Figure <figr fid="F5">5</figr> shows the results of searching the PCR primer pair from tandem repeat H37Rv_0024_18 bp <abbrgrp><abbr bid="B10">10</abbr></abbrgrp> in all bacteria: as expected, the PCR primer pair matches <it>Mycobacterium tuberculosis </it>strains H37Rv and CDC1551, providing different PCR product lengths.</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Example of a query in the "Blast of PCR primers" page, providing the length of the PCR products in the strains/species where the primer pair matches, and links to the corresponding tandem repeats descriptions</p>
               </caption>
               <text>
                  <p>Example of a query in the "Blast of PCR primers" page, providing the length of the PCR products in the strains/species where the primer pair matches, and links to the corresponding tandem repeats descriptions.</p>
               </text>
               <graphic file="1471-2105-5-4-5"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>The Bacterial Genotyping page</p>
            </st>
            <p>The Bacterial Genotyping page <abbrgrp><abbr bid="B37">37</abbr></abbrgrp> provides one illustration on how tandem repeat typing data can be made available via internet to allow external users to query genotyping data (<it>Bacillus anthracis</it>, <it>Yersinia pestis</it>, <it>Mycobacterium tuberculosis</it>, <it>Pseudomonas aeruginosa </it>for the moment) and compare a new strain to existing data as previously described in <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. For each locus, allele sizes can be selected among a list of possibilities (observed sizes). The results of the query indicate a similarity score and include links to the complete data recorded for each strain listed. This page is just meant as an illustration and prototype. MLVA reference data could also be made available for downloading as tabular data files, or can be copied from published datasets, which can then be complemented by in-house data, and analyzed by the appropriate clustering software.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <sec>
            <st>
               <p>Bacterial genomes evolution</p>
            </st>
            <p>As shown by the indexes from Figure <figr fid="F2">2</figr>, there are different ways to represent the divergence/similarity between two strains. They are not correlated, suggesting independent evolution processes. First, the "mean %identity" between two genomes reflects point mutations, and is an indicator of the time passed since the two strains diverged. For instance, <it>Yersinia pestis </it>is known to be of recent emergence <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> and shows a high "mean %identity" between strains CO-92 (orientalis) and KIM5 P12 (medievalis). In contrast, and as shown by the dot plot, large genome rearrangements occurred in this genome, which is representative of a high genome plasticity <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. The index "% of flanking sequences not rearranged" is an indicator of small-scale genome rearrangements, such as the insertions of mobile elements. This index is low for genomes rich in mobile elements, like <it>Streptococcus agalactiae</it>, in which such elements significantly contribute to strain diversity <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. Finally, the index "% of polymorphic tandem repeats" between two strains represents the tandem repeats evolution rate. For the moment, the mechanisms of bacterial VNTRs mutations have not been precisely investigated, but it seems likely to be independent of the other processes mentioned, as there are no correlations between the indexes. Figure <figr fid="F2">2</figr> provides clues to assess which typing method(s) will be efficient in the different species. For instance, the two bacterial species <it>Salmonella typhimurium </it>strain LT2 <abbrgrp><abbr bid="B39">39</abbr></abbrgrp> and <it>Shigella flexneri </it>strain 2a301 <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> share only 86.06% of sequence identity, clearly making the identification of matching tandem repeats between the two species difficult and of low significance. MLVA analysis appears to be of highest interest for the subspecies typing of highly monomorphic species including <it>Yersinia pestis</it>, <it>Bacillus anthracis</it>, <it>Mycobacterium tuberculosis </it>and <it>Brucella </it><abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B41">41</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Strain comparison efficiency</p>
            </st>
            <p>The sequencing of more than one strain for some bacterial species allows direct identification of polymorphic tandem repeats, assuming that no sequencing errors occurred. Earlier investigations provide good reasons to believe that tandem repeats in the size range considered here (a few hundred base-pairs) are correctly sequenced, and consequently, that the strain comparison data is reliable. As a negative control, the comparison of two independent sequences from the same strain of <it>Agrobacterium tumefaciens </it>strain (C58), one from Cereon genomics <abbrgrp><abbr bid="B42">42</abbr></abbrgrp> and the other from Washington University <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>, shows that no length polymorphism is detected among tandem repeats (Figure <figr fid="F2">2</figr>) between the two independent sequences. As a positive control, the tandem repeats predicted to be polymorphic by genome sequence comparison between the two strains of <it>M. tuberculosis </it>have indeed been proved polymorphic by PCR typing of isolates <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>.</p>
            <p>Selection based on comparison of sequence data from two strains will miss some polymorphic loci. Indeed, the results provided by the approach rely upon the phylogenetic distance between the two strains being compared. If the strains are very closely related, only a few TRs will be found different between them, but these tandem repeats will probably be the most polymorphic ones. Conversely, if the strains are distant in the phylogenetic tree, a larger number of polymorphic TRs will be found, some of them will be only moderately polymorphic. Obviously, when a few well-selected strains have been sequenced, it is likely that very few polymorphic tandem repeats are undetected in the Strain Comparison pages.</p>
            <p>It is of course still going to be very important to determine the TR allele frequency for isolates carefully selected to be representative of the global diversity of a given pathogen before suggesting the configuration of an MLVA assay to use in subsequent studies. In addition, those TR markers that are highly polymorphic in diverse test panels of isolates may be monomorphic when applied to isolates responsible for local outbreaks. The configuration of TR markers used to make up an assay needs to be determined empirically with representative local isolates and tailored to the study population and study questions.</p>
         </sec>
         <sec>
            <st>
               <p>Polymorphic tandem repeats selection for species with only one sequenced strain</p>
            </st>
            <p>The identification of simple criteria able to predict tandem repeat polymorphism when genome sequence data is available for only one strain would indeed greatly facilitate the development of MLVA assays. It would seem reasonable for instance to expect that the number of copies and the internal homogeneity of tandem arrays are strong predictors <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. We take advantage here of the many strain comparisons which are made available via the strain comparison pages to evaluate such criteria.</p>
            <p>We have analyzed bacteria with at least three sequenced genomes (<it>Staphylococcus aureus</it>: 6 strains, <it>Escherichia coli</it>: 4 strains, <it>Streptococcus pyogenes</it>: 4 strains and <it>Salmonella typhi </it>and <it>typhimurium</it>: 3 strains). We assume that in such cases, only a few polymorphic tandem repeats are missed in the comparisons. We compared the distribution of tandem repeats sequence characteristics among the group of "polymorphic" loci (differing in at least two of the strains compared, excluding length differences between strains that resulted from microdeletions in the flanking sequences) and the others. Comparisons were performed for the following sequence characteristics: unit length, copy number, total length, %GC, GC bias (=|%G-%C|/(%G+%C)), %matches, and HistoryR (a score derived from tandem repeat history reconstruction algorithm <abbrgrp><abbr bid="B44">44</abbr></abbrgrp> as described in <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>). None of the variables were normally distributed, as tested with Kolmogorov-Smirnov test, so a non-parametric Wilcoxon test was used to compare the distributions, which were judged significantly different at the .05 level of the statistic (2 tailed). Distributions were significantly different for all 4 species studied for %matches, total length and copy number. As shown on Figure <figr fid="F6">6</figr>, polymorphic TRs have a higher internal conservation and total length than monomorphic ones. Copy number, which is correlated with total length, is also higher among polymorphic TRs.</p>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>Proportion of predicted polymorphic (pink) and monomorphic (grey) tandem repeats according to different parameters (internal homogeneity of the repeat array (%matches) or total length)</p>
               </caption>
               <text>
                  <p>Proportion of predicted polymorphic (pink) and monomorphic (grey) tandem repeats according to different parameters (internal homogeneity of the repeat array (%matches) or total length). P-values obtained for the non-parametric Wilcoxon tests appear below each histogram.</p>
               </text>
               <graphic file="1471-2105-5-4-6"/>
            </fig>
            <p>Selecting the longest and most conserved tandem repeats should thus improve polymorphic TRs identification. Table <tblr tid="T1">1</tblr> illustrates the query "total length &#8805;  80 bp and %matches &#8805; 80%" applied to the four species used to find predictive criteria. For all four species, the group fulfilling the criterion is, as expected, enriched in polymorphic (at least two alleles) tandem repeats: in <it>Staphyloccocus aureus, </it>polymorphic tandem repeats represent only 8.5% of the whole population of tandem repeat loci but are predominant (87%) in the criterion positive group. The enrichment is even greater for highly polymorphic TRs, <it>i.e. </it>with 3 alleles or more: for example from 4.5% in the whole set to 66% in the positive group for <it>Staphylococcus aureus</it>. However this simple criterion misses more than half of the polymorphic loci. In addition, the efficiency of the criterion is highly variable in the different species: it is relatively satisfying in <it>Staphylococcus aureus </it>(54% of polymorphic tandem repeats would be missed) but very inefficient in <it>Streptococcus pyogenes </it>(almost 90% are missed). The results for highly polymorphic loci (3 alleles or more) are more consistent (the proportion of TRs with 3 alleles or more detected by the criterion ranges from 58% for <it>Escherichia coli </it>to 100% for <it>Salmonella</it>).</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Use of the criterion "total length &#8805; 80 bp and %matches &#8805; 80%" on 4 species for which 3 strains or more were compared. The number of monomorphic, polymorphic (2 alleles or more) and highly polymorphic (3 alleles or more) TRs in whole set, and positive and negative groups are listed. (a) "criterion" refers to the selection of TRs with L &#8805; 80 bp and %M &#8805; 80%</p>
               </caption>
               <tblbdy cols="13">
                  <r>
                     <c ca="center">
                        <p>Comparison (total number of TRs)</p>
                     </c>
                     <c cspan="3" ca="center">
                        <p>Whole set (proportion of total number)</p>
                     </c>
                     <c cspan="3" ca="center">
                        <p>Tandem repeats with L&#8805;80 bp AND %M&#8805;80% (proportion among the set)</p>
                     </c>
                     <c cspan="3" ca="center">
                        <p>Tandem repeats with L&lt;80 bp OR %M&lt;80% (proportion among the set)</p>
                     </c>
                     <c ca="center">
                        <p>% of the polymorphic TRs (2 alleles or more) that were detected by criterion<sup>a</sup></p>
                     </c>
                     <c ca="center">
                        <p>% of the TRs with 3 alleles or more that were detected by criterion<sup>a</sup></p>
                     </c>
                     <c ca="center">
                        <p>% of all TRs that fulfil the criterion<sup>a</sup></p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>1 allele</p>
                     </c>
                     <c ca="center">
                        <p>2 alleles or more</p>
                     </c>
                     <c ca="center">
                        <p>3 alleles or more</p>
                     </c>
                     <c ca="center">
                        <p>1 allele</p>
                     </c>
                     <c ca="center">
                        <p>2 alleles</p>
                     </c>
                     <c ca="center">
                        <p>3 alleles or more</p>
                     </c>
                     <c ca="center">
                        <p>1 allele</p>
                     </c>
                     <c ca="center">
                        <p>2 alleles or more</p>
                     </c>
                     <c ca="center">
                        <p>3 alleles or more</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c cspan="1">
                        <hr/>
                     </c>
                     <c cspan="3">
                        <hr/>
                     </c>
                     <c cspan="3">
                        <hr/>
                     </c>
                     <c cspan="3">
                        <hr/>
                     </c>
                     <c cspan="1">
                        <hr/>
                     </c>
                     <c cspan="1">
                        <hr/>
                     </c>
                     <c cspan="1">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p><it>S aureus </it>(833 TRs)</p>
                     </c>
                     <c ca="center">
                        <p>762 (91.5%)</p>
                     </c>
                     <c ca="center">
                        <p>71 (8.5%)</p>
                     </c>
                     <c ca="center">
                        <p>38 (4.5%)</p>
                     </c>
                     <c ca="center">
                        <p>5 (13%)</p>
                     </c>
                     <c ca="center">
                        <p>8 (20%)</p>
                     </c>
                     <c ca="center">
                        <p>25 (66%)</p>
                     </c>
                     <c ca="center">
                        <p>757 (95%)</p>
                     </c>
                     <c ca="center">
                        <p>25 (3.5%)</p>
                     </c>
                     <c ca="center">
                        <p>13 (1.5%)</p>
                     </c>
                     <c ca="center">
                        <p>46%</p>
                     </c>
                     <c ca="center">
                        <p>66%</p>
                     </c>
                     <c ca="center">
                        <p>7.23%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p><it>E coli </it>(790 TRs)</p>
                     </c>
                     <c ca="center">
                        <p>739 (93.5%)</p>
                     </c>
                     <c ca="center">
                        <p>51 (6.5%)</p>
                     </c>
                     <c ca="center">
                        <p>12 (1.5 %)</p>
                     </c>
                     <c ca="center">
                        <p>12 (38%)</p>
                     </c>
                     <c ca="center">
                        <p>13 (40%)</p>
                     </c>
                     <c ca="center">
                        <p>7 (22%)</p>
                     </c>
                     <c ca="center">
                        <p>727 (96%)</p>
                     </c>
                     <c ca="center">
                        <p>26 (3.5%)</p>
                     </c>
                     <c ca="center">
                        <p>5 (0.5%)</p>
                     </c>
                     <c ca="center">
                        <p>39%</p>
                     </c>
                     <c ca="center">
                        <p>58%</p>
                     </c>
                     <c ca="center">
                        <p>4.86%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p><it>S typhi / typhimurium </it>(641 TRs)</p>
                     </c>
                     <c ca="center">
                        <p>625 (97.5%)</p>
                     </c>
                     <c ca="center">
                        <p>16 (2.5%)</p>
                     </c>
                     <c ca="center">
                        <p>2 (0.3%)</p>
                     </c>
                     <c ca="center">
                        <p>13 (68%)</p>
                     </c>
                     <c ca="center">
                        <p>4 (22%)</p>
                     </c>
                     <c ca="center">
                        <p>2 (10%)</p>
                     </c>
                     <c ca="center">
                        <p>612 (98%)</p>
                     </c>
                     <c ca="center">
                        <p>10 (2%)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0%)</p>
                     </c>
                     <c ca="center">
                        <p>37.5%</p>
                     </c>
                     <c ca="center">
                        <p>100%</p>
                     </c>
                     <c ca="center">
                        <p>3.27%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p><it>S pyogenes </it>(292 TRs)</p>
                     </c>
                     <c ca="center">
                        <p>276 (94.5%)</p>
                     </c>
                     <c ca="center">
                        <p>16 (5.5%)</p>
                     </c>
                     <c ca="center">
                        <p>3 (1%)</p>
                     </c>
                     <c ca="center">
                        <p>4 (67%)</p>
                     </c>
                     <c ca="center">
                        <p>0 (0%)</p>
                     </c>
                     <c ca="center">
                        <p>2 (33%)</p>
                     </c>
                     <c ca="center">
                        <p>272 (95%)</p>
                     </c>
                     <c ca="center">
                        <p>14 (4.7%)</p>
                     </c>
                     <c ca="center">
                        <p>1 (0.3%)</p>
                     </c>
                     <c ca="center">
                        <p>12.5%</p>
                     </c>
                     <c ca="center">
                        <p>67%</p>
                     </c>
                     <c ca="center">
                        <p>2.71%</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>It is tempting to speculate that these observations are applicable to other species. Subsequently, we applied the criterion to ten of the 2-strains comparisons available on the Strain Comparison Page (Table <tblr tid="T2">2</tblr>). In all ten instances, the criterion positive group is enriched in TRs with different lengths between the two strains, compared to the whole set. This proportion varies from less than 3% in <it>Streptococcus agalactiae </it>to more than 20% in <it>Xylella fastidiosa </it>in the whole set. It is increased to 33% and 93% respectively among the set of loci which satisfy the criterion (these percentages correspond to the predictor's specificity), but the vast majority of polymorphic loci will be missed (90% and 80% respectively). Sensitivity, that is % of the TRs with different lengths that were detected by criterion varies from 6.90% for <it>Brucella </it>to 44.26% for <it>Mycobacterium tuberculosis</it>.</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Use of the criterion "total length &#8805; 80 bp and %matches &#8805; 80%" on 10 species for which 2 strains were compared. The numbers of tandem repeats with equal lengths and different lengths between the two strains in the whole set, and positive and negative groups are listed.</p>
               </caption>
               <tblbdy cols="10">
                  <r>
                     <c ca="center">
                        <p>Comparison (total number of TRs loci)</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>Whole set (proportion)</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>Criterion + (L&#8805;80 bp, %M&#8805;80%)</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>Criterion -</p>
                     </c>
                     <c ca="center">
                        <p>Sensitivity (% of the TRs with different lengths that were detected by criterion)</p>
                     </c>
                     <c ca="center">
                        <p>Specificity (% of the TRs predicted by the criterion that have different length)</p>
                     </c>
                     <c ca="center">
                        <p>% of all TRs that fulfil the criterion</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>equal length</p>
                     </c>
                     <c ca="center">
                        <p>different length</p>
                     </c>
                     <c ca="center">
                        <p>equal length</p>
                     </c>
                     <c ca="center">
                        <p>different length</p>
                     </c>
                     <c ca="center">
                        <p>equal length</p>
                     </c>
                     <c ca="center">
                        <p>different length</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c cspan="1">
                        <hr/>
                     </c>
                     <c cspan="2">
                        <hr/>
                     </c>
                     <c cspan="2">
                        <hr/>
                     </c>
                     <c cspan="2">
                        <hr/>
                     </c>
                     <c cspan="1">
                        <hr/>
                     </c>
                     <c cspan="1">
                        <hr/>
                     </c>
                     <c cspan="1">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p><it>H pylori </it>26695/J99 (624 TRs)</p>
                     </c>
                     <c ca="center">
                        <p>506 (81%)</p>
                     </c>
                     <c ca="center">
                        <p>118 (19%)</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>11</p>
                     </c>
                     <c ca="center">
                        <p>506</p>
                     </c>
                     <c ca="center">
                        <p>107</p>
                     </c>
                     <c ca="center">
                        <p>9%</p>
                     </c>
                     <c ca="center">
                        <p>100%</p>
                     </c>
                     <c ca="center">
                        <p>2%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p><it>N meningitidis </it>MC58/Z2491 (642 TRs)</p>
                     </c>
                     <c ca="center">
                        <p>528 (82%)</p>
                     </c>
                     <c ca="center">
                        <p>114 (18%)</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                     <c ca="center">
                        <p>23</p>
                     </c>
                     <c ca="center">
                        <p>518</p>
                     </c>
                     <c ca="center">
                        <p>91</p>
                     </c>
                     <c ca="center">
                        <p>20%</p>
                     </c>
                     <c ca="center">
                        <p>70%</p>
                     </c>
                     <c ca="center">
                        <p>5%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p><it>M tuberculosis </it>H37Rv/CDC1551(1502 TRs)</p>
                     </c>
                     <c ca="center">
                        <p>1441 (96%)</p>
                     </c>
                     <c ca="center">
                        <p>61 (4%)</p>
                     </c>
                     <c ca="center">
                        <p>35</p>
                     </c>
                     <c ca="center">
                        <p>27</p>
                     </c>
                     <c ca="center">
                        <p>1406</p>
                     </c>
                     <c ca="center">
                        <p>34</p>
                     </c>
                     <c ca="center">
                        <p>44%</p>
                     </c>
                     <c ca="center">
                        <p>44%</p>
                     </c>
                     <c ca="center">
                        <p>4%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p><it>L monocytogenes </it>EGD-e/<it>L innocua </it>Clip11262 (576 TRs)</p>
                     </c>
                     <c ca="center">
                        <p>553 (96%)</p>
                     </c>
                     <c ca="center">
                        <p>23 (4%)</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>551</p>
                     </c>
                     <c ca="center">
                        <p>20</p>
                     </c>
                     <c ca="center">
                        <p>13%</p>
                     </c>
                     <c ca="center">
                        <p>60%</p>
                     </c>
                     <c ca="center">
                        <p>1%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p><it>S agalactiae NEM316/2603 </it>(398 TRs)</p>
                     </c>
                     <c ca="center">
                        <p>387 (97%)</p>
                     </c>
                     <c ca="center">
                        <p>11 (3%)</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>385</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                     <c ca="center">
                        <p>9%</p>
                     </c>
                     <c ca="center">
                        <p>33%</p>
                     </c>
                     <c ca="center">
                        <p>1%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p><it>S pneumoniae </it>TIGR4/R6 (406 TRs)</p>
                     </c>
                     <c ca="center">
                        <p>339 (83%)</p>
                     </c>
                     <c ca="center">
                        <p>67 (17%)</p>
                     </c>
                     <c ca="center">
                        <p>14</p>
                     </c>
                     <c ca="center">
                        <p>29</p>
                     </c>
                     <c ca="center">
                        <p>325</p>
                     </c>
                     <c ca="center">
                        <p>38</p>
                     </c>
                     <c ca="center">
                        <p>43%</p>
                     </c>
                     <c ca="center">
                        <p>67%</p>
                     </c>
                     <c ca="center">
                        <p>10%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p><it>Y pestis CO-92/KIM5 P12 </it>(1499 TRs)</p>
                     </c>
                     <c ca="center">
                        <p>1372 (92%)</p>
                     </c>
                     <c ca="center">
                        <p>127 (8%)</p>
                     </c>
                     <c ca="center">
                        <p>44</p>
                     </c>
                     <c ca="center">
                        <p>19</p>
                     </c>
                     <c ca="center">
                        <p>1328</p>
                     </c>
                     <c ca="center">
                        <p>108</p>
                     </c>
                     <c ca="center">
                        <p>15%</p>
                     </c>
                     <c ca="center">
                        <p>30%</p>
                     </c>
                     <c ca="center">
                        <p>4%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p><it>R prowazekii </it>Madrid E/<it>R conorii </it>malish 7 (316 TRs)</p>
                     </c>
                     <c ca="center">
                        <p>290 (92%)</p>
                     </c>
                     <c ca="center">
                        <p>26 (8%)</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>290</p>
                     </c>
                     <c ca="center">
                        <p>24</p>
                     </c>
                     <c ca="center">
                        <p>8%</p>
                     </c>
                     <c ca="center">
                        <p>100%</p>
                     </c>
                     <c ca="center">
                        <p>1%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p><it>Brucella suis </it>1330/ <it>Brucella melitensis </it>16 M (739 TRs)</p>
                     </c>
                     <c ca="center">
                        <p>681 (92%)</p>
                     </c>
                     <c ca="center">
                        <p>58 (8%)</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>679</p>
                     </c>
                     <c ca="center">
                        <p>54</p>
                     </c>
                     <c ca="center">
                        <p>7%</p>
                     </c>
                     <c ca="center">
                        <p>67%</p>
                     </c>
                     <c ca="center">
                        <p>1%</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p><it>X fastidiosa </it>9a5c/grape Temecula1 (573 TRs)</p>
                     </c>
                     <c ca="center">
                        <p>440 (77%)</p>
                     </c>
                     <c ca="center">
                        <p>133 (23%)</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>28</p>
                     </c>
                     <c ca="center">
                        <p>438</p>
                     </c>
                     <c ca="center">
                        <p>105</p>
                     </c>
                     <c ca="center">
                        <p>21%</p>
                     </c>
                     <c ca="center">
                        <p>93%</p>
                     </c>
                     <c ca="center">
                        <p>5%</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>The finding that polymorphic tandem repeats have, on average, a higher internal conservation, total length, and copy number than monomorphic ones is in agreement with previous observations that TR polymorphism is correlated with conservation in <it>Yersinia pestis </it>and with total length in <it>Bacillus anthracis </it><abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. It is also reminiscent of the behavior of microsatellites (also called short sequence repeats: SSR, see <abbrgrp><abbr bid="B45">45</abbr></abbrgrp> for review), which are stabilized by internal variations <abbrgrp><abbr bid="B46">46</abbr></abbrgrp> and by reduction of the number of repeats <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>. Unfortunately, we show here that such simple prediction criteria may miss a very large proportion of polymorphic tandem repeats, and provide highly variable results in different species. This indicates that, in the absence of sequence data from two strains or more, the systematic testing of tandem repeats polymorphism across a set of relevant strains remains the most appropriate way to develop an MLVA assay. Consequently, the Strain Comparison page is of great use when two strains or more have been sequenced.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusions</p>
         </st>
         <p>Bacterial strain typing at the subspecies level is essential for epidemiological issues in the context of disease control. This can be used to determine if an <it>S. aureus </it>or <it>P. aeruginosa </it>infection for instance has been acquired in an hospital environment or not. On a larger scale, it can be used to trace the emergence of new, more virulent or drug resistant <it>M. tuberculosis </it>strains. It is also of interest in the field of bioterrorism and bioweapons control, as was shown by the investigations following the 2001 <it>B. anthracis </it>attacks. Tandem repeats typing has recently emerged as one way to address this issue. Indeed, in the case of a number of highly monomorphic bacterial species, including <it>B. anthracis </it>and <it>Y. pestis</it>, tandem repeats typing is the method of choice for subspecies typing. In addition to the fact that these loci represent an important fraction of the existing polymorphism, it offers a number of practical advantages, including the ease of typing, and of data exchanges among different countries. It is hoped that the tools which are described here will help evaluate the potential of tandem repeats typing assays for a larger range of pathogens.</p>
      </sec>
      <sec>
         <st>
            <p>Availability</p>
         </st>
         <p>All the tools presented are freely available from <url>http://minisatellites.u-psud.fr</url>.</p>
      </sec>
      <sec>
         <st>
            <p>List of abbreviations used</p>
         </st>
         <p>ASP: active server pages</p>
         <p>MLVA: multiple locus VNTR analysis</p>
         <p>PCR: polymerase chain reaction</p>
         <p>TR: tandem repeat</p>
         <p>TRF: tandem repeats finder</p>
      </sec>
      <sec>
         <st>
            <p>Authors contributions</p>
         </st>
         <p>FD is the developer of the database and web site, and the curator of the database. GV participated in the development of the initial procedure for the tandem repeat size comparisons between two genomes. The two authors contributed equally to the writing.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgments</p>
            </st>
            <p>This work was funded by grants from D&#233;l&#233;gation G&#233;n&#233;rale de l'Armement (DGA, France) aimed at facilitating the typing of dangerous pathogens.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1" rating="0">
            <title>
               <p>High-throughput epidemiologic typing in clinical microbiology</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>van Belkum</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Clin Microbiol Infect</source>
            <pubdate>2003</pubdate>
            <volume>9</volume>
            <fpage>86</fpage>
            <lpage>100</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi" link="fulltext">10.1046/j.1469-0691.2003.00549.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">12588328</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2" rating="0">
            <title>
               <p>Multilocus sequence typing</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Enright</snm>
                  <fnm>MC</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Spratt</snm>
                  <fnm>BG</fnm>
               </au>
            </aug>
            <source>Trends Microbiol</source>
            <pubdate>1999</pubdate>
            <volume>7</volume>
            <fpage>482</fpage>
            <lpage>7</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi" link="fulltext">10.1016/S0966-842X(99)01609-1</pubid>
                  <pubid idtype="pmpid" link="fulltext">10603483</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3" rating="0">
            <title>
               <p>Genome-wide analysis of synonymous single nucleotide polymorphisms in <it>Mycobacterium tuberculosis </it>complex organisms: resolution of genetic relationships among closely related microbial strains</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Gutacker</snm>
                  <fnm>MM</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Smoot</snm>
                  <fnm>JC</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Migliaccio</snm>
                  <fnm>CA</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Ricklefs</snm>
                  <fnm>SM</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Hua</snm>
                  <fnm>S</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Cousins</snm>
                  <fnm>DV</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Graviss</snm>
                  <fnm>EA</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Shashkina</snm>
                  <fnm>E</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Kreiswirth</snm>
                  <fnm>BN</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Musser</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2002</pubdate>
            <volume>162</volume>
            <fpage>1533</fpage>
            <lpage>43</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12524330</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B4" rating="0">
            <title>
               <p><it>Yersinia pestis</it>, the cause of plague, is a recently emerged clone of <it>Yersinia pseudotuberculosis</it></p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Achtman</snm>
                  <fnm>M</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Zurth</snm>
                  <fnm>K</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Morelli</snm>
                  <fnm>G</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Torrea</snm>
                  <fnm>G</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Guiyoule</snm>
                  <fnm>A</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Carniel</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>1999</pubdate>
            <volume>96</volume>
            <fpage>14043</fpage>
            <lpage>8</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid" link="fulltext">24187</pubid>
                  <pubid idtype="pmpid" link="fulltext">10570195</pubid>
                  <pubid idtype="doi" link="fulltext">10.1073/pnas.96.24.14043</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5" rating="0">
            <title>
               <p>Variable number of tandem repeats in clinical strains of <it>Haemophilus influenzae</it></p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>van Belkum</snm>
                  <fnm>A</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Scherer</snm>
                  <fnm>S</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>van Leeuwen</snm>
                  <fnm>W</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Willemse</snm>
                  <fnm>D</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>van Alphen</snm>
                  <fnm>L</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Verbrugh</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Infect Immun</source>
            <pubdate>1997</pubdate>
            <volume>65</volume>
            <fpage>5017</fpage>
            <lpage>27</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid" link="fulltext">175724</pubid>
                  <pubid idtype="pmpid" link="fulltext">9393791</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6" rating="0">
            <title>
               <p>Genetic diversity in the <it>Mycobacterium tuberculosis </it>complex based on variable numbers of tandem DNA repeats</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Frothingham</snm>
                  <fnm>R</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Meeker-O'Connell</snm>
                  <fnm>WA</fnm>
               </au>
            </aug>
            <source>Microbiology</source>
            <pubdate>1998</pubdate>
            <volume>144</volume>
            <fpage>1189</fpage>
            <lpage>1196</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9611793</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B7" rating="0">
            <title>
               <p>Variable human minisatellite-like regions in the <it>Mycobacterium tuberculosis </it>genome</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Supply</snm>
                  <fnm>P</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Mazars</snm>
                  <fnm>E</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Lesjean</snm>
                  <fnm>S</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Vincent</snm>
                  <fnm>V</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Gicquel</snm>
                  <fnm>B</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Locht</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>2000</pubdate>
            <volume>36</volume>
            <fpage>762</fpage>
            <lpage>771</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi" link="fulltext">10.1046/j.1365-2958.2000.01905.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">10844663</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8" rating="0">
            <title>
               <p>Diversity in a variable-number tandem repeat from <it>Yersinia pestis</it></p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Adair</snm>
                  <fnm>DM</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Worsham</snm>
                  <fnm>PL</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Hill</snm>
                  <fnm>KK</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Klevytska</snm>
                  <fnm>AM</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Jackson</snm>
                  <fnm>PJ</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Friedlander</snm>
                  <fnm>AM</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Keim</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>J Clin Microbiol</source>
            <pubdate>2000</pubdate>
            <volume>38</volume>
            <fpage>1516</fpage>
            <lpage>9</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid" link="fulltext">86479</pubid>
                  <pubid idtype="pmpid" link="fulltext">10747136</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9" rating="0">
            <title>
               <p>A tandem repeats database for bacterial genomes: application to the genotyping of <it>Yersinia pestis </it>and <it>Bacillus anthracis</it></p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Le Fl&#232;che</snm>
                  <fnm>P</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Hauck</snm>
                  <fnm>Y</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Onteniente</snm>
                  <fnm>L</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Prieur</snm>
                  <fnm>A</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Denoeud</snm>
                  <fnm>F</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Ramisse</snm>
                  <fnm>V</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Sylvestre</snm>
                  <fnm>P</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Benson</snm>
                  <fnm>G</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Ramisse</snm>
                  <fnm>F</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Vergnaud</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>BMC Microbiol</source>
            <pubdate>2001</pubdate>
            <volume>1</volume>
            <fpage>2</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid" link="fulltext">31411</pubid>
                  <pubid idtype="pmpid" link="fulltext">11299044</pubid>
                  <pubid idtype="doi" link="fulltext">10.1186/1471-2180-1-2</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10" rating="0">
            <title>
               <p>High resolution, on-line identification of strains from the <it>Mycobacterium tuberculosis </it>complex based on tandem repeat typing</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Le Fl&#232;che</snm>
                  <fnm>P</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Fabre</snm>
                  <fnm>M</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Denoeud</snm>
                  <fnm>F</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Koeck</snm>
                  <fnm>JL</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Vergnaud</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>BMC Microbiol</source>
            <pubdate>2002</pubdate>
            <volume>2</volume>
            <fpage>37</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid" link="fulltext">140014</pubid>
                  <pubid idtype="pmpid" link="fulltext">12456266</pubid>
                  <pubid idtype="doi" link="fulltext">10.1186/1471-2180-2-37</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11" rating="0">
            <title>
               <p>Small DNA deletions creating avirulence in <it>Streptococcus pyogenes</it></p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Spanier</snm>
                  <fnm>JG</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Jones</snm>
                  <fnm>SJ</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Cleary</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1984</pubdate>
            <volume>225</volume>
            <fpage>935</fpage>
            <lpage>8</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">6089334</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B12" rating="0">
            <title>
               <p>Size variation in group A streptococcal M protein is generated by homologous recombination between intragenic repeats</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Hollingshead</snm>
                  <fnm>SK</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Fischetti</snm>
                  <fnm>VA</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Scott</snm>
                  <fnm>JR</fnm>
               </au>
            </aug>
            <source>Mol Gen Genet</source>
            <pubdate>1987</pubdate>
            <volume>207</volume>
            <fpage>196</fpage>
            <lpage>203</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">3039291</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B13" rating="0">
            <title>
               <p>Molecular evolution and diversity in <it>Bacillus anthracis </it>as detected by amplified fragment length polymorphism markers</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Keim</snm>
                  <fnm>P</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Kalif</snm>
                  <fnm>A</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Schupp</snm>
                  <fnm>J</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Hill</snm>
                  <fnm>K</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Travis</snm>
                  <fnm>SE</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Richmond</snm>
                  <fnm>K</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Adair</snm>
                  <fnm>DM</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Hugh-Jones</snm>
                  <fnm>M</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Kuske</snm>
                  <fnm>CR</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Jackson</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>1997</pubdate>
            <volume>179</volume>
            <fpage>818</fpage>
            <lpage>24</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid" link="fulltext">178765</pubid>
                  <pubid idtype="pmpid" link="fulltext">9006038</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14" rating="0">
            <title>
               <p>Multiple-Locus Variable-Number Tandem Repeat Analysis Reveals Genetic Relationships within <it>Bacillus anthracis</it></p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Keim</snm>
                  <fnm>P</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Price</snm>
                  <fnm>LB</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Klevytska</snm>
                  <fnm>AM</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Smith</snm>
                  <fnm>KL</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Schupp</snm>
                  <fnm>JM</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Okinaka</snm>
                  <fnm>R</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Jackson</snm>
                  <fnm>PJ</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Hugh-Jones</snm>
                  <fnm>ME</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2000</pubdate>
            <volume>182</volume>
            <fpage>2928</fpage>
            <lpage>2936</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid" link="fulltext">102004</pubid>
                  <pubid idtype="pmpid" link="fulltext">10781564</pubid>
                  <pubid idtype="doi" link="fulltext">10.1128/JB.182.10.2928-2936.2000</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15" rating="0">
            <title>
               <p>Polymorphism in the collagen-like region of the <it>Bacillus anthracis </it>BclA protein leads to variation in exosporium filament length</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Sylvestre</snm>
                  <fnm>P</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Couture-Tosi</snm>
                  <fnm>E</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Mock</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2003</pubdate>
            <volume>185</volume>
            <fpage>1555</fpage>
            <lpage>63</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid" link="fulltext">148075</pubid>
                  <pubid idtype="pmpid" link="fulltext">12591872</pubid>
                  <pubid idtype="doi" link="fulltext">10.1128/JB.185.5.1555-1563.2003</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16" rating="0">
            <title>
               <p>Automated high-throughput genotyping for study of global epidemiology of <it>Mycobacterium tuberculosis </it>based on mycobacterial interspersed repetitive units</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Supply</snm>
                  <fnm>P</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Lesjean</snm>
                  <fnm>S</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Savine</snm>
                  <fnm>E</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Kremer</snm>
                  <fnm>K</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>van Soolingen</snm>
                  <fnm>D</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Locht</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>J Clin Microbiol</source>
            <pubdate>2001</pubdate>
            <volume>39</volume>
            <fpage>3563</fpage>
            <lpage>3571</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid" link="fulltext">88389</pubid>
                  <pubid idtype="pmpid" link="fulltext">11574573</pubid>
                  <pubid idtype="doi" link="fulltext">10.1128/JCM.39.10.3563-3571.2001</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17" rating="0">
            <title>
               <p>Minisatellites: Mutability and Genome Architecture</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Vergnaud</snm>
                  <fnm>G</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Denoeud</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2000</pubdate>
            <volume>10</volume>
            <fpage>899</fpage>
            <lpage>907</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi" link="fulltext">10.1101/gr.10.7.899</pubid>
                  <pubid idtype="pmpid" link="fulltext">10899139</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18" rating="0">
            <title>
               <p>Characterization of a Tandem Repeat Polymorphism in <it>Legionella pneumophila </it>and Its Use for Genotyping</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Pourcel</snm>
                  <fnm>C</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Vidgop</snm>
                  <fnm>Y</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Ramisse</snm>
                  <fnm>F</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Vergnaud</snm>
                  <fnm>G</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Tram</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>J Clin Microbiol</source>
            <pubdate>2003</pubdate>
            <volume>41</volume>
            <fpage>1819</fpage>
            <lpage>1826</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid" link="fulltext">154723</pubid>
                  <pubid idtype="pmpid" link="fulltext">12734211</pubid>
                  <pubid idtype="doi" link="fulltext">10.1128/JCM.41.5.1819-1826.2003</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19" rating="0">
            <title>
               <p>Evaluation of the polymorphisms associated with tandem repeats for <it>Pseudomonas aeruginosa </it>strain typing</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Onteniente</snm>
                  <fnm>L</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Brisse</snm>
                  <fnm>S</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Tassios</snm>
                  <fnm>PT</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Vergnaud</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>J Clin Microbiol</source>
            <pubdate>2003</pubdate>
            <volume>41</volume>
            <fpage>4991</fpage>
            <lpage>7</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid" link="fulltext">262500</pubid>
                  <pubid idtype="pmpid" link="fulltext">14605129</pubid>
                  <pubid idtype="doi" link="fulltext">10.1128/JCM.41.11.4991-4997.2003</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20" rating="0">
            <title>
               <p>GOLD Genomes OnLine Database</p>
            </title>
            <url>http://ergo.integratedgenomics.com/GOLD/</url>
         </bibl>
         <bibl id="B21" rating="0">
            <title>
               <p>Tandem repeats finder: a program to analyze DNA sequences</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Benson</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1999</pubdate>
            <volume>27</volume>
            <fpage>573</fpage>
            <lpage>580</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid" link="fulltext">148217</pubid>
                  <pubid idtype="pmpid" link="fulltext">9862982</pubid>
                  <pubid idtype="doi" link="fulltext">10.1093/nar/27.2.573</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22" rating="0">
            <title>
               <p>The Tandem Repeats Finder</p>
            </title>
            <url>http://tandem.bu.edu/trf/trf.html</url>
         </bibl>
         <bibl id="B23" rating="0">
            <title>
               <p>Predicting Human Minisatellite Polymorphism</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Denoeud</snm>
                  <fnm>F</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Vergnaud</snm>
                  <fnm>G</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Benson</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <fpage>856</fpage>
            <lpage>867</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi" link="fulltext">10.1101/gr.574403</pubid>
                  <pubid idtype="pmpid" link="fulltext">12695323</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24" rating="0">
            <title>
               <p>The ActiveState Programmer Network (ASPN) ActivePerl download page</p>
            </title>
            <url>http://www.activestate.com/products/ActivePerl</url>
         </bibl>
         <bibl id="B25" rating="0">
            <title>
               <p>Gapped BLAST and PSI-BLAST: a new generation of protein database search programs</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Altschul</snm>
                  <fnm>SF</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Madden</snm>
                  <fnm>TL</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Schaffer</snm>
                  <fnm>AA</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Zhang</snm>
                  <fnm>Z</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Lipman</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1997</pubdate>
            <volume>25</volume>
            <fpage>3389</fpage>
            <lpage>402</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid" link="fulltext">146917</pubid>
                  <pubid idtype="pmpid" link="fulltext">9254694</pubid>
                  <pubid idtype="doi" link="fulltext">10.1093/nar/25.17.3389</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26" rating="0">
            <title>
               <p>The NCBI BLAST ftp site</p>
            </title>
            <url>ftp://ftp.ncbi.nih.gov/blast/</url>
         </bibl>
         <bibl id="B27" rating="0">
            <title>
               <p>The tandem repeats database</p>
            </title>
            <url>http://minisatellites.u-psud.fr</url>
         </bibl>
         <bibl id="B28" rating="0">
            <title>
               <p>The Strain Comparison Page</p>
            </title>
            <url>http://minisatellites.u-psud.fr/comparison</url>
         </bibl>
         <bibl id="B29" rating="0">
            <title>
               <p>Genome sequence of <it>Yersinia pestis</it>, the causative agent of plague</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Parkhill</snm>
                  <fnm>J</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Wren</snm>
                  <fnm>BW</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Thomson</snm>
                  <fnm>NR</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Titball</snm>
                  <fnm>RW</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Holden</snm>
                  <fnm>MT</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Prentice</snm>
                  <fnm>MB</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Sebaihia</snm>
                  <fnm>M</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>James</snm>
                  <fnm>KD</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Churcher</snm>
                  <fnm>C</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Mungall</snm>
                  <fnm>KL</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Baker</snm>
                  <fnm>S</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Basham</snm>
                  <fnm>D</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Bentley</snm>
                  <fnm>SD</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Brooks</snm>
                  <fnm>K</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Cerdeno-Tarraga</snm>
                  <fnm>AM</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Chillingworth</snm>
                  <fnm>T</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Cronin</snm>
                  <fnm>A</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Davies</snm>
                  <fnm>RM</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Davis</snm>
                  <fnm>P</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Dougan</snm>
                  <fnm>G</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Feltwell</snm>
                  <fnm>T</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Hamlin</snm>
                  <fnm>N</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Holroyd</snm>
                  <fnm>S</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Jagels</snm>
                  <fnm>K</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Karlyshev</snm>
                  <fnm>AV</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Leather</snm>
                  <fnm>S</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Moule</snm>
                  <fnm>S</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Oyston</snm>
                  <fnm>PC</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Quail</snm>
                  <fnm>M</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Rutherford</snm>
                  <fnm>K</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Simmonds</snm>
                  <fnm>M</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Skelton</snm>
                  <fnm>J</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Stevens</snm>
                  <fnm>K</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Whitehead</snm>
                  <fnm>S</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Barrell</snm>
                  <fnm>BG</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2001</pubdate>
            <volume>413</volume>
            <fpage>523</fpage>
            <lpage>7</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi" link="fulltext">10.1038/35097083</pubid>
                  <pubid idtype="pmpid" link="fulltext">11586360</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30" rating="0">
            <title>
               <p>Genome sequence of <it>Yersinia pestis </it>KIM</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Deng</snm>
                  <fnm>W</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Burland</snm>
                  <fnm>V</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Plunkett</snm>
                  <fnm>G</fnm>
                  <suf>3rd</suf>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Boutin</snm>
                  <fnm>A</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Mayhew</snm>
                  <fnm>GF</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Liss</snm>
                  <fnm>P</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Perna</snm>
                  <fnm>NT</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Rose</snm>
                  <fnm>DJ</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Mau</snm>
                  <fnm>B</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Zhou</snm>
                  <fnm>S</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Schwartz</snm>
                  <fnm>DC</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Fetherston</snm>
                  <fnm>JD</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Lindler</snm>
                  <fnm>LE</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Brubaker</snm>
                  <fnm>RR</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Plano</snm>
                  <fnm>GV</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Straley</snm>
                  <fnm>SC</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>McDonough</snm>
                  <fnm>KA</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Nilles</snm>
                  <fnm>ML</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Matson</snm>
                  <fnm>JS</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Blattner</snm>
                  <fnm>FR</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Perry</snm>
                  <fnm>RD</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2002</pubdate>
            <volume>184</volume>
            <fpage>4601</fpage>
            <lpage>11</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid" link="fulltext">135232</pubid>
                  <pubid idtype="pmpid" link="fulltext">12142430</pubid>
                  <pubid idtype="doi" link="fulltext">10.1128/JB.184.16.4601-4611.2002</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31" rating="0">
            <title>
               <p>Genome plasticity in <it>Yersinia pestis</it></p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Radnedge</snm>
                  <fnm>L</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Agron</snm>
                  <fnm>PG</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Worsham</snm>
                  <fnm>PL</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Andersen</snm>
                  <fnm>GL</fnm>
               </au>
            </aug>
            <source>Microbiology</source>
            <pubdate>2002</pubdate>
            <volume>148</volume>
            <fpage>1687</fpage>
            <lpage>98</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12055289</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B32" rating="0">
            <title>
               <p>Whole-Genome Comparison of <it>Mycobacterium tuberculosis </it>Clinical and Laboratory Strains</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Fleischmann</snm>
                  <fnm>RD</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Alland</snm>
                  <fnm>D</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Eisen</snm>
                  <fnm>JA</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Carpenter</snm>
                  <fnm>L</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>White</snm>
                  <fnm>O</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Peterson</snm>
                  <fnm>J</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>DeBoy</snm>
                  <fnm>R</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Dodson</snm>
                  <fnm>R</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Gwinn</snm>
                  <fnm>M</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Haft</snm>
                  <fnm>D</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Hickey</snm>
                  <fnm>E</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Kolonay</snm>
                  <fnm>JF</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Nelson</snm>
                  <fnm>WC</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Umayam</snm>
                  <fnm>LA</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Ermolaeva</snm>
                  <fnm>M</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Salzberg</snm>
                  <fnm>SL</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Delcher</snm>
                  <fnm>A</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Utterback</snm>
                  <fnm>T</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Weidman</snm>
                  <fnm>J</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Khouri</snm>
                  <fnm>H</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Gill</snm>
                  <fnm>J</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Mikula</snm>
                  <fnm>A</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Bishai</snm>
                  <fnm>W</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Jacobs</snm>
                  <fnm>WR</fnm>
                  <suf>Jr</suf>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Venter</snm>
                  <fnm>JC</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Fraser</snm>
                  <fnm>CM</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2002</pubdate>
            <volume>184</volume>
            <fpage>5479</fpage>
            <lpage>5490</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid" link="fulltext">135346</pubid>
                  <pubid idtype="pmpid" link="fulltext">12218036</pubid>
                  <pubid idtype="doi" link="fulltext">10.1128/JB.184.19.5479-5490.2002</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33" rating="0">
            <title>
               <p>Complete genome sequence of enterohemorrhagic <it>Escherichia coli </it>O157:H7 and genomic comparison with a laboratory strain K-12</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Hayashi</snm>
                  <fnm>T</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Makino</snm>
                  <fnm>K</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Ohnishi</snm>
                  <fnm>M</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Kurokawa</snm>
                  <fnm>K</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Ishii</snm>
                  <fnm>K</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Yokoyama</snm>
                  <fnm>K</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Han</snm>
                  <fnm>CG</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Ohtsubo</snm>
                  <fnm>E</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Nakayama</snm>
                  <fnm>K</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Murata</snm>
                  <fnm>T</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Tanaka</snm>
                  <fnm>M</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Tobe</snm>
                  <fnm>T</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Iida</snm>
                  <fnm>T</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Takami</snm>
                  <fnm>H</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Honda</snm>
                  <fnm>T</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Sasakawa</snm>
                  <fnm>C</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Ogasawara</snm>
                  <fnm>N</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Yasunaga</snm>
                  <fnm>T</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Kuhara</snm>
                  <fnm>S</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Shiba</snm>
                  <fnm>T</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Hattori</snm>
                  <fnm>M</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Shinagawa</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>DNA Res</source>
            <pubdate>2001</pubdate>
            <volume>8</volume>
            <fpage>11</fpage>
            <lpage>22</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11258796</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B34" rating="0">
            <title>
               <p>Genome sequence of enterohaemorrhagic <it>Escherichia coli </it>O157:H7</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Perna</snm>
                  <fnm>NT</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Plunkett</snm>
                  <fnm>G</fnm>
                  <suf>3rd</suf>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Burland</snm>
                  <fnm>V</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Mau</snm>
                  <fnm>B</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Glasner</snm>
                  <fnm>JD</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Rose</snm>
                  <fnm>DJ</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Mayhew</snm>
                  <fnm>GF</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Evans</snm>
                  <fnm>PS</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Gregor</snm>
                  <fnm>J</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Kirkpatrick</snm>
                  <fnm>HA</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Posfai</snm>
                  <fnm>G</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Hackett</snm>
                  <fnm>J</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Klink</snm>
                  <fnm>S</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Boutin</snm>
                  <fnm>A</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Shao</snm>
                  <fnm>Y</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Miller</snm>
                  <fnm>L</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Grotbeck</snm>
                  <fnm>EJ</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Davis</snm>
                  <fnm>NW</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Lim</snm>
                  <fnm>A</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Dimalanta</snm>
                  <fnm>ET</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Potamousis</snm>
                  <fnm>KD</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Apodaca</snm>
                  <fnm>J</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Anantharaman</snm>
                  <fnm>TS</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Lin</snm>
                  <fnm>J</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Yen</snm>
                  <fnm>G</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Schwartz</snm>
                  <fnm>DC</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Welch</snm>
                  <fnm>RA</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Blattner</snm>
                  <fnm>FR</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2001</pubdate>
            <volume>409</volume>
            <fpage>529</fpage>
            <lpage>33</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi" link="fulltext">10.1038/35054089</pubid>
                  <pubid idtype="pmpid" link="fulltext">11206551</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35" rating="0">
            <title>
               <p>Extensive mosaic structure revealed by the complete genome sequence of uropathogenic <it>Escherichia coli</it></p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Welch</snm>
                  <fnm>RA</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Burland</snm>
                  <fnm>V</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Plunkett</snm>
                  <fnm>G</fnm>
                  <suf>3rd</suf>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Redford</snm>
                  <fnm>P</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Roesch</snm>
                  <fnm>P</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Rasko</snm>
                  <fnm>D</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Buckles</snm>
                  <fnm>EL</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Liou</snm>
                  <fnm>SR</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Boutin</snm>
                  <fnm>A</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Hackett</snm>
                  <fnm>J</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Stroud</snm>
                  <fnm>D</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Mayhew</snm>
                  <fnm>GF</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Rose</snm>
                  <fnm>DJ</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Zhou</snm>
                  <fnm>S</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Schwartz</snm>
                  <fnm>DC</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Perna</snm>
                  <fnm>NT</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Mobley</snm>
                  <fnm>HL</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Donnenberg</snm>
                  <fnm>MS</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Blattner</snm>
                  <fnm>FR</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2002</pubdate>
            <volume>99</volume>
            <fpage>17020</fpage>
            <lpage>4</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid" link="fulltext">139262</pubid>
                  <pubid idtype="pmpid" link="fulltext">12471157</pubid>
                  <pubid idtype="doi" link="fulltext">10.1073/pnas.252529799</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36" rating="0">
            <title>
               <p>The Blast in the tandem repeats database page</p>
            </title>
            <url>http://minisatellites.u-psud.fr/Blast</url>
         </bibl>
         <bibl id="B37" rating="0">
            <title>
               <p>The Bacterial Genotyping Page</p>
            </title>
            <url>http://bacterial-genotyping.igmors.u-psud.fr</url>
         </bibl>
         <bibl id="B38" rating="0">
            <title>
               <p>Complete genome sequence and comparative genomic analysis of an emerging human pathogen, serotype V <it>Streptococcus agalactiae</it></p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Tettelin</snm>
                  <fnm>H</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Masignani</snm>
                  <fnm>V</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Cieslewicz</snm>
                  <fnm>MJ</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Eisen</snm>
                  <fnm>JA</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Peterson</snm>
                  <fnm>S</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Wessels</snm>
                  <fnm>MR</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Paulsen</snm>
                  <fnm>IT</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Nelson</snm>
                  <fnm>KE</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Margarit</snm>
                  <fnm>I</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Read</snm>
                  <fnm>TD</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Madoff</snm>
                  <fnm>LC</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Wolf</snm>
                  <fnm>AM</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Beanan</snm>
                  <fnm>MJ</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Brinkac</snm>
                  <fnm>LM</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Daugherty</snm>
                  <fnm>SC</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>DeBoy</snm>
                  <fnm>RT</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Durkin</snm>
                  <fnm>AS</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Kolonay</snm>
                  <fnm>JF</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Madupu</snm>
                  <fnm>R</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Lewis</snm>
                  <fnm>MR</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Radune</snm>
                  <fnm>D</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Fedorova</snm>
                  <fnm>NB</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Scanlan</snm>
                  <fnm>D</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Khouri</snm>
                  <fnm>H</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Mulligan</snm>
                  <fnm>S</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Carty</snm>
                  <fnm>HA</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Cline</snm>
                  <fnm>RT</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Van Aken</snm>
                  <fnm>SE</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Gill</snm>
                  <fnm>J</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Scarselli</snm>
                  <fnm>M</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Mora</snm>
                  <fnm>M</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Iacobini</snm>
                  <fnm>ET</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Brettoni</snm>
                  <fnm>C</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Galli</snm>
                  <fnm>G</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Mariani</snm>
                  <fnm>M</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Vegni</snm>
                  <fnm>F</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Maione</snm>
                  <fnm>D</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Rinaudo</snm>
                  <fnm>D</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Rappuoli</snm>
                  <fnm>R</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Telford</snm>
                  <fnm>JL</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Kasper</snm>
                  <fnm>DL</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Grandi</snm>
                  <fnm>G</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Fraser</snm>
                  <fnm>CM</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2002</pubdate>
            <volume>99</volume>
            <fpage>12391</fpage>
            <lpage>6</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid" link="fulltext">129455</pubid>
                  <pubid idtype="pmpid" link="fulltext">12200547</pubid>
                  <pubid idtype="doi" link="fulltext">10.1073/pnas.182380799</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39" rating="0">
            <title>
               <p>Complete genome sequence of <it>Salmonella enterica </it>serovar Typhimurium LT2</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>McClelland</snm>
                  <fnm>M</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Sanderson</snm>
                  <fnm>KE</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Spieth</snm>
                  <fnm>J</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Clifton</snm>
                  <fnm>SW</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Latreille</snm>
                  <fnm>P</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Courtney</snm>
                  <fnm>L</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Porwollik</snm>
                  <fnm>S</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Ali</snm>
                  <fnm>J</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Dante</snm>
                  <fnm>M</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Du</snm>
                  <fnm>F</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Hou</snm>
                  <fnm>S</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Layman</snm>
                  <fnm>D</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Leonard</snm>
                  <fnm>S</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Nguyen</snm>
                  <fnm>C</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Scott</snm>
                  <fnm>K</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Holmes</snm>
                  <fnm>A</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Grewal</snm>
                  <fnm>N</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Mulvaney</snm>
                  <fnm>E</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Ryan</snm>
                  <fnm>E</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Sun</snm>
                  <fnm>H</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Florea</snm>
                  <fnm>L</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Stoneking</snm>
                  <fnm>T</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Nhan</snm>
                  <fnm>M</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Waterston</snm>
                  <fnm>R</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Wilson</snm>
                  <fnm>RK</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2001</pubdate>
            <volume>413</volume>
            <fpage>852</fpage>
            <lpage>6</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi" link="fulltext">10.1038/35101614</pubid>
                  <pubid idtype="pmpid" link="fulltext">11677609</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40" rating="0">
            <title>
               <p>Genome sequence of <it>Shigella flexneri </it>2a: insights into pathogenicity through comparison with genomes of <it>Escherichia coli </it>K12 and O157</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Jin</snm>
                  <fnm>Q</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Yuan</snm>
                  <fnm>Z</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Xu</snm>
                  <fnm>J</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Wang</snm>
                  <fnm>Y</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Shen</snm>
                  <fnm>Y</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Lu</snm>
                  <fnm>W</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Liu</snm>
                  <fnm>H</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Yang</snm>
                  <fnm>J</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Yang</snm>
                  <fnm>F</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Zhang</snm>
                  <fnm>X</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Yang</snm>
                  <fnm>G</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Wu</snm>
                  <fnm>H</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Qu</snm>
                  <fnm>D</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Dong</snm>
                  <fnm>J</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Sun</snm>
                  <fnm>L</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Xue</snm>
                  <fnm>Y</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Zhao</snm>
                  <fnm>A</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Gao</snm>
                  <fnm>Y</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Zhu</snm>
                  <fnm>J</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Kan</snm>
                  <fnm>B</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Ding</snm>
                  <fnm>K</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Chen</snm>
                  <fnm>S</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Cheng</snm>
                  <fnm>H</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Yao</snm>
                  <fnm>Z</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>He</snm>
                  <fnm>B</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Chen</snm>
                  <fnm>R</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Ma</snm>
                  <fnm>D</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Qiang</snm>
                  <fnm>B</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Wen</snm>
                  <fnm>Y</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Hou</snm>
                  <fnm>Y</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Yu</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2002</pubdate>
            <volume>30</volume>
            <fpage>4432</fpage>
            <lpage>41</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid" link="fulltext">137130</pubid>
                  <pubid idtype="pmpid" link="fulltext">12384590</pubid>
                  <pubid idtype="doi" link="fulltext">10.1093/nar/gkf566</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41" rating="0">
            <title>
               <p>Brucella 'Hoof-Prints': strain typing by multi-locus analysis of variable number tandem repeats (VNTRs)</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Bricker</snm>
                  <fnm>BJ</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Ewalt</snm>
                  <fnm>DR</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Halling</snm>
                  <fnm>SM</fnm>
               </au>
            </aug>
            <source>BMC Microbiol</source>
            <pubdate>2003</pubdate>
            <volume>3</volume>
            <fpage>15</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid" link="fulltext">183870</pubid>
                  <pubid idtype="pmpid" link="fulltext">12857351</pubid>
                  <pubid idtype="doi" link="fulltext">10.1186/1471-2180-3-15</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42" rating="0">
            <title>
               <p>Genome sequence of the plant pathogen and biotechnology agent <it>Agrobacterium tumefaciens </it>C58</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Goodner</snm>
                  <fnm>B</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Hinkle</snm>
                  <fnm>G</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Gattung</snm>
                  <fnm>S</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Miller</snm>
                  <fnm>N</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Blanchard</snm>
                  <fnm>M</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Qurollo</snm>
                  <fnm>B</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Goldman</snm>
                  <fnm>BS</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Cao</snm>
                  <fnm>Y</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Askenazi</snm>
                  <fnm>M</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Halling</snm>
                  <fnm>C</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Mullin</snm>
                  <fnm>L</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Houmiel</snm>
                  <fnm>K</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Gordon</snm>
                  <fnm>J</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Vaudin</snm>
                  <fnm>M</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Iartchouk</snm>
                  <fnm>O</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Epp</snm>
                  <fnm>A</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Liu</snm>
                  <fnm>F</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Wollam</snm>
                  <fnm>C</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Allinger</snm>
                  <fnm>M</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Doughty</snm>
                  <fnm>D</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Scott</snm>
                  <fnm>C</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Lappas</snm>
                  <fnm>C</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Markelz</snm>
                  <fnm>B</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Flanagan</snm>
                  <fnm>C</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Crowell</snm>
                  <fnm>C</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Gurson</snm>
                  <fnm>J</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Lomo</snm>
                  <fnm>C</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Sear</snm>
                  <fnm>C</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Strub</snm>
                  <fnm>G</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Cielo</snm>
                  <fnm>C</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Slater</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2001</pubdate>
            <volume>294</volume>
            <fpage>2323</fpage>
            <lpage>8</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi" link="fulltext">10.1126/science.1066803</pubid>
                  <pubid idtype="pmpid" link="fulltext">11743194</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43" rating="0">
            <title>
               <p>The genome of the natural genetic engineer <it>Agrobacterium tumefaciens </it>C58</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Wood</snm>
                  <fnm>DW</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Setubal</snm>
                  <fnm>JC</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Kaul</snm>
                  <fnm>R</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Monks</snm>
                  <fnm>DE</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Kitajima</snm>
                  <fnm>JP</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Okura</snm>
                  <fnm>VK</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Zhou</snm>
                  <fnm>Y</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Chen</snm>
                  <fnm>L</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Wood</snm>
                  <fnm>GE</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Almeida</snm>
                  <fnm>NF</fnm>
                  <suf>Jr</suf>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Woo</snm>
                  <fnm>L</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Chen</snm>
                  <fnm>Y</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Paulsen</snm>
                  <fnm>IT</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Eisen</snm>
                  <fnm>JA</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Karp</snm>
                  <fnm>PD</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Bovee</snm>
                  <fnm>D</fnm>
                  <suf>Sr</suf>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Chapman</snm>
                  <fnm>P</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Clendenning</snm>
                  <fnm>J</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Deatherage</snm>
                  <fnm>G</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Gillet</snm>
                  <fnm>W</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Grant</snm>
                  <fnm>C</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Kutyavin</snm>
                  <fnm>T</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Levy</snm>
                  <fnm>R</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Li</snm>
                  <fnm>MJ</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>McClelland</snm>
                  <fnm>E</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Palmieri</snm>
                  <fnm>A</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Raymond</snm>
                  <fnm>C</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Rouse</snm>
                  <fnm>G</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Saenphimmachak</snm>
                  <fnm>C</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Wu</snm>
                  <fnm>Z</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Romero</snm>
                  <fnm>P</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Gordon</snm>
                  <fnm>D</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Zhang</snm>
                  <fnm>S</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Yoo</snm>
                  <fnm>H</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Tao</snm>
                  <fnm>Y</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Biddle</snm>
                  <fnm>P</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Jung</snm>
                  <fnm>M</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Krespan</snm>
                  <fnm>W</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Perry</snm>
                  <fnm>M</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Gordon-Kamm</snm>
                  <fnm>B</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Liao</snm>
                  <fnm>L</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Kim</snm>
                  <fnm>S</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Hendrick</snm>
                  <fnm>C</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Zhao</snm>
                  <fnm>ZY</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Dolan</snm>
                  <fnm>M</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Chumley</snm>
                  <fnm>F</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Tingey</snm>
                  <fnm>SV</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Tomb</snm>
                  <fnm>JF</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Gordon</snm>
                  <fnm>MP</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Olson</snm>
                  <fnm>MV</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Nester</snm>
                  <fnm>EW</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2001</pubdate>
            <volume>294</volume>
            <fpage>2317</fpage>
            <lpage>23</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi" link="fulltext">10.1126/science.1066804</pubid>
                  <pubid idtype="pmpid" link="fulltext">11743193</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44" rating="0">
            <title>
               <p>Reconstructing the duplication history of a tandem repeat</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Benson</snm>
                  <fnm>G</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Dong</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Proc Int Conf Intell Syst Mol Biol</source>
            <pubdate>1999</pubdate>
            <fpage>44</fpage>
            <lpage>53</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10786285</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B45" rating="0">
            <title>
               <p>Short-sequence DNA repeats in prokaryotic genomes</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>van Belkum</snm>
                  <fnm>A</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Scherer</snm>
                  <fnm>S</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>van Alphen</snm>
                  <fnm>L</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Verbrugh</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Microbiol Mol Biol Rev</source>
            <pubdate>1998</pubdate>
            <volume>62</volume>
            <fpage>275</fpage>
            <lpage>93</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid" link="fulltext">98915</pubid>
                  <pubid idtype="pmpid" link="fulltext">9618442</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46" rating="0">
            <title>
               <p>Two distinct models account for short and long deletions within sequence repeats in <it>Escherichia coli</it></p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Schumacher</snm>
                  <fnm>S</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Fuchs</snm>
                  <fnm>RP</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Bichara</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>1997</pubdate>
            <volume>179</volume>
            <fpage>6512</fpage>
            <lpage>7</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid" link="fulltext">179572</pubid>
                  <pubid idtype="pmpid" link="fulltext">9335305</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47" rating="0">
            <title>
               <p>The length of a tetranucleotide repeat tract in <it>Haemophilus influenzae </it>determines the phase variation rate of a gene with homology to type III DNA methyltransferases</p>
            </title>
            <aug>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>De Bolle</snm>
                  <fnm>X</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Bayliss</snm>
                  <fnm>CD</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Field</snm>
                  <fnm>D</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>van de Ven</snm>
                  <fnm>T</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Saunders</snm>
                  <fnm>NJ</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Hood</snm>
                  <fnm>DW</fnm>
               </au>
               <au ca="no" ce="no" pa="no" da="no">
                  <snm>Moxon</snm>
                  <fnm>ER</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>2000</pubdate>
            <volume>35</volume>
            <fpage>211</fpage>
            <lpage>22</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi" link="fulltext">10.1046/j.1365-2958.2000.01701.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">10632891</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
