<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2148-8-18</ui>
   <ji>1471-2148</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>IS<it>4 </it>family goes genomic</p>
         </title>
         <aug>
            <au id="A1">
               <snm>De Palmenaer</snm>
               <fnm>Daniel</fnm>
               <insr iid="I1"/>
               <email>daniel.depalmenaer@uclouvain.be</email>
            </au>
            <au id="A2">
               <snm>Siguier</snm>
               <fnm>Patricia</fnm>
               <insr iid="I2"/>
               <email>siguier@ibcg.biotoul.fr</email>
            </au>
            <au id="A3" ca="yes">
               <snm>Mahillon</snm>
               <fnm>Jacques</fnm>
               <insr iid="I1"/>
               <email>jacques.mahillon@uclouvain.be</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Laboratoire de microbiologie alimentaire et environnementale, Universit&#233; catholique de Louvain, Croix du Sud 2/12, B-1348 Louvain-la-Neuve, Belgium</p>
            </ins>
            <ins id="I2">
               <p>Laboratoire de microbiologie et g&#233;n&#233;tique mol&#233;culaires, Universit&#233; Paul Sabatier, 118 route de Narbonne, 31062 Toulouse cedex 9, France</p>
            </ins>
         </insg>
         <source>BMC Evolutionary Biology</source>
         <issn>1471-2148</issn>
         <pubdate>2008</pubdate>
         <volume>8</volume>
         <issue>1</issue>
         <fpage>18</fpage>
         <url>http://www.biomedcentral.com/1471-2148/8/18</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18215304</pubid>
               <pubid idtype="doi">10.1186/1471-2148-8-18</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>13</day>
               <month>9</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>23</day>
               <month>1</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>23</day>
               <month>1</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>De Palmenaer et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Insertion sequences (ISs) are small, mobile DNA entities able to expand in prokaryotic genomes and trigger important rearrangements. To understand their role in evolution, accurate IS taxonomy is essential. The IS<it>4 </it>family is composed of ~70 elements and, like some other families, displays extremely elevated levels of internal divergence impeding its classification. The increasing availability of complete genome sequences provides a valuable source for the discovery of additional IS<it>4 </it>elements. In this study, this genomic database was used to update the structural and functional definition of the IS<it>4 </it>family.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>A total of 227 IS<it>4</it>-related sequences were collected among more than 500 sequenced bacterial and archaeal genomes, representing more than a three fold increase of the initial inventory. A clear division into seven coherent subgroups was discovered as well as three emerging families, which displayed distinct structural and functional properties. The IS<it>4 </it>family was sporadically present in 17 % of analyzed genomes, with most of them displaying single or a small number of IS<it>4 </it>elements. Significant expansions were detected only in some pathogens as well as among certain extremophiles, suggesting the probable involvement of some elements in bacterial and archaeal adaptation and/or evolution. Finally, it should be noted that some IS<it>4 </it>subgroups and two emerging families occurred preferentially in specific phyla or exclusively inside a specific genus.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>The present taxonomic update of IS<it>4 </it>and emerging families will facilitate the classification of future elements as they arise from ongoing genome sequencing. Their narrow genomic impact and the existence of both IS-poor and IS-rich thriving prokaryotes suggested that these families, and probably ISs in general, are occasionally used as a tool for genome flexibility and evolution, rather than just representing self sustaining DNA entities.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="bmc" subtype="user_supplied_xml" id="endnote"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Insertion sequences (ISs) are small (&lt; 2.5 kb), generally phenotypically cryptic segments of DNA able to jump, or copy themselves, into various genomic sites with no need for DNA homology <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. They generally encode no functions other than those involved in their mobility, although elements including additional genes are now being identified <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>. While almost exclusively restricted to bacterial and archaeal genomes, they are, like eukaryotic transposable elements, involved in a wide variety of biological transactions leading to genome reshuffling and evolution. Indeed, their ability to proliferate within a genome provides the potential for homologous recombination-mediated deletions or inversions, and their capacity to transport accessory genes represents an additional contribution to genome flexibility. By affecting gene expression and facilitating the emergence of new gene clusters they play an important role in adaptability of their host. Eventually, horizontal transfer mechanisms such as conjugation allow these IS-mediated sets of genes to cross barriers between strains, species and beyond <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>.</p>
         <p>The DNA breaks and joins necessary for transposition are catalysed by an element encoded protein referred to as transposase. These proteins determine transposition mechanisms and are now used to lead classification of prokaryotic transposable elements in general. Accordingly, transposases that form a covalent intermediate with DNA are distinguished from those that do not. Additional distinction is provided by protein active-site residues crucial for transposition. These define the five major transposase classes currently established : tyrosine (Y), serine (S), relaxase (Y1) and rolling-circle (Y2) transposases involve covalent intermediates with DNA during transposition, while the fifth class, namely DDE transposases, prompts transposition via direct transesterification reactions <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>.</p>
         <p>DDE transposases display three acidic residues in three distinct regions of their primary sequence, namely regions N2, N3 and C1, which harbor the aspartate (D), aspartate (D) and glutamate (E) residues, respectively. Spacers of various lengths separate these regions, but their acidic residues are brought together upon protein folding to form a catalytic triad essential for transposition <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>.</p>
         <p>Prokaryotic DDE transposons (mainly ISs) can move in two different ways, depending on the fate of the donor site. Replicative transposons mobilize a copy of their DNA, leaving the parent site intact, while conservative transposons cut themselves out of the donor molecule in order to paste their DNA into the target <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>.</p>
         <p>Beyond mechanistic behaviors, each individual IS is characterized by structural features used to fine-tune their classification. A wealth of these data is currently generated with the rising availability of whole genome sequencing projects. As of July 2007, more than 19 different IS families are established based on over 1800 bacterial and archaeal IS sequences <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>. An IS family can be defined as a collection of elements sharing the same catalytic site structure (with conserved spacers between key residues), an identical genetic organization (e.g. frameshifting in transposase gene), similar arrangements of their ends and uniform target site fates upon insertion. However, not all families are so coherent. This is why some of them (like families IS<it>4 </it>and IS<it>5</it>) are divided into subgroups being composed of a core of closely related elements that can be linked to other members of the family by weaker but still significant similarities.</p>
         <p>The IS<it>4 </it>family, like most IS families described so far, contains elements mobilized by DDE transposases performing a 'cut-and-paste' mechanism. The main and almost only hallmarks of this family's transposases are (i) absence of frameshift in the transposase gene and (ii) an Y-(2)-R-(3)-E-(6)-(K) signature (YREK) in region C1 where the glutamate residue is that of the DDE motif <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B17">17</abbr></abbrgrp>. Only very few IS<it>4 </it>elements have been studied in detail. IS<it>231A </it>has been shown to transpose <it>in vivo </it>by a 'cut-and-paste' mechanism <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>, both in its natural host, <it>Bacillus thuringiensis </it><abbrgrp><abbr bid="B19">19</abbr></abbrgrp>, and in <it>Escherichia coli </it><abbrgrp><abbr bid="B20">20</abbr></abbrgrp>; and it displays a certain degree of insertion specificity <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. Both IS<it>10R </it>and IS<it>50R </it>are part of composite transposons (Tn<it>10 </it><abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp> and Tn<it>5 </it><abbrgrp><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr></abbrgrp>, respectively) and are the only members for which <it>in vitro </it>systems have been set up. The latter has allowed extensive acquisition of genetic, biochemical, mechanistic and regulation data for these elements. Also, the Tn<it>5 </it>transposase is the only IS<it>4 </it>element for which X-ray crystallographic structure data are available <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B26">26</abbr></abbrgrp>.</p>
         <p>Since the initial definition of family IS<it>4 </it>(based on about 45 elements), other elements were progressively added via ISfinder, the prokaryotic IS database <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. However, some of them displayed distant resemblance both to existing IS<it>4 </it>members as well as members of other groups such as the IS<it>5 </it>family or ISNCY (IS Not Classified Yet). Some elements even lacked the above mentioned distinguishing protein motifs. It was therefore necessary to perform a systematic screen for related elements in order to gain a more rational view of the organization of the IS<it>4 </it>family.</p>
         <p>Here we report an extensive <it>in silico </it>search for IS elements related to family IS<it>4 </it>among more than 500 complete bacterial and archaeal genomes. A total of 227 putative intact IS<it>4</it>-related elements were collected and permitted a detailed update of the IS<it>4 </it>catalogue, together with the description of novel emerging IS families. This allowed the evaluation of their distribution and impact among major prokaryotic phyla. Finally, known transposition mechanisms could be discussed in light of novel primary sequence data.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Classification process</p>
            </st>
            <p>The present assignment of families and subgroups is primarily based on transposase and DNA end sequence data. Analysis of transposases was performed by multiple sequence alignments and clustering methods followed by dendrogram construction to set up clusters of related proteins (see Methods). Left and right DNA extremities flanking transposase genes were aligned (i) to each other to facilitate observation of terminal inverted repeats (TIRs) and (ii) with TIRs from other ISs to detect DNA extremity conservations. Together, these approaches split the initial IS set into ten groups. In each of them, the percentage of residue identity among transposase pairs often varied between 20 % and 50 %, highlighting the magnitude of divergence occurring among elements of a same IS group. The length of TIRs ranged from 10 to 40 bp and many of them were imperfect. Comparison of TIRs from different elements of a given IS group uncovered unique and conserved signatures in each of them, illustrating the relationship between the transposase sequence and the IS terminal repeats (Figure <figr fid="F1">1</figr>).</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Transposases <it>vs</it>. IRs and DRs</p>
               </caption>
               <text>
                  <p><b>Transposases <it>vs</it>. IRs and DRs</b>. Comparative overview of IS sequence features. <b>A. </b>Dendrogram representing an alignment of 227 transposases. The first common node of each family was pointed out on the left tree side. Subgroups of family IS<it>4 </it>are mentioned on the right tree side. Individual IS names were removed for clarity. <b>B. </b>Consensus sequences (5' to 3') of left (upper line) and right (lower line) DNA extremities of distinct subgroups/families. '+' symbols are used when the highest conservation level is shared by more than one residue. Percentages of nucleotide conservation at all positions are indicated by black bars. Decimal nucleotide numbering is marked by dotted vertical lines. Full alignments that generated each consensus can be found in Additional files <supplr sid="S1">1</supplr> to <supplr sid="S10">10</supplr>. Note that in the case of family IS<it>701</it>, the exposed consensus does not represent the entire family. For further details, see Additional file <supplr sid="S8">8</supplr>. IRL, left TIR; IRR, right TIR. <b>C. </b>Target duplication length range in bps.</p>
               </text>
               <graphic file="1471-2148-8-18-1"/>
            </fig>
            <p>To validate this clustering, established groups were analysed independently for conservation of specific transposase domains as well as length and specificity of target site duplications. Analysis of transposase DDE catalytic regions (N2, N3 and C1) revealed a perfect conservation of the DDE motif (Figure <figr fid="F2">2</figr>), while the size and sequence of the spacers that separate these catalytic residues differed among the ten groups. The YREK motif was partially lacking in three groups, where either its tyrosine, arginine or lysine residue was not conserved. According to the definition of IS families, IS groups sharing the same catalytic site structure were assigned together. Seven clusters displayed the complete YREK motif; and were named subgroups IS<it>231</it>, IS<it>H8</it>, IS<it>4Sa</it>, IS<it>4</it>, IS<it>Pepr1</it>, IS<it>10 </it>and IS<it>50</it>. The three remaining groups all displayed a distinct variation of the YREK motif and were therefore assigned to different and new families, namely IS<it>701</it>, IS<it>H3 </it>and IS<it>1634</it>, referred as emerging families. Finally, while IS<it>4 </it>family members generated about 10 bp target duplications, i.e. the approximate length of a complete DNA helix turn, those from emerging families displayed direct repeats (DRs) of around five bps, which corresponds approximately to half a DNA helix turn (Figures <figr fid="F1">1</figr> &amp;<figr fid="F3">3</figr>).</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>DDE and YREK motifs of IS<it>4 </it>subgroups and emerging families</p>
               </caption>
               <text>
                  <p><b>DDE and YREK motifs of IS<it>4 </it>subgroups and emerging families</b>. Comparative overview of conserved transposase regions among IS<it>4 </it>subgroups and emerging families. Each line represents a part of the amino acid consensus obtained from multiple alignments of members belonging to the displayed IS groups. Numbers in brackets correspond to the mean amino acid spacer (accompanied by standard deviation) between the two aspartate residues or the aspartate and glutamate residues of the DDE motifs among transposases form a given group. The conserved transposase regions N2, N3 and C1 are mentioned on top of alignment. Symbols and colors are used as depicted in keys.</p>
               </text>
               <graphic file="1471-2148-8-18-2"/>
            </fig>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Target sites : duplications and conservations</p>
               </caption>
               <text>
                  <p><b>Target sites : duplications and conservations</b>. Consensus sequences of <it>in silico </it>reconstructed and aligned target sites (5' to 3') typical for each subgroup or family. Only ISs found at least at five different genomic locations and flanked by DRs were considered, as well as elements described previously in literature. Each box represents one base position. Target sites are divided in three parts. The central sequence, which is duplicated upon insertion, is flanked by one upstream (left) and one downstream (right) target arm. Ten bps of each arm are shown. The number of target sites considered for each element is indicated. If more than one element displayed similar insertion specificity, their insertion sites were combined into a single line and their names listed above it. Gray boxes inside a duplication consensus indicate that DRs of variable length can be found for the given element. W, A or T; S, G or C; R, A or G; Y, T or C. * 'IS<it>231</it>' stands for following elements: IS<it>231A</it>, <it>C</it>, <it>F</it>, <it>M</it>, <it>S</it>, <it>T</it>, <it>U</it>, <it>Y</it>, IS<it>Bce4</it>, <it>5</it>, <it>6</it>, <it>11</it>, <it>12 </it>and IS<it>Bth5</it>. ** For the 6000 sites, see references [75&#8211;77].</p>
               </text>
               <graphic file="1471-2148-8-18-3"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Review of family IS<it>4</it></p>
            </st>
            <p>Family IS<it>4 </it>included 153 distinct intact elements. The main hallmarks were the presence of D(60~110)D(100~150)E and Y(2)R(3)E(6)K motifs, single <it>orf </it>encoded transposases and target site duplication lengths corresponding approximately to one DNA helix turn. Considerable diversity was observed at the level of transposase regions (sequence and length) outside catalytic residues, DNA end signatures and target site specificity. The following description of established subgroups will summarize this diversity.</p>
            <p><b>Subgroup IS<it>231 </it></b>was previously established (reviewed in reference <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>) and was the most coherent one since many transposase pairs displayed sequence identity percentages above 50 (see also Figure <figr fid="F1">1A</figr>). Likewise, their DNA ends are among the most conserved. A clear relationship was observed between transposase and extremity conservation (Additional file <supplr sid="S1">1</supplr>). This is particularly true for left extremities (conventionally upstream of the transposase gene) where transposase-related conservation extended beyond the TIRs. The IS<it>231 </it>subgroup further distinguished itself by the fact that its members occurred almost exclusively in genomes of phylogenetically close bacteria composing the <it>B. cereus sensu lato </it>group (i.e. <it>B. cereus sensu stricto</it>, <it>B. thuringiensis </it>and <it>B. anthracis</it>, see Additional file <supplr sid="S1">1</supplr> and Figure <figr fid="F4">4</figr>). As previously shown, an important hallmark is the large size range of its elements (Table <tblr tid="T1">1</tblr>). A significant fraction of them displayed additional DNA between the left TIR and the transposase gene <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>, which is still unusual among ISs. In this study, novel putative passenger genes were found in these extra sequences coding for resistance, virulence or metabolism determinants as well as unknown <it>orf</it>s (results not shown). This modular aspect has not yet been observed in other IS families or IS<it>4 </it>subgroups, which may be due to the fact that, so far, no systematic and thorough searches for supplementary IS DNA were carried out in other families.</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p>Subgroup IS<it>231</it>. <b>A. </b>Dendrogram displaying relative distances of transposases from subgroup IS<it>231</it>. Each tree leave indicates the name of the associated element, followed by the host organism and prokaryotic phylum in which the IS was found originally. For a complete description of individual elements please refer to the ISfinder database <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. <b>B. </b>and <b>C. </b>Alignment of left and right DNA extremities, respectively. Names of corresponding elements are listed in the same order as in A. The blue color scheme represents the percentage of nucleotide identity per column as displayed by black bars. The DNA extremity consensus used in Figure <figr fid="F1">1</figr> is shown in bottom, together with minimal (black line) and maximal (dashed line) extent of TIRs. IRL, left TIR; IRR, right TIR.</p>
               </text>
               <file name="1471-2148-8-18-S1.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Prokaryote distribution of IS<it>4 </it>subgroups and emerging families</p>
               </caption>
               <text>
                  <p><b>Prokaryote distribution of IS<it>4 </it>subgroups and emerging families</b>. <b>A. </b>Three dimensional histogram of the number of distinct elements belonging to IS<it>4 </it>subgroups and emerging families, as they can be found among major prokaryotic clades. Each IS subgroup/family is represented by a different color. Iso-forms (which we defined as elements that show a divergence of less than 2% in the amino acid sequence of their potential proteins) were not included. The line 'Others' stands for Aquificae, Chlamydiae/Verrucomicrobia, Chloroflexi, Chrysiogenetes, Dictyoglomi, Fusobacteria, Nitrospirae, Spirochetes, Thermodesulfobacteria and Thermotogae. For interpretation, see main text. <b>B. </b>To avoid misinterpretation triggered by unequal sequencing efforts between different microbial groups, the number of genome projects, as of 1<sup>st </sup>April 2007, is depicted by histogram.</p>
               </text>
               <graphic file="1471-2148-8-18-4"/>
            </fig>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Major features of IS<it>4 </it>subgroups and newly defined families</p>
               </caption>
               <tblbdy cols="6">
                  <r>
                     <c ca="left">
                        <p>
                           <b>Families</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Sub-groups</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Typical size range (bp)</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Ends<sup><it>a</it></sup></b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>YREK<sup><it>b</it></sup></b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Direct repeat length (bp)</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>IS<it>4</it></p>
                     </c>
                     <c ca="left">
                        <p>IS<it>231</it></p>
                     </c>
                     <c ca="center">
                        <p>1450 &#8211; 5400</p>
                     </c>
                     <c ca="left">
                        <p>
                           <monospace>CAT-------AA--TAA---</monospace>
                        </p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>10 &#8211; 12</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>IS<it>H8</it></p>
                     </c>
                     <c ca="center">
                        <p>1400 &#8211; 1800</p>
                     </c>
                     <c ca="left">
                        <p>
                           <monospace>CAT-----------T-----</monospace>
                        </p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>IS<it>4Sa</it></p>
                     </c>
                     <c ca="center">
                        <p>1150 &#8211; 1750</p>
                     </c>
                     <c ca="left">
                        <p>
                           <monospace>CA------T-----------</monospace>
                        </p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>8 &#8211; 10</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>IS<it>4</it></p>
                     </c>
                     <c ca="center">
                        <p>1400 &#8211; 1650</p>
                     </c>
                     <c ca="left">
                        <p>
                           <monospace>-AATR--------WTW----</monospace>
                        </p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>10 &#8211; 13</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>IS<it>10</it></p>
                     </c>
                     <c ca="center">
                        <p>1200 &#8211; 1350</p>
                     </c>
                     <c ca="left">
                        <p>
                           <monospace>CT------------------</monospace>
                        </p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>IS<it>50</it></p>
                     </c>
                     <c ca="center">
                        <p>1350 &#8211; 1550</p>
                     </c>
                     <c ca="left">
                        <p>
                           <monospace>CW------Y---CA-A--W-</monospace>
                        </p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>8 &#8211; 9</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>IS<it>Pepr1</it></p>
                     </c>
                     <c ca="center">
                        <p>1500 &#8211; 1600</p>
                     </c>
                     <c ca="left">
                        <p>
                           <monospace>YT-AA-YTT---A-------</monospace>
                        </p>
                     </c>
                     <c ca="center">
                        <p>Y</p>
                     </c>
                     <c ca="center">
                        <p>7 &#8211; 8</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>IS<it>701</it></p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>1400 &#8211; 1550</p>
                     </c>
                     <c ca="left">
                        <p>
                           <monospace>---YACT-Y--YR-------</monospace>
                        </p>
                     </c>
                     <c ca="center">
                        <p>N</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>IS<it>H3</it></p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>1225 &#8211; 1500</p>
                     </c>
                     <c ca="left">
                        <p>
                           <monospace>CRGT----------------</monospace>
                        </p>
                     </c>
                     <c ca="center">
                        <p>N</p>
                     </c>
                     <c ca="center">
                        <p>4 &#8211; 5</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>IS<it>1634</it></p>
                     </c>
                     <c ca="left">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>1500 &#8211; 2000</p>
                     </c>
                     <c ca="left">
                        <p>
                           <monospace>C-----YT------------</monospace>
                        </p>
                     </c>
                     <c ca="center">
                        <p>N</p>
                     </c>
                     <c ca="center">
                        <p>5 &#8211; 6</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><sup>a</sup>Conserved terminal based pairs. Capital letters refer to mostly conserved bases. W, A or T; R, G or A; Y, T or C.</p>
                  <p><sup>b</sup>Presence (Y) or absence (N) of YREK motif.</p>
                  <p>The new organization of family IS<it>4 </it>is shown together with emerging families. The table summarizes DNA characteristics with transposase motifs and target site duplication lengths. The two major differences between IS<it>4 </it>and emerging families are presence or absence of the YREK motif and dissimilar lengths of DRs.</p>
               </tblfn>
            </tbl>
            <p><b>Subgroup IS<it>H8 </it></b>was initially included into subgroup IS<it>4Sa </it><abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Yet, our results showed a closer proximity to subgroup IS<it>231</it>; not only at the transposase level, but also due to DNA extremity conservations (both displayed conserved 5'-CAT ends, Figure <figr fid="F1">1B</figr> and Additional file <supplr sid="S2">2</supplr>). IS<it>H8 </it>elements were initially discovered in archaea but recent genome projects uncovered elements of this subgroup in proteobacteria. Examination of IS<it>H8 </it>insertions did not reveal apparent target site specificity, contrary to the one observed for IS<it>231 </it>elements (Figure <figr fid="F3">3</figr>).</p>
            <suppl id="S2">
               <title>
                  <p>Additional file 2</p>
               </title>
               <text>
                  <p>Subgroup IS<it>H8</it>. <b>A. </b>Dendrogram displaying relative distances of transposases from subgroup IS<it>H8</it>. Each tree leave indicates the name of the associated element, followed by the host organism and prokaryotic phylum in which the IS was found originally. For a complete description of individual elements please refer to the ISfinder database <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. <b>B. </b>and <b>C. </b>Alignment of left and right DNA extremities, respectively. Names of corresponding elements are listed in the same order as in A. The blue color scheme represents the percentage of nucleotide identity per column as displayed by black bars. The DNA extremity consensus used in Figure <figr fid="F1">1</figr> is shown in bottom, together with minimal (black line) and maximal (dashed line) extent of TIRs. IRL, left TIR; IRR, right TIR.</p>
               </text>
               <file name="1471-2148-8-18-S2.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <sec>
               <st>
                  <p>
                     <b>Subgroups IS<it>4Sa </it>and IS<it>4</it></b>
                  </p>
               </st>
               <p>This study provided the first thorough description of subgroup IS<it>4Sa </it>that was originally mentioned in reference <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Together with IS<it>4Sa</it>-like elements appeared a closely related subgroup including element IS<it>4</it>, which provided the name for this second ensemble. Subgroups IS<it>4Sa </it>and IS<it>4 </it>were relatively close at the protein level and the major argument for splitting them in two was a different organization of TIRs (Figure <figr fid="F1">1B</figr> and Additional files <supplr sid="S3">3</supplr> &amp;<supplr sid=" S4">4</supplr>). Extremities in subgroups IS<it>4Sa </it>and IS<it>4 </it>displayed relaxed conservation of the third and first nucleotide, respectively, which is rather new but not unique in family IS<it>4 </it>(see below). Some elements of subgroup IS<it>4 </it>displayed spectacular apparent insertion specificity in left and right target arms, a conservation that was also observed for IS<it>Rm22 </it>from subgroup IS<it>4Sa </it>(Figure <figr fid="F3">3</figr>). Both subgroups were quite diverse and broadly distributed in bacteria. Rare occurrences were recorded in archaea (IS<it>Fac10</it>) and algae viruses (ISv<it>EsV1_1</it>, see Figure <figr fid="F4">4</figr> and Additional files <supplr sid="S3">3</supplr> and <supplr sid="S4">4</supplr>).</p>
               <suppl id="S3">
                  <title>
                     <p>Additional file 3</p>
                  </title>
                  <text>
                     <p>Subgroup IS<it>4Sa</it>. <b>A. </b>Dendrogram displaying relative distances of transposases from subgroup IS<it>4Sa</it>. Each tree leave indicates the name of the associated element, followed by the host organism and prokaryotic phylum in which the IS was found originally. For a complete description of individual elements please refer to the ISfinder database <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. <b>B. </b>and <b>C. </b>Alignment of left and right DNA extremities, respectively. Names of corresponding elements are listed in the same order as in A. The blue color scheme represents the percentage of nucleotide identity per column as displayed by black bars. The DNA extremity consensus used in Figure <figr fid="F1">1</figr> is shown in bottom, together with minimal (black line) and maximal (dashed line) extent of TIRs. IRL, left TIR; IRR, right TIR.</p>
                  </text>
                  <file name="1471-2148-8-18-S3.pdf">
                     <p>Click here for file</p>
                  </file>
               </suppl>
               <suppl id="S4">
                  <title>
                     <p>Additional file 4</p>
                  </title>
                  <text>
                     <p>Subgroup IS<it>4</it>. <b>A. </b>Dendrogram displaying relative distances of transposases from subgroup IS<it>4</it>. Each tree leave indicates the name of the associated element, followed by the host organism and prokaryotic phylum in which the IS was found originally. For a complete description of individual elements please refer to the ISfinder database <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. <b>B. </b>and <b>C. </b>Alignment of left and right DNA extremities, respectively. Names of corresponding elements are listed in the same order as in A. The blue color scheme represents the percentage of nucleotide identity per column as displayed by black bars. The DNA extremity consensus used in Figure <figr fid="F1">1</figr> is shown in bottom, together with minimal (black line) and maximal (dashed line) extent of TIRs. IRL, left TIR; IRR, right TIR.</p>
                  </text>
                  <file name="1471-2148-8-18-S4.pdf">
                     <p>Click here for file</p>
                  </file>
               </suppl>
               <p><b>Subgroups IS<it>10 </it>and IS<it>50 </it></b>were the only ones that did not cluster together with subgroups IS<it>231</it>, IS<it>H8</it>, IS<it>4Sa</it>, IS<it>4 </it>and IS<it>Pepr1 </it>during Tribe-MCL analysis (results not shown), indicating more distant relationships with these subgroups. This was also observed at the level of DNA ends since they systematically displayed 5'-NT extremities instead of 5'-NA. Yet, they displayed key residue conservation (Figure <figr fid="F2">2</figr>) and target site duplication lengths comparable to typical IS<it>4 </it>elements (Figures <figr fid="F1">1</figr> &amp;<figr fid="F3">3</figr>). So far, the majority of both subgroups occurred among proteobacteria.</p>
               <p><b>Subgroup IS<it>Pepr1 </it></b>was also newly established here and is almost exclusively composed of novel elements. Although it is composed of a limited repertoire, it is already forming a consistent subgroup with apparent insertion preference for AT-rich sequences (Figure <figr fid="F3">3</figr>) and, like elements of subgroups IS<it>10 </it>and IS<it>50</it>, it preferentially displays 5'-CT ends (Additional files <supplr sid="S5">5</supplr>, <supplr sid="S6">6</supplr> and <supplr sid="S7">7</supplr>), in contrast to 5'-CA extremities encountered in the other elements from family IS<it>4</it>.</p>
               <suppl id="S5">
                  <title>
                     <p>Additional file 5</p>
                  </title>
                  <text>
                     <p>Subgroup IS<it>10</it>. <b>A. </b>Dendrogram displaying relative distances of transposases from subgroup IS<it>10</it>. Each tree leave indicates the name of the associated element, followed by the host organism and prokaryotic phylum in which the IS was found originally. For a complete description of individual elements please refer to the ISfinder database <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. <b>B. </b>and <b>C. </b>Alignment of left and right DNA extremities, respectively. Names of corresponding elements are listed in the same order as in A. The blue color scheme represents the percentage of nucleotide identity per column as displayed by black bars. The DNA extremity consensus used in Figure <figr fid="F1">1</figr> is shown in bottom, together with minimal (black line) and maximal (dashed line) extent of TIRs. IRL, left TIR; IRR, right TIR.</p>
                  </text>
                  <file name="1471-2148-8-18-S5.pdf">
                     <p>Click here for file</p>
                  </file>
               </suppl>
               <suppl id="S6">
                  <title>
                     <p>Additional file 6</p>
                  </title>
                  <text>
                     <p>Subgroup IS<it>50</it>. <b>A. </b>Dendrogram displaying relative distances of transposases from subgroup IS<it>50</it>. Each tree leave indicates the name of the associated element, followed by the host organism and prokaryotic phylum in which the IS was found originally. For a complete description of individual elements please refer to the ISfinder database <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. <b>B. </b>and <b>C. </b>Alignment of left and right DNA extremities, respectively. Names of corresponding elements are listed in the same order as in A. The blue color scheme represents the percentage of nucleotide identity per column as displayed by black bars. The DNA extremity consensus used in Figure <figr fid="F1">1</figr> is shown in bottom, together with minimal (black line) and maximal (dashed line) extent of TIRs. IRL, left TIR; IRR, right TIR.</p>
                  </text>
                  <file name="1471-2148-8-18-S6.pdf">
                     <p>Click here for file</p>
                  </file>
               </suppl>
               <suppl id="S7">
                  <title>
                     <p>Additional file 7</p>
                  </title>
                  <text>
                     <p>Subgroup IS<it>Pepr1</it>. <b>A. </b>Dendrogram displaying relative distances of transposases from subgroup IS<it>Pepr1</it>. Each tree leave indicates the name of the associated element, followed by the host organism and prokaryotic phylum in which the IS was found originally. For a complete description of individual elements please refer to the ISfinder database <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. <b>B. </b>and <b>C. </b>Alignment of left and right DNA extremities, respectively. Names of corresponding elements are listed in the same order as in A. The blue color scheme represents the percentage of nucleotide identity per column as displayed by black bars. The DNA extremity consensus used in Figure <figr fid="F1">1</figr> is shown in bottom, together with minimal (black line) and maximal (dashed line) extent of TIRs. IRL, left TIR; IRR, right TIR.</p>
                  </text>
                  <file name="1471-2148-8-18-S7.pdf">
                     <p>Click here for file</p>
                  </file>
               </suppl>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Emerging families</p>
            </st>
            <p>Members of <b>Family IS<it>701 </it></b>were already considered distantly related to IS<it>4 </it>in reference <abbrgrp><abbr bid="B1">1</abbr></abbrgrp> and this was confirmed here. Only four of the 27 considered IS<it>701 </it>elements displayed the tyrosine of the YREK motif (results not shown, see Figure <figr fid="F2">2</figr>); and almost all elements displayed a highly conserved target site duplication of exactly four bps (results not shown, see Figure <figr fid="F3">3</figr>). The diversity emanating from IS<it>701 </it>transposases and TIRs allowed identification of three distinct clusters (Additional file <supplr sid="S8">8</supplr>), which announced a possible division into subgroups. As indicated by their prokaryotic distribution, family IS<it>701 </it>seemed to be rather widespread (Figure <figr fid="F4">4</figr>).</p>
            <suppl id="S8">
               <title>
                  <p>Additional file 8</p>
               </title>
               <text>
                  <p>Family IS<it>701</it>. <b>A. </b>Dendrogram displaying relative distances of transposases from family IS<it>701</it>. Each tree leave indicates the name of the associated element, followed by the host organism and prokaryotic phylum in which the IS was found originally. For a complete description of individual elements please refer to the ISfinder database <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. <b>B. </b>and <b>C. </b>Alignment of left and right DNA extremities, respectively. Three distinct alignments, corresponding to three different DNA end signatures, are shown in this case. Names of corresponding elements are listed in the same order as in A. Blue color scheme represents percentage of nucleotide identity per column in each alignment, and is represented by black bars for the upper alignment only. Minimal (black line) and maximal (dashed line) extent of TIRs is given in each case. IRL, left TIR; IRR, right TIR.</p>
               </text>
               <file name="1471-2148-8-18-S8.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p><b>Family IS<it>H3 </it></b>was a small group so far restricted to archaea (Additional file <supplr sid="S9">9</supplr>). Half of their transposases lacked the lysine residue of the YREK motif while all (except IS<it>Fac10</it>) displayed a Y-(2)-R-(3)-E-(3)-(R) motif. DRs flanking IS<it>H3</it>-like insertions are typically five bps long and generally flanked by A at one end and T at the other end.</p>
            <suppl id="S9">
               <title>
                  <p>Additional file 9</p>
               </title>
               <text>
                  <p>Family IS<it>H3</it>. <b>A. </b>Dendrogram displaying relative distances of transposases from family IS<it>H3</it>. Each tree leave indicates the name of the associated element, followed by the host organism and prokaryotic phylum in which the IS was found originally. For a complete description of individual elements please refer to the ISfinder database <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. <b>B. </b>and <b>C. </b>Alignment of left and right DNA extremities, respectively. Names of corresponding elements are listed in the same order as in A. The blue color scheme represents the percentage of nucleotide identity per column as displayed by black bars. The DNA extremity consensus used in Figure <figr fid="F1">1</figr> is shown in bottom, together with minimal (black line) and maximal (dashed line) extent of TIRs. IRL, left TIR; IRR, right TIR.</p>
               </text>
               <file name="1471-2148-8-18-S9.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p><b>Family IS<it>1634 </it></b>was initially named IS<it>1549 </it><abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. This new designation should prevent confusion with an emerging group called IS<it>1595 </it><abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. Transposases were among the largest due to relatively long N3-C1 spacers, which were sometimes twice as long as those from family IS<it>4</it>. Only five out of 32 displayed the arginine residue of the YREK motif (results not shown, see Figure <figr fid="F2">2</figr>). This large group could be divided in (three) distinct clusters according to transposases (Additional file <supplr sid="S10">10</supplr>). Target site duplications were five to six bp AT-rich tracts while DNA ends were only poorly conserved. They were remarkably diverse in archaea and relatively widespread in bacteria (Figure <figr fid="F4">4</figr>).</p>
            <suppl id="S10">
               <title>
                  <p>Additional file 10</p>
               </title>
               <text>
                  <p>Family IS<it>1634</it>. <b>A. </b>Dendrogram displaying relative distances of transposases from family IS<it>1634</it>. Each tree leave indicates the name of the associated element, followed by the host organism and prokaryotic phylum in which the IS was found originally. For a complete description of individual elements please refer to the ISfinder database <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. <b>B. </b>and <b>C. </b>Alignment of left and right DNA extremities, respectively. Names of corresponding elements are listed in the same order as in A. The blue color scheme represents the percentage of nucleotide identity per column as displayed by black bars. The DNA extremity consensus used in Figure <figr fid="F1">1</figr> is shown in bottom, together with minimal (black line) and maximal (dashed line) extent of TIRs. IRL, left TIR; IRR, right TIR.</p>
               </text>
               <file name="1471-2148-8-18-S10.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Distribution of copy numbers</p>
            </st>
            <p>Members of the IS<it>4 </it>family were found in 92 out of 540 (~17 %) complete genomes representing 65 prokaryotic species. Families IS<it>701</it>, IS<it>H3 </it>and IS<it>1634 </it>displayed lower incidences, being present in ~3 %, ~0.7 % and ~2 % of available genomes, respectively. 172 of these genomes included plasmids of which 22 harbored elements of these families. Figure <figr fid="F5">5</figr> displays the distribution of IS genomic copy numbers as it was found on chromosomes and plasmids. The distribution of family IS<it>4 </it>uncovered a clear preference for single genomic copies, followed by a preference for two, three and six to eight copies per genome. The genomes of <it>Photobacterium profundum </it>SS9, <it>Mycoplasma mycoides </it>subsp. <it>mycoides </it>SC strain PG1, <it>Sulfolobus solfataricus </it>P2 and the two sequenced strains of <it>Xanthomonas oryzae </it>pv. <it>oryzae </it>displayed the highest amounts of elements from families IS<it>4</it>, IS<it>1634</it>, IS<it>H3 </it>and IS<it>701</it>, respectively (Figure <figr fid="F5">5</figr>). The most IS-rich extrachromosomal replicons were megaplasmids pNRC100 (191 kb) and pNRC200 (365 kb) from <it>Halobacterium sp</it>. NRC-1, which harbored together 20 copies of IS<it>4 </it>family elements and 18 copies of family IS<it>H3 </it>elements.</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Genomic IS copy number</p>
               </caption>
               <text>
                  <p><b>Genomic IS copy number</b>. Genomic copy numbers of four families as they were found on chromosomes and plasmids. Families IS<it>4</it>, IS<it>1634</it>, IS<it>H3 </it>and IS<it>701 </it>are represented in red, blue, green and purple, respectively. The number of copies located on chromosomes is represented by cylinders while those located on plasmids is shown as cones. The height of each cylinder/cone indicates the number of chromosome/plasmid displaying the corresponding number of IS family members. No distinction was made when different elements of the same IS family occurred in the same genome. The histograms include intact elements, disrupted full length copies and large partial fragments displaying more than 95 % DNA sequence identity. Genomes without any copy of the aforementioned families were not included.</p>
               </text>
               <graphic file="1471-2148-8-18-5"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Distribution of IS subgroups and families among prokaryotic phyla and viruses</p>
            </st>
            <p>The distribution of individual elements among the different bacterial and archaeal phyla is shown in Figure <figr fid="F4">4</figr>. As indicated by the histogram in Figure <figr fid="F4">4B</figr>, current sequencing projects focus unequally on different prokaryotic phyla <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. The distribution presented in Figure <figr fid="F4">4A</figr> partially reflected this sequencing bias, which is why most gathered elements were found in proteobacteria. Globally, families IS<it>4</it>, IS<it>701</it>, IS<it>H3 </it>and IS<it>1634 </it>were detected in eight, seven, two and seven phyla, respectively.</p>
            <p>Figure <figr fid="F4">4A</figr> shows that the distribution of individual IS families or subgroups as defined in this study could either be broad (as for subgroups IS<it>4</it>, IS<it>4Sa </it>and family IS<it>701</it>) or concentrated in a particular phylum (as for subgroup IS<it>231 </it>and family IS<it>H3</it>). Interestingly, two copies of an element belonging to subgroup IS<it>4 </it>were found in the genome of the brown algae virus, <it>Ectocarpus siliculosus virus 1 </it><abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. So far, no IS<it>4</it>-related element was found in eukaryotes.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <sec>
            <st>
               <p>Impact of IS<it>4 </it>and related families on prokaryotic genomes</p>
            </st>
            <p>The availability of a large number of completely sequenced genomes on one hand, and the extensive manual characterization of an IS family on the other hand, allowed a direct evaluation of global presence of its putatively functional members. Family IS<it>4 </it>displayed a sporadic distribution among 17 % of explored genomes. Moreover, the behavior of these ISs was assessed by determining their genomic occurrence. This showed that numerous elements exhibited single or low copy numbers, which was in line with the assumption that transposable elements need to compromise between transposition for self maintenance and limited insertions to keep host genome integrity <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. This distribution also displayed a certain, less pronounced, prevalence for copy numbers between 6 and 8, although the biological relevance of this observation remains unclear.</p>
            <p>While the global distribution of IS elements was shown to be rather sporadic, important expansion events were observed in a few pathogens as well as in some free-living extremophiles. One of these pathogens is a member of the facultative intracellular <it>Mycoplasma</it>, of which eleven genomes are available. 65 intact copies of family IS<it>1634 </it>elements were found together with 17 intact copies of the IS<it>3 </it>family in <it>M. mycoides </it><abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. The presence of family IS<it>1634 </it>was also detected in <it>M. synoviae </it><abbrgrp><abbr bid="B31">31</abbr></abbrgrp>, <it>M. agalactiae </it>[GenBank:<ext-link ext-link-type="gen" ext-link-id="CU179680">CU179680</ext-link>] and the three strains of <it>M. hyopneumoniae </it><abbrgrp><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr></abbrgrp>. Only <it>M. genitalium </it><abbrgrp><abbr bid="B33">33</abbr></abbrgrp>, <it>M. pneumoniae </it><abbrgrp><abbr bid="B34">34</abbr></abbrgrp> and <it>M. mobile </it><abbrgrp><abbr bid="B35">35</abbr></abbrgrp> were devoid of ISs. Interestingly, the <it>Mycoplasma </it>species lacking ISs were also those displaying the smallest genomes (between 0,58 and 0,82 Mb), while the IS-rich <it>M. mycoides </it>had a relatively large genome (1,2 Mbs). A positive correlation between the total number of genes and the amount of mobile DNA in a genome has been reported for other facultative intracellular bacteria <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>, and a recent study has extended this observation to other prokaryotes <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. However, this relationship must be seen as a trend with notable exceptions (e.g. the large genome of <it>M. penetrans </it>&#8211; 1,36 Mbs &#8211; has only 21 transposase genes <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>).</p>
            <p>Family IS<it>701 </it>was represented by 70 and 72 elements in the genomes of two sequenced pathovars of the rice pathogen <it>X. oryzae </it><abbrgrp><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr></abbrgrp>. This family was completely absent in phylogenetically related species with available genome sequences, including <it>X. campestris </it>(black rot in crucifers) and <it>X. axonopodis </it>(citrus canker) <abbrgrp><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr></abbrgrp>. Additionally, a moderate presence of family IS<it>4 </it>was observed in all but one of the six <it>Xanthomonas </it>genomes (<it>X. campestris </it>pv. <it>vesicatoria</it>), together with other, more expanded families. So far, all sequenced xanthomonads are IS-rich, which is supposed to provide the necessary flexibility for adaptation to different plant hosts <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. Interestingly, each strain is characterized by the expansion of one specific IS family <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>. The IS<it>4 </it>family was also observed to be relatively expanded in sequenced <it>Shigella </it>spp. where other families, both with higher and lower expansion levels, were also detected <abbrgrp><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr><abbr bid="B47">47</abbr></abbrgrp>. These mobile elements are supposed to contribute to the emergence of variable epidemiological and pathological features among these phylogenetically close pathogens <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>.</p>
            <p>These observations are reminiscent of IS expansions observed together with emergence of pathogenicity in other host associated pathogens, such as <it>Bordetella pertussis </it>(whooping cough <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>), <it>Yersinia pestis </it>(plague <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>) and <it>Francisella tularensis </it>(tularemia <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>).</p>
            <p>A different relationship was observed in the genus <it>Vibrio</it>. The expansion of family IS<it>4 </it>in the non-pathogenic <it>Photobacterium </it><abbrgrp><abbr bid="B51">51</abbr></abbrgrp> was striking (Figure <figr fid="F5">5</figr>) as compared to those observed in the pathogenic <it>Vibrionaceae</it>, such as <it>V. cholerae </it><abbrgrp><abbr bid="B52">52</abbr></abbrgrp>, <it>V. parahaemolyticus </it><abbrgrp><abbr bid="B53">53</abbr></abbrgrp> and <it>V. vulnificus </it><abbrgrp><abbr bid="B54">54</abbr></abbrgrp>, that displayed relatively modest IS contents. Therefore, ISs apparently have a quite different role in <it>Vibrio </it>evolution, where some IS-rich species are non-pathogenic piezophiles. Other important IS expansions among free-living extremophiles were observed in one out of three <it>Sulfolobus </it>spp. <abbrgrp><abbr bid="B55">55</abbr><abbr bid="B56">56</abbr><abbr bid="B57">57</abbr></abbrgrp>, where family IS<it>H3 </it>represents a significant fraction of global IS content, as well as in <it>Halobacterium</it>, which harbors numerous copies of IS<it>4 </it>and IS<it>H3 </it>elements together with other families <abbrgrp><abbr bid="B58">58</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Lateral transfer and clade specificity</p>
            </st>
            <p>Analysis of the distribution of individual elements among prokaryotes and among established families and subgroups was also performed regardless of copy numbers (Figure <figr fid="F4">4</figr>). Prokaryotic IS hosts were from world-wide origins and covered a large range of lifestyles and habitats, from free-living environmental germs and extremophiles to host associated symbionts and pathogens. Evaluation of emergence on evolutionary timescales, as well as estimation of horizontal gene transfer (HGT) extent prompted by this distribution remain both problematic because of low identity levels among transposases. However, many IS<it>4</it>-related elements were found on putative mobile DNA, including various plasmids (Figure <figr fid="F5">5</figr>), bacteriophages (results not shown) and one eukaryotic virus (Figure <figr fid="F4">4</figr>). Additionally, most IS<it>4</it>-related subgroups and emerging families were scattered over several phyla (Additional files <supplr sid="S1">1</supplr>, <supplr sid="S2">2</supplr>, <supplr sid="S3">3</supplr>, <supplr sid="S4">4</supplr>, <supplr sid="S5">5</supplr>, <supplr sid="S6">6</supplr>, <supplr sid="S7">7</supplr>, <supplr sid="S8">8</supplr>, <supplr sid="S9">9</supplr>, <supplr sid="S10">10</supplr>), suggesting an involvement of HGT in the dissemination of these elements.</p>
            <p>Reports on prokaryotic IS distribution have shown that IS families are not clade-specific and our data regarding family IS<it>4 </it>were in agreement with this assumption. However, the distribution of IS<it>4 </it>subgroups and emerging families showed that the number of elements of some IS groups was not systematically proportional to the extent of genome sequencing (Figure <figr fid="F4">4</figr>). Subgroup IS<it>231 </it>(family IS<it>4</it>) was almost exclusively restricted to Firmicutes and 26 of its 32 elements were found among bacteria of the <it>B. cereus s.l</it>. group. Likewise, family IS<it>1634</it>, and to a lesser extent family IS<it>H3 </it>and subgroup IS<it>H8</it>, were preferentially found among <it>Euryarchaeota</it>. These observations may partially be due to insufficient genome data, but they do not rule out the possibility of existing clade specificity at the level of IS subgroups. Preferential IS occurrences among certain prokaryotic phyla or genera can be the consequence of isolated niches reducing or limiting horizontal transfer of hosted ISs. In this case, a series of extremophilic archaea were almost exclusive hosts of subgroup IS<it>H8 </it>and family IS<it>H3</it>, their preference for harsh ecological niches may represent a frontier for HGT with other phyla or with bacteria. Another possible explanation can be based on IS-associated features. It is possible that the presence of some ISs cannot be tolerated by certain hosts. Uncontrollable transposition behaviors, lack of target site specificity, preferred insertions into vital genes or regulatory regions can limit the host compatibility of some ISs for which a viable equilibrium can be found only in rare 'IS &#8211; host' combinations.</p>
         </sec>
         <sec>
            <st>
               <p>Reaction mechanisms</p>
            </st>
            <p>The co-crystal structure of the IS<it>50R </it>transposase binding the ends of the Tn<it>5 </it>transposon generated function assignments for numerous residues of this protein <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. A comparative sequence analysis based on 19 transposases was performed and compared with a previous report on seven transposases from subgroup IS<it>50 </it>by Reznikoff <it>et al</it>. in 2004 <abbrgrp><abbr bid="B59">59</abbr></abbrgrp>. This confirmed the previous function assignment of conserved residues since most identified domains were still conserved among this subgroup, as were a series of residues with unknown function (further results available in Additional file <supplr sid="S11">11</supplr>).</p>
            <suppl id="S11">
               <title>
                  <p>Additional file 11</p>
               </title>
               <text>
                  <p>Combining alignment data from subgroup IS<it>50 </it>with functional data from transposons Tn<it>5</it>. The black sequence is the consensus obtained after multiple alignments of 19 members of the IS<it>50 </it>subgroup with MCOFFEE. Interrupting dashes indicate alignment gaps. The conservation percentage is represented by black bars over each position. The dotted line stands for 70% residue conservation. Residues of the IS<it>50R </it>transposase with known functional data are shown together with their position coordinates. They are grouped and color-coded following the function they carry out in regulation or during the transposition mechanism. Aspartate and glutamate residues of the DDE motif are pointed out by white triangles among the conservation bars. The YREK motif is highlighted by white 'V'. The transpososome formation step (i) is divided into successive stages including initial <it>cis</it>-binding, dimerization, 3'-end &amp; 5'-end <it>trans </it>binding, <it>trans</it>-binding via the <it>&#946; </it>hairpin clamp and additional <it>trans </it>contacts. Target capture residues are displayed by (ii), while integration and transposase release are not shown. Residues annotated as intramolecular inhibition are supposed to inhibit dimerization of full length transposases. For further details about these mechanistic concepts, see references <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B25">25</abbr><abbr bid="B59">59</abbr></abbrgrp>. Each residue is linked by colored lines to the corresponding position in the alignment consensus. Their designation is colored and bold if the residue is conserved at least at 70%, gray if it is less or not conserved. Conserved positions with unknown function are pointed out by asterisks (*).</p>
               </text>
               <file name="1471-2148-8-18-S11.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>The primary sequence data concerning the transposase motifs (Figure <figr fid="F2">2</figr>) and target site duplications (Figure <figr fid="F3">3</figr>) suggested divergent biochemical behaviors between IS<it>4 </it>transposases and those from the emerging families. The two 3'-OH transposon ends are known to attack the target helix at opposite sites in the case of IS<it>4</it>. For the emerging families, the length of target site duplications (5 bp) suggested a different target strand cleavage path: owing to the structure of B-DNA, this attack needs to come from the same helix side. It is worth noting that in the case of family IS<it>1634</it>, two elements were reported to display atypical target site duplication lengths. IS<it>1549 </it>and IS<it>1634</it>, from <it>M. smegmatis </it>and <it>M. mycoides</it>, respectively, have both been shown to produce long, variable-length DRs upon insertion <abbrgrp><abbr bid="B60">60</abbr><abbr bid="B61">61</abbr></abbrgrp>. They had lengths between 8 and 514 bps, depending on the copy. So far, no mechanistic model has been proposed for the generation of this variable target duplication size.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>This study consisted in a thorough and systematic screen for IS<it>4</it>-related elements among available genome sequence data. It allowed a considerable improvement regarding the description and definition of family IS<it>4</it>, as well as the establishment of new IS families which were, until now, assimilated to the IS<it>4 </it>ensemble (Table <tblr tid="T1">1</tblr>). Thus, 22 distinct IS families are now officially established. Novel primary sequence descriptions were unraveled for reported families, which will allow easier identification of other related elements as they will be uncovered with ongoing genome sequencing projects. This work also established a link between genomic and functional data regarding reaction mechanisms, which underlined the importance of both approaches for a more complete understanding of transposition biology.</p>
         <p>The extent of genomic impact of single IS families showed that IS<it>4 </it>and its related families are far from being ubiquitous among prokaryotes. This limited IS distribution applies to most known IS families as reported by a recent automated survey of bacterial ISs <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>. Significant genome wide expansions were observed only in a few host-associated pathogens and certain free-living extremophiles, suggesting that particular ISs could have been, at least partially, implicated in the emergence or evolution of these particular lifestyles. Yet, the reasons explaining these sporadic IS demographic explosions remain to be uncovered.</p>
         <p>Our interpretation of these results is that the IS families described here, and probably ISs in general, represent an evolutionary tool available among several. In order to provide the necessary genome flexibility for adaptation to new or variable environments, evolution seems to select this tool in some cases. In contrast to a purely 'selfish' or 'parasitic' perception of mobile DNA, this view supports the idea that prokaryotes, and maybe life in general, may also make use of mobile DNA for its own benefits, rather than being constantly invaded by it in an uncontrollable manner.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Genomic <it>in silico </it>screen for IS<it>4</it>-like elements</p>
            </st>
            <p>This search covered more than 500 complete and partial bacterial genomes. Only apparently full-length transposases were retained to avoid inclusion of partially deleted and thus inactivated transposases which can accumulate mutations of functionally important residues.</p>
            <p>When we began our search for novel IS<it>4</it>-like elements, the ISfinder database contained about 70 ISs designated as 'member of family IS<it>4</it>'. First, representative elements (about twenty) covering most of the sequence diversity of this family were selected. Primary transposase sequence of each of these representatives was then used in a BLASTP search, either among microbial genomes only, or against all organisms. Since IS<it>4 </it>elements exhibit short TIRs and generate target site duplications upon insertion (DRs), the flanking DNA of resulting hits was checked for the presence of these repeats. This, together with DNA extremity comparison of various elements, allowed the determination of both ends of the collected elements. New ISs were checked for the existence of formerly registered iso-forms which we defined as elements with less than 2% divergence in the amino acid sequence of their putative transposases and/or less than 5% difference in their DNA sequences. ISs were submitted to the ISfinder database, which provided new names according to the current IS nomenclature <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Bioinformatic procedures</p>
            </st>
            <p>BLASTP searches were performed on the NCBI BLAST online interface <abbrgrp><abbr bid="B27">27</abbr></abbrgrp> without low complexity filter and with otherwise default parameters. Each transposase sequence retrieved a series of protein hits which were possible transposase candidates. The DNA encoding these candidates was downloaded together with 1000 bps up- and downstream regions. These DNA sequences were then verified for the presence of TIRs flanking the transposase candidate genes using BLASTN and the PALINDROME algorithm of the wEMBOSS package at the Belgian EMBnet Node <abbrgrp><abbr bid="B63">63</abbr></abbrgrp>. If TIRs were present, flanking DRs were looked for by eye.</p>
            <p>The following multiple alignment algorithms were then evaluated with the resulting ISs for their performance to accurately align the catalytic D, D and E residues of transposases : Clustal W <abbrgrp><abbr bid="B64">64</abbr></abbrgrp>, Dialign <abbrgrp><abbr bid="B65">65</abbr></abbrgrp>, Parallel PRRN <abbrgrp><abbr bid="B66">66</abbr></abbrgrp>, Muscle <abbrgrp><abbr bid="B67">67</abbr></abbrgrp>, T-Coffee <abbrgrp><abbr bid="B68">68</abbr></abbrgrp> and M-Coffee <abbrgrp><abbr bid="B69">69</abbr></abbrgrp>. Global alignment of the 227 transposases was made by merging existing M-Coffee alignments with Clustal W. The following order was applied where pre-existing alignments are represented by subgroup or family designations flanked by brackets and merging procedures by + symbols: (((IS<it>231</it>)+(IS<it>H8</it>)+(IS<it>4Sa</it>)+(IS<it>4</it>))+(IS<it>10</it>, IS<it>50</it>, IS<it>Pepr1</it>))+((IS<it>701</it>, IS<it>H3</it>)+(IS<it>1634</it>)). Examination of resulting alignments and subgroup specific dendrogram construction (UPGMA, BLOSUM62) were performed using the Jalview alignment editor <abbrgrp><abbr bid="B70">70</abbr></abbrgrp>. Dendrograms were drawn with TreeView <abbrgrp><abbr bid="B71">71</abbr></abbrgrp>.</p>
            <p>In order to facilitate visualization of transposase clusters, the TRIBE-MCL clustering algorithm <abbrgrp><abbr bid="B72">72</abbr></abbrgrp> was applied to the complete set of transposases with inflation option (-I) set to 1,2 and default values for other parameters. These settings corresponded to those used by the ISfinder <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> and ACLAME <abbrgrp><abbr bid="B73">73</abbr></abbrgrp> databases.</p>
         </sec>
         <sec>
            <st>
               <p>Evaluation of IS impact on sequenced genomes</p>
            </st>
            <p>587 chromosomes and 363 plasmids, representing 540 individual completely sequenced bacterial and archaeal genomes, were screened for the presence of IS<it>4</it>, IS<it>701</it>, IS<it>H3 </it>and IS<it>1634 </it>family DNA. Therefore, the 950 molecules were used as input in BLASTN <abbrgrp><abbr bid="B74">74</abbr></abbrgrp> searches against a homemade database containing the 227 DNA sequences encompassing these IS families.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>DDP carried out the genomic <it>in silico </it>screen for IS<it>4</it>-related elements, aligned DNA and protein sequences of gathered elements, collected insertion sites, proposed the present classification, submitted new elements to the ISfinder database and drafted the manuscript. PS conceived of the IS retrieval strategy, carried out the Tribe-MCL clustering, participated in analysis and interpretation of data and helped to draft the manuscript. JM participated in design of study and its coordination, participated in interpretation of data and critically revised the manuscript several times. All authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We like to thank Mick Chandler for critically revising this manuscript and for his assistance in finding cozy accommodations in Toulouse. We are also grateful to B&#233;n&#233;dicte Laverne for her contribution to the retrieval of useful bioinformatic software as well as towards Bernard Hallet for some fruitful and inspiring discussions. This work was supported by grants from the National Fund for Scientific Research (FNRS, Belgium) as well as the <it>Universit&#233; catholique de Louvain </it>(UCL).</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Insertion Sequences Revisited</p>
            </title>
            <aug>
               <au>
                  <snm>Chandler</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Mahillon</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Mobile DNA II</source>
            <publisher>Washington, D.C., ASMPress</publisher>
            <editor>Craig NL, Craigie R, Gellert M and Lambowitz AM</editor>
            <pubdate>2002</pubdate>
            <fpage>305</fpage>
            <lpage>366</lpage>
         </bibl>
         <bibl id="B2">
            <title>
               <p>IS231-MIC231 elements from <it>Bacillus cereus</it> sensu lato are modular</p>
            </title>
            <aug>
               <au>
                  <snm>De Palmenaer</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Vermeiren</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Mahillon</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>2004</pubdate>
            <volume>53</volume>
            <fpage>457</fpage>
            <lpage>467</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1365-2958.2004.04146.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">15228527</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>MIC231, a naturally occurring mobile insertion cassette from <it>Bacillus cereus</it></p>
            </title>
            <aug>
               <au>
                  <snm>Chen</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Braathen</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Leonard</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Mahillon</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>1999</pubdate>
            <volume>32</volume>
            <fpage>657</fpage>
            <lpage>668</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-2958.1999.01388.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">10320586</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Lateral gene transfer and the nature of bacterial innovation</p>
            </title>
            <aug>
               <au>
                  <snm>Ochman</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Lawrence</snm>
                  <fnm>JG</fnm>
               </au>
               <au>
                  <snm>Groisman</snm>
                  <fnm>EA</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>405</volume>
            <fpage>299</fpage>
            <lpage>304</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35012500</pubid>
                  <pubid idtype="pmpid" link="fulltext">10830951</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Active site sharing and subterminal hairpin recognition in a new class of DNA transposases</p>
            </title>
            <aug>
               <au>
                  <snm>Ronning</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Guynet</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ton-Hoang</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Perez</snm>
                  <fnm>ZN</fnm>
               </au>
               <au>
                  <snm>Ghirlando</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Chandler</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Dyda</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Mol Cell</source>
            <pubdate>2005</pubdate>
            <volume>20</volume>
            <fpage>143</fpage>
            <lpage>154</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.molcel.2005.07.026</pubid>
                  <pubid idtype="pmpid" link="fulltext">16209952</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Transposition of ISHp608, member of an unusual family of bacterial insertion sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Ton-Hoang</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Guynet</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ronning</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Cointin-Marty</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Dyda</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Chandler</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Embo J</source>
            <pubdate>2005</pubdate>
            <volume>24</volume>
            <fpage>3325</fpage>
            <lpage>3338</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1224677</pubid>
                  <pubid idtype="pmpid" link="fulltext">16163392</pubid>
                  <pubid idtype="doi">10.1038/sj.emboj.7600787</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>The outs and ins of transposition: from mu to kangaroo</p>
            </title>
            <aug>
               <au>
                  <snm>Curcio</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Derbyshire</snm>
                  <fnm>KM</fnm>
               </au>
            </aug>
            <source>Nat Rev Mol Cell Biol</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <fpage>865</fpage>
            <lpage>877</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrm1241</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Structure of the bacteriophage Mu transposase core: a common structural motif for DNA transposition and retroviral integration</p>
            </title>
            <aug>
               <au>
                  <snm>Rice</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Mizuuchi</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>1995</pubdate>
            <volume>82</volume>
            <fpage>209</fpage>
            <lpage>220</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0092-8674(95)90308-9</pubid>
                  <pubid idtype="pmpid">7628012</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Three-dimensional structure of the Tn5 synaptic complex transposition intermediate</p>
            </title>
            <aug>
               <au>
                  <snm>Davies</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Goryshin</snm>
                  <fnm>IY</fnm>
               </au>
               <au>
                  <snm>Reznikoff</snm>
                  <fnm>WS</fnm>
               </au>
               <au>
                  <snm>Rayment</snm>
                  <fnm>I</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2000</pubdate>
            <volume>289</volume>
            <fpage>77</fpage>
            <lpage>85</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.289.5476.77</pubid>
                  <pubid idtype="pmpid" link="fulltext">10884228</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Integrating DNA: transposases and retroviral integrases</p>
            </title>
            <aug>
               <au>
                  <snm>Haren</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Ton-Hoang</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Chandler</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Annu Rev Microbiol</source>
            <pubdate>1999</pubdate>
            <volume>53</volume>
            <fpage>245</fpage>
            <lpage>281</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.micro.53.1.245</pubid>
                  <pubid idtype="pmpid" link="fulltext">10547692</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Residues critical for retroviral integrative recombination in a region that is highly conserved among retroviral/retrotransposon integrases and bacterial insertion sequence transposases</p>
            </title>
            <aug>
               <au>
                  <snm>Kulkosky</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Katz</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Mack</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Skalka</snm>
                  <fnm>AM</fnm>
               </au>
            </aug>
            <source>Mol Cell Biol</source>
            <pubdate>1992</pubdate>
            <volume>12</volume>
            <fpage>2331</fpage>
            <lpage>2338</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">364405</pubid>
                  <pubid idtype="pmpid" link="fulltext">1314954</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Unity in transposition reactions</p>
            </title>
            <aug>
               <au>
                  <snm>Craig</snm>
                  <fnm>NL</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1995</pubdate>
            <volume>270</volume>
            <fpage>253</fpage>
            <lpage>254</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.270.5234.253</pubid>
                  <pubid idtype="pmpid" link="fulltext">7569973</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Insertion sequence diversity in archaea</p>
            </title>
            <aug>
               <au>
                  <snm>Filee</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Siguier</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Chandler</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Microbiol Mol Biol Rev</source>
            <pubdate>2007</pubdate>
            <volume>71</volume>
            <fpage>121</fpage>
            <lpage>157</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1128/MMBR.00031-06</pubid>
                  <pubid idtype="pmpid" link="fulltext">17347521</pubid>
                  <pubid idtype="pmcid">1847376</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Insertion sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Mahillon</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Chandler</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Microbiol Mol Biol Rev</source>
            <pubdate>1998</pubdate>
            <volume>62</volume>
            <fpage>725</fpage>
            <lpage>774</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">98933</pubid>
                  <pubid idtype="pmpid" link="fulltext">9729608</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>The ISfinder Database</p>
            </title>
            <url>http://www-is.biotoul.fr</url>
         </bibl>
         <bibl id="B16">
            <title>
               <p>ISfinder: the reference centre for bacterial insertion sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Siguier</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Perochon</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lestrade</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Mahillon</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Chandler</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2006</pubdate>
            <volume>34</volume>
            <fpage>D32</fpage>
            <lpage>6</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1347377</pubid>
                  <pubid idtype="pmpid" link="fulltext">16381877</pubid>
                  <pubid idtype="doi">10.1093/nar/gkj014</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>The IS4 family of insertion sequences: evidence for a conserved transposase motif</p>
            </title>
            <aug>
               <au>
                  <snm>Rezsohazy</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Hallet</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Delcour</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Mahillon</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>1993</pubdate>
            <volume>9</volume>
            <fpage>1283</fpage>
            <lpage>1295</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1365-2958.1993.tb01258.x</pubid>
                  <pubid idtype="pmpid">7934941</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>IS231A transposition: conservative versus replicative pathway</p>
            </title>
            <aug>
               <au>
                  <snm>Leonard</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Mahillon</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Res Microbiol</source>
            <pubdate>1998</pubdate>
            <volume>149</volume>
            <fpage>549</fpage>
            <lpage>555</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0923-2508(99)80002-3</pubid>
                  <pubid idtype="pmpid" link="fulltext">9795992</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>IS231 and other <it>Bacillus thuringiensis</it> transposable elements: a review</p>
            </title>
            <aug>
               <au>
                  <snm>Mahillon</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Rezs&#246;hazy</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Hallet</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Delcour</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Genetica</source>
            <pubdate>1994</pubdate>
            <volume>93</volume>
            <fpage>13</fpage>
            <lpage>26</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF01435236</pubid>
                  <pubid idtype="pmpid">7813910</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>IS231A from <it>Bacillus thuringiensis</it> is functional in <it>Escherichia coli</it>: transposition and insertion specificity</p>
            </title>
            <aug>
               <au>
                  <snm>Hallet</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Rezs&#246;hazy</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Delcour</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>1991</pubdate>
            <volume>173</volume>
            <fpage>4526</fpage>
            <lpage>4529</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">208117</pubid>
                  <pubid idtype="pmpid" link="fulltext">1648561</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>IS231A insertion specificity: consensus sequence and DNA bending at the target site</p>
            </title>
            <aug>
               <au>
                  <snm>Hallet</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Rezs&#246;hazy</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Mahillon</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Delcour</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>1994</pubdate>
            <volume>14</volume>
            <fpage>131</fpage>
            <lpage>139</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1365-2958.1994.tb01273.x</pubid>
                  <pubid idtype="pmpid">7830551</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Transposon Tn10</p>
            </title>
            <aug>
               <au>
                  <snm>Haniford</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Mobile DNA II</source>
            <publisher>Washington, D.C., ASM Press</publisher>
            <editor>Craig NL, Craigie R, Gellert M and Lambowitz AM</editor>
            <pubdate>2002</pubdate>
            <fpage>457</fpage>
            <lpage>483</lpage>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Transposon Tn10</p>
            </title>
            <aug>
               <au>
                  <snm>Kleckner</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Mobile DNA</source>
            <publisher>Washington, D.C., American Society for Microbiology</publisher>
            <editor>Berg DE and Howe MM</editor>
            <pubdate>1989</pubdate>
            <fpage>227</fpage>
            <lpage>268</lpage>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Transposon Tn5</p>
            </title>
            <aug>
               <au>
                  <snm>Berg</snm>
                  <fnm>DE</fnm>
               </au>
            </aug>
            <source>Mobile DNA</source>
            <publisher>Washington, D.C., Amercian Society for Microbiology</publisher>
            <editor>Berg DE and Howe MM</editor>
            <pubdate>1989</pubdate>
            <fpage>185</fpage>
            <lpage>210</lpage>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Tn5 Transposition</p>
            </title>
            <aug>
               <au>
                  <snm>Reznikoff</snm>
                  <fnm>WS</fnm>
               </au>
            </aug>
            <source>Mobile DNA II</source>
            <publisher>Washington, D.C., ASM Press</publisher>
            <editor>Craig NL, Craigie R, Gellert M and Lambowitz AM</editor>
            <pubdate>2002</pubdate>
            <fpage>403</fpage>
            <lpage>421</lpage>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Two-metal active site binding of a Tn5 transposase synaptic complex</p>
            </title>
            <aug>
               <au>
                  <snm>Lovell</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Goryshin</snm>
                  <fnm>IY</fnm>
               </au>
               <au>
                  <snm>Reznikoff</snm>
                  <fnm>WR</fnm>
               </au>
               <au>
                  <snm>Rayment</snm>
                  <fnm>I</fnm>
               </au>
            </aug>
            <source>Nat Struct Biol</source>
            <pubdate>2002</pubdate>
            <volume>9</volume>
            <fpage>278</fpage>
            <lpage>281</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nsb778</pubid>
                  <pubid idtype="pmpid" link="fulltext">11896402</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>National Center for Biotechnology Information (NCBI)</p>
            </title>
            <url>http://www.ncbi.nlm.nih.gov/</url>
         </bibl>
         <bibl id="B28">
            <title>
               <p>I am what I eat and I eat what I am: acquisition of bacterial genes by giant viruses</p>
            </title>
            <aug>
               <au>
                  <snm>Filee</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Siguier</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Chandler</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2007</pubdate>
            <volume>23</volume>
            <fpage>10</fpage>
            <lpage>15</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tig.2006.11.002</pubid>
                  <pubid idtype="pmpid" link="fulltext">17109990</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Regulation of transposition in bacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Nagy</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Chandler</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Res Microbiol</source>
            <pubdate>2004</pubdate>
            <volume>155</volume>
            <fpage>387</fpage>
            <lpage>398</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.resmic.2004.01.008</pubid>
                  <pubid idtype="pmpid" link="fulltext">15207871</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>The genome sequence of <it>Mycoplasma mycoides</it> subsp. <it>mycoides</it> SC type strain PG1T, the causative agent of contagious bovine pleuropneumonia (CBPP)</p>
            </title>
            <aug>
               <au>
                  <snm>Westberg</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Persson</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Holmberg</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Goesmann</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lundeberg</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Johansson</snm>
                  <fnm>KE</fnm>
               </au>
               <au>
                  <snm>Pettersson</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Uhlen</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <fpage>221</fpage>
            <lpage>227</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">327097</pubid>
                  <pubid idtype="pmpid" link="fulltext">14762060</pubid>
                  <pubid idtype="doi">10.1101/gr.1673304</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Swine and poultry pathogens: the complete genome sequences of two strains of <it>Mycoplasma hyopneumoniae</it> and a strain of <it>Mycoplasma synoviae</it></p>
            </title>
            <aug>
               <au>
                  <snm>Vasconcelos</snm>
                  <fnm>AT</fnm>
               </au>
               <au>
                  <snm>Ferreira</snm>
                  <fnm>HB</fnm>
               </au>
               <au>
                  <snm>Bizarro</snm>
                  <fnm>CV</fnm>
               </au>
               <au>
                  <snm>Bonatto</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Carvalho</snm>
                  <fnm>MO</fnm>
               </au>
               <au>
                  <snm>Pinto</snm>
                  <fnm>PM</fnm>
               </au>
               <au>
                  <snm>Almeida</snm>
                  <fnm>DF</fnm>
               </au>
               <au>
                  <snm>Almeida</snm>
                  <fnm>LG</fnm>
               </au>
               <au>
                  <snm>Almeida</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Alves-Filho</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Assuncao</snm>
                  <fnm>EN</fnm>
               </au>
               <au>
                  <snm>Azevedo</snm>
                  <fnm>VA</fnm>
               </au>
               <au>
                  <snm>Bogo</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Brigido</snm>
                  <fnm>MM</fnm>
               </au>
               <au>
                  <snm>Brocchi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Burity</snm>
                  <fnm>HA</fnm>
               </au>
               <au>
                  <snm>Camargo</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Camargo</snm>
                  <fnm>SS</fnm>
               </au>
               <au>
                  <snm>Carepo</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Carraro</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>de Mattos Cascardo</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Castro</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Cavalcanti</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Chemale</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Collevatti</snm>
                  <fnm>RG</fnm>
               </au>
               <au>
                  <snm>Cunha</snm>
                  <fnm>CW</fnm>
               </au>
               <au>
                  <snm>Dallagiovanna</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Dambros</snm>
                  <fnm>BP</fnm>
               </au>
               <au>
                  <snm>Dellagostin</snm>
                  <fnm>OA</fnm>
               </au>
               <au>
                  <snm>Falcao</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Fantinatti-Garboggini</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Felipe</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Fiorentin</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Franco</snm>
                  <fnm>GR</fnm>
               </au>
               <au>
                  <snm>Freitas</snm>
                  <fnm>NS</fnm>
               </au>
               <au>
                  <snm>Frias</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Grangeiro</snm>
                  <fnm>TB</fnm>
               </au>
               <au>
                  <snm>Grisard</snm>
                  <fnm>EC</fnm>
               </au>
               <au>
                  <snm>Guimaraes</snm>
                  <fnm>CT</fnm>
               </au>
               <au>
                  <snm>Hungria</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Jardim</snm>
                  <fnm>SN</fnm>
               </au>
               <au>
                  <snm>Krieger</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Laurino</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Lima</snm>
                  <fnm>LF</fnm>
               </au>
               <au>
                  <snm>Lopes</snm>
                  <fnm>MI</fnm>
               </au>
               <au>
                  <snm>Loreto</snm>
                  <fnm>EL</fnm>
               </au>
               <au>
                  <snm>Madeira</snm>
                  <fnm>HM</fnm>
               </au>
               <au>
                  <snm>Manfio</snm>
                  <fnm>GP</fnm>
               </au>
               <au>
                  <snm>Maranhao</snm>
                  <fnm>AQ</fnm>
               </au>
               <au>
                  <snm>Martinkovics</snm>
                  <fnm>CT</fnm>
               </au>
               <au>
                  <snm>Medeiros</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Moreira</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Neiva</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ramalho-Neto</snm>
                  <fnm>CE</fnm>
               </au>
               <au>
                  <snm>Nicolas</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Oliveira</snm>
                  <fnm>SC</fnm>
               </au>
               <au>
                  <snm>Paixao</snm>
                  <fnm>RF</fnm>
               </au>
               <au>
                  <snm>Pedrosa</snm>
                  <fnm>FO</fnm>
               </au>
               <au>
                  <snm>Pena</snm>
                  <fnm>SD</fnm>
               </au>
               <au>
                  <snm>Pereira</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Pereira-Ferrari</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Piffer</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Pinto</snm>
                  <fnm>LS</fnm>
               </au>
               <au>
                  <snm>Potrich</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Salim</snm>
                  <fnm>AC</fnm>
               </au>
               <au>
                  <snm>Santos</snm>
                  <fnm>FR</fnm>
               </au>
               <au>
                  <snm>Schmitt</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Schneider</snm>
                  <fnm>MP</fnm>
               </au>
               <au>
                  <snm>Schrank</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Schrank</snm>
                  <fnm>IS</fnm>
               </au>
               <au>
                  <snm>Schuck</snm>
                  <fnm>AF</fnm>
               </au>
               <au>
                  <snm>Seuanez</snm>
                  <fnm>HN</fnm>
               </au>
               <au>
                  <snm>Silva</snm>
                  <fnm>DW</fnm>
               </au>
               <au>
                  <snm>Silva</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Silva</snm>
                  <fnm>SC</fnm>
               </au>
               <au>
                  <snm>Soares</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Souza</snm>
                  <fnm>KR</fnm>
               </au>
               <au>
                  <snm>Souza</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Staats</snm>
                  <fnm>CC</fnm>
               </au>
               <au>
                  <snm>Steffens</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Teixeira</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Urmenyi</snm>
                  <fnm>TP</fnm>
               </au>
               <au>
                  <snm>Vainstein</snm>
                  <fnm>MH</fnm>
               </au>
               <au>
                  <snm>Zuccherato</snm>
                  <fnm>LW</fnm>
               </au>
               <au>
                  <snm>Simpson</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Zaha</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2005</pubdate>
            <volume>187</volume>
            <fpage>5568</fpage>
            <lpage>5577</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1196056</pubid>
                  <pubid idtype="pmpid" link="fulltext">16077101</pubid>
                  <pubid idtype="doi">10.1128/JB.187.16.5568-5577.2005</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>The genome sequence of <it>Mycoplasma hyopneumoniae</it> strain 232, the agent of swine mycoplasmosis</p>
            </title>
            <aug>
               <au>
                  <snm>Minion</snm>
                  <fnm>FC</fnm>
               </au>
               <au>
                  <snm>Lefkowitz</snm>
                  <fnm>EJ</fnm>
               </au>
               <au>
                  <snm>Madsen</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Cleary</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Swartzell</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Mahairas</snm>
                  <fnm>GG</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2004</pubdate>
            <volume>186</volume>
            <fpage>7123</fpage>
            <lpage>7133</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">523201</pubid>
                  <pubid idtype="pmpid" link="fulltext">15489423</pubid>
                  <pubid idtype="doi">10.1128/JB.186.21.7123-7133.2004</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>The minimal gene complement of <it>Mycoplasma genitalium</it></p>
            </title>
            <aug>
               <au>
                  <snm>Fraser</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Gocayne</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Adams</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Clayton</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Fleischmann</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Bult</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Kerlavage</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Sutton</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Kelley</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Fritchman</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Weidman</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Small</snm>
                  <fnm>KV</fnm>
               </au>
               <au>
                  <snm>Sandusky</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Fuhrmann</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Nguyen</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Utterback</snm>
                  <fnm>TR</fnm>
               </au>
               <au>
                  <snm>Saudek</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Phillips</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Merrick</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Tomb</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Dougherty</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Bott</snm>
                  <fnm>KF</fnm>
               </au>
               <au>
                  <snm>Hu</snm>
                  <fnm>PC</fnm>
               </au>
               <au>
                  <snm>Lucier</snm>
                  <fnm>TS</fnm>
               </au>
               <au>
                  <snm>Peterson</snm>
                  <fnm>SN</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>HO</fnm>
               </au>
               <au>
                  <snm>Hutchison</snm>
                  <fnm>CA</fnm>
                  <suf>3rd</suf>
               </au>
               <au>
                  <snm>Venter</snm>
                  <fnm>JC</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1995</pubdate>
            <volume>270</volume>
            <fpage>397</fpage>
            <lpage>403</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.270.5235.397</pubid>
                  <pubid idtype="pmpid" link="fulltext">7569993</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Comparative analysis of the genomes of the bacteria <it>Mycoplasma pneumoniae</it> and <it>Mycoplasma genitalium</it></p>
            </title>
            <aug>
               <au>
                  <snm>Himmelreich</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Plagens</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Hilbert</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Reiner</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Herrmann</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1997</pubdate>
            <volume>25</volume>
            <fpage>701</fpage>
            <lpage>712</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">146517</pubid>
                  <pubid idtype="pmpid" link="fulltext">9016618</pubid>
                  <pubid idtype="doi">10.1093/nar/25.4.701</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>The complete genome and proteome of <it>Mycoplasma mobile</it></p>
            </title>
            <aug>
               <au>
                  <snm>Jaffe</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Stange-Thomann</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>DeCaprio</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Fisher</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Butler</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Calvo</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Elkins</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>FitzGerald</snm>
                  <fnm>MG</fnm>
               </au>
               <au>
                  <snm>Hafez</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Kodira</snm>
                  <fnm>CD</fnm>
               </au>
               <au>
                  <snm>Major</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wilkinson</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Nicol</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Nusbaum</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Birren</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Berg</snm>
                  <fnm>HC</fnm>
               </au>
               <au>
                  <snm>Church</snm>
                  <fnm>GM</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <fpage>1447</fpage>
            <lpage>1461</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">509254</pubid>
                  <pubid idtype="pmpid" link="fulltext">15289470</pubid>
                  <pubid idtype="doi">10.1101/gr.2674004</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Mobile DNA in obligate intracellular bacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Bordenstein</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Reznikoff</snm>
                  <fnm>WS</fnm>
               </au>
            </aug>
            <source>Nat Rev Microbiol</source>
            <pubdate>2005</pubdate>
            <volume>3</volume>
            <fpage>688</fpage>
            <lpage>699</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrmicro1233</pubid>
                  <pubid idtype="pmpid" link="fulltext">16138097</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Causes of insertion sequences abundance in prokaryotic genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Touchon</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rocha</snm>
                  <fnm>EP</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2007</pubdate>
            <volume>24</volume>
            <fpage>969</fpage>
            <lpage>981</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msm014</pubid>
                  <pubid idtype="pmpid" link="fulltext">17251179</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>The complete genomic sequence of <it>Mycoplasma penetrans</it>, an intracellular bacterial pathogen in humans</p>
            </title>
            <aug>
               <au>
                  <snm>Sasaki</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Ishikawa</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Yamashita</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Oshima</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kenri</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Furuya</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Yoshino</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Horino</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Shiba</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Sasaki</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hattori</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2002</pubdate>
            <volume>30</volume>
            <fpage>5293</fpage>
            <lpage>5300</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">137978</pubid>
                  <pubid idtype="pmpid" link="fulltext">12466555</pubid>
                  <pubid idtype="doi">10.1093/nar/gkf667</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>The genome sequence of <it>Xanthomonas oryzae</it> pathovar <it>oryzae</it> KACC10331, the bacterial blight pathogen of rice</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>BM</fnm>
               </au>
               <au>
                  <snm>Park</snm>
                  <fnm>YJ</fnm>
               </au>
               <au>
                  <snm>Park</snm>
                  <fnm>DS</fnm>
               </au>
               <au>
                  <snm>Kang</snm>
                  <fnm>HW</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>JG</fnm>
               </au>
               <au>
                  <snm>Song</snm>
                  <fnm>ES</fnm>
               </au>
               <au>
                  <snm>Park</snm>
                  <fnm>IC</fnm>
               </au>
               <au>
                  <snm>Yoon</snm>
                  <fnm>UH</fnm>
               </au>
               <au>
                  <snm>Hahn</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Koo</snm>
                  <fnm>BS</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>GB</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Park</snm>
                  <fnm>HS</fnm>
               </au>
               <au>
                  <snm>Yoon</snm>
                  <fnm>KO</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Jung</snm>
                  <fnm>CH</fnm>
               </au>
               <au>
                  <snm>Koh</snm>
                  <fnm>NH</fnm>
               </au>
               <au>
                  <snm>Seo</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Go</snm>
                  <fnm>SJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <fpage>577</fpage>
            <lpage>586</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">548351</pubid>
                  <pubid idtype="pmpid" link="fulltext">15673718</pubid>
                  <pubid idtype="doi">10.1093/nar/gki206</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Genome sequence of <it>Xanthomonas oryzae</it> pv. <it>oryzae</it> suggests contribution of large numbers of effector genes and insertion sequences to its race diversity</p>
            </title>
            <aug>
               <au>
                  <snm>Ochiai</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Inoue</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Takeya</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sasaki</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kaku</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Japan Agricultural Research Quarterly</source>
            <pubdate>2005</pubdate>
            <volume>39</volume>
            <fpage>275</fpage>
            <lpage>287</lpage>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Comparative and functional genomic analyses of the pathogenicity of phytopathogen <it>Xanthomonas campestris</it> pv. <it>campestris</it></p>
            </title>
            <aug>
               <au>
                  <snm>Qian</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Jia</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Ren</snm>
                  <fnm>SX</fnm>
               </au>
               <au>
                  <snm>He</snm>
                  <fnm>YQ</fnm>
               </au>
               <au>
                  <snm>Feng</snm>
                  <fnm>JX</fnm>
               </au>
               <au>
                  <snm>Lu</snm>
                  <fnm>LF</fnm>
               </au>
               <au>
                  <snm>Sun</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Ying</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Tang</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Tang</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Hao</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Jiang</snm>
                  <fnm>BL</fnm>
               </au>
               <au>
                  <snm>Zeng</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gu</snm>
                  <fnm>WY</fnm>
               </au>
               <au>
                  <snm>Lu</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Rong</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Tian</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Yao</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Fu</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Fang</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Qiang</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>GP</fnm>
               </au>
               <au>
                  <snm>Tang</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>He</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2005</pubdate>
            <volume>15</volume>
            <fpage>757</fpage>
            <lpage>767</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1142466</pubid>
                  <pubid idtype="pmpid" link="fulltext">15899963</pubid>
                  <pubid idtype="doi">10.1101/gr.3378705</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Comparison of the genomes of two <it>Xanthomonas </it>pathogens with differing host specificities</p>
            </title>
            <aug>
               <au>
                  <snm>da Silva</snm>
                  <fnm>AC</fnm>
               </au>
               <au>
                  <snm>Ferro</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Reinach</snm>
                  <fnm>FC</fnm>
               </au>
               <au>
                  <snm>Farah</snm>
                  <fnm>CS</fnm>
               </au>
               <au>
                  <snm>Furlan</snm>
                  <fnm>LR</fnm>
               </au>
               <au>
                  <snm>Quaggio</snm>
                  <fnm>RB</fnm>
               </au>
               <au>
                  <snm>Monteiro-Vitorello</snm>
                  <fnm>CB</fnm>
               </au>
               <au>
                  <snm>Van Sluys</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Almeida</snm>
                  <fnm>NF</fnm>
               </au>
               <au>
                  <snm>Alves</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>do Amaral</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Bertolini</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Camargo</snm>
                  <fnm>LE</fnm>
               </au>
               <au>
                  <snm>Camarotte</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Cannavan</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Cardozo</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Chambergo</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Ciapina</snm>
                  <fnm>LP</fnm>
               </au>
               <au>
                  <snm>Cicarelli</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Coutinho</snm>
                  <fnm>LL</fnm>
               </au>
               <au>
                  <snm>Cursino-Santos</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>El-Dorry</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Faria</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Ferreira</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Ferreira</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Ferro</snm>
                  <fnm>MI</fnm>
               </au>
               <au>
                  <snm>Formighieri</snm>
                  <fnm>EF</fnm>
               </au>
               <au>
                  <snm>Franco</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Greggio</snm>
                  <fnm>CC</fnm>
               </au>
               <au>
                  <snm>Gruber</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Katsuyama</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Kishi</snm>
                  <fnm>LT</fnm>
               </au>
               <au>
                  <snm>Leite</snm>
                  <fnm>RP</fnm>
               </au>
               <au>
                  <snm>Lemos</snm>
                  <fnm>EG</fnm>
               </au>
               <au>
                  <snm>Lemos</snm>
                  <fnm>MV</fnm>
               </au>
               <au>
                  <snm>Locali</snm>
                  <fnm>EC</fnm>
               </au>
               <au>
                  <snm>Machado</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Madeira</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Martinez-Rossi</snm>
                  <fnm>NM</fnm>
               </au>
               <au>
                  <snm>Martins</snm>
                  <fnm>EC</fnm>
               </au>
               <au>
                  <snm>Meidanis</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Menck</snm>
                  <fnm>CF</fnm>
               </au>
               <au>
                  <snm>Miyaki</snm>
                  <fnm>CY</fnm>
               </au>
               <au>
                  <snm>Moon</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Moreira</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Novo</snm>
                  <fnm>MT</fnm>
               </au>
               <au>
                  <snm>Okura</snm>
                  <fnm>VK</fnm>
               </au>
               <au>
                  <snm>Oliveira</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Oliveira</snm>
                  <fnm>VR</fnm>
               </au>
               <au>
                  <snm>Pereira</snm>
                  <fnm>HA</fnm>
               </au>
               <au>
                  <snm>Rossi</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sena</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Silva</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>de Souza</snm>
                  <fnm>RF</fnm>
               </au>
               <au>
                  <snm>Spinola</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Takita</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Tamura</snm>
                  <fnm>RE</fnm>
               </au>
               <au>
                  <snm>Teixeira</snm>
                  <fnm>EC</fnm>
               </au>
               <au>
                  <snm>Tezza</snm>
                  <fnm>RI</fnm>
               </au>
               <au>
                  <snm>Trindade dos Santos</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Truffi</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Tsai</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>FF</fnm>
               </au>
               <au>
                  <snm>Setubal</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Kitajima</snm>
                  <fnm>JP</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>417</volume>
            <fpage>459</fpage>
            <lpage>463</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/417459a</pubid>
                  <pubid idtype="pmpid" link="fulltext">12024217</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Insights into genome plasticity and pathogenicity of the plant pathogenic bacterium <it>Xanthomonas campestris</it> pv. <it>vesicatoria </it>revealed by the complete genome sequence</p>
            </title>
            <aug>
               <au>
                  <snm>Thieme</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Koebnik</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Bekel</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Berger</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Boch</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Buttner</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Caldana</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Gaigalat</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Goesmann</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kay</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kirchner</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Lanz</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Linke</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>McHardy</snm>
                  <fnm>AC</fnm>
               </au>
               <au>
                  <snm>Meyer</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Mittenhuber</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Nies</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Niesbach-Klosgen</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Patschkowski</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Ruckert</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Rupp</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Schneiker</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Schuster</snm>
                  <fnm>SC</fnm>
               </au>
               <au>
                  <snm>Vorholter</snm>
                  <fnm>FJ</fnm>
               </au>
               <au>
                  <snm>Weber</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Puhler</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bonas</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Bartels</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kaiser</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2005</pubdate>
            <volume>187</volume>
            <fpage>7254</fpage>
            <lpage>7266</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1272972</pubid>
                  <pubid idtype="pmpid" link="fulltext">16237009</pubid>
                  <pubid idtype="doi">10.1128/JB.187.21.7254-7266.2005</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Complete genome sequence and comparative genomics of <it>Shigella flexneri</it> serotype 2a strain 2457T</p>
            </title>
            <aug>
               <au>
                  <snm>Wei</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Goldberg</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Burland</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Venkatesan</snm>
                  <fnm>MM</fnm>
               </au>
               <au>
                  <snm>Deng</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Fournier</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Mayhew</snm>
                  <fnm>GF</fnm>
               </au>
               <au>
                  <snm>Plunkett</snm>
                  <fnm>G</fnm>
                  <suf>3rd</suf>
               </au>
               <au>
                  <snm>Rose</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Darling</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Mau</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Perna</snm>
                  <fnm>NT</fnm>
               </au>
               <au>
                  <snm>Payne</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Runyen-Janecky</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Schwartz</snm>
                  <fnm>DC</fnm>
               </au>
               <au>
                  <snm>Blattner</snm>
                  <fnm>FR</fnm>
               </au>
            </aug>
            <source>Infect Immun</source>
            <pubdate>2003</pubdate>
            <volume>71</volume>
            <fpage>2775</fpage>
            <lpage>2786</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">153260</pubid>
                  <pubid idtype="pmpid" link="fulltext">12704152</pubid>
                  <pubid idtype="doi">10.1128/IAI.71.5.2775-2786.2003</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Genome dynamics and diversity of <it>Shigella </it>species, the etiologic agents of bacillary dysentery</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Jiang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Yan</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Tang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Xiong</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Dong</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Xue</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Xu</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Sun</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nie</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Peng</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Xu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Yuan</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Wen</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Yao</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Shen</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Qiang</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Hou</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Jin</snm>
                  <fnm>Q</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <fpage>6445</fpage>
            <lpage>6458</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1278947</pubid>
                  <pubid idtype="pmpid" link="fulltext">16275786</pubid>
                  <pubid idtype="doi">10.1093/nar/gki954</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>Complete genome sequence of <it>Shigella flexneri</it> 5b and comparison with <it>Shigella flexneri</it> 2a</p>
            </title>
            <aug>
               <au>
                  <snm>Nie</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Xiong</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Peng</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Sun</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Dong</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Xue</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Xu</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Yao</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Shen</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Jin</snm>
                  <fnm>Q</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>173</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1550401</pubid>
                  <pubid idtype="pmpid" link="fulltext">16822325</pubid>
                  <pubid idtype="doi">10.1186/1471-2164-7-173</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>Genome sequence of <it>Shigella flexneri</it> 2a: insights into pathogenicity through comparison with genomes of <it>Escherichia coli</it> K12 and O157</p>
            </title>
            <aug>
               <au>
                  <snm>Jin</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Yuan</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Xu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Shen</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Lu</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Qu</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Dong</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Sun</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Xue</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Gao</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kan</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Ding</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Cheng</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Yao</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>He</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Ma</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Qiang</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Wen</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Hou</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2002</pubdate>
            <volume>30</volume>
            <fpage>4432</fpage>
            <lpage>4441</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">137130</pubid>
                  <pubid idtype="pmpid" link="fulltext">12384590</pubid>
                  <pubid idtype="doi">10.1093/nar/gkf566</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>Comparative analysis of the genome sequences of <it>Bordetella pertussis</it>, <it>Bordetella parapertussis</it> and <it>Bordetella bronchiseptica</it></p>
            </title>
            <aug>
               <au>
                  <snm>Parkhill</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Sebaihia</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Preston</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Murphy</snm>
                  <fnm>LD</fnm>
               </au>
               <au>
                  <snm>Thomson</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Holden</snm>
                  <fnm>MT</fnm>
               </au>
               <au>
                  <snm>Churcher</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Bentley</snm>
                  <fnm>SD</fnm>
               </au>
               <au>
                  <snm>Mungall</snm>
                  <fnm>KL</fnm>
               </au>
               <au>
                  <snm>Cerdeno-Tarraga</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Temple</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>James</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Quail</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Achtman</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Atkin</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Baker</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Basham</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Bason</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Cherevach</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Chillingworth</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Collins</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Cronin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Doggett</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Feltwell</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Goble</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hamlin</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Hauser</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Holroyd</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Jagels</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Leather</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Moule</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Norberczak</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>O'Neil</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ormond</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Price</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Rabbinowitsch</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Rutter</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sanders</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Saunders</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Seeger</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Sharp</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Simmonds</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Skelton</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Squares</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Squares</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Stevens</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Unwin</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Whitehead</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Barrell</snm>
                  <fnm>BG</fnm>
               </au>
               <au>
                  <snm>Maskell</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2003</pubdate>
            <volume>35</volume>
            <fpage>32</fpage>
            <lpage>40</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1227</pubid>
                  <pubid idtype="pmpid" link="fulltext">12910271</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>Insights into the evolution of <it>Yersinia pestis</it> through whole-genome comparison with <it>Yersinia pseudotuberculosis</it></p>
            </title>
            <aug>
               <au>
                  <snm>Chain</snm>
                  <fnm>PS</fnm>
               </au>
               <au>
                  <snm>Carniel</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Larimer</snm>
                  <fnm>FW</fnm>
               </au>
               <au>
                  <snm>Lamerdin</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Stoutland</snm>
                  <fnm>PO</fnm>
               </au>
               <au>
                  <snm>Regala</snm>
                  <fnm>WM</fnm>
               </au>
               <au>
                  <snm>Georgescu</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Vergez</snm>
                  <fnm>LM</fnm>
               </au>
               <au>
                  <snm>Land</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Motin</snm>
                  <fnm>VL</fnm>
               </au>
               <au>
                  <snm>Brubaker</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Fowler</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hinnebusch</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Marceau</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Medigue</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Simonet</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Chenal-Francisque</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Souza</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Dacheux</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Elliott</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Derbise</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hauser</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Garcia</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2004</pubdate>
            <volume>101</volume>
            <fpage>13826</fpage>
            <lpage>13831</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">518763</pubid>
                  <pubid idtype="pmpid" link="fulltext">15358858</pubid>
                  <pubid idtype="doi">10.1073/pnas.0404012101</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>Comparison of <it>Francisella tularensis</it> genomes reveals evolutionary events associated with the emergence of human pathogenic strains</p>
            </title>
            <aug>
               <au>
                  <snm>Rohmer</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Fong</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Abmayr</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wasnick</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Larson Freeman</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Radey</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Guina</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Svensson</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Hayden</snm>
                  <fnm>HS</fnm>
               </au>
               <au>
                  <snm>Jacobs</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Gallagher</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Manoil</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ernst</snm>
                  <fnm>RK</fnm>
               </au>
               <au>
                  <snm>Drees</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Buckley</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Haugen</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Bovee</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Chang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Levy</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Lim</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Gillett</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Guenthener</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kang</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Shaffer</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Gallis</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>D'Argenio</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Forsman</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Olson</snm>
                  <fnm>MV</fnm>
               </au>
               <au>
                  <snm>Goodlett</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Kaul</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>SI</fnm>
               </au>
               <au>
                  <snm>Brittnacher</snm>
                  <fnm>MJ</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>R102</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/gb-2007-8-6-r102</pubid>
                  <pubid idtype="pmpid" link="fulltext">17550600</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>Life at depth: <it>Photobacterium profundum</it> genome sequence and expression analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Vezzi</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Campanaro</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>D'Angelo</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Simonato</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Vitulo</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Lauro</snm>
                  <fnm>FM</fnm>
               </au>
               <au>
                  <snm>Cestaro</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Malacrida</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Simionati</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Cannata</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Romualdi</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Bartlett</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Valle</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2005</pubdate>
            <volume>307</volume>
            <fpage>1459</fpage>
            <lpage>1461</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1103341</pubid>
                  <pubid idtype="pmpid" link="fulltext">15746425</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>DNA sequence of both chromosomes of the cholera pathogen <it>Vibrio cholerae</it></p>
            </title>
            <aug>
               <au>
                  <snm>Heidelberg</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Eisen</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>WC</fnm>
               </au>
               <au>
                  <snm>Clayton</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Gwinn</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Dodson</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Haft</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Hickey</snm>
                  <fnm>EK</fnm>
               </au>
               <au>
                  <snm>Peterson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Umayam</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Gill</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>KE</fnm>
               </au>
               <au>
                  <snm>Read</snm>
                  <fnm>TD</fnm>
               </au>
               <au>
                  <snm>Tettelin</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Richardson</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Ermolaeva</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Vamathevan</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bass</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Qin</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Dragoi</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Sellers</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>McDonald</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Utterback</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Fleishmann</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Nierman</snm>
                  <fnm>WC</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Salzberg</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>HO</fnm>
               </au>
               <au>
                  <snm>Colwell</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Mekalanos</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Venter</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Fraser</snm>
                  <fnm>CM</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>406</volume>
            <fpage>477</fpage>
            <lpage>483</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35020000</pubid>
                  <pubid idtype="pmpid" link="fulltext">10952301</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>Genome sequence of <it>Vibrio parahaemolyticus</it>: a pathogenic mechanism distinct from that of <it>V cholerae</it></p>
            </title>
            <aug>
               <au>
                  <snm>Makino</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Oshima</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kurokawa</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Yokoyama</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Uda</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Tagomori</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Iijima</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Najima</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Nakano</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Yamashita</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kubota</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Kimura</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Yasunaga</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Honda</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Shinagawa</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Hattori</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Iida</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Lancet</source>
            <pubdate>2003</pubdate>
            <volume>361</volume>
            <fpage>743</fpage>
            <lpage>749</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0140-6736(03)12659-1</pubid>
                  <pubid idtype="pmpid" link="fulltext">12620739</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>Comparative genome analysis of <it>Vibrio vulnificus</it>, a marine pathogen</p>
            </title>
            <aug>
               <au>
                  <snm>Chen</snm>
                  <fnm>CY</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>KM</fnm>
               </au>
               <au>
                  <snm>Chang</snm>
                  <fnm>YC</fnm>
               </au>
               <au>
                  <snm>Chang</snm>
                  <fnm>CH</fnm>
               </au>
               <au>
                  <snm>Tsai</snm>
                  <fnm>HC</fnm>
               </au>
               <au>
                  <snm>Liao</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>YM</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>HJ</fnm>
               </au>
               <au>
                  <snm>Shen</snm>
                  <fnm>AB</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Su</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Shao</snm>
                  <fnm>CP</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>CT</fnm>
               </au>
               <au>
                  <snm>Hor</snm>
                  <fnm>LI</fnm>
               </au>
               <au>
                  <snm>Tsai</snm>
                  <fnm>SF</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <fpage>2577</fpage>
            <lpage>2587</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">403799</pubid>
                  <pubid idtype="pmpid" link="fulltext">14656965</pubid>
                  <pubid idtype="doi">10.1101/gr.1295503</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>The genome of <it>Sulfolobus acidocaldariu</it>s, a model organism of the <it>Crenarchaeota</it></p>
            </title>
            <aug>
               <au>
                  <snm>Chen</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Brugger</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Skovgaard</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Redder</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>She</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Torarinsson</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Greve</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Awayez</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Zibat</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Klenk</snm>
                  <fnm>HP</fnm>
               </au>
               <au>
                  <snm>Garrett</snm>
                  <fnm>RA</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2005</pubdate>
            <volume>187</volume>
            <fpage>4992</fpage>
            <lpage>4999</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1169522</pubid>
                  <pubid idtype="pmpid" link="fulltext">15995215</pubid>
                  <pubid idtype="doi">10.1128/JB.187.14.4992-4999.2005</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <title>
               <p>Complete genome sequence of an aerobic thermoacidophilic crenarchaeon, <it>Sulfolobus tokodaii</it> strain7</p>
            </title>
            <aug>
               <au>
                  <snm>Kawarabayasi</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Hino</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Horikawa</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Jin-no</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Takahashi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sekine</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Baba</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ankai</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kosugi</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Hosoyama</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Fukui</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nagai</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Nishijima</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Otsuka</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Nakazawa</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Takamiya</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kato</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Yoshizawa</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Tanaka</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Kudoh</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Yamazaki</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kushida</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Oguchi</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Aoki</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Masuda</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Yanagii</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Nishimura</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Yamagishi</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Oshima</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Kikuchi</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>DNA Res</source>
            <pubdate>2001</pubdate>
            <volume>8</volume>
            <fpage>123</fpage>
            <lpage>140</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/dnares/8.4.123</pubid>
                  <pubid idtype="pmpid" link="fulltext">11572479</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <title>
               <p>The complete genome of the crenarchaeon <it>Sulfolobus solfataricus</it> P2</p>
            </title>
            <aug>
               <au>
                  <snm>She</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Singh</snm>
                  <fnm>RK</fnm>
               </au>
               <au>
                  <snm>Confalonieri</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Zivanovic</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Allard</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Awayez</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Chan-Weiher</snm>
                  <fnm>CC</fnm>
               </au>
               <au>
                  <snm>Clausen</snm>
                  <fnm>IG</fnm>
               </au>
               <au>
                  <snm>Curtis</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>De Moors</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Erauso</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Fletcher</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Gordon</snm>
                  <fnm>PM</fnm>
               </au>
               <au>
                  <snm>Heikamp-de Jong</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Jeffries</snm>
                  <fnm>AC</fnm>
               </au>
               <au>
                  <snm>Kozera</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Medina</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Peng</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Thi-Ngoc</snm>
                  <fnm>HP</fnm>
               </au>
               <au>
                  <snm>Redder</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Schenk</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Theriault</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Tolstrup</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Charlebois</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Doolittle</snm>
                  <fnm>WF</fnm>
               </au>
               <au>
                  <snm>Duguet</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Gaasterland</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Garrett</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Ragan</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Sensen</snm>
                  <fnm>CW</fnm>
               </au>
               <au>
                  <snm>Van der Oost</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2001</pubdate>
            <volume>98</volume>
            <fpage>7835</fpage>
            <lpage>7840</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">35428</pubid>
                  <pubid idtype="pmpid" link="fulltext">11427726</pubid>
                  <pubid idtype="doi">10.1073/pnas.141222098</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>Genome sequence of Halobacterium species NRC-1</p>
            </title>
            <aug>
               <au>
                  <snm>Ng</snm>
                  <fnm>WV</fnm>
               </au>
               <au>
                  <snm>Kennedy</snm>
                  <fnm>SP</fnm>
               </au>
               <au>
                  <snm>Mahairas</snm>
                  <fnm>GG</fnm>
               </au>
               <au>
                  <snm>Berquist</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Pan</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Shukla</snm>
                  <fnm>HD</fnm>
               </au>
               <au>
                  <snm>Lasky</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Baliga</snm>
                  <fnm>NS</fnm>
               </au>
               <au>
                  <snm>Thorsson</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Sbrogna</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Swartzell</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Weir</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Hall</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Dahl</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Welti</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Goo</snm>
                  <fnm>YA</fnm>
               </au>
               <au>
                  <snm>Leithauser</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Keller</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Cruz</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Danson</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Hough</snm>
                  <fnm>DW</fnm>
               </au>
               <au>
                  <snm>Maddocks</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Jablonski</snm>
                  <fnm>PE</fnm>
               </au>
               <au>
                  <snm>Krebs</snm>
                  <fnm>MP</fnm>
               </au>
               <au>
                  <snm>Angevine</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Dale</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Isenbarger</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Peck</snm>
                  <fnm>RF</fnm>
               </au>
               <au>
                  <snm>Pohlschroder</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Spudich</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Jung</snm>
                  <fnm>KW</fnm>
               </au>
               <au>
                  <snm>Alam</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Freitas</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hou</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Daniels</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Dennis</snm>
                  <fnm>PP</fnm>
               </au>
               <au>
                  <snm>Omer</snm>
                  <fnm>AD</fnm>
               </au>
               <au>
                  <snm>Ebhardt</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Lowe</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Liang</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Riley</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hood</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>DasSarma</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2000</pubdate>
            <volume>97</volume>
            <fpage>12176</fpage>
            <lpage>12181</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">17314</pubid>
                  <pubid idtype="pmpid" link="fulltext">11016950</pubid>
                  <pubid idtype="doi">10.1073/pnas.190337797</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <title>
               <p>Comparative sequence analysis of IS50/Tn5 transposase</p>
            </title>
            <aug>
               <au>
                  <snm>Reznikoff</snm>
                  <fnm>WS</fnm>
               </au>
               <au>
                  <snm>Bordenstein</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Apodaca</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2004</pubdate>
            <volume>186</volume>
            <fpage>8240</fpage>
            <lpage>8247</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">532410</pubid>
                  <pubid idtype="pmpid" link="fulltext">15576772</pubid>
                  <pubid idtype="doi">10.1128/JB.186.24.8240-8247.2004</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B60">
            <title>
               <p>IS1634, a novel insertion element creating long, variable-length direct repeats which is specific for <it>Mycoplasma mycoides</it> subsp. mycoides small-colony type</p>
            </title>
            <aug>
               <au>
                  <snm>Vilei</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Nicolet</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Frey</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>1999</pubdate>
            <volume>181</volume>
            <fpage>1319</fpage>
            <lpage>1323</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">93511</pubid>
                  <pubid idtype="pmpid" link="fulltext">9973360</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B61">
            <title>
               <p>IS1549 from <it>Mycobacterium smegmatis</it> forms long direct repeats upon insertion</p>
            </title>
            <aug>
               <au>
                  <snm>Plikaytis</snm>
                  <fnm>BB</fnm>
               </au>
               <au>
                  <snm>Crawford</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Shinnick</snm>
                  <fnm>TM</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>1998</pubdate>
            <volume>180</volume>
            <fpage>1037</fpage>
            <lpage>1043</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">106989</pubid>
                  <pubid idtype="pmpid" link="fulltext">9495740</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B62">
            <title>
               <p>A survey of bacterial insertion sequences using IScan</p>
            </title>
            <aug>
               <au>
                  <snm>Wagner</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lewis</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Bichsel</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2007</pubdate>
            <volume>35</volume>
            <fpage>5284</fpage>
            <lpage>5293</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2018620</pubid>
                  <pubid idtype="pmpid" link="fulltext">17686783</pubid>
                  <pubid idtype="doi">10.1093/nar/gkm597</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B63">
            <title>
               <p>The Belgian EMBnet node (BEN)</p>
            </title>
            <url>http://www.be.embnet.org/</url>
         </bibl>
         <bibl id="B64">
            <title>
               <p>CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice</p>
            </title>
            <aug>
               <au>
                  <snm>Thompson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Higgins</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Gibson</snm>
                  <fnm>TJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1994</pubdate>
            <volume>22</volume>
            <fpage>4673</fpage>
            <lpage>4680</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308517</pubid>
                  <pubid idtype="pmpid" link="fulltext">7984417</pubid>
                  <pubid idtype="doi">10.1093/nar/22.22.4673</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B65">
            <title>
               <p>DIALIGN 2: improvement of the segment-to-segment approach to multiple sequence alignment</p>
            </title>
            <aug>
               <au>
                  <snm>Morgenstern</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>1999</pubdate>
            <volume>15</volume>
            <fpage>211</fpage>
            <lpage>218</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/15.3.211</pubid>
                  <pubid idtype="pmpid" link="fulltext">10222408</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B66">
            <title>
               <p>A weighting system and algorithm for aligning many phylogenetically related sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Gotoh</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Comput Appl Biosci</source>
            <pubdate>1995</pubdate>
            <volume>11</volume>
            <fpage>543</fpage>
            <lpage>551</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid">8590178</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B67">
            <title>
               <p>MUSCLE: multiple sequence alignment with high accuracy and high throughput</p>
            </title>
            <aug>
               <au>
                  <snm>Edgar</snm>
                  <fnm>RC</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <fpage>1792</fpage>
            <lpage>1797</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">390337</pubid>
                  <pubid idtype="pmpid" link="fulltext">15034147</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh340</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B68">
            <title>
               <p>T-Coffee: A novel method for fast and accurate multiple sequence alignment</p>
            </title>
            <aug>
               <au>
                  <snm>Notredame</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Higgins</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Heringa</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2000</pubdate>
            <volume>302</volume>
            <fpage>205</fpage>
            <lpage>217</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.2000.4042</pubid>
                  <pubid idtype="pmpid" link="fulltext">10964570</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B69">
            <title>
               <p>M-Coffee: combining multiple sequence alignment methods with T-Coffee</p>
            </title>
            <aug>
               <au>
                  <snm>Wallace</snm>
                  <fnm>IM</fnm>
               </au>
               <au>
                  <snm>O'Sullivan</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Higgins</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Notredame</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2006</pubdate>
            <volume>34</volume>
            <fpage>1692</fpage>
            <lpage>1699</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1410914</pubid>
                  <pubid idtype="pmpid" link="fulltext">16556910</pubid>
                  <pubid idtype="doi">10.1093/nar/gkl091</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B70">
            <title>
               <p>The Jalview Java alignment editor</p>
            </title>
            <aug>
               <au>
                  <snm>Clamp</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Cuff</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Searle</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Barton</snm>
                  <fnm>GJ</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <fpage>426</fpage>
            <lpage>427</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btg430</pubid>
                  <pubid idtype="pmpid" link="fulltext">14960472</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B71">
            <title>
               <p>TreeView: an application to display phylogenetic trees on personal computers</p>
            </title>
            <aug>
               <au>
                  <snm>Page</snm>
                  <fnm>RD</fnm>
               </au>
            </aug>
            <source>Comput Appl Biosci</source>
            <pubdate>1996</pubdate>
            <volume>12</volume>
            <fpage>357</fpage>
            <lpage>358</lpage>
            <xrefbib>
               <pubid idtype="pmpid">8902363</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B72">
            <title>
               <p>An efficient algorithm for large-scale detection of protein families</p>
            </title>
            <aug>
               <au>
                  <snm>Enright</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Van Dongen</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ouzounis</snm>
                  <fnm>CA</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2002</pubdate>
            <volume>30</volume>
            <fpage>1575</fpage>
            <lpage>1584</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">101833</pubid>
                  <pubid idtype="pmpid" link="fulltext">11917018</pubid>
                  <pubid idtype="doi">10.1093/nar/30.7.1575</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B73">
            <title>
               <p>ACLAME</p>
            </title>
            <url>http://aclame.ulb.ac.be/</url>
         </bibl>
         <bibl id="B74">
            <title>
               <p>Gapped BLAST and PSI-BLAST: a new generation of protein database search programs</p>
            </title>
            <aug>
               <au>
                  <snm>Altschul</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Madden</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Schaffer</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Lipman</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1997</pubdate>
            <volume>25</volume>
            <fpage>3389</fpage>
            <lpage>3402</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">146917</pubid>
                  <pubid idtype="pmpid" link="fulltext">9254694</pubid>
                  <pubid idtype="doi">10.1093/nar/25.17.3389</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B75">
            <title>
               <p>Systematic mutagenesis of the <it>Escherichia coli</it> genome</p>
            </title>
            <aug>
               <au>
                  <snm>Kang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Durfee</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Glasner</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Qiu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Frisch</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Winterberg</snm>
                  <fnm>KM</fnm>
               </au>
               <au>
                  <snm>Blattner</snm>
                  <fnm>FR</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2004</pubdate>
            <volume>186</volume>
            <fpage>4921</fpage>
            <lpage>4930</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">451658</pubid>
                  <pubid idtype="pmpid" link="fulltext">15262929</pubid>
                  <pubid idtype="doi">10.1128/JB.186.15.4921-4930.2004</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B76">
            <title>
               <p>Systematic sequencing of cDNA clones using the transposon Tn5</p>
            </title>
            <aug>
               <au>
                  <snm>Shevchenko</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Bouffard</snm>
                  <fnm>GG</fnm>
               </au>
               <au>
                  <snm>Butterfield</snm>
                  <fnm>YS</fnm>
               </au>
               <au>
                  <snm>Blakesley</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>Hartley</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Young</snm>
                  <fnm>AC</fnm>
               </au>
               <au>
                  <snm>Marra</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Touchman</snm>
                  <fnm>JW</fnm>
               </au>
               <au>
                  <snm>Green</snm>
                  <fnm>ED</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2002</pubdate>
            <volume>30</volume>
            <fpage>2469</fpage>
            <lpage>2477</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">117195</pubid>
                  <pubid idtype="pmpid" link="fulltext">12034835</pubid>
                  <pubid idtype="doi">10.1093/nar/30.11.2469</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B77">
            <title>
               <p>Tn5/IS50 target recognition</p>
            </title>
            <aug>
               <au>
                  <snm>Goryshin</snm>
                  <fnm>IY</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Kil</snm>
                  <fnm>YV</fnm>
               </au>
               <au>
                  <snm>Lanzov</snm>
                  <fnm>VA</fnm>
               </au>
               <au>
                  <snm>Reznikoff</snm>
                  <fnm>WS</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>1998</pubdate>
            <volume>95</volume>
            <fpage>10716</fpage>
            <lpage>10721</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">27961</pubid>
                  <pubid idtype="pmpid" link="fulltext">9724770</pubid>
                  <pubid idtype="doi">10.1073/pnas.95.18.10716</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
