<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2164-9-261</ui>
   <ji>1471-2164</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Insights and inferences about integron evolution from genomic data</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Nemergut</snm>
               <mi>R</mi>
               <fnm>Diana</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>nemergut@colorado.edu</email>
            </au>
            <au id="A2">
               <snm>Robeson</snm>
               <mi>S</mi>
               <fnm>Michael</fnm>
               <insr iid="I3"/>
               <email>Michael.Robeson@Colorado.edu</email>
            </au>
            <au id="A3">
               <snm>Kysela</snm>
               <mi>F</mi>
               <fnm>Robert</fnm>
               <insr iid="I1"/>
               <insr iid="I3"/>
               <email>Robert.Kysela@Colorado.edu</email>
            </au>
            <au id="A4">
               <snm>Martin</snm>
               <mi>P</mi>
               <fnm>Andrew</fnm>
               <insr iid="I3"/>
               <email>Andrew.Martin-1@Colorado.edu</email>
            </au>
            <au id="A5">
               <snm>Schmidt</snm>
               <mi>K</mi>
               <fnm>Steven</fnm>
               <insr iid="I3"/>
               <email>Steve.Schmidt@Colorado.edu</email>
            </au>
            <au id="A6">
               <snm>Knight</snm>
               <fnm>Rob</fnm>
               <insr iid="I4"/>
               <email>rob@spot.colorado.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Institute of Arctic and Alpine Research, University of Colorado, Boulder, Colorado, USA</p>
            </ins>
            <ins id="I2">
               <p>Environmental Studies Program, University of Colorado, Boulder, Colorado, USA</p>
            </ins>
            <ins id="I3">
               <p>Department of Ecology and Evolutionary Biology, University of Colorado, Boulder, Colorado, USA</p>
            </ins>
            <ins id="I4">
               <p>Department of Chemistry and Biochemistry, University of Colorado, Boulder, Colorado, USA</p>
            </ins>
         </insg>
         <source>BMC Genomics</source>
         <issn>1471-2164</issn>
         <pubdate>2008</pubdate>
         <volume>9</volume>
         <issue>1</issue>
         <fpage>261</fpage>
         <url>http://www.biomedcentral.com/1471-2164/9/261</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18513439</pubid>
               <pubid idtype="doi">10.1186/1471-2164-9-261</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>01</day>
               <month>2</month>
               <year>2008</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>31</day>
               <month>5</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>31</day>
               <month>5</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Nemergut et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Integrons are mechanisms that facilitate horizontal gene transfer, allowing bacteria to integrate and express foreign DNA. These are important in the exchange of antibiotic resistance determinants, but can also transfer a diverse suite of genes unrelated to pathogenicity. Here, we provide a systematic analysis of the distribution and diversity of integron <it>intI </it>genes and integron-containing bacteria.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We found integrons in 103 different pathogenic and non-pathogenic bacteria, in six major phyla. Integrons were widely scattered, and their presence was not confined to specific clades within bacterial orders. Nearly 1/3 of the <it>intI </it>genes that we identified were pseudogenes, containing either an internal stop codon or a frameshift mutation that would render the protein product non-functional. Additionally, 20% of bacteria contained more than one integrase gene. dN/dS ratios revealed mutational hotspots in clades of <it>Vibrio </it>and <it>Shewanella intI </it>genes. Finally, we characterized the gene cassettes associated with integrons in <it>Methylobacillus flagellatus </it>KT and <it>Dechloromonas aromatica </it>RCB, and found a heavy metal efflux gene as well as genes involved in protein folding and stability.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>Our analysis suggests that the present distribution of integrons is due to multiple losses and gene transfer events. While, in some cases, the ability to integrate and excise foreign DNA may be selectively advantageous, the gain, loss, or rearrangment of gene cassettes could also be deleterious, selecting against functional integrases. Thus, such a high fraction of pseudogenes may suggest that the selective impact of integrons on genomes is variable, oscillating between beneficial and deleterious, possibly depending on environmental conditions.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="bmc" subtype="user_supplied_xml" id="endnote"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Horizontal gene transfer (HGT) is effected through some combination of the activity of mobile gene elements and/or recipient cellular enzyme systems. In the most general terms, the process of horizontal gene transfer requires several, distinct steps <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. The mechanisms and controls over processes for physically transferring DNA into recipient cells (i.e., transformation, conjugation and transduction) have received significant attention in the literature. However, the ability of cells to replicate and express foreign DNA is also essential for the transferred genes to become prevalent in the population through natural selection <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Mobile gene elements can thus be important in promoting HGT because they can contain origins of replication (ORIs) and/or promoters to facilitate the replication and transcription of foreign DNA. ORIs and promoters vary widely both within and between different species of bacteria, and may be more likely to be recognized by the cellular machinery of closely related organisms than more distantly related species, limiting the phylogenetic distance over which certain mobile gene elements can be transferred. However, many cross-phyla [e.g., <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>] and even cross-domain [e.g., <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>] gene transfer events have been documented. Although broad host range plasmids <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp> and phages <abbrgrp><abbr bid="B6">6</abbr></abbrgrp> play important roles in these long distance transfers, questions remain about the potential role of other mobile elements.</p>
         <p>Integrons <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>, are gene elements which may also play an important role in the transfer of genes between distantly related lineages. Integrons facilitate the integration <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>, excision <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>, and rearrangement <abbrgrp><abbr bid="B10">10</abbr></abbrgrp> of mobile genes which contain <it>attC </it>(also referred to as 59-be) recombination sites <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B11">11</abbr></abbrgrp>, called "gene cassettes" <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. Once integrated, the gene cassette is then expressed from the P<sub>c </sub>promoter Integrons catalyze the integration of foreign genes into a DNA molecule that is already recognized by the native replication machinery (chromosome or plasmid), and under the control of a promoter that allows gene expression in the host. Thus, this mechanism permits organisms to sample from the diversity of gene cassettes in their local environment without the need for host recognition of foreign promoters or ORIs, and may therefore facilitate gene transfer among highly divergent groups of bacteria.</p>
         <p>Integrons are classified according to the sequence of the IntI protein. The first integron discovered, the class 1 integron, is now known to be important in the dissemination of antibiotic resistance genes in both Gram-negative and Gram-positive bacteria <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. These integrons are typically found on plasmids or transposons, which catalyze their own mobility. Class 1 integrons have been found in association with dozens of different resistance genes [reviewed in <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>], and have been found in environments ranging from hospitals to poultry litter <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>. Other types of integrons, including classes 2, 3, 9, and an unnamed class found on a <it>Vibrio salmonicida </it>plasmid, (GenBank accession number <ext-link ext-link-type="gen" ext-link-id="AJ277063">AJ277063</ext-link>) have been similarly found on mobile elements and in association with antibiotic resistance genes <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr></abbrgrp>. Collectively, these are sometimes called "mobile integrons" or "multi-resistance" integrons; however, because the genomic location and associated gene cassettes are not stable features of integrons, this nomenclature has been discouraged <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp>. Indeed, integrons associated with antibiotic genes have been mobilized several times in the history of their evolution <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>, leading to a major public health concern.</p>
         <p>Genome sequencing projects <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp> have revealed a wide variety of <it>intI </it>genes from phylogenetically diverse bacteria (no reports of integrons in the Eucarya or Archaea have been made to date), and a recent study demonstrates that they are present in ~10% of sequenced genomes <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. Boucher et al. <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> used BLAST searches and identified integrons in the genomes of organisms from the Spirochaetes, Cyanobacteria, Chlorobi, Planctomycetes and the &#947;, &#946;, &#948;, and &#949; Proteobacteria. Mazel <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> identified two major clades of <it>intI </it>genes, one arising from organisms living in soil/freshwater environments and the other found in marine bacteria. Cultivation-independent studies have revealed an enormous diversity of <it>intI </it>genes in a variety of environmental samples <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. Likewise, a plethora of different types of gene cassettes, many having no obvious role in pathogenicity or resistance phenotypes, have been found associated with these integrase genes [e.g., <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr></abbrgrp>], and in environmental metagenomic samples <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B27">27</abbr><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr></abbrgrp>. In some cases, such as within species of the genus <it>Vibrio</it>, integrase genes are found in association with over 100 gene cassettes <abbrgrp><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr></abbrgrp>, suggesting that these elements play a major role in the evolution of these organisms. Thus, it has been hypothesized that their wide distribution and in some cases major genome load may make these elements important in the evolution of a diverse, environmentally-relevant suite of bacteria.</p>
         <p>However, many fascinating questions remain about the nature of integrons. For example, why are these potentially adaptive elements found in some organisms but not in others? One major force in the evolution of integrons is thought to be mobility &#8211; through association with plasmids or transposons &#8211; and much evidence suggests that integrons themselves are transferred between bacterial lineages <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B25">25</abbr></abbrgrp>. However, there is support for a good deal of vertical inheritance for integrons as well <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B25">25</abbr></abbrgrp>. Here, we use genomic analyses to examine integron diversity and distribution. We also describe the types of gene cassettes found in association with two organisms, further emphasizing the diversity of genes that integrons can mobilize.</p>
      </sec>
      <sec>
         <st>
            <p>Results and Discussion</p>
         </st>
         <sec>
            <st>
               <p>Which lineages contain integrons?</p>
            </st>
            <p>We used BLAST to search for <it>intI </it>genes in organisms whose whole genomes have been partially or completely sequenced, and to query sequences in the nr GenBank database. Although the promoter and <it>attI </it>recombination site are integral parts of both integron structure and function, we chose to broaden our definition and search for just the presence of integron integrase genes because IntI can also catalyze recombination events between the <it>attC </it>site and secondary sites <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B34">34</abbr><abbr bid="B35">35</abbr></abbrgrp>. These events occur at low frequencies, but may be important for bacterial evolution because they result in the insertion of a gene cassette that is flanked by a single recombination site, significantly lowering the possibility of IntI-mediated excision <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. Thus, we use the terms "integron integrase gene" "<it>intI</it>", and "integron" interchangeably.</p>
            <p>We identified a total of 103 different bacteria that contain integron integrase genes (the typically mobile integrons were excluded from this analysis but have been reviewed elsewhere <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>). We found <it>intI </it>genes in eighteen different bacterial orders within six divisions, including the Bacteroidetes/Chlorobi group, Chloroflexi, Cyanobacteria, Planctomycetes, Proteobacteria and Spirochaetes (Additional File <supplr sid="S1">1</supplr>), expanding on the diversity recovered from the most recent survey <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. These phylogenetically diverse organisms are found in a variety of both oxic and anoxic environments, and their metabolisms range from heterotrophy to photoautotrophy. The diversity of integron-containing organisms suggests that either these elements are ancient, playing significant roles in shaping microbial genomes over long timescales, or that the evolutionary advantage of being able to catalyze integron-mediated gene transfer has led to the more recent, rapid dispersal of integrons among disparate bacterial lineages.</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p>Integron integrase genes used in this study. The accession numbers for the genes used for the phylogenetic analyses in this study.</p>
               </text>
               <file name="1471-2164-9-261-S1.doc">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>There are many biases associated with the selection of organisms for both laboratory work and for whole genome sequencing, so the lack of a particular type of organism or group of organisms in Additional File <supplr sid="S1">1</supplr> should not be interpreted as evidence for the lack of an integron (with the exception of within specific, entirely sequenced genomes, see below). However, some divisions and subdivisions have been the targets of major sequencing efforts yet are notably absent from this table, including organisms within the Actinobacteria (37 complete genomes sequenced representing 3 orders), and the Firmicutes (103 complete genomes representing 7 orders). It is unknown whether integrons are missing from these lineages all together, or if they are merely absent from the specific organisms selected for sequencing projects. This uncertainty is especially interesting in light of the importance of Gram positive organisms as environmental reservoirs of class 1 integrons <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. In addition, none of the 54 &#945;-proteobacterial genomes, representing 6 orders, contain integrons, yet these elements are found in the &#947;, &#946;, &#948;, and &#949; proteobacterial subdivisions (Additional File <supplr sid="S1">1</supplr>). To our knowledge, no type 1 integrons have been found in the &#945;-proteobacteria.</p>
            <p>Nearly one third of the <it>intI </it>genes that we uncovered are predicted to be non-functional, containing either a stop codon, a frameshift mutation, or a major insertion or deletion that would likely render them inactive (*, Additional File <supplr sid="S1">1</supplr>). The <it>Vibrios </it>appear to be an exception to this rule, as only 4% of <it>intI </it>genes from this group are pseudogenes. Another interesting feature of integron distribution is that more than one-fifth of all integron-containing lineages harbor more than one <it>intI </it>gene (Additional File <supplr sid="S1">1</supplr>). This is likely an underestimate of the fraction of organisms with two or more genes: fewer than half of the isolates in Additional File <supplr sid="S1">1</supplr> are completely sequenced, and the remainder may contain additional integrase genes in the unsequenced regions. Again, the <it>Vibrios </it>differ from the rest, as only 4% of these species with integrons contain multiple integrase genes.</p>
            <p>We mapped the integron phenotype (putative functional genes in green, putative non-functional in purple) onto a 16S rRNA gene tree of all orders of bacteria that contain integrons (again, for simplicity, the typically mobile integrons were excluded). All species reported in Additional File <supplr sid="S1">1</supplr> were included in this phylogeny as integron-positive, while all organisms whose genomes are completely sequenced and do not contain <it>intI </it>genes were designated integron-negative. The distribution of integron-positive lineages suggests that integron-containing organisms comprise a significant fraction of the phylogenetic diversity within these bacterial orders (Figure <figr fid="F1">1</figr>). Specifically, 67.1% of the total branch length within the 16S rRNA gene tree leads to modern genomes that contain integrons, indicating that less than a third of the phylogenetic diversity falls only within integron-negative lineages. UniFrac analysis <abbrgrp><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr></abbrgrp> showed significant phylogenetic clustering (P &lt; 0.001 using unweighted UniFrac significance test and P test through the UniFrac web site), indicating that closely related lineages tend to share integron-positive or integron-negative status more than would be expected by chance.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Phylogenetic distribution of integron- and pseudointegron-containing bacteria</p>
               </caption>
               <text>
                  <p><b>Phylogenetic distribution of integron- and pseudointegron-containing bacteria. </b>Lineages colored in green represent integron positive lineages; purple represents lineages with pseudointegrons, and black represents lineages without integrons (organisms which have been entirely sequenced and contain no <it>intI </it>gene). Species that contain pseudo and functional integrons are colored both purple and green. 16S rRNA genes from all integron positive (functional and pseudo) and integron negative organisms in the eighteen bacterial orders in which integrons are found were aligned using the NAST aligner [64]. The tree was constructed in PAUP [61] using the NJ algorithm using the best-fit model of sequence evolution calculated in MODELTEST [66]. Orders and major bacterial phyla are indicated. Tree was rooted using five archaeal 16S rRNA gene sequences: <it>Pyrobaculum clidifontis</it>, <it>Aeropyrum pernix</it>, soil clone cren34kb, <it>Methanobacterium thermoautotrophicum</it>, and <it>Methanosphaera stadtmanae </it>from the Greengenes [64] alignment.</p>
               </text>
               <graphic file="1471-2164-9-261-1"/>
            </fig>
            <p>The current phylogenetic distribution of integrons supports that loss may be an important feature of integron evolution (Figure <figr fid="F1">1</figr>). In some cases this pattern is apparent even within strains of the same species: for example, <it>Shewanella baltica </it>OS195 contains two <it>intI </it>genes, while <it>Shewanella baltica </it>OS155 lacks integrons. Holmes et al. <abbrgrp><abbr bid="B38">38</abbr></abbrgrp> also reported on the spotty nature of integron distribution, finding that two of three isolates of <it>Pseudomonas stutzeri </it>contained <it>intI </it>genes. Although some of these differences may also be due to horizontal transfer of integrons between closely related species <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>, the abundance of pseudogenes suggests that decay and subsequent loss are also important processes. While intra-species differences in gene cassette composition is predicted and often observed <abbrgrp><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr></abbrgrp> due to the inherent integration/excision activity of integrons, these phylogenetically small-scale differences in the presence or absence of integrons is not an expected feature of a non-mobile gene element. Again, the genus <it>Vibrio </it>is a notable exception to this pattern, as all species examined to date harbor integrons.</p>
         </sec>
         <sec>
            <st>
               <p>What is the evolutionary history of intI genes?</p>
            </st>
            <p>We performed extensive phylogenetic analyses using a variety of optimality criteria (maximum likelihood, Bayesian inference, maximum parsimony and minimum evolution) on all <it>intI </it>genes and the predicted protein sequences listed in Additional File <supplr sid="S1">1</supplr> as well as on the typically mobile <it>intI </it>genes (Figure <figr fid="F2">2</figr>). For all methods, <it>intI </it>genes formed a supported clade, exclusive of the XerC/XerD outgroup <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B41">41</abbr></abbrgrp>, with a total of fifteen <it>intI </it>clades that were supported by nearly every analysis (Figure <figr fid="F2">2</figr>). Although our tree structure hints at the recently described soil/freshwater and marine groups <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>, these clades are not supported by all of our phylogenetic analyses. Each clade of integrase genes primarily originates from organisms within a specific bacterial lineage (clades 1&#8211;15, with taxon names in Figure <figr fid="F2">2</figr>; colored boxes in Additional File <supplr sid="S1">1</supplr>). The vast majority of these <it>intI </it>genes have been identified by sequence similarity only; thus their functions remain unknown. However, the seven <it>intI </it>genes that have been functionally characterized <it>in vivo </it>are widely distributed throughout the <it>intI </it>phylogeny (highlighted red in Figure <figr fid="F2">2</figr>), supporting a similar role for the uncharacterized genes in the integration and excision of gene cassettes <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B38">38</abbr><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr></abbrgrp>].</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Representative phylogram showing the relationship between <it>intI </it>genes</p>
               </caption>
               <text>
                  <p><b>Representative phylogram showing the relationship between <it>intI </it>genes.</b> Genes colored in purple are putative pseudogenes, sequences in black are putative functional integron integrase genes, and genes colored in red have been shown to be functional <it>in vivo</it>. Circled genes are from mobile integrons. IntI proteins were aligned in MAFFT [58] and used to generate a DNA alignment. We used MrModelTest [60] to select the best-fit model of sequence evolution for MrBayes [59]. Rapid likelihood analysis trees where constructed in RAxML-VI-HPC [63]. PHYLIP [62] was used for constructing parsimony trees. Amino acid phylogenies also support the clades indicated (data not shown). The tree shown is the maximum likelihood estimation, and numbers for each clade indicate the maximum likelihood bootstrap, Bayesian posterior probability and maximum parsimony bootstrap values, respectively. The phyla names indicate the dominant organism type found within each clade. The tree was rooted with <it>xerC </it>and <it>xerD </it>genes from <it>Escherichia coli </it>and <it>Thiobacillus denitrificans</it>. Gene names and accession numbers are provided in Additional File <supplr sid="S1">1</supplr>.</p>
               </text>
               <graphic file="1471-2164-9-261-2"/>
            </fig>
            <p>The integrase gene tree suggests some similarities between integron and organismal evolutionary relationships, and in many cases the <it>intI </it>genes found within supported clades derive largely from a related group of bacteria (Figure <figr fid="F2">2</figr>). However, there is some evidence for incongruent <it>intI </it>and organismal phylogenies, and some of these inconsistencies may be due to horizontal gene transfer events. As stated above, many organisms contain multiple <it>intI </it>genes, and in some cases, the <it>intI </it>phylogeny suggests that a horizontal gene transfer event may have given rise to one of the genes (Figure <figr fid="F2">2</figr>, Additional File <supplr sid="S1">1</supplr>). For example, a gene from each of the deltaproteobacterial species <it>Geobacter metallireducens </it>and <it>Geobacter lovleyi </it>falls just outside of integrase clade 2, which contains genes primarily from within the Betaproteobacteria. These organisms harbor at least two <it>intI </it>genes (<it>G. lovleyi </it>is not completely sequenced yet and therefore may contain more), but the other genes fall into a supported clade with sequences from the Deltaproteobacteria (clade 15, Figure <figr fid="F2">2</figr>). This is also true for <it>Shewanella baltica </it>and <it>Shewanella woodyi</it>, which both contain two <it>intI </it>genes, one which falls in clade 9 and the other which falls in clade 8.</p>
            <p>Indeed, <it>intI </it>mobility is a character state that is widely distributed across the integrase phylogenetic tree &#8211; found in clades 5, 7, 8, and 9 &#8211; supporting that the mobile phenotype has arisen multiple times (Figure <figr fid="F2">2</figr>, circled genes) <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. Additionally, although at first glance, integron integrase genes roughly fall into supported clades reflecting the organisms from which they originated (Figure <figr fid="F2">2</figr>), there are significant differences between integrase and 16S rRNA gene phylogenies <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. We performed likelihood-based tests for congruence <abbrgrp><abbr bid="B46">46</abbr></abbrgrp> on the <it>intI </it>and 16S rRNA gene phylogenies for Deltaproteobacteria (clade 15) and <it>Shewanella </it>(clade 10). In all cases, there were significant differences between the two phylogenies (Figure <figr fid="F3">3</figr>). However, these data are somewhat difficult to interpret because in many cases the relationships among several "housekeeping" genes (e.g., <it>fusA</it>, <it>rpoA</it>, <it>recA</it>, <it>gyrA</it>) were also determined to be significantly different from the 16S rRNA phylogenies, and from one another (Figure <figr fid="F3">3</figr>). Assuming that the 16S rRNA gene acts as the most accurate metric of genomic evolution, several factors could lead to the lack of concordance between 16S rDNA and integrase relationships. The presence of paralogous <it>intI </it>genes, selection, and horizontal gene transfer events could all lead to a lack of agreement between organismal and integrase phylogenies. Paralogous genes are certainly a factor, as many species contain multiple genes, some which appear to have arisen recently within the genome from gene duplication events (Additional File <supplr sid="S1">1</supplr>, Figure <figr fid="F2">2</figr>). However, even when these events are ignored, and all nodes that are shared between the 16S rRNA gene tree and any paralog are identified, there is a notable lack of concordance. Genetic exchange may also provide a plausible explanation for these patterns, and the association of integrons with transposons and transposase genes provides a possible mechanism for transfer <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. Finally, differential selection on the 16S rRNA gene and the integrase gene could also contribute to a lack of agreement between phylogenetic reconstructions, and is an interesting possibility given the evidence for positive selection on IntI proteins (see below).</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Likelihood-based tests (KH and SH, [46]) for phylogenetic congruence between <it>intI </it>and a variety of housekeeping genes</p>
               </caption>
               <text>
                  <p><b>Likelihood-based tests (KH and SH, [46]) for phylogenetic congruence between <it>intI </it>and a variety of housekeeping genes.</b> Comparisons indicate if there is a significant difference between the fit of the data to trees generated from the different datasets. Boxes above the diagonal represent comparisons using the genes listed above as the reference dataset, while boxes below the diagonal represent comparisons using the genes listed on the side as the reference dataset. The three boxes for each comparison represent distance, parsimony and likelihood based analyses, respectively. White boxes indicate no significant difference, light yellow boxes indicate that one test revealed significant (p &lt; 0.05) differences and yellow boxes indicate that both tests revealed significant differences.</p>
               </text>
               <graphic file="1471-2164-9-261-3"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Pseudogenes</p>
            </st>
            <p>As mentioned above, pseudogenes make up more than one third of all integrase genes uncovered in this study. Others have found <it>intI </it>pseudogenes in a number of different bacteria <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B24">24</abbr><abbr bid="B39">39</abbr></abbrgrp>. For example, the typically mobile class 2 integron integrase, <it>intI2</it>, is a non-functional pseudogene, the activity of which can be recovered by suppressing the internal stop codon <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. Additionally, <it>intI </it>pseudogenes were prevalent in a recent survey of integrons from <it>Xanthomonas </it>strains <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. Pseudogenes are also abundant in molecular phylogenetic surveys of <it>intI </it>genes from environmental samples, comprising 4&#8211;20% of total genes in these collections (Nemergut and Schmidt, unpublished data; Rodriguez-Minguela et al., unpublished data, accession numbers DQ282376&#8211;DQ2822194). In general, pseudogenes are relatively rare features of bacterial genomes, as non-functional genes typically comprise only 2&#8211;8% of all genes in free-living bacteria <abbrgrp><abbr bid="B47">47</abbr><abbr bid="B48">48</abbr></abbrgrp>. Therefore, the high percentage of pseudo-integrase genes, or "pseudo-integrons", that we uncovered is surprising. Recent analyses suggest that pseudogenes are likely to be of contemporary origin, as the same pseudogenes are typically not found in the genomes of closely related bacteria <abbrgrp><abbr bid="B47">47</abbr><abbr bid="B48">48</abbr></abbrgrp>. Indeed, orthologs of both functional and non-functional integrases are found in different strains of the same species of <it>Xanthomonas </it><abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. The widespread phylogenetic distribution of pseudointegrons within the bacteria (Additional File <supplr sid="S1">1</supplr>, Figure <figr fid="F1">1</figr>) implies that decay is a common feature across lineages.</p>
            <p>Likewise, the scattered distribution of pseudogenes on the <it>intI </it>phylogenetic tree (Figure <figr fid="F2">2</figr>), suggests that gene decay is a frequent process across different classes of integrase genes. For the most part, these appear to be randomly arrayed on the tree, supporting the hypothesis that pseudogenes decay rapidly and are quickly removed from genomes <abbrgrp><abbr bid="B47">47</abbr><abbr bid="B48">48</abbr></abbrgrp>. However, integrases within clades 4 and 15 are notable exceptions. Interestingly, despite the minimal evolutionary distance among the pseudogenes found in clade 4, these mutations all appear to have originated independently by unique frameshift mutations. This may be an artifact of biased sampling, or it may suggest that strong purifying selection is acting on genes in clade 4, contributing to their rapid decay. In contrast, a single gene fusion event characterizes an entire group of integrase genes within clade 15, and these proteins are an average of ~120 amino acids longer than other IntI proteins. As mentioned previously, pseudogenes are not generally conserved between closely related species <abbrgrp><abbr bid="B47">47</abbr><abbr bid="B48">48</abbr></abbrgrp> as they are typically lost at rates higher than speciation. Thus, the shared gene fusion event within clade 15 may suggest that these are actually functional genes.</p>
            <p>Pseudogenes are relatively common features of some eucaryotic genomes, particularly vertebrates <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>, where they arise from either retrotransposition or DNA duplication events. In bacteria, duplication processes give rise to pseudogenes as well; however, they are also thought to develop via the decay of native single-copy genes and following failed horizontal transfers <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. Our phylogenetic analysis suggests that many of the pseudointegrons that we identified appear to have arisen through intragenomic duplication events (a nearly identical functional integrase and pseudogene within the same genome), while others appear to have entered the cell through gene transfer processes (a pseudogene from a different clade in the same genome with a functional gene from the "expected" clade), and still others appear to have been resident genes that are undergoing decay (only one pseudogene from the "expected" clade in the genome) (Figure <figr fid="F2">2</figr>, Additional File <supplr sid="S1">1</supplr>). In a recent multigenome analysis of bacterial and archaeal pseudogenes, Liu and coworkers revealed that while pseudogenes occur in only approximately 1&#8211;5% of total genes, the proportion of integrase pseudogenes are significantly higher, suggesting a different evolutionary dynamic for this class of genes <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. While, in some cases, the ability to integrate and excise foreign DNA may be selectively advantageous, the gain, loss or rearrangment of gene cassettes could also be deleterious, selecting against functional integrases. Thus, such a high fraction of pseudogenes may suggest that the selective impact of integrons on genomes is variable, oscillating between beneficial and deleterious, possibly depending on environmental conditions. Other interpretations are possible as well, and, as noted by one insightful reviewer, the formation of integrase pseudogenes may result when organisms inhabit environments without gene cassettes, or when recombination processes result in the deletion of gene cassettes that maintain the array (e.g., toxin genes <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>).</p>
            <p>Finally, there is mounting evidence that not all sequences which have been identified as pseudogenes are without a biological function. In some organisms, for example, pseudogenes appear to be important in the generation of variation within multigene families, including both antibody and antigen determinants <abbrgrp><abbr bid="B52">52</abbr><abbr bid="B53">53</abbr></abbrgrp>. These pseudogenes act as a reservoir of sequence diversity and promote the rapid diversification of gene families through intragenomic recombination events. In many of these cases, genes are under "positive selection" pressures, and mutation (i.e., diversification) is favored. A hallmark of genes undergoing positive selection is an increased ratio of non-synonymous (protein altering) to synonymous (non-protein altering) mutations <abbrgrp><abbr bid="B54">54</abbr></abbrgrp> within groups of closely related genes.</p>
            <p>Thus, we examined the selection dynamics across integrase proteins (Figure <figr fid="F4">4A</figr>). Heuristic dN/dS (also called K<sub>A</sub>/K<sub>S</sub>) pairwise comparisons were made between proteins within integrase clade 7 to the functionally characterized <it>intI </it>from <it>Vibrio cholerae</it>. This comparison reveals two excursions in the dN/dS ratios above 1, suggesting mutational hotspots ranging from codon ~1 to ~50 and from codon ~160 to ~220. However, when these regions were further examined under the best-fit evolutionary model <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>, only the N-terminal domain showed evidence for positive selection (data not shown). This motif is thought to be important in the recognition of attC and attI sites <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>, and it is possible that selection for mutation in this region would expand or change the consensus sequences that an integrase is able to recognize. Alternatively, as pointed out by one reviewer, this site is also involved in the expression of gene cassettes <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>; thus, mutation in this region may modulate gene cassette mRNA levels in the host cell. There was a suggestion of a similar pattern for the <it>Shewanella </it>proteins in clade 10; however, no pairwise comparisons were significantly greater than 1 when variable evolutionary rates were considered (Figure <figr fid="F4">4B</figr>).</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>The ratio of non-synonymous to synonymous mutations over the integrase sequence for <it>Vibrio </it>(A) from clade 7 and <it>Shewanella </it>(B) from clade 10</p>
               </caption>
               <text>
                  <p><b>The ratio of non-synonymous to synonymous mutations over the integrase sequence for <it>Vibrio </it>(A) from clade 7 and <it>Shewanella </it>(B) from clade 10.</b> Proteins were aligned in MAFFT [58] and used to generate a DNA alignment. DNA alignments were used to generate pairwise comparisons with the <it>Vibrio cholerae intI </it>(A), or <it>Shewanella oneidensis intI </it>(B) and the ratio of nonsynonymous and synonymous substitutions at each codon position was calculated. The figure legends show a consensus cladogram (maximum likelihood, maximum parsimony and distance) for the integrase genes.</p>
               </text>
               <graphic file="1471-2164-9-261-4"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Characterization of new cassettes</p>
            </st>
            <p>Finally, we characterized the integron-associated gene cassettes in two species: <it>Methylobacillus flagellatus </it>KT, an obligate methylotroph; and <it>Dechloromonas aromatica </it>RCB, a perchlorate reducing bacterium (Figure <figr fid="F5">5</figr>). These organisms are found in different orders of the subphylum &#946;-proteobacteria and contain <it>intI </it>genes that group with clade 2 (Figure <figr fid="F2">2</figr>). Several of the cassettes that we identified were classified as hypothetical proteins, and pBLAST provided no further clues as to the potential functions of the proteins coded for by these genes. However, several interesting putative functions for gene cassettes were assigned, including a heavy metal (Co, Zn, Cd) pump (CAT) and a heat shock protein (HSP). Additionally, both integron arrays contained genes related to the c-type peptidyl prolyl cis-trans isomerase family, which are involved in protein folding.</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p><it>Methylobacillus flagellatus </it>KT (A) and <it>Dechloromonas aromatica </it>RCB (B) integrons</p>
               </caption>
               <text>
                  <p><it><b>Methylobacillus flagellatus</b></it><b>KT (A) and <it>Dechloromonas aromatica </it>RCB (B) integrons.</b> Numbers indicate locus tags; abbreviations: CAT, cation (Co/Zn/Cd) efflux; LRR, leucine rich repeat; BLEO<sup>R</sup>, bleomycin resistance; PPI, c-type peptidyl prolyl cis-trans isomerase; HSP70, heat shock protein 70; OMP, outer membrane protein; HYPR, hypothetical protein.</p>
               </text>
               <graphic file="1471-2164-9-261-5"/>
            </fig>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>These data reveal several interesting and unusual features of integron evolution, some that appear to be consistent across <it>intI </it>and/or organismal clades. First, the present distribution of integrons is likely due to multiple loss events (Figure <figr fid="F1">1</figr>) as well as horizontal gene transfers (Figure <figr fid="F3">3</figr>). Second, many organisms contain multiple, sometimes phylogenetically distinct, integrons (Additional File <supplr sid="S1">1</supplr>, Figure <figr fid="F2">2</figr>). Third, a high percentage of <it>intI </it>genes are non-functional (Additional File <supplr sid="S1">1</supplr>; Figures <figr fid="F1">1</figr>, <figr fid="F2">2</figr>). Fourth, positive selection seems to be occurring on one IntI domain, at least in the vibrios and perhaps in <it>Shewanella </it>(Figure <figr fid="F4">4</figr>). And, finally, <it>intI </it>genes form supported clades that roughly reflect the order of the species from which they originated, but in some cases there are obvious exceptions (Figure <figr fid="F2">2</figr>, Additional File <supplr sid="S1">1</supplr>). Together, these results highlight the dynamic nature of integron evolution, and provide further support that these genetic elements may be important in the genomic fluidity of a large number of diverse bacteria.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Integrase phylogeny</p>
            </st>
            <p>The <it>Enterobacter cloacae </it>IntI1 protein sequence (GenBank accession <ext-link ext-link-type="gen" ext-link-id="ABO46012">ABO46012</ext-link>) was used to search the nr GenBank database with the PSI-BLAST algorithm and to query the NCBI microbial genome database using the tBLASTn algorithm on April 20, 2007 <abbrgrp><abbr bid="B55">55</abbr></abbrgrp>. All coding region matches that contained the <it>intI </it>integrase additional domain <abbrgrp><abbr bid="B56">56</abbr></abbrgrp>, or <it>intI </it>patch <abbrgrp><abbr bid="B57">57</abbr></abbrgrp> were selected for subsequent analysis, as were the corresponding putative translated sequences. Pseudogenes were identified as those coding regions that were either interrupted by a frameshift mutation or a stop codon, or as proteins that were at least 20% larger or smaller than the <it>intI1 </it>gene. For phylogenetic analyses of pseudogenes, we selected only the translated region that aligned to the IntI1 protein and the corresponding DNA sequence. For pseudogenes that contained multiple mutations, alignable regions of both DNA and proteins were concatenated so that their sequences were contiguous. It is of note that NB231_00025, an <it>intI </it>sequence from <it>Nitrococcus mobilis </it>Nb-231, may not be a pseudogene, as it was found on the end of a shotgun sequence and may be complete within the contiguous genome.</p>
            <p>The online version of the sequence alignment program MAFFT <abbrgrp><abbr bid="B58">58</abbr></abbrgrp> was employed to align the amino acid sequences and four outgroup sequences (XerC and XerD from <it>Escherichia coli </it>and <it>Thiobacillus denitrificans </it><abbrgrp><abbr bid="B17">17</abbr><abbr bid="B41">41</abbr></abbrgrp>) using the following settings: FFT-NS-I, and the BLOSUM45 model. Amino acid alignments were manually adjusted, and any unalignable regions, particularly from within pseudogenes, were trimmed and removed from further analysis. Next, a Python script was developed to guide the insertion of gaps into the corresponding coding DNA sequence for the production of a DNA-based codon alignment (Robeson, unpublished program).</p>
            <p>These coding DNA data sets were analyzed using the parallel version of MrBayes <abbrgrp><abbr bid="B59">59</abbr></abbrgrp> and run anywhere from two million to eight million generations in order to achieve convergence under the GTR+I+G model of evolution, as selected via MrModelTest <abbrgrp><abbr bid="B60">60</abbr></abbrgrp>. Consensus trees from the analyses were constructed in PAUP*4.0 <abbrgrp><abbr bid="B61">61</abbr></abbrgrp> after typically removing ~10% of the burnin trees. Parsimony phylogenetic reconstructions were performed using the PHYLIP <abbrgrp><abbr bid="B62">62</abbr></abbrgrp> software package and subjected to 100 bootstrap replicates with 10 randomizations of taxa input order for each bootstrap. Maximum Likelihood analysis used the GTRGAMMA model and was subjected to 100 boostrapped replicates in RAxML-VI-HPC <abbrgrp><abbr bid="B63">63</abbr></abbrgrp>.</p>
            <p>We performed likelihood-based tests for congruence <abbrgrp><abbr bid="B46">46</abbr></abbrgrp> on the <it>intI</it>, 16S rRNA, <it>fusA</it>, <it>rpoA</it>, <it>recA</it>, <it>gyrA </it>gene trees for Deltaproteobacteria (clade 15) and <it>Shewanella </it>(clade 10). For each comparison, we downloaded DNA sequences from Genbank for all available phyla, and generated trees using distance, parsimony and likelihood optimality criteria. Likelihood tests were executed in PAUP*4.0 as described previously <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>16S rRNA gene phylogeny</p>
            </st>
            <p>For all organisms found to contain integrons, available 16S rRNA gene sequences were obtained from GenBank. We were unable to obtain 16S rRNA gene sequences for all species with integrons; specifically: Alteromonadales bacterium TW-7, <it>Vibrio sp</it>. DAT722 <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>, <it>Xanthomonas sp</it>. CIP 102397 <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>, <it>Vibrio cholerae </it>MAK 757, <it>Vibrio cholerae </it>MZO-3, the <it>Xanthomonas </it>DAR strains <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>, <it>Shewanella putrefaciens </it>CIP 69.34 <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. In addition, because of the lack of availability, several of the 16S rRNA genes that we obtained were not from the same strain that the integron was identified in, specifically: <it>Listonella anguillarum</it>, <it>Listonella pelagia</it>, <it>Vibrio mimicus</it>, <it>Vibrio metschnikovii</it>, <it>Vibrio natriegens</it>, <it>Vibrio salmonicida</it>, and <it>Xanthomonas badrii</it>. All told, we found integrons in eighteen different bacterial orders. 16S rRNA genes from integron containing lineages were aligned with 16S rRNA genes from completely sequenced bacteria from within these same eighteen orders that lacked integrons, as well as with five archaeal 16S rRNA gene outgroup sequences: <it>Pyrobaculum clidifontis</it>, <it>Aeropyrum pernix</it>, soil clone cren34kb, <it>Methanobacterium thermoautotrophicum</it>, and <it>Methanosphaera stadtmanae</it>. We used the online NAST-based algorithm available from the Greengenes website <abbrgrp><abbr bid="B64">64</abbr></abbrgrp> to align the 16S rRNA genes, imported alignments into ARB and exported the alignment using lanemaskPH to remove hypervariable regions <abbrgrp><abbr bid="B65">65</abbr></abbrgrp>. MODELTEST <abbrgrp><abbr bid="B66">66</abbr></abbrgrp> was used to estimate the best-fit model of sequence evolution for 16S rRNA gene alignments, and a minimum evolution-based phylogenetic reconstruction was generated in PAUP 4.0. Maximum parsimony and maximum likelihood methods yielded largely similar phylogenic reconstructions but were not shown.</p>
         </sec>
         <sec>
            <st>
               <p>dN/dS ratios and mutational frequencies</p>
            </st>
            <p>Integrase protein alignments for sequences in clades 7 and 10 were selected for pairwise comparisons to the functionally characterized <it>Vibrio cholerae </it>and <it>Shewanella oneidensis </it>IntI proteins, respectively. Only putative functional integrase proteins that were >95% different from all others and that originated from within the genus <it>Vibrio </it>or <it>Shewanella </it>were analyzed. Amino acid alignments were generated using MAFFT with the parameters described above. Next, a codon-based DNA alignment was constructed using our Python script. DNA alignments were then formatted to compare two sequences at a time, using the <it>Shewanella oneidensis </it>and <it>Vibrio cholerae intI </it>genes as reference sequences. DNA alignments were submitted to the online program SNAP <abbrgrp><abbr bid="B67">67</abbr></abbrgrp> which calculates the change in nonsynonymous (dN) and synonymous (dS) substitutions at each codon position. dN/dS ratios were calculated by integrating over 20 nucleotide positions, and the two-times greater frequency of synonymous sites in a codon was corrected for by dividing this value by 2. dN/dS ratios were plotted against codon position integrated over twenty nucleotide positions. Finally, we used PAML <abbrgrp><abbr bid="B68">68</abbr></abbrgrp> to calculate the dN/dS ratio and likelihood values of a variety of evolutionary models <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>DRN conceived of the study and drafted the manuscript. MSR carried out the molecular phylogenetic studies. RFK conceived of and carried out the dN/dS ratio comparisons. APM, SKS and RK helped to draft the manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>The authors wish to thank Alan Townsend, Elizabeth Costello, Jesse Zaneveld and Sean O'Neill for helpful discussions, and members of the Nemergut lab for critically reviewing the manuscript. Computer time was provided by NSF ARI Grant CDA-9601817, NSF MRI Grant CNS-0420873, NASA AIST grant NAG2-1646, DOE SciDAC grant DE-FG02-04ER63870, NSF sponsorship of the National Center for Atmospheric Research, and a grant from the IBM Shared University Research (SUR) program.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Horizontal gene transfer: perspectives at a crossroads of scientific disciplines</p>
            </title>
            <aug>
               <au>
                  <snm>Smets</snm>
                  <fnm>BF</fnm>
               </au>
               <au>
                  <snm>Barkay</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Nature Reviews Microbiology</source>
            <pubdate>2005</pubdate>
            <volume>3</volume>
            <fpage>675</fpage>
            <lpage>678</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrmicro1253</pubid>
                  <pubid idtype="pmpid" link="fulltext">16145755</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Whole-genome analysis of photosynthetic prokaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Raymond</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zhaxybayeva</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Gogarten</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Gerdes</snm>
                  <fnm>SY</fnm>
               </au>
               <au>
                  <snm>Blankenship</snm>
                  <fnm>RE</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>298</volume>
            <issue>5598</issue>
            <fpage>1616</fpage>
            <lpage>1620</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1075558</pubid>
                  <pubid idtype="pmpid" link="fulltext">12446909</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Evidence for lateral gene transfer between Archaea and Bacteria from genome sequence of Thermotoga maritima</p>
            </title>
            <aug>
               <au>
                  <snm>Nelson</snm>
                  <fnm>KE</fnm>
               </au>
               <au>
                  <snm>Clayton</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Gill</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Gwinn</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Dodson</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Haft</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Hickey</snm>
                  <fnm>EK</fnm>
               </au>
               <au>
                  <snm>Peterson</snm>
                  <fnm>LD</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>WC</fnm>
               </au>
               <au>
                  <snm>Ketchum</snm>
                  <fnm>KA</fnm>
               </au>
               <au>
                  <snm>McDonald</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Utterback</snm>
                  <fnm>TR</fnm>
               </au>
               <au>
                  <snm>Malek</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Linher</snm>
                  <fnm>KD</fnm>
               </au>
               <au>
                  <snm>Garrett</snm>
                  <fnm>MM</fnm>
               </au>
               <au>
                  <snm>Stewart</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Cotton</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Pratt</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Phillips</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Richardson</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Heidelberg</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Sutton</snm>
                  <fnm>GG</fnm>
               </au>
               <au>
                  <snm>Fleischmann</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Eisen</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Salzberg</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>HO</fnm>
               </au>
               <au>
                  <snm>Venter</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Fraser</snm>
                  <fnm>CM</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1999</pubdate>
            <volume>399</volume>
            <issue>6734</issue>
            <fpage>323</fpage>
            <lpage>329</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/20601</pubid>
                  <pubid idtype="pmpid" link="fulltext">10360571</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Studying plasmid horizontal transfer in situ: a critical review</p>
            </title>
            <aug>
               <au>
                  <snm>Sorensen</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Bailey</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hansen</snm>
                  <fnm>LH</fnm>
               </au>
               <au>
                  <snm>Kroer</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Wuertz</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Nature Reviews Microbiology</source>
            <pubdate>2005</pubdate>
            <volume>3</volume>
            <fpage>700</fpage>
            <lpage>710</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrmicro1232</pubid>
                  <pubid idtype="pmpid" link="fulltext">16138098</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Mechanisms of, and barriers to, horizontal gene transfer between bacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Thomas</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>KM</fnm>
               </au>
            </aug>
            <source>Nature Reviews Microbiology</source>
            <pubdate>2005</pubdate>
            <volume>3</volume>
            <issue>9</issue>
            <fpage>711</fpage>
            <lpage>721</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrmicro1234</pubid>
                  <pubid idtype="pmpid" link="fulltext">16138099</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>A broad-host-range, generalized transducing phage (SN-T) acquires 16S rRNA genes from different genera of bacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Beumer</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Robinson</snm>
                  <fnm>JB</fnm>
               </au>
            </aug>
            <source>Appl Environ Microbiol</source>
            <pubdate>2005</pubdate>
            <volume>71</volume>
            <issue>12</issue>
            <fpage>8301</fpage>
            <lpage>8304</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1317401</pubid>
                  <pubid idtype="pmpid" link="fulltext">16332816</pubid>
                  <pubid idtype="doi">10.1128/AEM.71.12.8301-8304.2005</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>A novel family of potentially mobile DNA elements encoding site-specific gene-integration functions - integrons</p>
            </title>
            <aug>
               <au>
                  <snm>Stokes</snm>
                  <fnm>HW</fnm>
               </au>
               <au>
                  <snm>Hall</snm>
                  <fnm>RM</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>1989</pubdate>
            <volume>3</volume>
            <issue>12</issue>
            <fpage>1669</fpage>
            <lpage>1683</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1365-2958.1989.tb00153.x</pubid>
                  <pubid idtype="pmpid">2560119</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Site-specific insertion of genes into integrons - role of the 59-base element and determination of the recombination cross-over point</p>
            </title>
            <aug>
               <au>
                  <snm>Hall</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Brookes</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Stokes</snm>
                  <fnm>HW</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>1991</pubdate>
            <volume>5</volume>
            <issue>8</issue>
            <fpage>1941</fpage>
            <lpage>1959</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1365-2958.1991.tb00817.x</pubid>
                  <pubid idtype="pmpid">1662753</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Gene cassettes from the insert region of integrons are excised as covalently closed circles</p>
            </title>
            <aug>
               <au>
                  <snm>Collis</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Hall</snm>
                  <fnm>RM</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>1992</pubdate>
            <volume>6</volume>
            <issue>19</issue>
            <fpage>2875</fpage>
            <lpage>2885</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1365-2958.1992.tb01467.x</pubid>
                  <pubid idtype="pmpid">1331702</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Site-specific deletion and rearrangement of integron insert genes catalyzed by the integron DNA integrase</p>
            </title>
            <aug>
               <au>
                  <snm>Collis</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Hall</snm>
                  <fnm>RM</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>1992</pubdate>
            <volume>174</volume>
            <issue>5</issue>
            <fpage>1574</fpage>
            <lpage>1585</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">206553</pubid>
                  <pubid idtype="pmpid" link="fulltext">1311297</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Structure and function of 59-base element recombination sites associated with mobile gene cassettes</p>
            </title>
            <aug>
               <au>
                  <snm>Stokes</snm>
                  <fnm>HW</fnm>
               </au>
               <au>
                  <snm>O'Gorman</snm>
                  <fnm>DB</fnm>
               </au>
               <au>
                  <snm>Recchia</snm>
                  <fnm>GD</fnm>
               </au>
               <au>
                  <snm>Parsekhian</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hall</snm>
                  <fnm>RM</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>1997</pubdate>
            <volume>26</volume>
            <issue>4</issue>
            <fpage>731</fpage>
            <lpage>745</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-2958.1997.6091980.x</pubid>
                  <pubid idtype="pmpid">9427403</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Gene cassettes: A new class of mobile element</p>
            </title>
            <aug>
               <au>
                  <snm>Recchia</snm>
                  <fnm>GD</fnm>
               </au>
               <au>
                  <snm>Hall</snm>
                  <fnm>RM</fnm>
               </au>
            </aug>
            <source>Microbiol -UK</source>
            <pubdate>1995</pubdate>
            <volume>141</volume>
            <fpage>3015</fpage>
            <lpage>3027</lpage>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Expression of antibiotic resistance genes in the integrated cassettes of integrons</p>
            </title>
            <aug>
               <au>
                  <snm>Collis</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Hall</snm>
                  <fnm>RM</fnm>
               </au>
            </aug>
            <source>Antimicrob Agents Chemother</source>
            <pubdate>1995</pubdate>
            <volume>39</volume>
            <issue>1</issue>
            <fpage>155</fpage>
            <lpage>162</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">162502</pubid>
                  <pubid idtype="pmpid" link="fulltext">7695299</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Resistance integrons and super-integrons</p>
            </title>
            <aug>
               <au>
                  <snm>Fluit</snm>
                  <fnm>AC</fnm>
               </au>
               <au>
                  <snm>Schmitz</snm>
                  <fnm>FJ</fnm>
               </au>
            </aug>
            <source>Clinical Microbiology and Infection</source>
            <pubdate>2004</pubdate>
            <volume>10</volume>
            <issue>4</issue>
            <fpage>272</fpage>
            <lpage>288</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1111/j.1198-743X.2004.00858.x</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Gram-positive bacteria are a major reservoir of Class 1 antibiotic resistance integrons in poultry litter</p>
            </title>
            <aug>
               <au>
                  <snm>Nandi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Maurer</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Hofacre</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Summers</snm>
                  <fnm>AO</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2004</pubdate>
            <volume>101</volume>
            <issue>18</issue>
            <fpage>7118</fpage>
            <lpage>7122</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">406475</pubid>
                  <pubid idtype="pmpid" link="fulltext">15107498</pubid>
                  <pubid idtype="doi">10.1073/pnas.0306466101</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>OXA-46, a new class D beta-lactamase of narrow substrate specificity encoded by a (bla)(vim-1)-containing integron from a Pseudomonas aeruginosa clinical isolate</p>
            </title>
            <aug>
               <au>
                  <snm>Giuliani</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Docquier</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Riccio</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Pagani</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Rossolini</snm>
                  <fnm>GM</fnm>
               </au>
            </aug>
            <source>Antimicrob Agents Chemother</source>
            <pubdate>2005</pubdate>
            <volume>49</volume>
            <issue>5</issue>
            <fpage>1973</fpage>
            <lpage>1980</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1087641</pubid>
                  <pubid idtype="pmpid" link="fulltext">15855521</pubid>
                  <pubid idtype="doi">10.1128/AAC.49.5.1973-1980.2005</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Integrons: agents of bacterial evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Mazel</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Nature Reviews Microbiology</source>
            <pubdate>2006</pubdate>
            <volume>4</volume>
            <issue>8</issue>
            <fpage>608</fpage>
            <lpage>620</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrmicro1462</pubid>
                  <pubid idtype="pmpid" link="fulltext">16845431</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Molecular analysis of antibiotic resistance gene clusters in Vibrio cholerae O139 and O1SXT constins</p>
            </title>
            <aug>
               <au>
                  <snm>Hochhut</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Lotfi</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Mazel</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Faruque</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Woodgate</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Waldor</snm>
                  <fnm>MK</fnm>
               </au>
            </aug>
            <source>Antimicrob Agents Chemother</source>
            <pubdate>2001</pubdate>
            <volume>45</volume>
            <issue>11</issue>
            <fpage>2991</fpage>
            <lpage>3000</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">90773</pubid>
                  <pubid idtype="pmpid" link="fulltext">11600347</pubid>
                  <pubid idtype="doi">10.1128/AAC.45.11.2991-3000.2001</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>IntI2 integron integrase in Tn7</p>
            </title>
            <aug>
               <au>
                  <snm>Hansson</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Sundstrom</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Pelletier</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Roy</snm>
                  <fnm>PH</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2002</pubdate>
            <volume>184</volume>
            <issue>6</issue>
            <fpage>1712</fpage>
            <lpage>1721</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">134885</pubid>
                  <pubid idtype="pmpid" link="fulltext">11872723</pubid>
                  <pubid idtype="doi">10.1128/JB.184.6.1712-1721.2002</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Characterization of the class 3 integron and the site-specific recombination system it determines</p>
            </title>
            <aug>
               <au>
                  <snm>Collis</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Partridge</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Stokes</snm>
                  <fnm>HW</fnm>
               </au>
               <au>
                  <snm>Hall</snm>
                  <fnm>RM</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2002</pubdate>
            <volume>184</volume>
            <issue>11</issue>
            <fpage>3017</fpage>
            <lpage>3026</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">135066</pubid>
                  <pubid idtype="pmpid" link="fulltext">12003943</pubid>
                  <pubid idtype="doi">10.1128/JB.184.11.3017-3026.2002</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Integrons or super integrons?</p>
            </title>
            <aug>
               <au>
                  <snm>Hall</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Stokes</snm>
                  <fnm>HW</fnm>
               </au>
            </aug>
            <source>Microbiol -SGM</source>
            <pubdate>2004</pubdate>
            <volume>150</volume>
            <fpage>3</fpage>
            <lpage>4</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1099/mic.0.26854-0</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>What are superintegrons?</p>
            </title>
            <aug>
               <au>
                  <snm>Hall</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Holmes</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Roy</snm>
                  <fnm>PH</fnm>
               </au>
               <au>
                  <snm>Stokes</snm>
                  <fnm>HW</fnm>
               </au>
            </aug>
            <source>Nature Reviews Microbiology</source>
            <pubdate>2007</pubdate>
            <volume>5</volume>
            <issue>2</issue>
            <fpage>1</fpage>
            <lpage>2</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrmicro1462-c1</pubid>
                  <pubid idtype="pmpid">1722535</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Integrons: mobilizable platforms that promote genetic diversity in bacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Boucher</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Labbate</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Koenig</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Stokes</snm>
                  <fnm>HW</fnm>
               </au>
            </aug>
            <source>Trends Microbiol</source>
            <pubdate>2007</pubdate>
            <volume>15</volume>
            <issue>7</issue>
            <fpage>301</fpage>
            <lpage>309</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tim.2007.05.004</pubid>
                  <pubid idtype="pmpid" link="fulltext">17566739</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>The evolutionary history of chromosomal super-integrons provides an ancestry for multiresistant integrons</p>
            </title>
            <aug>
               <au>
                  <snm>Rowe-Magnus</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Guerout</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Ploncard</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Dychinco</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Davies</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Mazel</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2001</pubdate>
            <volume>98</volume>
            <issue>2</issue>
            <fpage>652</fpage>
            <lpage>657</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">14643</pubid>
                  <pubid idtype="pmpid" link="fulltext">11209061</pubid>
                  <pubid idtype="doi">10.1073/pnas.98.2.652</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Integron diversity in heavy-metal-contaminated mine tailings and inferences about integron evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Nemergut</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Martin</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Schmidt</snm>
                  <fnm>SK</fnm>
               </au>
            </aug>
            <source>Appl Environ Microbiol</source>
            <pubdate>2004</pubdate>
            <volume>70</volume>
            <issue>2</issue>
            <fpage>1160</fpage>
            <lpage>1168</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">348930</pubid>
                  <pubid idtype="pmpid" link="fulltext">14766601</pubid>
                  <pubid idtype="doi">10.1128/AEM.70.2.1160-1168.2004</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>A distinctive class of integron in the Vibrio cholerae genome</p>
            </title>
            <aug>
               <au>
                  <snm>Mazel</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Dychinco</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Webb</snm>
                  <fnm>VA</fnm>
               </au>
               <au>
                  <snm>Davies</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1998</pubdate>
            <volume>280</volume>
            <issue>5363</issue>
            <fpage>605</fpage>
            <lpage>608</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.280.5363.605</pubid>
                  <pubid idtype="pmpid" link="fulltext">9554855</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>The gene cassette metagenome is a basic resource for bacterial genome evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Holmes</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Gillings</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Nield</snm>
                  <fnm>BS</fnm>
               </au>
               <au>
                  <snm>Mabbutt</snm>
                  <fnm>BC</fnm>
               </au>
               <au>
                  <snm>Nevalainen</snm>
                  <fnm>KMH</fnm>
               </au>
               <au>
                  <snm>Stokes</snm>
                  <fnm>HW</fnm>
               </au>
            </aug>
            <source>Environ Microbiol</source>
            <pubdate>2003</pubdate>
            <volume>5</volume>
            <issue>5</issue>
            <fpage>383</fpage>
            <lpage>394</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1462-2920.2003.00429.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">12713464</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Discovery and distribution of super-integrons among Pseudomonads</p>
            </title>
            <aug>
               <au>
                  <snm>Vaisvila</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Morgan</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Posfai</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Raleigh</snm>
                  <fnm>EA</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>2001</pubdate>
            <volume>42</volume>
            <issue>3</issue>
            <fpage>587</fpage>
            <lpage>601</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-2958.2001.02604.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">11722728</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Mobile gene cassettes: A fundamental resource for bacterial evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Michael</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Gillings</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Holmes</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Hughes</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Andrew</snm>
                  <fnm>NR</fnm>
               </au>
               <au>
                  <snm>Holley</snm>
                  <fnm>MP</fnm>
               </au>
               <au>
                  <snm>Stokes</snm>
                  <fnm>HW</fnm>
               </au>
            </aug>
            <source>American Naturalist</source>
            <pubdate>2004</pubdate>
            <volume>164</volume>
            <issue>1</issue>
            <fpage>1</fpage>
            <lpage>12</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1086/421733</pubid>
                  <pubid idtype="pmpid" link="fulltext">15266366</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Gene cassette PCR: sequence-independent recovery of entire genes from environmental DNA</p>
            </title>
            <aug>
               <au>
                  <snm>Stokes</snm>
                  <fnm>HW</fnm>
               </au>
               <au>
                  <snm>Holmes</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Nield</snm>
                  <fnm>BS</fnm>
               </au>
               <au>
                  <snm>Holley</snm>
                  <fnm>MP</fnm>
               </au>
               <au>
                  <snm>Nevalainen</snm>
                  <fnm>KMH</fnm>
               </au>
               <au>
                  <snm>Mabbutt</snm>
                  <fnm>BC</fnm>
               </au>
               <au>
                  <snm>Gillings</snm>
                  <fnm>MR</fnm>
               </au>
            </aug>
            <source>Appl Environ Microbiol</source>
            <pubdate>2001</pubdate>
            <volume>67</volume>
            <issue>11</issue>
            <fpage>5240</fpage>
            <lpage>5246</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">93296</pubid>
                  <pubid idtype="pmpid" link="fulltext">11679351</pubid>
                  <pubid idtype="doi">10.1128/AEM.67.11.5240-5246.2001</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Recovery and evolutionary analysis of complete integron gene cassette arrays from Vibrio</p>
            </title>
            <aug>
               <au>
                  <snm>Boucher</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Nesbo</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>Joss</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Robinson</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Mabbutt</snm>
                  <fnm>BC</fnm>
               </au>
               <au>
                  <snm>Gillings</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Doolittle</snm>
                  <fnm>WF</fnm>
               </au>
               <au>
                  <snm>Stokes</snm>
                  <fnm>HW</fnm>
               </au>
            </aug>
            <source>BMC Evolutionary Biology</source>
            <pubdate>2006</pubdate>
            <volume>6</volume>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1382264</pubid>
                  <pubid idtype="pmpid" link="fulltext">16417647</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Comparative genome analysis of Vibrio vulnificus, a marine pathogen</p>
            </title>
            <aug>
               <au>
                  <snm>Chen</snm>
                  <fnm>CY</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>KM</fnm>
               </au>
               <au>
                  <snm>Chang</snm>
                  <fnm>YC</fnm>
               </au>
               <au>
                  <snm>Chang</snm>
                  <fnm>CH</fnm>
               </au>
               <au>
                  <snm>Tsai</snm>
                  <fnm>HC</fnm>
               </au>
               <au>
                  <snm>Liao</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>YM</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>HJ</fnm>
               </au>
               <au>
                  <snm>Shen</snm>
                  <fnm>ABT</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Su</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Shao</snm>
                  <fnm>CP</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>CT</fnm>
               </au>
               <au>
                  <snm>Hor</snm>
                  <fnm>LI</fnm>
               </au>
               <au>
                  <snm>Tsai</snm>
                  <fnm>SF</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <issue>12</issue>
            <fpage>2577</fpage>
            <lpage>2587</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">403799</pubid>
                  <pubid idtype="pmpid" link="fulltext">14656965</pubid>
                  <pubid idtype="doi">10.1101/gr.1295503</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>The Vibrio cholerae O1 chromosomal integron</p>
            </title>
            <aug>
               <au>
                  <snm>Clark</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Purins</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Kaewrakon</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Focareta</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Manning</snm>
                  <fnm>PA</fnm>
               </au>
            </aug>
            <source>Microbiol -UK</source>
            <pubdate>2000</pubdate>
            <volume>146</volume>
            <fpage>2605</fpage>
            <lpage>2612</lpage>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Secondary sites for integration mediated by the Tn21 integrase</p>
            </title>
            <aug>
               <au>
                  <snm>Francia</snm>
                  <fnm>MV</fnm>
               </au>
               <au>
                  <snm>Delacruz</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Lobo</snm>
                  <fnm>JMG</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>1993</pubdate>
            <volume>10</volume>
            <issue>4</issue>
            <fpage>823</fpage>
            <lpage>828</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1365-2958.1993.tb00952.x</pubid>
                  <pubid idtype="pmpid">7934844</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Characterization of specific and secondary recombination sites recognized by the integron DNA integrase</p>
            </title>
            <aug>
               <au>
                  <snm>Recchia</snm>
                  <fnm>GD</fnm>
               </au>
               <au>
                  <snm>Stokes</snm>
                  <fnm>HW</fnm>
               </au>
               <au>
                  <snm>Hall</snm>
                  <fnm>RM</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1994</pubdate>
            <volume>22</volume>
            <issue>11</issue>
            <fpage>2071</fpage>
            <lpage>2078</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308123</pubid>
                  <pubid idtype="pmpid" link="fulltext">8029014</pubid>
                  <pubid idtype="doi">10.1093/nar/22.11.2071</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>UniFrac - An online tool for comparing microbial community diversity in a phylogenetic context</p>
            </title>
            <aug>
               <au>
                  <snm>Lozupone</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Hamady</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Knight</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1564154</pubid>
                  <pubid idtype="pmpid" link="fulltext">16893466</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>UniFrac: a new phylogenetic method for comparing microbial communities</p>
            </title>
            <aug>
               <au>
                  <snm>Lozupone</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Knight</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Appl Environ Microbiol</source>
            <pubdate>2005</pubdate>
            <volume>71</volume>
            <issue>12</issue>
            <fpage>8228</fpage>
            <lpage>8235</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1317376</pubid>
                  <pubid idtype="pmpid" link="fulltext">16332807</pubid>
                  <pubid idtype="doi">10.1128/AEM.71.12.8228-8235.2005</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Recombination activity of a distinctive integron-gene cassette system associated with Pseudomonas stutzeri populations in soil</p>
            </title>
            <aug>
               <au>
                  <snm>Holmes</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Holley</snm>
                  <fnm>MP</fnm>
               </au>
               <au>
                  <snm>Mahon</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Nield</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Gillings</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Stokes</snm>
                  <fnm>HW</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2003</pubdate>
            <volume>185</volume>
            <issue>3</issue>
            <fpage>918</fpage>
            <lpage>928</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">142810</pubid>
                  <pubid idtype="pmpid" link="fulltext">12533467</pubid>
                  <pubid idtype="doi">10.1128/JB.185.3.918-928.2003</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Integrons in Xanthomonas: A source of species genome diversity</p>
            </title>
            <aug>
               <au>
                  <snm>Gillings</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Holley</snm>
                  <fnm>MP</fnm>
               </au>
               <au>
                  <snm>Stokes</snm>
                  <fnm>HW</fnm>
               </au>
               <au>
                  <snm>Holmes</snm>
                  <fnm>AJ</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <fpage>4419</fpage>
            <lpage>4424</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">555480</pubid>
                  <pubid idtype="pmpid" link="fulltext">15755815</pubid>
                  <pubid idtype="doi">10.1073/pnas.0406620102</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Comparative analysis of superintegrons: Engineering extensive genetic diversity in the Vibrionaceae</p>
            </title>
            <aug>
               <au>
                  <snm>Rowe-Magnus</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Guerout</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Biskri</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Bouige</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Mazel</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <issue>3</issue>
            <fpage>428</fpage>
            <lpage>442</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">430272</pubid>
                  <pubid idtype="pmpid" link="fulltext">12618374</pubid>
                  <pubid idtype="doi">10.1101/gr.617103</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>The integrase family of tyrosine recombinases: evolution of a conserved active site domain</p>
            </title>
            <aug>
               <au>
                  <snm>Esposito</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Scocca</snm>
                  <fnm>JJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1997</pubdate>
            <volume>25</volume>
            <issue>18</issue>
            <fpage>3605</fpage>
            <lpage>3614</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">146934</pubid>
                  <pubid idtype="pmpid" link="fulltext">9278480</pubid>
                  <pubid idtype="doi">10.1093/nar/25.18.3605</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>The intI-like tyrosine recombinase of Shewanella oneidensis is active as an integron integrase</p>
            </title>
            <aug>
               <au>
                  <snm>Drouin</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Melancon</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Roy</snm>
                  <fnm>PH</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2002</pubdate>
            <volume>184</volume>
            <issue>6</issue>
            <fpage>1811</fpage>
            <lpage>1815</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">134884</pubid>
                  <pubid idtype="pmpid" link="fulltext">11872738</pubid>
                  <pubid idtype="doi">10.1128/JB.184.6.1811-1815.2002</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Excision and integration of cassettes by an integron integrase of Nitrosomonas europaea</p>
            </title>
            <aug>
               <au>
                  <snm>Leon</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Roy</snm>
                  <fnm>PH</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2003</pubdate>
            <volume>185</volume>
            <issue>6</issue>
            <fpage>2036</fpage>
            <lpage>2041</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">150120</pubid>
                  <pubid idtype="pmpid" link="fulltext">12618471</pubid>
                  <pubid idtype="doi">10.1128/JB.185.6.2036-2041.2003</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>The native Pseudomonas stutzeri strain Q chromosonal integron can capture and express cassette-associated genes</p>
            </title>
            <aug>
               <au>
                  <snm>Coleman</snm>
                  <fnm>NV</fnm>
               </au>
               <au>
                  <snm>Holmes</snm>
                  <fnm>AJ</fnm>
               </au>
            </aug>
            <source>Microbiology</source>
            <pubdate>2005</pubdate>
            <volume>151</volume>
            <fpage>1853</fpage>
            <lpage>1864</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1099/mic.0.27854-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">15941993</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Structural basis for broad DNA-specificity in integron recombination</p>
            </title>
            <aug>
               <au>
                  <snm>MacDonald</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Demarre</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Bouvier</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Mazel</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Gopaul</snm>
                  <fnm>DN</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2006</pubdate>
            <volume>440</volume>
            <issue>7088</issue>
            <fpage>1157</fpage>
            <lpage>1162</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature04643</pubid>
                  <pubid idtype="pmpid" link="fulltext">16641988</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>Multiple comparisons of log-likelihoods with applications to phylogenetic inference</p>
            </title>
            <aug>
               <au>
                  <snm>Shimodaira</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Hasegawa</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1999</pubdate>
            <volume>16</volume>
            <issue>8</issue>
            <fpage>1114</fpage>
            <lpage>1116</lpage>
         </bibl>
         <bibl id="B47">
            <title>
               <p>Recognizing the pseudogenes in bacterial genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Lerat</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Ochman</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <issue>10</issue>
            <fpage>3125</fpage>
            <lpage>3132</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1142405</pubid>
                  <pubid idtype="pmpid" link="fulltext">15933207</pubid>
                  <pubid idtype="doi">10.1093/nar/gki631</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>Psi-Phi: Exploring the outer limits of bacterial pseudogenes</p>
            </title>
            <aug>
               <au>
                  <snm>Lerat</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Ochman</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <issue>11</issue>
            <fpage>2273</fpage>
            <lpage>2278</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">525686</pubid>
                  <pubid idtype="pmpid" link="fulltext">15479949</pubid>
                  <pubid idtype="doi">10.1101/gr.2925604</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>Vertebrate pseudogenes</p>
            </title>
            <aug>
               <au>
                  <snm>Mighell</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>NR</fnm>
               </au>
               <au>
                  <snm>Robinson</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>Markham</snm>
                  <fnm>AF</fnm>
               </au>
            </aug>
            <source>FEBS Lett</source>
            <pubdate>2000</pubdate>
            <volume>468</volume>
            <issue>2-3</issue>
            <fpage>109</fpage>
            <lpage>114</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0014-5793(00)01199-6</pubid>
                  <pubid idtype="pmpid" link="fulltext">10692568</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>Comprehensive analysis of pseudogenes in prokaryotes: widespread gene decay and failure of putative horizontally transferred genes</p>
            </title>
            <aug>
               <au>
                  <snm>Liu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Harrison</snm>
                  <fnm>PM</fnm>
               </au>
               <au>
                  <snm>Kunin</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Gerstein</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Genome Biology</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <issue>9</issue>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">15345048</pubid>
                  <pubid idtype="pmcid">522871</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>Chromosomal toxin-antitoxin loci can diminish large-scale genome reductions in the absence of selection</p>
            </title>
            <aug>
               <au>
                  <snm>Szekeres</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Dauti</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Wilde</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Mazel</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Rowe-Magnus</snm>
                  <fnm>DA</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>2007</pubdate>
            <volume>63</volume>
            <issue>6</issue>
            <fpage>1588</fpage>
            <lpage>1605</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1365-2958.2007.05613.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">17367382</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>Pseudogenes: Are they "Junk" or functional DNA?</p>
            </title>
            <aug>
               <au>
                  <snm>Balakirev</snm>
                  <fnm>ES</fnm>
               </au>
               <au>
                  <snm>Ayala</snm>
                  <fnm>FJ</fnm>
               </au>
            </aug>
            <source>Annu Rev Genet</source>
            <pubdate>2003</pubdate>
            <volume>37</volume>
            <fpage>123</fpage>
            <lpage>151</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.genet.37.040103.103949</pubid>
                  <pubid idtype="pmpid" link="fulltext">14616058</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>Biological applications of the theory of birth-and-death processes</p>
            </title>
            <aug>
               <au>
                  <snm>Novozhilov</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Karev</snm>
                  <fnm>GP</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Briefings In Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <issue>1</issue>
            <fpage>70</fpage>
            <lpage>85</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bib/bbk006</pubid>
                  <pubid idtype="pmpid" link="fulltext">16761366</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>Codon-substitution models for heterogeneous selection pressure at amino acid sites</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>ZH</fnm>
               </au>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Goldman</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Pedersen</snm>
                  <fnm>AMK</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2000</pubdate>
            <volume>155</volume>
            <issue>1</issue>
            <fpage>431</fpage>
            <lpage>449</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1461088</pubid>
                  <pubid idtype="pmpid" link="fulltext">10790415</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>Basic local alignment search tool</p>
            </title>
            <aug>
               <au>
                  <snm>Altschul</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Gish</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Myers</snm>
                  <fnm>EW</fnm>
               </au>
               <au>
                  <snm>Lipman</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1990</pubdate>
            <volume>215</volume>
            <issue>3</issue>
            <fpage>403</fpage>
            <lpage>410</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">2231712</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <title>
               <p>Integron integrases possess a unique additional domain necessary for activity</p>
            </title>
            <aug>
               <au>
                  <snm>Messier</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Roy</snm>
                  <fnm>PH</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2001</pubdate>
            <volume>183</volume>
            <issue>22</issue>
            <fpage>6699</fpage>
            <lpage>6706</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">95504</pubid>
                  <pubid idtype="pmpid" link="fulltext">11673443</pubid>
                  <pubid idtype="doi">10.1128/JB.183.22.6699-6706.2001</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <title>
               <p>Recovery of new integron classes from environmental DNA</p>
            </title>
            <aug>
               <au>
                  <snm>Nield</snm>
                  <fnm>BS</fnm>
               </au>
               <au>
                  <snm>Holmes</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Gillings</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Recchia</snm>
                  <fnm>GD</fnm>
               </au>
               <au>
                  <snm>Mabbutt</snm>
                  <fnm>BC</fnm>
               </au>
               <au>
                  <snm>Nevalainen</snm>
                  <fnm>KMH</fnm>
               </au>
               <au>
                  <snm>Stokes</snm>
                  <fnm>HW</fnm>
               </au>
            </aug>
            <source>FEMS Microbiol Lett</source>
            <pubdate>2001</pubdate>
            <volume>195</volume>
            <issue>1</issue>
            <fpage>59</fpage>
            <lpage>65</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1574-6968.2001.tb10498.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">11166996</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform</p>
            </title>
            <aug>
               <au>
                  <snm>Katoh</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Misawa</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kuma</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Miyata</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2002</pubdate>
            <volume>30</volume>
            <issue>14</issue>
            <fpage>3059</fpage>
            <lpage>3066</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">135756</pubid>
                  <pubid idtype="pmpid" link="fulltext">12136088</pubid>
                  <pubid idtype="doi">10.1093/nar/gkf436</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <title>
               <p>Parallel metropolis coupled Markov chain Monte Carlo for Bayesian phylogenetic inference</p>
            </title>
            <aug>
               <au>
                  <snm>Altekar</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Dwarkadas</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Huelsenbeck</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Ronquist</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <issue>3</issue>
            <fpage>407</fpage>
            <lpage>415</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btg427</pubid>
                  <pubid idtype="pmpid" link="fulltext">14960467</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B60">
            <title>
               <p>Bayesian phylogenetic analysis of combined data</p>
            </title>
            <aug>
               <au>
                  <snm>Nylander</snm>
                  <fnm>JAA</fnm>
               </au>
               <au>
                  <snm>Ronquist</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Huelsenbeck</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Nieves-Aldrey</snm>
                  <fnm>JL</fnm>
               </au>
            </aug>
            <source>Systematic Biology</source>
            <pubdate>2004</pubdate>
            <volume>53</volume>
            <issue>1</issue>
            <fpage>47</fpage>
            <lpage>67</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1080/10635150490264699</pubid>
                  <pubid idtype="pmpid">14965900</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B61">
            <title>
               <p>Phylogenetic analysis using parsimony (*and other methods)</p>
            </title>
            <aug>
               <au>
                  <snm>Swofford</snm>
                  <fnm>DL</fnm>
               </au>
            </aug>
            <publisher>Sunderland, MA , Sinauer Associates</publisher>
            <edition>4 ed.</edition>
            <pubdate>2001</pubdate>
         </bibl>
         <bibl id="B62">
            <title>
               <p>PHYLIP -- Phylogeny Inference Package (Version 3.2)</p>
            </title>
            <aug>
               <au>
                  <snm>Felsenstein</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Cladistics</source>
            <pubdate>1989</pubdate>
            <volume>5</volume>
            <fpage>164</fpage>
            <lpage>166</lpage>
         </bibl>
         <bibl id="B63">
            <title>
               <p>RAxML-VI-HPC: Maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models</p>
            </title>
            <aug>
               <au>
                  <snm>Stamatakis</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>22</volume>
            <issue>21</issue>
            <fpage>2688</fpage>
            <lpage>2690</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btl446</pubid>
                  <pubid idtype="pmpid" link="fulltext">16928733</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B64">
            <title>
               <p>Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB</p>
            </title>
            <aug>
               <au>
                  <snm>DeSantis</snm>
                  <fnm>TZ</fnm>
               </au>
               <au>
                  <snm>Hugenholtz</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Larsen</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Rojas</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Brodie</snm>
                  <fnm>EL</fnm>
               </au>
               <au>
                  <snm>Keller</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Huber</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Dalevi</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Hu</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Andersen</snm>
                  <fnm>GL</fnm>
               </au>
            </aug>
            <source>Appl Environ Microbiol</source>
            <pubdate>2006</pubdate>
            <volume>72</volume>
            <issue>7</issue>
            <fpage>5069</fpage>
            <lpage>5072</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1489311</pubid>
                  <pubid idtype="pmpid" link="fulltext">16820507</pubid>
                  <pubid idtype="doi">10.1128/AEM.03006-05</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B65">
            <title>
               <p>ARB: a software environment for sequence data</p>
            </title>
            <aug>
               <au>
                  <snm>Ludwig</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Strunk</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Westram</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Richter</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Meier</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <cnm>Yadhukumar</cnm>
               </au>
               <au>
                  <snm>Buchner</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lai</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Steppi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Jobb</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Forster</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Brettske</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Gerber</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ginhart</snm>
                  <fnm>AW</fnm>
               </au>
               <au>
                  <snm>Gross</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Grumann</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hermann</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Jost</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Konig</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Liss</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Lussmann</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>May</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Nonhoff</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Reichel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Strehlow</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Stamatakis</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Stuckmann</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Vilbig</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lenke</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ludwig</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Bode</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Schleifer</snm>
                  <fnm>KH</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <issue>4</issue>
            <fpage>1363</fpage>
            <lpage>1371</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">390282</pubid>
                  <pubid idtype="pmpid" link="fulltext">14985472</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh293</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B66">
            <title>
               <p>MODELTEST: testing the model of DNA substitution</p>
            </title>
            <aug>
               <au>
                  <snm>Posada</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Crandall</snm>
                  <fnm>KA</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>1998</pubdate>
            <volume>14</volume>
            <issue>9</issue>
            <fpage>817</fpage>
            <lpage>818</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/14.9.817</pubid>
                  <pubid idtype="pmpid" link="fulltext">9918953</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B67">
            <title>
               <p/>
            </title>
            <aug>
               <au>
                  <cnm>SNAP</cnm>
               </au>
            </aug>
            <url>http://www.es.embnet.org/Doc/SNAP/</url>
         </bibl>
         <bibl id="B68">
            <title>
               <p>PAML: a program package for phylogenetic analysis by maximum likelihood</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>ZH</fnm>
               </au>
            </aug>
            <source>Comput Appl Biosci</source>
            <pubdate>1997</pubdate>
            <volume>13</volume>
            <issue>5</issue>
            <fpage>555</fpage>
            <lpage>556</lpage>
            <xrefbib>
               <pubid idtype="pmpid">9367129</pubid>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
