<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2148-6-89</ui>
   <ji>1471-2148</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>SmTRC1, a novel <it>Schistosoma mansoni </it>DNA transposon, discloses new families of animal and fungi transposons belonging to the CACTA superfamily</p>
         </title>
         <aug>
            <au id="A1">
               <snm>DeMarco</snm>
               <fnm>Ricardo</fnm>
               <insr iid="I1"/>
               <email>rdemarco@iq.usp.br</email>
            </au>
            <au id="A2">
               <snm>Venancio</snm>
               <mi>M</mi>
               <fnm>Thiago</fnm>
               <insr iid="I2"/>
               <email>venancio@iq.usp.br</email>
            </au>
            <au id="A3" ca="yes">
               <snm>Verjovski-Almeida</snm>
               <fnm>Sergio</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>verjo@iq.usp.br</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Laboratory of Gene Expression in Eukaryotes; Departamento de Bioqu&#237;mica, Instituto de Qu&#237;mica, Universidade de S&#227;o Paulo, Brazil</p>
            </ins>
            <ins id="I2">
               <p>Laboratory of Bioinformatics; Departamento de Bioqu&#237;mica, Instituto de Qu&#237;mica, Universidade de S&#227;o Paulo, Brazil</p>
            </ins>
         </insg>
         <source>BMC Evolutionary Biology</source>
         <issn>1471-2148</issn>
         <pubdate>2006</pubdate>
         <volume>6</volume>
         <issue>1</issue>
         <fpage>89</fpage>
         <url>http://www.biomedcentral.com/1471-2148/6/89</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">17090310</pubid>
               <pubid idtype="doi">10.1186/1471-2148-6-89</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>26</day>
               <month>6</month>
               <year>2006</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>07</day>
               <month>11</month>
               <year>2006</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>07</day>
               <month>11</month>
               <year>2006</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2006</year>
         <collab>DeMarco et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>The CACTA (also called En/Spm) superfamily of DNA-only transposons contain the core sequence CACTA in their Terminal Inverted Repeats (TIRs) and so far have only been described in plants. Large transcriptome and genome sequence data have recently become publicly available for <it>Schistosoma mansoni</it>, a digenetic blood fluke that is a major causative agent of schistosomiasis in humans, and have provided a comprehensive repository for the discovery of novel genes and repetitive elements. Despite the extensive description of retroelements in <it>S. mansoni</it>, just a single DNA-only transposon belonging to the Merlin family has so far been reported in this organism.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We describe a novel <it>S. mansoni </it>transposon named SmTRC1, for <it>S. mansoni </it><ul>T</ul>ransposon <ul>R</ul>elated to <ul>C</ul>ACTA <ul>1</ul>, an element that shares several characteristics with plant CACTA transposons. Southern blotting indicates approximately 30&#8211;300 copies of SmTRC1 in the <it>S. mansoni </it>genome. Using genomic PCR followed by cloning and sequencing, we amplified and characterized a full-length and a truncated copy of this element. RT-PCR using <it>S. mansoni </it>mRNA followed by cloning and sequencing revealed several alternatively spliced transcripts of this transposon, resulting in distinct ORFs coding for different proteins. Interestingly, a survey of complete genomes from animals and fungi revealed several other novel TRC elements, indicating new families of DNA transposons belonging to the CACTA superfamily that have not previously been reported in these kingdoms. The first three bases in the <it>S. mansoni </it>TIR are CCC and they are identical to those in the TIRs of the insects <it>Aedes aegypti </it>and <it>Tribolium castaneum</it>, suggesting that animal TRCs may display a CCC core sequence.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>The DNA-only transposable element SmTRC1 from <it>S. mansoni </it>exhibits various characteristics, such as generation of multiple alternatively-spliced transcripts, the presence of terminal inverted repeats at the extremities of the elements flanked by direct repeats and the presence of a Transposase_21 domain, that suggest a distant relationship to CACTA transposons from Magnoliophyta. Several sequences from other Metazoa and Fungi code for proteins similar to those encoded by SmTRC1, suggesting that such elements have a common ancestry, and indicating inheritance through vertical transmission before separation of the Eumetazoa, Fungi and Plants.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="bmc" subtype="user_supplied_xml" id="endnote"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Transposable elements constitute a large portion of the genomes of eukaryotes and play an important role in genome structure and evolution <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. They can be assigned to two broad groups, retroelements (Class I) and DNA-only transposable elements (Class II). Unlike the retroelements, DNA-only transposons do not rely on a RNA intermediate, but transpose directly from DNA using a multi-step cut and paste mechanism catalyzed by a transposase that recognizes the transposon DNA by its Terminal Inverted Repeats (TIRs) (Reviewed in <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>). Class II elements in eukaryotes can be categorized on the basis of sequence similarities into nine superfamilies: Mariner-TC1; hAT; P; Mutator; CACTA; PIF/Harbinger; Transib; <it>piggyBac</it>; and Merlin <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>.</p>
         <p>Most elements of the CACTA superfamily of DNA-only transposons (also called En/Spm) contain the core sequence CACTA in their TIRs and so far have only been described in plants <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B5">5</abbr></abbrgrp>. The prototypical CACTA maize Suppressor-mutator (Spm) transposon was one of the first transposons described by Barbara McClintock <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, and subsequent studies have shown that most of its length is occupied by a single transcription unit, which can be alternatively spliced to generate four distinct transcripts (<it>tnpA </it>to <it>tnpD</it>) encoding different proteins <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. Two of these transcripts, <it>tnpA </it>and <it>tnpD</it>, encode proteins that have been shown to be essential for transposition <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp>. TNPA protein has been shown to perform a role in reactivating the methylated transposon promoter and to repress active unmethylated promoter <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>, while TNPD protein interacts directly with TNPA and stabilizes its binding to DNA <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>.</p>
         <p><it>Schistosoma mansoni</it>, a digenetic blood fluke, is a major cause of schistosomiasis in humans and an important source of morbidity on a global scale. The disease is endemic in 74 developing countries, infecting about 200 million individuals, and an additional 500&#8211;600 million are estimated to be at risk <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. The <it>S. mansoni </it>genome is approximately 270 Mbp long <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> and 55% of its content is expected to comprise mobile elements or other repetitive sequences <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. Recently, independent transcriptome and genome sequencing initiatives have provided an extensive repository for the discovery of novel genes and repetitive elements <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp>. Despite the extensive description of retroelements in <it>S. mansoni </it><abbrgrp><abbr bid="B15">15</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp>, so far the presence of just a single DNA-only transposon, belonging to the Merlin family, has been reported in this organism <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>.</p>
         <p>Using the public repository of <it>S. mansoni </it>sequence data as a starting point, we describe a novel <it>S. mansoni </it>transposon named SmTRC1, an element that shares several characteristics with plant CACTA transposons, which suggests a distant relationship between these elements. A survey of complete genomes from the Animal and Fungi kingdoms revealed novel families of DNA transposons belonging to the CACTA superfamily.</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <sec>
            <st>
               <p>Isolation of SmTRC1 clones from <it>S. mansoni </it>genome</p>
            </st>
            <p>Our attention was drawn to the <it>S. mansoni </it>genomic sequence of Supercontig_0018735 available at the Wellcome Trust Sanger Institute <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> while we were manually examining the splicing pattern of the gene represented by transcript SmAE C610100.1 <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. Three exons of the latter mapped to bases 2032 to 2160, 6905 to 6961 and 7001 to 7231 of Supercontig_0018735. Upon examining the intron formed between bases 2160 to 6905 we detected a 4.5 kbp element that extended between bases 2164 and 6907 with one Open Reading Frame (referred to as SmTRC1-ORF in the following text) of 1,683 bp, which codes for a sequence with similarity (E-value 10<sup>-5</sup>) to a DNA-only transposon. An inverted repeat motif was found at both extremities (the left with 54 bp and the right missing only one base), suggesting that this intron is an inserted mobile element (Figure <figr fid="F1">1B</figr>). Although the element may be considered large (4.5 kpb) in comparison, for example, to Mariner family transposons (1.3 to 2.4 kpb) <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>, it is considerably smaller than CACTA elements such as Spm/En (8.3 kpb) <abbrgrp><abbr bid="B8">8</abbr></abbrgrp> and Rim coding elements (14.1 kpb on average) <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. We detected several other copies of the element in the <it>S. mansoni </it>genome sequence dataset displaying a perfect 54 bp inverted repeat at both extremities, confirming the sequence and correct length of the Transposon Inverted Repeat (TIR) (Figure <figr fid="F2">2A</figr>). We named these elements SmTRC1 (an abbreviation for <b><it>S</it></b><it>chistosoma </it><b><it>m</it></b><it>ansoni </it><b>T</b>ransposon <b>R</b>elated to <b>C</b>ACTA transposons) because of its several similarities to transposons of the CACTA family, as described in the text below.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>SmTRC1 elements</p>
               </caption>
               <text>
                  <p><b>SmTRC1 elements</b>. <b>A: </b>Agarose gel electrophoresis of typical PCR amplification products of <it>S. mansoni </it>genomic DNA with primers designed from the sequence of SmTRC1 extremities. <b>B: </b>Schematic representation of the SmTRC1 element derived <it>in silico </it>from <it>S. mansoni </it>shotgun genomic sequencing and assembly data obtained from the Sanger Institute (Supercontig 0018735) or from direct sequencing of clones amplified by PCR from genomic DNA obtained in this work (SmTRC1f1 and SmTRC1d1). Black boxes indicate the Terminal Inverted Repeats (TIR). Light gray boxes indicate the predicted SmTRC1-ORF and the dark gray box indicates the Transposase_21 domain within this ORF. The hatched box indicates a region with tandem repeats.</p>
               </text>
               <graphic file="1471-2148-6-89-1"/>
            </fig>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Transposon inverted and direct repeats</p>
               </caption>
               <text>
                  <p><b>Transposon inverted and direct repeats</b>. <b>A: </b>the complete sequence of SmTRC1 TIR is shown in this panel. Dots represent the transposon sequence not shown in the figure. <b>B: </b>blue boxes show direct repeats flanking <it>S. mansoni </it>and other animal TIRs (in gray). Only part of the <it>S. mansoni </it>TIR is represented in the figure. The dots represent the transposon sequence not shown in the figure. <b>C</b>: examples of target-site duplication created upon SmTRC1 insertion. Examples of alignments of sequences flanking SmTRC1 insertions (S-0000026, S-0000464 and S-0000144) with paralogous genomic sequences lacking transposon insertions (BH202398.1, BN000802.1 and AL620357.1) that were found in the <it>S. mansoni </it>public sequences database. The paralogous "gap" sequence (marked as &#8211;) presumably corresponds to the genomic target sequence before a transposon insertion event. Blue boxes indicate the target-site duplication in the flanking sequence. The number on the side of each sequence represents the supercontig from which it was derived (in the case of transposon inserted sequences) or GenBank accession numbers (in the case of paralogous sequences). <b>D: </b>TIR sequences from diverse CACTA superfamily animal and plant elements. The regions with high and medium levels of identity among the sequences are shown as black and gray columns, respectively.</p>
               </text>
               <graphic file="1471-2148-6-89-2"/>
            </fig>
            <p>We found a 2 bp direct repeat suggestive of target site duplication flanking the inverted repeat in 7 out of 14 (50%) of the copies of these elements analyzed (Figure <figr fid="F2">2B</figr>). Three of these copies were found to be inserted into repetitive elements of the <it>S. mansoni </it>genome. Two of them were inserted into copies of the <it>S. mansoni </it>LTR retrotransposons Saci-1 and Saci-4 <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp> and one into an unidentified repetitive element of which there are at least 40 copies in the preliminary assembly of the <it>S. mansoni </it>genome. When the flanking sequences of these three elements were aligned with paralogous sequence copies of the respective repetitive element [GenBank:<ext-link ext-link-type="gen" ext-link-id="BH202328.1">BH202328.1</ext-link>, GenBank:<ext-link ext-link-type="gen" ext-link-id="BN000802.1">BN000802.1</ext-link>, GenBank:<ext-link ext-link-type="gen" ext-link-id="AL620357.1">AL620357.1</ext-link>] obtained from either the GSS or the nr databases at GenBank, it was clear that the original repetitive element (not having an inserted transposon) contained none of the direct repeats flanking the SmTRC1 elements (Figure <figr fid="F2">2C</figr>); one of the 2bp direct repeat motifs was missing from the original repetitive element. This suggests that the direct repeats are in fact target-site duplications created by insertion of the SmTRC1 element. It is well known that transposon integration results in the duplication of a short host sequence at the insertion site and that the length of the target-site duplication is determined by the properties of each transposase <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>, therefore these data provide further evidence for the mobility of SmTRC1 elements.</p>
            <p>Using different combinations of a set of primers designed from the extremities of the transposon sequence from Supercontig_0018735, we performed several PCR reactions to amplify genomic copies of SmTRC1. A typical result is shown in Figure <figr fid="F1">1A</figr>; the major products are approximately 2.5 kbp. Cloning and sequencing of this major band revealed a 2274 bp clone that represented a truncated copy of SmTRC1, which we designated SmTRC1d1 (Figure <figr fid="F1">1B</figr>, bottom). This element displays a truncated ORF that codes for only 98 amino acids out of the 560 deduced from the full-length transposon. The entire sequence of SmTRC1d1 aligns with the full-length genomic transposon, but it lacks part of the ORF and its 3' tandem repeat (Figure <figr fid="F1">1B</figr>), hence its short length. The diversity of lower molecular weight bands generated by PCR (Figure <figr fid="F1">1A</figr>) suggests that SmTRC1 copies of several sizes, differently truncated, must exist in the <it>S. mansoni </it>genome.</p>
            <p>For one set of PCR amplifications with genomic DNA as template, we used a 16 bp region downstream from the left TIR as forward primer, and a sequence composed of 8 bp overlapping the 3'-end of the right TIR plus a 12 bp region immediately downstream as reverse primer. This primer set permitted a copy of approximately 4.7 kb to be amplified; this copy was cloned and sequenced (Figure <figr fid="F1">1B</figr>, middle). The sequence was named SmTRC1f1; it lacked 71 bp at its 5'-end, including the left TIR, owing to the design of the primer used in the amplification reaction, but otherwise it appears to represent an integral copy of SmTRC1 (Figure <figr fid="F1">1B</figr>, middle). In fact, this clone displays 99.8 % nucleotide identity (only 9 mismatches over 4675 nucleotides) with the element described in Supercontig_0018735, and one base in the left TIR is deleted in both sequences. This suggests that we had cloned from the PCR products a copy representing the sequence contained in Supercontig_0018735; the nine mismatched bases may have arisen from mutations generated either naturally in the field among the copies from individual parasites, or artificially from the <it>Taq </it>polymerase during the PCR amplification step.</p>
         </sec>
         <sec>
            <st>
               <p>Identification of TRC transposons in other species</p>
            </st>
            <p>A BLASTP search against the nr database at GenBank was performed using the protein encoded by SmTRC1-ORF as query. The highest hits produced were with two proteins of unknown function from <it>Schistosoma japonicum </it>[GenBank:<ext-link ext-link-type="gen" ext-link-id="AAW24935.1">AAW24935.1</ext-link>] (with E-value 10<sup>-20</sup>) and <it>Anopheles gambiae </it>[GenBank:<ext-link ext-link-type="gen" ext-link-id="EAA01922.3">EAA01922.3</ext-link>] (with E-value 10<sup>-9</sup>). The next best hits (E-value 10<sup>-5</sup>) were with the TNPD proteins of CACTA transposons from <it>Oryza sativa</it>. It is worth noting that the region displaying similarity corresponds exactly to the Transposase_21 domain (Pfam 02992). In fact, as described below, global alignment of this region of the SmTRC1-ORF product with the Transposase_21 domains of CACTA transposons indicates several conserved residues.</p>
            <p>The SmTRC1-ORF sequence was used as query to perform an additional TBLASTN search directly into several complete animal and fungal genomes using the Genomic Blast tool at NCBI. This search produced hits indicating high similarity (E-value 10<sup>-88 </sup>to 10<sup>-30</sup>) between the deduced SmTRC1-ORF translated sequence and translated sequences from genomes of such diverse animals as <it>Strongylocentrotus purpuratus</it>, <it>Ciona intestinalis</it>, <it>Danio rerio </it>and <it>Aedes aegypti</it>. In all these cases, practically the whole protein was aligned, not only the Transposase_21 domain. The search against fungal genomes produced hits indicating moderate similarity (E-values 10<sup>-16 </sup>to 10<sup>-4</sup>) between the deduced SmTRC1-ORF translated sequence and translated sequences from the genomes of e.g. <it>Rhizopus oryzae</it>, <it>Coprinopsis cinerea </it>and <it>Phanerochaete chrysosporium</it>. For several of the above organisms, multiple hits were generated in the TBLASTN searches against the genomic sequence, indicating that more than one copy must be present in their genomes.</p>
            <p>As noted earlier, SmTRC1 has a 54 bp TIR sequence at its extremities, which is consistently present in all copies examined. Also, the <it>S. mansoni </it>TIR sequence has an internal repeat of the motif AAAGGGGAAATAAAG. A TRC element of approximately 5.2 kb was detected in the <it>Tribolium castaneum </it>genomic sequence [GenBank:<ext-link ext-link-type="gen" ext-link-id="AAJJ01002287.1">AAJJ01002287.1</ext-link>], and displays at its extremities an inverted repeat of 27 bp with the sequence CCCTAGTAGCACCGAATATTTGTAAAA. In addition, we found a 10 bp inverted repeat (CCCAGTCAAC) flanking a TRC in an <it>A. aegypti </it>genomic segment [GenBank:<ext-link ext-link-type="gen" ext-link-id="AAGE02020512">AAGE02020512</ext-link>] that delimits a 9.2 kb element. Both elements have a 2 bp direct repeat adjacent to each inverted repeat (Figure <figr fid="F2">2B</figr>), suggesting that target site duplication occurred when the mobile elements were inserted, analogous to the situation in the <it>S. mansoni </it>genomic sequences.</p>
            <p>It is interesting to note that the first 3 bases in the TIRs from the <it>T. castaneum </it>and <it>A. aegypti </it>elements are identical to those in the TIR of <it>S. mansoni </it>(Figure <figr fid="F2">2D</figr>), suggesting that animal TRCs may display a CCC core sequence in their TIRs analogous to the CACTA core sequence in TIRs from CACTA transposons. Comparison of these animal TIRs with those of the plant CACTA transposons showed a high similarity between the <it>T. castaneum </it>TIR core sequence and the CACTA core sequence of plants, with only one mismatched base (Figure <figr fid="F2">2D</figr>). No such level of similarity is found in <it>S. mansoni </it>or <it>A. aegypti </it>TIR core sequences, which display only 3 and 2 coincident bases, respectively. Owing to the low numbers of bases and examples involved, it is difficult to determine whether the matching bases in <it>T. castaneum </it>are coincidental or reflect an evolutionary process.</p>
            <p>We have not been able to characterize the TIRs flanking the ORFs of TRC elements in organisms other than those described above, namely <it>S. mansoni</it>, <it>A. aegypti </it>and <it>T. castaneum</it>. In some animal species, the TIRs may have become unrecognizable because of mutations or have been lost through further recombination. Another possibility is that some of these elements may represent domesticated transposases <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr></abbrgrp>, which no longer transpose but perform another cellular function instead. Further analyses of these animal elements are warranted to determine which is the case for each particular element.</p>
         </sec>
         <sec>
            <st>
               <p>Estimation of the number of copies of <it>Sm</it>TRC1 by Southern blotting</p>
            </st>
            <p>Southern blot analysis using a probe from the 5' end region of SmTRC1 detected multiple bands when hybridized to <it>EcoR</it>I-digested genomic DNA, showing the existence of multiple copies (Figure <figr fid="F3">3</figr>). In parallel experiments, we used the same number of radioactive counts and probes of approximately the same size for SmTRC1 and Saci-2, a previously described <it>S. mansoni </it>retrotransposon <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. This allowed us to compare the two hybridization signals directly to estimate the number of SmTRC1 copies. The hybridization image was processed to measure the signal intensity in each lane. For each digestion, we divided the intensity from SmTRC1 by that from Saci-2. The results from both <it>EcoR</it>I and <it>Stu</it>I digestions show that SmTRC1 has approximately 1/3 of the number of copies of Saci-2 (estimated at 85&#8211;850 by DeMarco <it>et al</it>. <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>) (Figure <figr fid="F3">3</figr>). Extrapolation of these data indicates that there are approximately 30&#8211;300 copies of SmTRC1 in the <it>S. mansoni </it>genome. Amplification of SmTRC1 genomic DNA by PCR suggested that most of the genomic copies are not integral, implying that there are only a few full-length copies of SmTRC1 in the genome.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Southern blotting of SmTRC1</p>
               </caption>
               <text>
                  <p><b>Southern blotting of SmTRC1</b>. <it>S. mansoni </it>genomic DNA (5 &#956;g) digested with the indicated restriction enzyme was loaded in each lane and analyzed by Southern blotting with a specific radiolabeled probe for SmTRC1. A parallel experiment was run with a probe for the Saci-2 retrotransposon [18], which was used as a benchmark. Probes of similar sizes and the same number of radioactive counts were used for each of the two hybridizations. Below the figure, the ratio between the total intensities of the SmTRC1/Saci-2 signals is indicated. This value was calculated for each digestion by integrating the signal from all the bands, and the average and total deviation was obtained by computing data from the two different digestions.</p>
               </text>
               <graphic file="1471-2148-6-89-3"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>SmTRC 1 produces multiple spliced transcripts</p>
            </st>
            <p>Mapping of the <it>S. mansoni </it>ESTs available at GenBank dbEST to the full-length genomic copy of SmTRC1 produced a discontinuous alignment of 64 ESTs (Figure <figr fid="F4">4B</figr> shows some representative examples), indicating that messages transcribed from this transposon are subject to alternative splicing. Moreover, several of the predicted introns contain the canonical GT-AG splicing sites at their extremities (Figure <figr fid="F4">4B</figr>). The diversity of patterns obtained for different ESTs mapping to the same <it>locus </it>indicates that a number of variant messages are produced by alternative splicing.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Alternatively spliced forms of SmTRC1 transcripts</p>
               </caption>
               <text>
                  <p><b>Alternatively spliced forms of SmTRC1 transcripts</b>. <b>A: </b>agarose gel electrophoresis of products from an RT-PCR reaction with <it>S. mansoni </it>mRNA using primers designed from the sequence of the extremities of previously deposited ESTs mapping to the full-length SmTRC1 sequence. The "no RT" lane indicates a control in which no reverse transcriptase was added to the reaction medium. <b>B: </b>full-length SmTRC1 sequence (top scheme) and relative mapping positions of five existing ESTs from GenBank (Accession numbers shown next to each) and of three newly sequenced transcripts obtained by cloning the major band derived from the RT-PCR shown in panel A (Clones B1, B2 and B4 as indicated). Black boxes in the top scheme indicate the Terminal Inverted Repeats (TIR); light gray boxes indicate a predicted SmTRC1-ORF and the dark gray box indicates a Transposase_21 domain within this ORF; the hatched box indicates a region with tandem repeats. Thin black bars below the top scheme indicate mapped exons derived from each transcript; a white box indicates a region of a particular transcript not mapped to this specific copy of the SmTRC1 genomic sequence. Thin continuous lines represent junctions between interconnected exons in the transcripts, defining an intron with the canonical GT-AG splicing sites. Dashed lines represent junctions between interconnected exons in the transcripts, defining an intron without the canonical GT-AG splicing sites. Two "A"s indicate the presence of a poly-A tail. <b>C: </b>schematic representation of 3 clones of SmTRC1 transcripts. The scale in this part of the figure is expanded in comparison to that used in part B above. Light gray boxes indicate predicted ORF. Names inside the boxes indicate different hypothetical protein products coded by those transcripts. The asterisk indicates a stop codon present in the transcript but not in the equivalent genomic sequence of the full-length SmTRC1 element.</p>
               </text>
               <graphic file="1471-2148-6-89-4"/>
            </fig>
            <p>Intriguingly, no <it>S. japonicum </it>ESTs with homology to SmTRC1 were found by either BLASTN or TBLASTN searches using the full-length <it>S. mansoni </it>element as query against GenBank dbEST. This contrasts with the 70 <it>S. mansoni </it>ESTs (64 with discontinuous and 6 with continuous alignments) that were found with an E-value lower than 10<sup>-10 </sup>by a BLASTN search against the same database. This leads us to hypothesize that these elements either have a much lower transcriptional activity in <it>S. japonicum </it>than in <it>S. mansoni </it>or have been eliminated from the <it>S. japonicum </it>genome.</p>
            <p>To document the existence of alternatively spliced transcripts in <it>S. mansoni </it>better, we used the sequences at the extremities of many of the 70 ESTs known to align to SmTRC1 to design primers for amplifying additional SmTRC1 transcripts by RT-PCR using mRNA as template. The RT-PCR reaction produced a strong amplification band of approximately 900 bp and sub-products of lower molecular weight, plus a faint amplification band of higher molecular weight, which suggests that full-length transcripts may be expressed at a low level (Figure <figr fid="F4">4A</figr>). No amplification was seen in the control without reverse transcription, indicating that the amplification products were in fact derived from <it>bona fide </it>transcribed messages.</p>
            <p>We cloned and sequenced these different RT-PCR products. Mapping the different sequences to the full-length SmTRC1 genomic sequence confirmed a diversity of splicing patterns, as already suggested by mapping of previously existing GenBank ESTs (Figure <figr fid="F4">4B</figr>). Two of the sequenced clones (Clones B1 and B2) are similar in size to the major band from the RT-PCR reaction, indicating that this band may contain more than one product. These two products did not overlap the ORF region, and the intron formed between their exons 1 and 2 does not display the canonical CT-AG splicing sites. However, mapping of these transcripts to the entire <it>S. mansoni </it>genome showed that canonical splicing sites are present at the extremities of the same intron in some truncated forms of the element (data not shown). This indicates that these transcripts probably originated from truncated copies that are apparently transcriptionally active, being responsible for a considerable fraction of the SmTRC1 transcripts detected.</p>
            <p>Clone B1 has an ORF of 681 bp that is interrupted by a TGA stop codon at bases 439 to 441. However, alignment with the SmTRC1f1 sequence shows substitution of the TGA by CTA, which codes for isoleucine (Figure <figr fid="F4">4C</figr>). We named this hypothetical protein product SmTRC-PrA (<it>S. mansoni </it>TRC Protein A). In addition, a 4 bp deletion at base 37 of this clone in relation to SmTRC1f1 produces a frame shift at the beginning of the message. This suggests that the transcript was generated from a truncated copy of this element and that degeneration has produced the stop codon interrupting the ORF. Clone B2 has a shorter ORF that codes for a product very similar to the deduced N-terminal amino acids of the ORF product of clone B1; this hypothetical protein product was named SmTRC-PrB.</p>
            <p>Although most of the ESTs exhibit a splicing pattern that does not include the SmTRC1-ORF sequence in any exon, the presence of a few ESTs mapped in the region of the SmTRC1-ORF suggests that it is actually transcribed and translated. We were not able to clone the high molecular weight products directly from the first RT-PCR using primers designed from the extremities of EST sequences, but we designed a primer from the 5'-end of the SmTRC1-ORF sequence and used a primer from the 3' extremity of the transcripts. The ensuing RT-PCR resulted in the amplification of a partial transcript of 1.3 kb (data not shown). Cloning and sequencing of this message showed that it was indeed derived from exons mapping to the SmTRC1-ORF as well as from other exons in the 3' region (Figure <figr fid="F4">4B</figr>, clone B4). It codes for an incomplete hypothetical protein product of 125 amino acids named SmTRC-PrC. A longer, complete version of the message encoding a longer SmTRC-PrC protein is expected to exist, including the Transposase_21 domain in its amino-terminal portion. Clone B4 also exhibits a second ORF coding for a protein of 299 amino acids, 175 of which are shared with SmTRC-PrA (76% of the SmTRC-PrA amino acids). In view of this level of conservation, we named this protein SmTRC-PrA2. Characterization of additional clones could eventually identify further alternatively spliced forms of SmTRC1-derived transcripts.</p>
            <p>The Spm element of Maize has been shown to display four different transcripts (<it>TnpA-D</it>) generated by alternative splicing, each coding for a different protein product <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. One of these transcripts (<it>TnpD</it>) spans the entire region comprising ORFs 1 and 2 predicted in the <it>spm </it>DNA, which explains the selection pressure that maintains such ORFs in the transposon structure. Similarly, we predict the existence of a spliced transcript spanning all the TRC1-ORF to explain the maintenance of such a large ORF and the conserved amino acid sequence observed on comparison with elements from other animals. The diverse splicing pattern of the transcribed <it>S. mansoni </it>messages results in distinct ORFs coding for different proteins; only the Transposase_21 domain of proteins encoded by these transposons has a detectable similarity with proteins from CACTA transposons. The other portions of the proteins encoded by these alternatively spliced <it>S. mansoni </it>transcripts appear not to be detectably conserved.</p>
            <p>Spm <it>TnpA</it>, a short alternatively spliced transcript that lacks the Transposase_21 domain, is apparently more abundantly transcribed in plants than the other, longer transcripts. This reflects the results of RT-PCR experiments suggesting that shorter TRC transcripts are more abundantly transcribed in <it>S. mansoni </it>than the longer alternatively spliced transcript that includes the Transposase_21 domain. <it>Spm TnpD </it>displays two ORFs in tandem, one coding for TNPD and the other for TNPA <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. Likewise, clone B4 also exhibits two ORFs in tandem, one coding for SmTRC-PrC and the other for a SmTRC-PrA-like protein.</p>
         </sec>
         <sec>
            <st>
               <p>SmTRC 1 ORF contains a conserved Transposase_21 domain</p>
            </st>
            <p>Multiple protein sequence alignment (Figure <figr fid="F5">5</figr>) of the Transposase_21 domain (PFAM# PF02992) was performed using sequences from known CACTA transposons from several plants <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr></abbrgrp>, together with the related domain in the deduced ORFs from <it>S. mansoni </it>and several other metazoan and fungal elements identified in the present work by BLAST analysis (as described above). Although there is a visible divergence between the domains from elements derived from different phyla, conservation of several residues is apparent (Figure <figr fid="F5">5</figr>). It is also interesting to note that the TRC-like sequence derived from the fungus <it>Cryptococcus neoformans </it>has several characteristics that distinguish it from the other fungal sequences, which are apparently very similar to one another.</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Multiple alignment of the Transposase_21 domains of proteins of CACTA related transposons from diverse organisms</p>
               </caption>
               <text>
                  <p><b>Multiple alignment of the Transposase_21 domains of proteins of CACTA related transposons from diverse organisms</b>. Typical plant CACTA transposon sequences from six Magnoliophyta were included in the alignment. In addition, eleven novel CACTA-related elements identified here were included: seven from Eumetazoa and four from Fungi. Shading indicates the level of conservation of each residue. Boxes with Roman numbers I to III indicate conserved motifs of the Transposase_21 domain in all organisms. Box marked with A indicates a Transposase_21 motif displayed only by Eumetazoa and Fungi proteins.</p>
               </text>
               <graphic file="1471-2148-6-89-5"/>
            </fig>
            <p>Three different conserved motifs can be discerned, marked I-III in the aligned proteins (Figure <figr fid="F5">5</figr>). In most of the elements, a first conserved motif I/L/V-X-I/L/V/F-X-I/L/V/F-X<sub>2</sub>-D-G-X<sub>3</sub>-F/Y-X<sub>7&#8211;9</sub>-W-P-I/L/V of Transposase_21 domain is present (Figure <figr fid="F5">5</figr>, box I); however, in 3 out of 4 fungal sequences this motif shows an interchange between tryptophan and glycine residues. This suggests that these two residues have an important role and that only the position of one residue relative to the other is essential for activity of this protein. In the second conserved domain (Figure <figr fid="F5">5</figr>, box II), the fungal proteins show a level of conservation comparable to Magnoliophyta and higher than Eumetazoa. Thus, only proteins from Fungi and Magnoliophyta have a proline immediately after the conserved glycine residue (Figure <figr fid="F5">5</figr>, box II), and an additional P-L/I conserved motif in the middle of this domain; both are absent from Eumetazoa. Interestingly, we identified a L/V/I-D-X-L/M-H-X<sub>3</sub>-L-G motif in the Transposase_21 domain that is present in the eumetazoan TRC elements (Figure <figr fid="F5">5</figr>, box A) and also in 3 out of the 4 fungal TRC proteins, but is absent in proteins from Magnoliophyta.</p>
            <p>Among the conserved residues there are two aspartyl residues (DD), one in the first and one in the third conserved domain, separated by 80 residues; this is very similar to the distance between the two aspartyl residues in the DDE motifs in Tn3 transposases <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. There is also a glutamyl residue (E) in the conserved domain 2 in all but three sequences, two of which have this residue in adjacent positions and one in a position 2 residues away. It is possible that conservation of such an amino acid triad reflects a similar catalytic mechanism in the DDE motif despite the different arrangement of residues. In this case its function would be similar to that described for the DDE motif, which is presumed to coordinate divalent metal ions to promote catalysis of DNA cleavage and ligation <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. Moreover, there is a conserved CXXC motif in the conserved domain 3 that is identical to the configuration of cysteines in the zinc-finger-like motifs (HHCC domains) of retroviral integrases, suggesting that TRC cysteines may also be involved in DNA binding.</p>
            <p>A phylogenetic tree (Figure <figr fid="F6">6</figr>) was generated from the three conserved regions (I-III) shown in the alignment of Figure <figr fid="F5">5</figr>. Although the analysis does not permit a clear inference of phylogeny within the Eumetazoan elements, it clearly shows the separation of the Eumetazoa and Plant branches (Figure <figr fid="F6">6</figr>). Interestingly, three of the Fungi elements appear to be distinct from the others, but one of them, the <it>C. neoformans </it>element, appears to be at a basal position in relation to plant transposons. With the exception of the latter, the analysis clearly shows the separation between Plants, Fungi and Eumetazoa.</p>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>Phylogenetic tree for the Transposase_21 domains of CACTA-like transposases</p>
               </caption>
               <text>
                  <p><b>Phylogenetic tree for the Transposase_21 domains of CACTA-like transposases</b>. The tree was constructed by the neighbor-joining method using the three conserved regions indicated by boxes I to III in Figure 5 and excluding positions with gaps. Numbers represent the confidence of the branches assigned by bootstrap analysis (in 1,000 samplings); bootstrap values lower than 500 are omitted from the figure. The names indicate a transposon member of the CACTA or of the <ul>T</ul>ransposon <ul>R</ul>elated to <ul>C</ul>ACTA (TRC) family belonging to the organism indicated. Circles indicate the 3 different proposed families of transposons within the CACTA superfamily.</p>
               </text>
               <graphic file="1471-2148-6-89-6"/>
            </fig>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>The DNA-only transposable element SmTRC1 from <it>S. mansoni </it>exhibits various characteristics, such as generation of multiple spliced transcripts, the presence of terminal inverted repeats at the extremities of the elements flanked by direct repeats and the presence of a Transposase_21 domain, that suggest a distant relationship to CACTA transposons from Magnoliophyta. Despite these similarities, conservation of proteins deduced from this new family of transposons in relation to CACTA transposons is restricted to the Transposase_21 domain. The presence in <it>S. mansoni </it>of multiple transcripts and a higher expression level of a shorter alternatively spliced transcript coding for an ORF lacking the Transposase_21 domain suggests that similar strategies are employed by transposons from animals and plants. The absence of conservation between SmTRC-PrA/SmTRC-PrB and plant TNPA protein indicates that these <it>S. mansoni </it>proteins either have a different evolutionary origin or are very divergent. The latter is more probable since we could detect no similarity between the <it>S. mansoni </it>proteins SmTRC-PrA and SmTRC-PrB and any sequences encoded in other animal or fungal TRC elements by BLASTP or TBLASTN comparison to the genomes of these organisms, suggesting that this protein is rapidly evolving. Nevertheless, SmTRC-PrA must perform an analogous role to the DNA binding function described for TNPA in plant transposons, and experimental verification of this hypothesis is warranted.</p>
         <p>Several sequences from Metazoa and Fungi code for proteins similar to those encoded by SmTRC, providing evidence that this superfamily exists in branches other than Plants. Data from phylogenetic analysis of the Transposase_21 domain suggests a common ancestry for such elements, and indicates inheritance through vertical transmission before the separation of Eumetazoa, Fungi and Plants. This organization permits a division of the CACTA superfamily into 3 different families, each represented by one of these branches (circles in Figure <figr fid="F6">6</figr>). The <it>C. neoformans </it>element appears to be an exception, being more closely related to the plant transposon family than to the Fungi family.</p>
         <p>In view of the evolutionary distance between these related elements, the few conserved amino acids of the Transposase_21 domain must be essential for TNPD function. These conserved residues are preferential targets for future mutagenesis experiments to determine the importance of the Transposase_21 domain for TNPD function, as they are expected to abolish or significantly alter the domain functionality.</p>
         <p>Discovery of this new transposable element in <it>S. mansoni </it>should help in obtaining a more complete annotation of the genome of this parasite. Apparently DNA transposons are not as widespread as retroelements in <it>S. mansoni</it>, since only 2 elements of the former type (<abbrgrp><abbr bid="B4">4</abbr></abbrgrp>; this report) and 28 of the latter type <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B19">19</abbr></abbrgrp> have been described. Nevertheless, DNA transposons may have a significant impact on the biology of the parasite. Comparison between Merlin and SmTRC1 shows that these elements have very distinct characteristics, the former being more compact (1.4 kpb <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> as opposed to 4.5 kbp) and presenting a slightly higher copy number, with an estimated 500 copies <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> compared to 30&#8211;300 for SmTRC1. On the other hand, most of the copies from both SmTRC1 and Merlin elements appear to be internal deletion derivatives. Several transcripts of both elements have been detected in the <it>S. mansoni </it>EST database, suggesting that both are transcriptionally active.</p>
         <p>In addition, it is interesting to consider the SmTRC1 element as a potential new tool for insertional mutagenesis experiments in <it>Schistosoma </it>and other platyhelminths; CACTA elements have been successfully used for this purpose in plants <abbrgrp><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr></abbrgrp>. Moreover, other superfamilies of transposons have been widely used for invertebrate and vertebrate transgene experiments and constitute a valuable tool for analyzing gene function and effects on phenotype <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr></abbrgrp>. Further studies on other Metazoa and Fungi elements from the family described here will certainly provide candidate vectors for several other organisms.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>RT-PCR and genomic DNA PCR</p>
            </st>
            <p>The BH isolates of <it>S. mansoni </it>were maintained in the laboratory by routine passage through mice and snails. Adult parasites were obtained by portal perfusion of hamsters 7 to 8 weeks after infection. Tissues were conserved in RNALater (Ambion) according to the manufacturer's instructions. Tissue mRNAs were extracted using MAC isolation kits (Miltenyi Biotec). mRNA samples were treated with RQ1 RNase-free DNase (1 U/10 &#956;l; Promega) for 30 minutes at 37&#176;C. cDNAs were prepared using the Superscript II first strand synthesis system for RT-PCR (Invitrogen), following the manufacturer's instructions. A parallel control reaction was performed without the addition of reverse transcriptase and used as template for a PCR reaction to detect any genomic DNA contamination. The PCR step was performed using Advantage II polymerase (Clontech) with the buffer supplied by the manufacturer, 200 &#956;M dNTPs, and 200 nM of each primer using the following program: 95&#176;C (1 min); 35 cycles of 95&#176;C (30 s), 55&#176;C (30 s) and 68&#176;C (4 min); and final extension at 68&#176;C (3 min). The reaction products were cloned into pGem-T vector (Promega) and sequenced.</p>
            <p>Genomic DNA was extracted from 800 mg of adult worms (approximately 3,000 worms) using the protocol described by Ausubel et al. <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. PCR reactions were performed using the same conditions as described in the previous paragraph.</p>
         </sec>
         <sec>
            <st>
               <p>Southern Blot</p>
            </st>
            <p>Southern blot experiments for estimating the number of copies of SmTRC1 were performed according to the protocol described by DeMarco et al. <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>, except that <it>Stu</it>I was used in place of <it>BamH</it>I and that Saci-2 was used as a benchmark. The total signal intensity for each lane was calculated using ImageQuant v5.1 (Molecular Dynamics), with a fixed area rectangle to delimit the area for each sample analyzed.</p>
         </sec>
         <sec>
            <st>
               <p>Sequence alignment and construction of phylogenetic trees</p>
            </st>
            <p>Using the deduced protein sequence of the Transposase_21 domain from SmTRC1, we retrieved several other sequences that showed significant similarity (E-values less than 10<sup>-4</sup>) in a tBLASTn search against genomes of Metazoa and Fungi at NCBI. We used these sequences along with the sequences of known CACTA transposons from Magnoliophyta to perform an alignment using ClustalX v1.83 <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. Alignments were imported to the GeneDoc program V2.6.002 for shading of conserved residues. Further analysis with Clustal X and the neighbor-joining method, using the three conserved regions indicated by boxes I to III in Figure <figr fid="F5">5</figr> and excluding positions with gaps, resulted in the phylogenetic tree shown in Figure <figr fid="F6">6</figr>. The confidence of the branches was evaluated by bootstrap analysis using 1,000 samplings. Phylogenetic trees were drawn using Treeview (version 1.6.6) <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. The GenBank sequences and accession numbers utilized for construction of alignments and phylogenetic trees are as follows: (1) Transposase family tnp2 members &#8211; <it>Arabdopsis thaliana</it>, [GenBank:<ext-link ext-link-type="gen" ext-link-id="CAB80813.1">CAB80813.1</ext-link>]; <it>Brassica rapa</it>, [GenBank:<ext-link ext-link-type="gen" ext-link-id="BAA85462">BAA85462</ext-link>]; <it>Daucus carota</it>, [GenBank:<ext-link ext-link-type="gen" ext-link-id="BAA20532.1">BAA20532.1</ext-link>], <it>Oryza sativa</it>, [GenBank:<ext-link ext-link-type="gen" ext-link-id="DAA02106.1">DAA02106.1</ext-link>]; <it>Zea mays</it>, [GenBank:<ext-link ext-link-type="gen" ext-link-id="AAA66266.1">AAA66266.1</ext-link>]; <it>Ipomoea trifida</it>, [GenBank:<ext-link ext-link-type="gen" ext-link-id="AAS79612.1">AAS79612.1</ext-link>]; (2) Whole genome DNA sequences &#8211; <it>Aedes aegypti</it>, [GenBank:<ext-link ext-link-type="gen" ext-link-id="AAGE02020512">AAGE02020512</ext-link>]; <it>Danio rerio</it>, [GenBank:<ext-link ext-link-type="gen" ext-link-id="CAAK02057130.1">CAAK02057130.1</ext-link>]; <it>Strongylocentrotus purpuratus</it>, [GenBank:<ext-link ext-link-type="gen" ext-link-id="AAGJ01221697.1">AAGJ01221697.1</ext-link>]; <it>Ciona intestinalis</it>, [GenBank:<ext-link ext-link-type="gen" ext-link-id="AABS01001120.1">AABS01001120.1</ext-link>]; <it>Tribolium castaneum</it>, [GenBank:<ext-link ext-link-type="gen" ext-link-id="AAJJ01002287.1">AAJJ01002287.1</ext-link>]; <it>Drosophila pseudoobscura</it>, [GenBank:<ext-link ext-link-type="gen" ext-link-id="AADE01004520.1">AADE01004520.1</ext-link>]; <it>Glomus intraradices</it>, [GenBank:<ext-link ext-link-type="gen" ext-link-id="AC156590">AC156590</ext-link>]; <it>Phakopsora pachyrhizi</it>, [GenBank:<ext-link ext-link-type="gen" ext-link-id="AC149399">AC149399</ext-link>]; <it>Cryptococcus neoformans</it>, [GenBank:<ext-link ext-link-type="gen" ext-link-id="EAL18770.1">EAL18770.1</ext-link>]; <it>Rhizopus oryzae</it>, [GenBank:<ext-link ext-link-type="gen" ext-link-id="AACW02000214.1">AACW02000214.1</ext-link>].</p>
         </sec>
         <sec>
            <st>
               <p>Accession numbers of sequences identified in this work</p>
            </st>
            <p>We have deposited all sequences obtained in this work at EMBL under the following numbers: SmTRC1f1, [EMBL:<ext-link ext-link-type="embl" ext-link-id="AM268206">AM268206</ext-link>]; SmTRC1d1, [EMBL:<ext-link ext-link-type="embl" ext-link-id="AM268205">AM268205</ext-link>]; SmTRC-PrA, [EMBL:<ext-link ext-link-type="embl" ext-link-id="AM268207">AM268207</ext-link>]; SmTRC-PrB, [EMBL:<ext-link ext-link-type="embl" ext-link-id="AM268208">AM268208</ext-link>]; SmTRC-PrC, [EMBL:<ext-link ext-link-type="embl" ext-link-id="AM268209">AM268209</ext-link>]. We have also deposited Third Party Annotations (TPAs) at EMBL for Transposase family Tnp2 members, under the following TPA numbers: <it>Aedes aegypti</it>, [EMBL:<ext-link ext-link-type="embl" ext-link-id="BN000947">BN000947</ext-link>]; <it>Danio rerio</it>, [EMBL:<ext-link ext-link-type="embl" ext-link-id="BN000951">BN000951</ext-link>]; <it>Strongylocentrotus purpuratus</it>, [EMBL:<ext-link ext-link-type="embl" ext-link-id="BN000955">BN000955</ext-link>]; <it>Ciona intestinalis</it>, [EMBL:<ext-link ext-link-type="embl" ext-link-id="BN000948">BN000948</ext-link>]; <it>Tribolium castaneum</it>, [EMBL:<ext-link ext-link-type="embl" ext-link-id="BN000946">BN000946</ext-link>]; <it>Drosophila pseudoobscura</it>, [EMBL:<ext-link ext-link-type="embl" ext-link-id="BN000950">BN000950</ext-link>]; <it>Glomus intraradices</it>, [EMBL:<ext-link ext-link-type="embl" ext-link-id="BN000952">BN000952</ext-link>]; <it>Phakopsora pachyrhizi</it>, [EMBL:<ext-link ext-link-type="embl" ext-link-id="BN000953">BN000953</ext-link>]; <it>Rhizopus oryzae</it>, [EMBL:<ext-link ext-link-type="embl" ext-link-id="BN000954">BN000954</ext-link>]; <it>Cryptococcus neoformans</it>, [EMBL:<ext-link ext-link-type="embl" ext-link-id="BN000949">BN000949</ext-link>].</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>RdeM conceived of the study, carried out the molecular genetic experiments, participated in the sequence alignment and drafted the manuscript. TMV participated in the sequence alignment. SVA participated in the design of the study and coordination and drafted the manuscript. All authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>This work was supported by a grant from FAPESP, Funda&#231;&#227;o de Amparo a Pesquisa do Estado de S&#227;o Paulo, and by fellowships from FAPESP and CNPq, Conselho Nacional de Desenvolvimento Cient&#237;fico e Tecnol&#243;gico, Brasil. The technical assistance of Renato Alvarenga is acknowledged.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>The evolutionary dynamics of repetitive DNA in eukaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Charlesworth</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Sniegowski</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Stephan</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1994</pubdate>
            <volume>371</volume>
            <issue>6494</issue>
            <fpage>215</fpage>
            <lpage>220</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/371215a0</pubid>
                  <pubid idtype="pmpid" link="fulltext">8078581</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Transposable elements as sources of variation in animals and plants</p>
            </title>
            <aug>
               <au>
                  <snm>Kidwell</snm>
                  <fnm>MG</fnm>
               </au>
               <au>
                  <snm>Lisch</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1997</pubdate>
            <volume>94</volume>
            <issue>15</issue>
            <fpage>7704</fpage>
            <lpage>7711</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">33680</pubid>
                  <pubid idtype="pmpid" link="fulltext">9223252</pubid>
                  <pubid idtype="doi">10.1073/pnas.94.15.7704</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Mobile DNA II</p>
            </title>
            <aug>
               <au>
                  <snm>Craig</snm>
                  <fnm>NL</fnm>
               </au>
               <au>
                  <snm>Craigie</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Gellert</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lambowitz</snm>
                  <fnm>AM</fnm>
               </au>
            </aug>
            <publisher>Washington, D.C. , ASM Press</publisher>
            <pubdate>2002</pubdate>
            <fpage>xviii, 1204 p., [32] p. of plates</fpage>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Merlin, a new superfamily of DNA transposons identified in diverse animal genomes and related to bacterial IS1016 insertion sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Feschotte</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2004</pubdate>
            <volume>21</volume>
            <issue>9</issue>
            <fpage>1769</fpage>
            <lpage>1780</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msh188</pubid>
                  <pubid idtype="pmpid" link="fulltext">15190130</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Dynamics and evolution of transposable elements</p>
            </title>
            <aug>
               <au>
                  <snm>Capy</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Bazin</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Higuet</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Langin</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Molecular biology intelligence unit</source>
            <publisher>Austin, Tex; New York , Landes Bioscience ; North American distributor Chapman &amp; Hall</publisher>
            <pubdate>1998</pubdate>
            <fpage>197 p.</fpage>
         </bibl>
         <bibl id="B6">
            <title>
               <p>The origin and behavior of mutable loci in maize</p>
            </title>
            <aug>
               <au>
                  <snm>McClintock</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>1950</pubdate>
            <volume>36</volume>
            <issue>6</issue>
            <fpage>344</fpage>
            <lpage>355</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1063197</pubid>
                  <pubid idtype="pmpid">15430309</pubid>
                  <pubid idtype="doi">10.1073/pnas.36.6.344</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Essential large transcripts of the maize Spm transposable element are generated by alternative splicing</p>
            </title>
            <aug>
               <au>
                  <snm>Masson</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Rutherford</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Banks</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Fedoroff</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>1989</pubdate>
            <volume>58</volume>
            <issue>4</issue>
            <fpage>755</fpage>
            <lpage>765</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0092-8674(89)90109-8</pubid>
                  <pubid idtype="pmpid" link="fulltext">2548734</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>The tnpA and tnpD gene products of the Spm element are required for transposition in tobacco</p>
            </title>
            <aug>
               <au>
                  <snm>Masson</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Strem</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Fedoroff</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>1991</pubdate>
            <volume>3</volume>
            <issue>1</issue>
            <fpage>73</fpage>
            <lpage>85</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">159980</pubid>
                  <pubid idtype="pmpid" link="fulltext">1668614</pubid>
                  <pubid idtype="doi">10.1105/tpc.3.1.73</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Excision of the En/Spm transposable element of Zea mays requires two element-encoded proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Frey</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Reinecke</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Grant</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Saedler</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Gierl</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>EMBO Journal</source>
            <pubdate>1990</pubdate>
            <volume>9</volume>
            <issue>12</issue>
            <fpage>4037</fpage>
            <lpage>4044</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">552176</pubid>
                  <pubid idtype="pmpid">2174354</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Epigenetic regulation of the maize Spm transposable element: novel activation of a methylated promoter by TnpA</p>
            </title>
            <aug>
               <au>
                  <snm>Schlappi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Raina</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Fedoroff</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>1994</pubdate>
            <volume>77</volume>
            <issue>3</issue>
            <fpage>427</fpage>
            <lpage>437</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0092-8674(94)90157-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">8181061</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Inducible DNA demethylation mediated by the maize Suppressor-mutator transposon-encoded TnpA protein</p>
            </title>
            <aug>
               <au>
                  <snm>Cui</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Fedoroff</snm>
                  <fnm>NV</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>2002</pubdate>
            <volume>14</volume>
            <issue>11</issue>
            <fpage>2883</fpage>
            <lpage>2899</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">152734</pubid>
                  <pubid idtype="pmpid" link="fulltext">12417708</pubid>
                  <pubid idtype="doi">10.1105/tpc.006163</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Concerted formation of macromolecular Suppressor-mutator transposition complexes</p>
            </title>
            <aug>
               <au>
                  <snm>Raina</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Schlappi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Karunanandaa</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Elhofy</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Fedoroff</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>1998</pubdate>
            <volume>95</volume>
            <issue>15</issue>
            <fpage>8526</fpage>
            <lpage>8531</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">21109</pubid>
                  <pubid idtype="pmpid" link="fulltext">9671711</pubid>
                  <pubid idtype="doi">10.1073/pnas.95.15.8526</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>WHO Technical Report Series 912: prevention and control of schistosomiasis and soil-transmitted helminthiasis.</p>
            </title>
            <aug>
               <au>
                  <cnm>WHO-Geneve</cnm>
               </au>
            </aug>
            <publisher>Geneva , World Health Organization</publisher>
            <pubdate>2002</pubdate>
         </bibl>
         <bibl id="B14">
            <title>
               <p>The genome of Schistosoma mansoni: isolation of DNA, its size, bases and repetitive sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Simpson</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Sher</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>McCutchan</snm>
                  <fnm>TF</fnm>
               </au>
            </aug>
            <source>Mol Biochem Parasitol</source>
            <pubdate>1982</pubdate>
            <volume>6</volume>
            <issue>2</issue>
            <fpage>125</fpage>
            <lpage>137</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0166-6851(82)90070-6</pubid>
                  <pubid idtype="pmpid">6182465</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Mobile genetic elements colonizing the genomes of metazoan parasites</p>
            </title>
            <aug>
               <au>
                  <snm>Brindley</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Laha</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>McManus</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Loukas</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Trends Parasitol</source>
            <pubdate>2003</pubdate>
            <volume>19</volume>
            <issue>2</issue>
            <fpage>79</fpage>
            <lpage>87</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S1471-4922(02)00061-2</pubid>
                  <pubid idtype="pmpid" link="fulltext">12586476</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Transcriptome analysis of the acoelomate human parasite Schistosoma mansoni</p>
            </title>
            <aug>
               <au>
                  <snm>Verjovski-Almeida</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>DeMarco</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Martins</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Guimaraes</snm>
                  <fnm>PE</fnm>
               </au>
               <au>
                  <snm>Ojopi</snm>
                  <fnm>EP</fnm>
               </au>
               <au>
                  <snm>Paquola</snm>
                  <fnm>AC</fnm>
               </au>
               <au>
                  <snm>Piazza</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Nishiyama</snm>
                  <fnm>MY</fnm>
               </au>
               <au>
                  <snm>Kitajima</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Adamson</snm>
                  <fnm>RE</fnm>
               </au>
               <au>
                  <snm>Ashton</snm>
                  <fnm>PD</fnm>
               </au>
               <au>
                  <snm>Bonaldo</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Coulson</snm>
                  <fnm>PS</fnm>
               </au>
               <au>
                  <snm>Dillon</snm>
                  <fnm>GP</fnm>
               </au>
               <au>
                  <snm>Farias</snm>
                  <fnm>LP</fnm>
               </au>
               <au>
                  <snm>Gregorio</snm>
                  <fnm>SP</fnm>
               </au>
               <au>
                  <snm>Ho</snm>
                  <fnm>PL</fnm>
               </au>
               <au>
                  <snm>Leite</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Malaquias</snm>
                  <fnm>LC</fnm>
               </au>
               <au>
                  <snm>Marques</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Miyasato</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>Nascimento</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Ohlweiler</snm>
                  <fnm>FP</fnm>
               </au>
               <au>
                  <snm>Reis</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Ribeiro</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Sa</snm>
                  <fnm>RG</fnm>
               </au>
               <au>
                  <snm>Stukart</snm>
                  <fnm>GC</fnm>
               </au>
               <au>
                  <snm>Soares</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Gargioni</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Kawano</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Rodrigues</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Madeira</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Wilson</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Menck</snm>
                  <fnm>CF</fnm>
               </au>
               <au>
                  <snm>Setubal</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Leite</snm>
                  <fnm>LC</fnm>
               </au>
               <au>
                  <snm>Dias-Neto</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2003</pubdate>
            <volume>35</volume>
            <issue>2</issue>
            <fpage>148</fpage>
            <lpage>157</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1237</pubid>
                  <pubid idtype="pmpid" link="fulltext">12973350</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Advances in schistosome genomics</p>
            </title>
            <aug>
               <au>
                  <snm>El-Sayed</snm>
                  <fnm>NM</fnm>
               </au>
               <au>
                  <snm>Bartholomeu</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Ivens</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Johnston</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>LoVerde</snm>
                  <fnm>PT</fnm>
               </au>
            </aug>
            <source>Trends Parasitol</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <issue>4</issue>
            <fpage>154</fpage>
            <lpage>157</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.pt.2004.02.002</pubid>
                  <pubid idtype="pmpid" link="fulltext">15099549</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Saci-1, -2 and -3 and Perere, four novel retrotransposons with high transcriptional activities from the human parasite Schistosoma mansoni</p>
            </title>
            <aug>
               <au>
                  <snm>DeMarco</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Kowaltowski</snm>
                  <fnm>AT</fnm>
               </au>
               <au>
                  <snm>Machado</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Soares</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Gargioni</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Kawano</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Rodrigues</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Madeira</snm>
                  <fnm>AMBN</fnm>
               </au>
               <au>
                  <snm>Wilson</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Menck</snm>
                  <fnm>CFM</fnm>
               </au>
               <au>
                  <snm>Setubal</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Dias-Neto</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Leite</snm>
                  <fnm>LCC</fnm>
               </au>
               <au>
                  <snm>Verjovski-Almeida</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>J Virol</source>
            <pubdate>2004</pubdate>
            <volume>78</volume>
            <fpage>2967</fpage>
            <lpage>2978</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">353769</pubid>
                  <pubid idtype="pmpid" link="fulltext">14990715</pubid>
                  <pubid idtype="doi">10.1128/JVI.78.6.2967-2978.2004</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Identification of 18 new transcribed retrotransposons in Schistosoma mansoni</p>
            </title>
            <aug>
               <au>
                  <snm>DeMarco</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Machado</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Bisson-Filho</snm>
                  <fnm>AW</fnm>
               </au>
               <au>
                  <snm>Verjovski-Almeida</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Biochem Biophys Res Commun</source>
            <pubdate>2005</pubdate>
            <volume>333</volume>
            <issue>1</issue>
            <fpage>230</fpage>
            <lpage>240</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.bbrc.2005.05.080</pubid>
                  <pubid idtype="pmpid" link="fulltext">15939396</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Wellcome Trust Sanger Institute S. mansoni genome project</p>
            </title>
            <url>http://www.sanger.ac.uk/Projects/S_mansoni/</url>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Resident aliens: the Tc1/mariner superfamily of transposable elements</p>
            </title>
            <aug>
               <au>
                  <snm>Plasterk</snm>
                  <fnm>RH</fnm>
               </au>
               <au>
                  <snm>Izsvak</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Ivics</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>1999</pubdate>
            <volume>15</volume>
            <issue>8</issue>
            <fpage>326</fpage>
            <lpage>332</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-9525(99)01777-1</pubid>
                  <pubid idtype="pmpid" link="fulltext">10431195</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Genomic characterization of Rim2/Hipa elements reveals a CACTA-like transposon superfamily with unique features in the rice genome</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>GD</fnm>
               </au>
               <au>
                  <snm>Tian</snm>
                  <fnm>PF</fnm>
               </au>
               <au>
                  <snm>Cheng</snm>
                  <fnm>ZK</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Jiang</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>DB</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>He</snm>
                  <fnm>ZH</fnm>
               </au>
            </aug>
            <source>Mol Genet Genomics</source>
            <pubdate>2003</pubdate>
            <volume>270</volume>
            <issue>3</issue>
            <fpage>234</fpage>
            <lpage>242</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s00438-003-0918-z</pubid>
                  <pubid idtype="pmpid" link="fulltext">14513364</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Molecular domestication--more than a sporadic episode in evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Miller</snm>
                  <fnm>WJ</fnm>
               </au>
               <au>
                  <snm>McDonald</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Nouaud</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Anxolabehere</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Genetica</source>
            <pubdate>1999</pubdate>
            <volume>107</volume>
            <issue>1-3</issue>
            <fpage>197</fpage>
            <lpage>207</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1023/A:1004070603792</pubid>
                  <pubid idtype="pmpid" link="fulltext">10952213</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>An Arabidopsis hAT-like transposase is essential for plant development</p>
            </title>
            <aug>
               <au>
                  <snm>Bundock</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Hooykaas</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2005</pubdate>
            <volume>436</volume>
            <issue>7048</issue>
            <fpage>282</fpage>
            <lpage>284</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature03667</pubid>
                  <pubid idtype="pmpid" link="fulltext">16015335</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Homologs of Drosophila P transposons were mobile in zebrafish but have been domesticated in a common ancestor of chicken and human</p>
            </title>
            <aug>
               <au>
                  <snm>Hammer</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Strehl</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hagemann</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2005</pubdate>
            <volume>22</volume>
            <issue>4</issue>
            <fpage>833</fpage>
            <lpage>844</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msi068</pubid>
                  <pubid idtype="pmpid" link="fulltext">15616143</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Birth of a chimeric primate gene by capture of the transposase gene from a mobile element</p>
            </title>
            <aug>
               <au>
                  <snm>Cordaux</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Udit</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Batzer</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Feschotte</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2006</pubdate>
            <volume>103</volume>
            <issue>21</issue>
            <fpage>8101</fpage>
            <lpage>8106</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.0601161103</pubid>
                  <pubid idtype="pmpid" link="fulltext">16672366</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Molecular analysis of the En/Spm transposable element system of Zea mays</p>
            </title>
            <aug>
               <au>
                  <snm>Pereira</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Cuypers</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Gierl</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Schwarz-Sommer</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Saedler</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>EMBO Journal</source>
            <pubdate>1986</pubdate>
            <volume>5</volume>
            <issue>5</issue>
            <fpage>835</fpage>
            <lpage>841</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1166871</pubid>
                  <pubid idtype="pmpid">15957213</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Molecular paleontology of transposable elements from Arabidopsis thaliana</p>
            </title>
            <aug>
               <au>
                  <snm>Kapitonov</snm>
                  <fnm>VV</fnm>
               </au>
               <au>
                  <snm>Jurka</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Genetica</source>
            <pubdate>1999</pubdate>
            <volume>107</volume>
            <issue>1-3</issue>
            <fpage>27</fpage>
            <lpage>37</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1023/A:1004030922447</pubid>
                  <pubid idtype="pmpid" link="fulltext">10952195</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Catalytic center quest: comparison of transposases belonging to the Tn3 family reveals an invariant triad of acidic amino acid residues</p>
            </title>
            <aug>
               <au>
                  <snm>Yurieva</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Nikiforov</snm>
                  <fnm>V</fnm>
               </au>
            </aug>
            <source>Biochem Mol Biol Int</source>
            <pubdate>1996</pubdate>
            <volume>38</volume>
            <issue>1</issue>
            <fpage>15</fpage>
            <lpage>20</lpage>
            <xrefbib>
               <pubid idtype="pmpid">8932514</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>The behaviour of the autonomous maize transposable element En/Spm in Arabidopsis thaliana allows efficient mutagenesis</p>
            </title>
            <aug>
               <au>
                  <snm>Wisman</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Cardon</snm>
                  <fnm>GH</fnm>
               </au>
               <au>
                  <snm>Fransz</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Saedler</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Plant Mol Biol</source>
            <pubdate>1998</pubdate>
            <volume>37</volume>
            <issue>6</issue>
            <fpage>989</fpage>
            <lpage>999</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1023/A:1006082009151</pubid>
                  <pubid idtype="pmpid" link="fulltext">9700071</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Multiple independent defective suppressor-mutator transposon insertions in Arabidopsis: a tool for functional genomics</p>
            </title>
            <aug>
               <au>
                  <snm>Tissier</snm>
                  <fnm>AF</fnm>
               </au>
               <au>
                  <snm>Marillonnet</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Klimyuk</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Patel</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Torres</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Murphy</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>JD</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>1999</pubdate>
            <volume>11</volume>
            <issue>10</issue>
            <fpage>1841</fpage>
            <lpage>1852</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">144107</pubid>
                  <pubid idtype="pmpid" link="fulltext">10521516</pubid>
                  <pubid idtype="doi">10.1105/tpc.11.10.1841</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>A current perspective on insect gene transformation</p>
            </title>
            <aug>
               <au>
                  <snm>Handler</snm>
                  <fnm>AM</fnm>
               </au>
            </aug>
            <source>Insect Biochem Mol Biol</source>
            <pubdate>2001</pubdate>
            <volume>31</volume>
            <issue>2</issue>
            <fpage>111</fpage>
            <lpage>128</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0965-1748(00)00159-4</pubid>
                  <pubid idtype="pmpid" link="fulltext">11164334</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Mammalian germ-line transgenesis by transposition</p>
            </title>
            <aug>
               <au>
                  <snm>Dupuy</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Clark</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Carlson</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Fritz</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Davidson</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Markley</snm>
                  <fnm>KM</fnm>
               </au>
               <au>
                  <snm>Finley</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Fletcher</snm>
                  <fnm>CF</fnm>
               </au>
               <au>
                  <snm>Ekker</snm>
                  <fnm>SC</fnm>
               </au>
               <au>
                  <snm>Hackett</snm>
                  <fnm>PB</fnm>
               </au>
               <au>
                  <snm>Horn</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Largaespada</snm>
                  <fnm>DA</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2002</pubdate>
            <volume>99</volume>
            <issue>7</issue>
            <fpage>4495</fpage>
            <lpage>4499</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">123676</pubid>
                  <pubid idtype="pmpid" link="fulltext">11904379</pubid>
                  <pubid idtype="doi">10.1073/pnas.062630599</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Sleeping beauty transposition: biology and applications for molecular therapy</p>
            </title>
            <aug>
               <au>
                  <snm>Izsvak</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Ivics</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Mol Ther</source>
            <pubdate>2004</pubdate>
            <volume>9</volume>
            <issue>2</issue>
            <fpage>147</fpage>
            <lpage>156</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.ymthe.2003.11.009</pubid>
                  <pubid idtype="pmpid" link="fulltext">14759798</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Preparation of genomic DNA from mammalian tissue.</p>
            </title>
            <aug>
               <au>
                  <snm>Ausubel</snm>
                  <fnm>FM</fnm>
               </au>
               <au>
                  <snm>Brent</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Kingston</snm>
                  <fnm>RE</fnm>
               </au>
               <au>
                  <snm>Moore</snm>
                  <fnm>DD</fnm>
               </au>
               <au>
                  <snm>Seidman</snm>
                  <fnm>JG</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Struhl</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Current Protocols in Molecular Biology</source>
            <publisher>Hoboken , John Wiley &amp; Sons</publisher>
            <pubdate>1994</pubdate>
         </bibl>
         <bibl id="B36">
            <title>
               <p>The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools</p>
            </title>
            <aug>
               <au>
                  <snm>Thompson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Gibson</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Plewniak</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Jeanmougin</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Higgins</snm>
                  <fnm>DG</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1997</pubdate>
            <volume>25</volume>
            <issue>24</issue>
            <fpage>4876</fpage>
            <lpage>4882</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">147148</pubid>
                  <pubid idtype="pmpid" link="fulltext">9396791</pubid>
                  <pubid idtype="doi">10.1093/nar/25.24.4876</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>TreeView: an application to display phylogenetic trees on personal computers</p>
            </title>
            <aug>
               <au>
                  <snm>Page</snm>
                  <fnm>RD</fnm>
               </au>
            </aug>
            <source>Comput Appl Biosci</source>
            <pubdate>1996</pubdate>
            <volume>12</volume>
            <issue>4</issue>
            <fpage>357</fpage>
            <lpage>358</lpage>
            <xrefbib>
               <pubid idtype="pmpid">8902363</pubid>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
