<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2164-8-409</ui>
   <ji>1471-2164</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Diversity and structure of <it>PIF/Harbinger</it>-like elements in the genome of <it>Medicago truncatula</it></p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Grzebelus</snm>
               <fnm>Dariusz</fnm>
               <insr iid="I1"/>
               <email>dgrzebel@ogr.ar.krakow.pl</email>
            </au>
            <au id="A2">
               <snm>Lasota</snm>
               <fnm>Slawomir</fnm>
               <insr iid="I2"/>
               <email>S.Lasota@mimuw.edu.pl</email>
            </au>
            <au id="A3">
               <snm>Gambin</snm>
               <fnm>Tomasz</fnm>
               <insr iid="I3"/>
               <email>tgambin@gmail.com</email>
            </au>
            <au id="A4">
               <snm>Kucherov</snm>
               <fnm>Gregory</fnm>
               <insr iid="I4"/>
               <email>Gregory.Kucherov@lifl.fr</email>
            </au>
            <au id="A5">
               <snm>Gambin</snm>
               <fnm>Anna</fnm>
               <insr iid="I2"/>
               <email>A.Gambin@mimuw.edu.pl</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Genetics, Plant Breeding and Seed Science, Agricultural University of Krakow, Al. 29 Listopada 54, 31-425 Krakow, Poland</p>
            </ins>
            <ins id="I2">
               <p>Institute of Informatics, Warsaw University, Banacha 2, 02-097, Poland</p>
            </ins>
            <ins id="I3">
               <p>Institute of Computer Science, Warsaw University of Technology, Nowowiejska 15/19, 00-665 Warsaw, Poland</p>
            </ins>
            <ins id="I4">
               <p>LIFL/CNRS/INRIA, Bat. M3 59655 Villeneuve d'Ascq, Lille, France</p>
            </ins>
         </insg>
         <source>BMC Genomics</source>
         <issn>1471-2164</issn>
         <pubdate>2007</pubdate>
         <volume>8</volume>
         <issue>1</issue>
         <fpage>409</fpage>
         <url>http://www.biomedcentral.com/1471-2164/8/409</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">17996080</pubid>
               <pubid idtype="doi">10.1186/1471-2164-8-409</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>11</day>
               <month>6</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>09</day>
               <month>11</month>
               <year>2007</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>09</day>
               <month>11</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Grzebelus et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Transposable elements constitute a significant fraction of plant genomes. The <it>PIF/Harbinger </it>superfamily includes DNA transposons (class II elements) carrying terminal inverted repeats and producing a 3 bp target site duplication upon insertion. The presence of an ORF coding for the DDE/DDD transposase, required for transposition, is characteristic for the autonomous <it>PIF/Harbinger</it>-like elements. Based on the above features, <it>PIF/Harbinger</it>-like elements were identified in several plant genomes and divided into several evolutionary lineages. Availability of a significant portion of <it>Medicago truncatula </it>genomic sequence allowed for mining <it>PIF/Harbinger</it>-like elements, starting from a single previously described element <it>MtMaster</it>.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Twenty two putative autonomous, i.e. carrying an ORF coding for TPase and complete terminal inverted repeats, and 67 non-autonomous <it>PIF/Harbinger</it>-like elements were found in the genome of <it>M. truncatula</it>. They were divided into five families, <it>MtPH-A5</it>, <it>MtPH-A6</it>, <it>MtPH-D</it>,<it>MtPH-E</it>, and <it>MtPH-M</it>, corresponding to three previously identified and two new lineages. The largest families, <it>MtPH-A6 </it>and <it>MtPH-M </it>were further divided into four and three subfamilies, respectively. Non-autonomous elements were usually direct deletion derivatives of the putative autonomous element, however other types of rearrangements, including inversions and nested insertions were also observed. An interesting structural characteristic &#8211; the presence of 60 bp tandem repeats &#8211; was observed in a group of elements of subfamily <it>MtPH-A6-4</it>. Some families could be related to miniature inverted repeat elements (MITEs). The presence of empty <it>loci </it>(RESites), paralogous to those flanking the identified transposable elements, both autonomous and non-autonomous, as well as the presence of transposon insertion related size polymorphisms, confirmed that some of the mined elements were capable for transposition.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>The population of <it>PIF/Harbinger</it>-like elements in the genome of <it>M. truncatula </it>is diverse. A detailed intra-family comparison of the elements' structure proved that they proliferated in the genome generally following the model of abortive gap repair. However, the presence of tandem repeats facilitated more pronounced rearrangements of the element internal regions. The insertion polymorphism of the <it>MtPH </it>elements and related MITE families in different populations of <it>M. truncatula</it>, if further confirmed experimentally, could be used as a source of molecular markers complementary to other marker systems.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Transposable elements (TEs) are dispersed repetitive sequences constituting a major fraction of plant genomes, ranging from 10% of <it>Arabidopsis thaliana </it>genome <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>, to an estimated value over 70% of maize genome <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. Class I elements (retrotransposons), transposing via an RNA intermediate, form the most abundant fraction, while class II elements (DNA transposons), use a 'cut and paste' mechanism for transposition and are usually less numerous.</p>
         <p>Advances in genome sequencing of model plant species enabled systematic, computer-based studies towards the identification of repetitive sequences, including those representing putative TEs. The presence of certain structural characteristics of particular groups of TEs allowed the development of a range of strategies for <it>de novo </it>or homology-based identification of novel elements. A number of methods for automatic mining of transposable elements were developed <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp>, To date, two model plant genomes, i.e. <it>A. thaliana </it>and <it>Oryza sativa </it>(rice) have been extensively studied <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>.</p>
         <p>Founder members of the <it>PIF/Harbinger </it>superfamily of class II TEs were identified in maize <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> and <it>A. thaliana </it><abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. Other full-length elements were subsequently found in rice (<it>Pong </it><abbrgrp><abbr bid="B13">13</abbr></abbrgrp>), carrot, and <it>M. truncatula </it>(<it>Master </it><abbrgrp><abbr bid="B14">14</abbr></abbrgrp>). Autonomous <it>PIF/Harbinger</it>-like elements carry 14&#8211;25 bp long terminal inverted repeats (TIRs) flanked by 3 bp long (TTA/TAA) target site duplications (TSD), and a DDD/DDE transposase showing similarity to that of the bacterial IS5 insertion sequence. The group of <it>PIF/Harbinger</it>-like elements was shown to be widespread in the plant kingdom and composed of two easily distinguishable subgroups, i.e. <it>PIF </it>and <it>Pong </it><abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. Elements representing both subgroups were related to certain miniature inverted repeat elements (MITEs), like <it>Tourist </it>in maize <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B16">16</abbr></abbrgrp> and <it>mPING </it>in rice <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>.</p>
         <p><it>Medicago truncatula </it>(barrel medic) has been chosen as a model plant for the Fabaceae family, primarily to study relationships between plants and their symbiotic microbes. It has a relatively small genome of 500 to 600 Mbp <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>, shows annual growth habit and self-fertility. The genome of <it>M. truncatula </it>has not been extensively analysed with respect to TE identification. A MITE element <it>Bigfoot </it>was reported in the genomes of <it>M. truncatula </it>and <it>M. sativa </it><abbrgrp><abbr bid="B18">18</abbr></abbrgrp>, a set of <it>Ty3</it>/gypsy-like <it>Ogre </it>elements characteristic for legume species was described in <it>M. truncatula </it><abbrgrp><abbr bid="B19">19</abbr></abbrgrp>, and several other <it>M. truncatula </it>elements were briefly characterized in Repbase Update database <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. A recent study of another model legume, <it>Lotus japonicus</it>, identified a number of <it>PIF</it>- and <it>Pong</it>-like elements and a strong evidence for their recent amplification in the host genome <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>.</p>
         <p>In this paper we used the accumulated <it>M. truncatula </it>genomic sequence data to identify putative TEs belonging to the <it>PIF/Harbinger </it>superfamily and related to a previously characterized <it>MtMaster </it>element <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. Therefore, our study was focused on identification and in-depth characterization of a strictly defined group of full-length (putative autonomous and non-autonomous) TEs carrying not only a <it>PIF/Harbinger</it>-specific transposase, but also a particular TIR motif characteristic of most of the <it>PIF</it>-like, but not of the <it>Pong</it>-like elements.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Identification and phylogeny of <it>PIF/Harbinger</it>-like elements of <it>M. truncatula</it></p>
            </st>
            <p>The initial search of the <it>M. truncatula </it>genome aimed at the identification of putative autonomous elements, i.e. those carrying an ORF showing homology to the predicted <it>MtMaster </it>TPase (transposase) protein sequence <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> and flanked with terminal inverted repeats of at least 14 bp, containing the G(N)<sub>5</sub>GTT motif, and followed by a 3 bp-long TSD (TAA or TTA). This resulted to 44 sequences showing significant homology (E-value &lt; 10<sup>-20</sup>) to the TPase, after eliminating the redundancy coming from overlapping BACs. We obtained precisely the same hits using the whole TPase sequence and the DDE region, likely because of the very rigorous E-value threshold imposed during the search. Of the identified sequences, 22 were flanked by TIRs and TSDs characteristic for <it>PIF/Harbinger</it>-like elements and these were assumed to represent complete transposable elements. They ranged in length from 2,180 to 25,288 bp. In 11 of these elements, another coding region, similar to the <it>MtMaster </it>orf1 with E-value ranging from 10<sup>-4 </sup>to 10<sup>-99</sup>, could be found. The relative order of both ORFs was variable &#8211; five elements had orf1 upstream and six downstream the TPase (Table <tblr tid="T1">1</tblr>).</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Characteristics of the core <it>PIF/Harbinger</it>-like elements of <it>M. truncatula</it></p>
               </caption>
               <tblbdy cols="6">
                  <r>
                     <c ca="left">
                        <p>Element</p>
                     </c>
                     <c ca="left">
                        <p>GenBank sequence no.</p>
                     </c>
                     <c ca="left">
                        <p>Position (first base-last base)</p>
                     </c>
                     <c ca="left">
                        <p>Element length</p>
                     </c>
                     <c ca="left">
                        <p>TPase/orf1 orientation</p>
                     </c>
                     <c ca="left">
                        <p>No. of introns in TPase</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>MtPH-A5-Ia</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="AC132565">AC132565</ext-link>
                        </p>
                     </c>
                     <c ca="left">
                        <p>126754&#8211;132718</p>
                     </c>
                     <c ca="left">
                        <p>5965 bp</p>
                     </c>
                     <c ca="left">
                        <p>TP > orf1</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>MtPH-A6-1-Ia</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="AC151598">AC151598</ext-link>
                        </p>
                     </c>
                     <c ca="left">
                        <p>118204&#8211;122278</p>
                     </c>
                     <c ca="left">
                        <p>4075 bp</p>
                     </c>
                     <c ca="left">
                        <p>TP > orf1</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>MtPH-A6-2-Ia</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="AC122722">AC122722</ext-link>
                        </p>
                     </c>
                     <c ca="left">
                        <p>63283&#8211;67500</p>
                     </c>
                     <c ca="left">
                        <p>4218 bp</p>
                     </c>
                     <c ca="left">
                        <p>TP > orf1</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>MtPH-A6-3-Ia</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="AC144563">AC144563</ext-link>
                        </p>
                     </c>
                     <c ca="left">
                        <p>2339&#8211;7183 (-)*</p>
                     </c>
                     <c ca="left">
                        <p>4845 bp</p>
                     </c>
                     <c ca="left">
                        <p>TP > orf1</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>MtPH-A6-4-Ia</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="AC146704">AC146704</ext-link>
                        </p>
                     </c>
                     <c ca="left">
                        <p>67498&#8211;72196</p>
                     </c>
                     <c ca="left">
                        <p>4699 bp</p>
                     </c>
                     <c ca="left">
                        <p>TP > orf1</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>MtPH-D-Ia</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="AC135566">AC135566</ext-link>
                        </p>
                     </c>
                     <c ca="left">
                        <p>96556&#8211;99715 (-)</p>
                     </c>
                     <c ca="left">
                        <p>3160 bp</p>
                     </c>
                     <c ca="left">
                        <p>TP > orf1</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>MtPH-E-Ia</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="AC135606">AC135606</ext-link>
                        </p>
                     </c>
                     <c ca="left">
                        <p>48232&#8211;52188</p>
                     </c>
                     <c ca="left">
                        <p>3957 bp</p>
                     </c>
                     <c ca="left">
                        <p>no orf1</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>MtPH-E-IIa</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="AC139748">AC139748</ext-link>
                        </p>
                     </c>
                     <c ca="left">
                        <p>47216&#8211;50597</p>
                     </c>
                     <c ca="left">
                        <p>3382 bp</p>
                     </c>
                     <c ca="left">
                        <p>no orf1</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>MtPH-M-1-Ia (MtMaster)</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="AC144478">AC144478</ext-link>
                        </p>
                     </c>
                     <c ca="left">
                        <p>46234&#8211;51373</p>
                     </c>
                     <c ca="left">
                        <p>5140 bp</p>
                     </c>
                     <c ca="left">
                        <p>orf1 > TP</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>MtPH-M-1-IIa</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="AC146861">AC146861</ext-link>
                        </p>
                     </c>
                     <c ca="left">
                        <p>104340&#8211;109602 (-)</p>
                     </c>
                     <c ca="left">
                        <p>5006 bp</p>
                     </c>
                     <c ca="left">
                        <p>orf1 > TP</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>MtPH-M-2-Ia</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="AC160098">AC160098</ext-link>
                        </p>
                     </c>
                     <c ca="left">
                        <p>52670&#8211;58188</p>
                     </c>
                     <c ca="left">
                        <p>5519 bp</p>
                     </c>
                     <c ca="left">
                        <p>orf1 > TP</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>MtPH-M-2-IIa</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="AC149306">AC149306</ext-link>
                        </p>
                     </c>
                     <c ca="left">
                        <p>56522&#8211;61824</p>
                     </c>
                     <c ca="left">
                        <p>5303 bp</p>
                     </c>
                     <c ca="left">
                        <p>orf1 > TP</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>MtPH-M-3-Ia</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <ext-link ext-link-type="gen" ext-link-id="CR962122">CR962122</ext-link>
                        </p>
                     </c>
                     <c ca="left">
                        <p>73712&#8211;77759</p>
                     </c>
                     <c ca="left">
                        <p>4048 bp</p>
                     </c>
                     <c ca="left">
                        <p>orf1 > TP</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>* (-) indicates that the sequence of the TE is reverse complement of the original BAC sequence</p>
               </tblfn>
            </tbl>
            <p>A phylogenetic analysis of the DDE domain region of the TPase revealed that the <it>M. truncatula PIF/Harbinger</it>-like elements could be divided into five lineages. Nine elements, including the previously described <it>MtMaster</it>, were grouped into lineage M, together with carrot <it>DcMaster </it><abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. In seven of these elements the orf1 preceded the TPase as expected, while for the remaining two the orf1 was absent, most likely because of an internal deletion. Eight elements formed a new lineage designated as A6. Typically for the group A, the orf1 was located downstream the TPase in the five elements carrying both coding regions. Another new lineage, designated as E, was formed by two elements. In none of them could the orf1 be identified. Two other elements were included into lineage A5, together with maize <it>ZmPIF </it><abbrgrp><abbr bid="B12">12</abbr></abbrgrp> and one was placed into lineage D (Figure <figr fid="F1">1</figr>). However, in the latter case the orf1 was located downstream to TPase, contrary to previously described elements from that lineage <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Neighbor-joining tree representing the diversity of the <it>M. truncatula PIF/Harbinger</it>-like elements in relation with other previously identified TEs</p>
               </caption>
               <text>
                  <p><b>Neighbor-joining tree representing the diversity of the <it>M. truncatula PIF/Harbinger</it>-like elements in relation with other previously identified TEs</b>. Lineages are marked with color rectangles and letters, numbers show bootstrap values obtained using 1000 replicates.</p>
               </text>
               <graphic file="1471-2164-8-409-1"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Diversity and abundance of <it>PIF/Harbinger</it>-like elements in <it>M. truncatula</it></p>
            </st>
            <p>In addition to the TPase-containing elements described above, using a strategy outlined in the Methods section, we identified additional 67 elements lacking any coding capacity and thus considered as non-autonomous. List of all identified elements and their coordinates are given in the Additional File <supplr sid="S1">1</supplr>. The grouping of the identified transposable elements was based on the full element sequence similarity or 5' and 3' terminal sequence similarity using two approaches: hierarchical clustering and multidimensional scaling (Additional File <supplr sid="S2">2</supplr>). This strategy allowed us to define families and subfamilies of <it>PIF/Harbinger</it>-like transposable elements in <it>M. trunactula </it>(Table <tblr tid="T2">2</tblr>), where families essentially reflected the lineages previously identified on the basis of the TPase phylogeny, and subfamilies grouped elements carrying homologous TIRs (Table <tblr tid="T3">3</tblr>) and showing a degree of overall DNA sequence similarity. For each but two subfamilies, one or two putatively autonomous core elements could be identified. The exception was a low copy number family <it>MtPH-E</it>, for which none of the elements contained a region homologous to the orf1.</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p>List of all <it>PIF/Harbinger</it>-like elements identified in the course of the study in the genome of <it>Medicago truncatula</it>.</p>
               </text>
               <file name="1471-2164-8-409-S1.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S2">
               <title>
                  <p>Additional file 2</p>
               </title>
               <text>
                  <p><b>Similarity-based grouping of <it>M. truncatula PIF/Harbinger</it>-like elements.</b>. Results of multidimensional scaling (MDS): A. whole TE sequence, B. 5'end subterminal regions, C. 3'end subterminal regions, and hierarchical clustering (HC): D. whole TE sequence, E. 5'end subterminal regions, F. 3'end subterminal regions.</p>
               </text>
               <file name="1471-2164-8-409-S2.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Classification and abundance of <it>M. truncatula PIF/Harbinger</it>-like elements</p>
               </caption>
               <tblbdy cols="6">
                  <r>
                     <c ca="left">
                        <p>Family</p>
                     </c>
                     <c ca="left">
                        <p>Subfamily</p>
                     </c>
                     <c cspan="4" ca="left">
                        <p>Number of elements</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Total</p>
                     </c>
                     <c ca="left">
                        <p>Containing TPase</p>
                     </c>
                     <c ca="left">
                        <p>Containing Tpase and orf1</p>
                     </c>
                     <c ca="left">
                        <p>With no coding capacity</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c cspan="2" ca="left">
                        <p>
                           <it>MtPH-A5</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>MtPH-A6</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>9</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>8</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>6</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>16</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>13</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>23</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>21</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="2" ca="left">
                        <p>
                           <it>MtPH-D</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="2" ca="left">
                        <p>
                           <it>MtPH-E</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>MtPH-M (MtMaster)</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>5</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>18</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>15</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c cspan="2" ca="right">
                        <p>Total:</p>
                     </c>
                     <c ca="left">
                        <p>89</p>
                     </c>
                     <c ca="left">
                        <p>22</p>
                     </c>
                     <c ca="left">
                        <p>14</p>
                     </c>
                     <c ca="left">
                        <p>67</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Consensus TIR sequences of <it>M. truncatula PIF/Harbinger</it>-like elements</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="left">
                        <p>Family</p>
                     </c>
                     <c ca="left">
                        <p>Subfamily</p>
                     </c>
                     <c ca="left">
                        <p>TIR length</p>
                     </c>
                     <c ca="left">
                        <p>TIR sequence</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c cspan="2" ca="left">
                        <p>
                           <it>MtPH-A5</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>21 bp</p>
                     </c>
                     <c ca="left">
                        <p>5' GGGKGYGTTTGTTTGAGGGTT 3'</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>MtPH-A6</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>15 bp</p>
                     </c>
                     <c ca="left">
                        <p>5' GGGTCCGTTTGGTTC 3'</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>15 bp</p>
                     </c>
                     <c ca="left">
                        <p>5' GGCTMTGTTTGGATT 3'</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>22 bp</p>
                     </c>
                     <c ca="left">
                        <p>5' GGGTCCGTTTGGTTCGAGARTT 3'</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>17 bp</p>
                     </c>
                     <c ca="left">
                        <p>5' GGCTTTGTTTGCGAGTT 3'</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="2" ca="left">
                        <p>
                           <it>MtPH-D</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>12 bp</p>
                     </c>
                     <c ca="left">
                        <p>5' GGCTWTGTTTGG 3'</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="2" ca="left">
                        <p>
                           <it>MtPH-E</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>22 bp</p>
                     </c>
                     <c ca="left">
                        <p>5' GGGCCTGTTTGRAACACTTTTT 3'</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>MtPH-M (MtMaster)</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>14 bp</p>
                     </c>
                     <c ca="left">
                        <p>5' GTGYRTGTTTGGYA 3'</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>14 bp</p>
                     </c>
                     <c ca="left">
                        <p>5' GYRYGTGTTTGGTT 3'</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>14 bp</p>
                     </c>
                     <c ca="left">
                        <p>5' GNSYSTGTTTGGTT 3'</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>The largest family, <it>MtPH-A6</it>, contained 54 elements, while family <it>MtPH-D </it>was represented only by a single element. The second most abundant family, containing 27 elements, was <it>MtPH-M (Master)</it>, of which 18 was grouped into subfamily 3.</p>
         </sec>
         <sec>
            <st>
               <p>Detailed structure analysis of <it>MtPH </it>families</p>
            </st>
            <p><it>MtPH-A6 </it>consisted of four subfamilies represented by putative autonomous elements sharing similar ORF organization, i.e. a TPase containing two introns, followed by orf1. <it>MtPH-A6 </it>TPases formed a well supported clade, containing four subclades with high bootstrap values, representing the corresponding subfamilies (Figure <figr fid="F1">1</figr>).</p>
            <p>Subfamily <it>MtPH-A6-1 </it>contained nine elements ranging in length from 802 to 8,707 bp, the longest element carrying a nested insertion of the 7,555 bp long <it>RAM12 gypsy</it>-like retrotransposon.</p>
            <p>Subfamily <it>MtPH-A6-2 </it>grouped six elements, 898 to 4,218 bp long, all being simple internal deletion derivatives of the core element <it>MtPH-A6-2-Ia</it>.</p>
            <p>Sixteen elements belonged to subfamily <it>MtPH-A6-3</it>, ranging in length from 553 to 4,845 bp, except for a much larger, 23,892 bp long element <it>MtPH-A6-3-IIIa</it>, initially identified as being flanked by 15 bp TIRs unrelated to those of the <it>MtPH-A6-3 </it>subfamily. However, the element contained a 4.9 kb region 74% identical to the two elements mentioned above, but lacking the first 8 bases in the 5' TIR (Figure <figr fid="F2">2</figr>). Hence, the true boundaries of the elements could not be initially identified using our mining strategy. An interesting feature of that subfamily was the presence of a perfect microsatellite site in the first intron of the TPase. The three elements containing the region coding for the TPase, <it>MtPH-A6-3-Ia</it>, <it>MtPH-A6-3-IIa</it>, and <it>MtPH-A6-3-IIIa </it>had, respectively, 27, 8, and 21 repeats of the (TA)<sub>n </sub>core motif (Figure <figr fid="F2">2</figr>).</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Structure of three elements representing family <it>MtPH-A6-3</it></p>
               </caption>
               <text>
                  <p><b>Structure of three elements representing family <it>MtPH-A6-3</it></b>. Arrows show terminal inverted repeats (TIRs), letters represent sequences of target site duplications (TSDs) and TIRs, solid lines show homologous regions with similarity rate written in italics, dotted lines show regions with no homology, numbers in bold show localization of nucleotide positions of important features, (TA) indicates presence of a microsatellite repeat, followed by the number of the core motif repeats.</p>
               </text>
               <graphic file="1471-2164-8-409-2"/>
            </fig>
            <p><it>MtPH-A6-4 </it>subfamily members ranged in length from 431 to 25,288 bp. Among the 23 members of that subfamily, 18 were characterized by the presence of imperfect 60 bp long tandem repeats, variable in number, while in the remaining five elements the core repeat was entirely absent. Each repeat itself contained a triplicated AAACNNCTTATT motif. These elements contained from 2 to 35 repeats that in extreme cases covered almost the entire region between the TIRs (Figure <figr fid="F3">3A</figr> and <figr fid="F3">3B</figr>). In some elements, tandem repeats were present only in one subterminal region, while for the others they were present in both subterminal regions in opposite orientation. The 60 bp tandem repeats were identified in 27 other sites in the <it>M. truncatula </it>genome, initially not identified as occupied by <it>MtPH-A6-4 </it>elements. However, BLAST search with the terminal 214 bp + 3 bp TSD of the <it>MtPH-A6-4-Ia </it>and <it>MtPH-A6-4-IIa </it>elements indicated that in all instances at least one of the regions flanking the repeats showed residual homology to the TE terminus (E value &lt; 1e-08). The presence of tandem repeats facilitated internal rearrangements resulting in inversions of the internal region (Figure <figr fid="F3">3C</figr>). Two nested insertions were identified in the longest element <it>MtPH-A6-4-IIa</it>, which showed three blocks of significant homology to the <it>MtPH-A6-4-Ia </it>core element, interrupted by an unidentified element of 2,191 bp carrying 15 bp TIRs and flanked by a 5 bp long TSD and a <it>gypsy</it>-like retrotransposon (Figure <figr fid="F3">3D</figr>).</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>VNTR regions, inversions, and nested insertions in elements belonging to family <it>MtPH-A6-4</it></p>
               </caption>
               <text>
                  <p><b>VNTR regions, inversions, and nested insertions in elements belonging to family <it>MtPH-A6-4</it></b>. A. Consensus sequence of the 60 bp core VNTR motif, triplicated regions within the core motif are underlined, variable nucleotide positions within the triplicated motif are written in italics. B. Dot-plot and schematic representation of <it>MtPH-A6-4-XXI</it>, an example of TE carrying a large number of tandem repeats. Thick black arrowheads represent TIRs, gray arrows indicate localization and orientation of the VNTR region, number of repetitions is given below each arrow. C. Comparison of two elements containing an inversion of the internal region, thick black arrowheads show TIRs, gray arrows show localization of the VNTR, thin arrows indicate the orientation of the inverted region, solid lines represent homologous regions with similarity rates written in italics, dotted lines represent regions with no homology, numbers in bold show localization of nucleotide positions of important features. D. Organization of the long element <it>MtPH-A6-4-IIa </it>as compared to the core element <it>MtPH-A6-4-Ia</it>, thick black arrowheads show TIRs, solid lines represent homologous regions with percentages of similarity written in italics, dotted lines represent regions with no homology, numbers in bold show localization of nucleotide positions of important features, nested TEs are drawn above the <it>MtPH-A6-4-IIa </it>element.</p>
               </text>
               <graphic file="1471-2164-8-409-3"/>
            </fig>
            <p><it>MtPH-M </it>family included three subfamilies with short (14 bp), similar TIRs and orf1 followed by TPase. Subfamily <it>MtPH-M-1 </it>contained only four elements, ranging in length from 812 to 5,140 bp. Two of them, <it>MtPH-M-1-Ia </it>(previously described as <it>MtMaster </it><abbrgrp><abbr bid="B10">10</abbr></abbrgrp>) and <it>MtPH-M-1-IIa </it>(showing 90% overall sequence identity to <it>MtMaster</it>) had both ORFs, and the remaining two were internally deleted derivatives.</p>
            <p>Five elements were grouped into subfamily <it>MtPH-M-2</it>, three of them carrying both orf1 and TPase. The region containing element <it>MtPH-M-2-IIa </it>occurred to be a composite structure of two related TEs. The initially identified sequence flanked by TIRs and TSDs spanned over 21,696 bp. The 5,303 bp element <it>MtPH-M-2-IIa </it>occupied the 5' region of that sequence, however the downstream sequence also contained blocks of homology to the core element <it>MtPH-M-2-Ia</it>, and a nested insertion of a <it>Gypsy</it>-like retrotransposon (Figure <figr fid="F4">4</figr>). It indicates that an ancient copy of a TE related to those belonging to the subfamily <it>MtPH-M-2 </it>became a target for subsequent nested insertions. Other elements from that family ranged in length from 2,240 to 7,816 bp.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Mosaic structure of the <it>MtPH-M-2-IIa </it>element, as compared to the core element <it>MtPH-M-2-Ia</it></p>
               </caption>
               <text>
                  <p><b>Mosaic structure of the <it>MtPH-M-2-IIa </it>element, as compared to the core element <it>MtPH-M-2-Ia</it></b>. Solid lines represent homologous regions with similarity rates written in italics, dotted lines represent regions with no homology, numbers in bold show the localization of nucleotide positions of important features, a nested retrotransposon is drawn above the AC149306 element.</p>
               </text>
               <graphic file="1471-2164-8-409-4"/>
            </fig>
            <p>Subfamily <it>MtPH-M-3 </it>was the largest within the family and contained 18 elements, of which two carried both ORFs. Their length varied from 442 to 4,048 bp, and interestingly, two 442 bp-long elements were 100% identical. As their length resembled that of miniature inverted repeat elements (MITEs), but unlike MITEs, their number in the <it>M. truncatula </it>genome was low, it would be tempting to speculate that these copies might become founders of a new MITE family. A slightly more advanced stage of proliferation of MITE-like elements could be observed with a group of 10 short (776&#8211;905 bp) elements from the same family. A more detailed comparison of the element sequences provided a further insight into the evolution of <it>MtPH-M-3 </it>subfamily. Internal deletions were accompanied by differentiation and rearrangement of variant sequences (blocks A, B, and C in Figure <figr fid="F5">5</figr>, Additional File <supplr sid="S3">3</supplr>) in the subterminal regions. Two lineages could be traced that originated from the core element <it>MtPH-M-3-Ia</it>, that included respectivley 5 and 11 elements. The element <it>MtPH-M-3-VI </it>showed apparently a mosaic structure, as it contained the 3' subterminal region from lineage I, while the major portion of the element contained sequence variants chracteristic for the lineage II (Figure <figr fid="F5">5</figr>).</p>
            <suppl id="S3">
               <title>
                  <p>Additional file 3</p>
               </title>
               <text>
                  <p>Sequence alignment of the A and B blocks differentiating individual elements belonging to the <it>MtPH-M-3 </it>family.</p>
               </text>
               <file name="1471-2164-8-409-S3.ppt">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Intra-family relationships among the <it>MtPH-M-3 </it>elements</p>
               </caption>
               <text>
                  <p><b>Intra-family relationships among the <it>MtPH-M-3 </it>elements</b>. Thick solid lines represent homologous regions, thick dotted lines represent regions with no homology, thin dashed lines represent internal deletions, blocks marked with orf1 and TPase show localization of the coding regions, blocks marked with A, B, and C show localization of sequence polymorphisms used to trace intra-family lineages, numbers show the length of the element.</p>
               </text>
               <graphic file="1471-2164-8-409-5"/>
            </fig>
            <p>Family <it>MtPH-A5 </it>was represented by four elements ranging in length from 1,182 to 6,770 bp. The two putative autonomous elements were 72% similar over the entire sequence, but within the coding region the nucleotide sequence similarity reached 95%. Two shorter elements were deletion derivatives of full-length elements. Interestingly, a recently reported <it>MITRAV </it>family of miniature elements of barrel medic <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> showed a high nucleotide sequence similarity of their termini to the <it>MtPH-A5 </it>elements, spanning over ca. 40 bp on both ends of the element.</p>
            <p>Family <it>MtPH-E </it>consisted of three elements, none of which carried both ORFs. The elements ranged from 1,508 to 3,957 bp. The two largest elements were very similar, differing by one indel, while the similarity of the shortest element to the other two was restricted only to the 180 bp of the 5' terminus and 70 bp of the 3' terminus.</p>
            <p>Family <it>MtPH-D </it>was represented by a single element of 3,160 bp, carrying both ORFs. However, their orientation was opposite to that of typical <it>PIF/Harbinger</it>-like elements representing the D lineage <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. Its localization in the D lineage was not strongly supported by bootstrap analysis (Figure <figr fid="F1">1</figr>). The fact that no internally truncated elements were identified could suggest that the element might be capable of perfect excision, not triggering the process of abortive gap repair.</p>
         </sec>
         <sec>
            <st>
               <p>Documentation of the mobility of the mined elements</p>
            </st>
            <p>In order to find evidence for a possible mobility of identified elements we implemented a strategy proposed by Le et al. <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>, i.e. we searched for regions, called RESites (Related to Empty Sites), paralogous to sequences flanking the insertion sites, but lacking the transposable element. We identified 11 RESites, of which five represented insertion sites of non-autonomous elements belonging to the <it>MtPH-A6-4 </it>subfamily, while two and one of them were related to non-autonomous elements of the <it>MtPH-A6-3 </it>and <it>MtPH-A6-2 </it>subfamilies, respectively. The remaining three RESites represented insertion sites of the putative autonomous (core) elements belonging to family <it>MtPH-E </it>and subfamilies <it>MtPH-M-2</it>, and <it>MtPH-M-3 </it>(Figure <figr fid="F6">6</figr>).</p>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>RESites corresponding to mined <it>M. truncatula MtPH </it>elements</p>
               </caption>
               <text>
                  <p><b>RESites corresponding to mined <it>M. truncatula MtPH </it>elements</b>. For each group of sequences the upper one represents the insertion site and the lower one is the corresponding RESite. Numbers indicate the nucleotide position of the first and the last nucleotide of the presented sequence, related to the BAC clone from which it was extracted.</p>
               </text>
               <graphic file="1471-2164-8-409-6"/>
            </fig>
            <p>We identified several <it>M. truncatula </it>ESTs showing high similarity to putative expression products (orf1 and TPase) of the mined autonomous elements (Additional File <supplr sid="S4">4</supplr>). However, ESTs directly corresponding to the putative expression products, both to the orf1 (CX532696, 641 bp, 94% identity) and the TPase (AW686181, 304 bp, 99% identity), could be detected only in case of elements representing the <it>MtPH-M-1 </it>subfamily (Additional File <supplr sid="S5">5</supplr>). Interestingly, A number of ESTs similar to non-coding terminal regions of the TEs could also be identified (data not presented).</p>
            <suppl id="S4">
               <title>
                  <p>Additional file 4</p>
               </title>
               <text>
                  <p><b>Identification of <it>M. truncatula </it>ESTs similar to putative expression products of orf1 and TPases coded by MtPH elements</b>. Sequence of the whole element was used as query against <it>M. truncatula </it>EST database, hits in orf1 and TPase coding regions with E value lower than 1e-06 were scored. Nearly identical hits to orf1 and TPase of the <it>MtPH-M-1 </it>elements are marked red.</p>
               </text>
               <file name="1471-2164-8-409-S4.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S5">
               <title>
                  <p>Additional file 5</p>
               </title>
               <text>
                  <p><b>Alignment of <it>MtPH-M-1-Ia </it>and the ESTs corresponding to orf1 (CX532696) and TPase (AW686181)</b>. Predicted exons of the orf1 and TPase are highlighted yellow and green, respectively, TSDs of the element are marked gray.</p>
               </text>
               <file name="1471-2164-8-409-S5.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>The PCR assay of <it>MtPH </it>insertion polymorphism was performed on eight <it>M. truncatula </it>populations selected to represent genetic diversity of the species, as proposed by Ronfort <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. Fifty-six insertion sites identified in the reference genome of cv. Jemalong A17 were checked for presence of the TE. Thirty-seven primer pairs yielded products of the expected size for the reference sample, while 11 generated complex profiles, likely indicating that insertions were present in repetitive regions. The remaining eight primer pairs produced ambiguous results. Of the 37 successful amplifications, 20 occurred to be polymorphic. Usually, the size the shorter amplicon corresponded to the predicted size of the product amplified from the unoccupied site. However, amplicons slightly differing from the expected size were also observed, indicating a possible imperfect excision event (Figure <figr fid="F7">7</figr>).</p>
            <fig id="F7">
               <title>
                  <p>Figure 7</p>
               </title>
               <caption>
                  <p>Insertion related size polymorphisms of MtPH-A6-3 elements</p>
               </caption>
               <text>
                  <p><b>Insertion related size polymorphisms of MtPH-A6-3 elements</b>. A. Long PCR amplification of the region encompassing the <it>MtPH-A6-3-IIa </it>insertion site, B. PCR amplification of the region encompassing the <it>MtPH-A6-3-VI </it>insertion site, C. PCR amplification of the region encompassing the <it>MtPH-A6-3-XVI </it>insertion site. Lanes: M &#8211; 1 kB ladder (Fermentas), 1 &#8211; Jemalong A17, 2 &#8211; L163, 3 &#8211; L174, 4 &#8211; L368, 5 &#8211; L530, 6 &#8211; L544, 7 &#8211; L651, 8 &#8211; L734, 9 &#8211; negative control. Fragments representing occupied and unoccupied sites are marked by red and green arrows, respectively. Numbers in red indicate the expected length of products representing occupied sites, predicted from the original sequence.</p>
               </text>
               <graphic file="1471-2164-8-409-7"/>
            </fig>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>We developed a strategy for identification of transposable element families through <it>in silico </it>genome mining, based on initial assumptions on the type of transposase and the consensus sequences of terminal inverted repeats. It required several consecutive steps, i.e. (1) search for regions coding for the TPase, (2) identification of TIRs flanking the identified regions and matching a defined sequence motif, (3) identification of related elements with no coding capacity, and (4) grouping the identified elements into families on the basis of their sequence similarity. We applied this strategy to mine the genome of <it>Medicago truncatula </it>for <it>PIF/Harbinger</it>-like elements similar to the previously described <it>MtMaster </it>element <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. In principle, the proposed strategy can be used to mine for any other type of class II TEs, provided that at least one 'seed' element is known.</p>
         <p>Diversity of the identified <it>PIF/Harbinger</it>-like elements is high, although our search was limited by a specifically defined core TIR sequence. We focused on 22 ORFs coding for putative TPases, representing a half of all initially identified ORFs, as for the other half, TIRs flanking the ORF and containing the required motif could not be found. A recent broad analysis of the TE landscape in another legume, <it>Lotus japonicus </it><abbrgrp><abbr bid="B21">21</abbr></abbrgrp>, revealed a presence of nine putative autonomous <it>PIF</it>-like elements (besides several more distantly related <it>Pong</it>-like elements) in ca. 32 Mb portion of the genome. This number is in agrrement with our results, as we found 22 full-length elements (2.5 times more) in ca. 200 Mb representing a certain level of redundancy. Interestingly, all <it>PIF</it>-like TEs from <it>L. japonicus </it>represented the A3 lineage, while no A3 members were identified in <it>M. truncatula</it>, which may indicate a strikingly different evolutionary fate of that group of TEs in each of the closely related species.</p>
         <p>Detailed structure analysis of the mined element families indicates that their proliferation in the genome generally follows the model of abortive gap repair (AGR), as proposed for the <it>Ac/Ds </it>elements in maize <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. Members of a particular family were usually direct deletion derivatives of the related, putative autonomous element. However, assuming that members of all <it>PIF/Harbinger</it>-like TE families in the genome of <it>M. truncatula </it>were mobilized with similar frequency, the efficiency of AGR seems to vary from one family to another. Two families, <it>MtPH-A6 </it>and <it>MtPH-M</it>, were the most numerous, while the remaining three were represented by a very small number of copies. Difference in the copy number may be a result of different transposition rates, but it may also indicate that some elements less efficiently trigger the process of AGR following excision, which would result in a higher frequency of perfect excision. The latter is further supported by two observations. Firstly, the members of subfamily <it>MtPH-A6-4 </it>contain a variable number of 60 bp tandem repeats in one or both subterminal regions, serving as targets for AGR and leading to increase of the TE copy number accompanied by changes in the number of VNTRs. The presence of 60 bp tandem repeats was inherently connected with <it>MtPH-A6-4 </it>elements throughout the <it>M. truncatula </it>genome, which implies that they likely evolved in the course of the proliferation of that subfamily. Probably, triggering the AGR from the VNTR region also led to an inversion of the internal region in <it>MtPH-A6-4-XIV</it>, as compared to <it>MtPH-A6-4-Ia</it>. Secondly, at least one member of the low copy number family <it>MtPH-E </it>was transpositionally active, as confirmed by the presence of the RESite, but despite the potential for mobility, the number of <it>MtPH-E </it>elements has remained low.</p>
         <p><it>PIF/Harbinger</it>-like elements are ancestors of certain groups of miniature transposons (MITEs), the relation of maize <it>PIF </it>element and MITEs belonging to the <it>Tourist </it>family has been well documented <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B16">16</abbr></abbrgrp>. Also, several other MITE families, e.g. <it>Heartbreaker </it>from maize <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>, <it>Kiddo </it>from rice <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>, and <it>Krak </it>from carrot <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> show TIR sequence similarities to those of <it>PIF/Harbinger</it>-like elements. We were able to directly link the previously identified <it>MITRAV </it>MITE family <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> to family <it>MtPH-A5 </it>of <it>M. truncatula PIF/Harbinger</it>-like elements. This suggests that both <it>MtPH-A5 </it>and <it>MITRAV </it>originated from a recent common ancestor and <it>MtPH-A5 </it>TPase might be the <it>trans</it>-acting factor for <it>MITRAV </it>mobilization, as experimentally proven for the <it>Pong </it>and <it>mPing </it>MITE in rice <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp>. Also, two groups of two and ten TEs, all classified in the subfamily <it>MtPH-M-3</it>, might represent newly emerging MITE families. We performed an initial search for other MITEs showing a TIR homology to the consensus motif of the <it>PIF/Harbinger </it>TIRs leading to an identification of few other MITE families (data not presented). Altogether, it confirms that <it>PIF/Harbinger</it>-like elements and related MITEs are present in the genome of <it>M. truncatula</it>, similar to genomes of other plant species. However, the number of MITE copies is probably much lower than that present in the grass genomes.</p>
         <p>A more detailed experimental evaluation of <it>MtPH </it>TEs diversity in a range of <it>M. truncatula </it>populations should be useful to further characterize the transpositional activity and the dynamics of particular families. Analysis of RESites and a high incidence of insertion related size polymorphisms shows that a significant fraction of the mined elements was mobile in the recent past. The presence of ESTs related to ORFs of the <it>MtPH </it>elements, including those directly derived from the <it>MtPH-M-1 </it>elements, suggests that they can still be mobile. As proven previously, one transcriptionally active autonomous element can cause <it>trans</it>-mobilization of a range of related, but not directly derived elements <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>.</p>
         <p>Polymorphic insertion sites could be used as a source of molecular markers, as shown previously for other species <abbrgrp><abbr bid="B28">28</abbr><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr></abbrgrp>, to measure intraspecific diversity in relation to its geographic structure, complementing other molecular marker systems, e.g. these based on microsatellites <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>Starting from a single previously described <it>PIF/Harbinger</it>-like TE of <it>M. truncatula</it>, we identified 89 elements representing the diversity of this superfamily in the host plant genome. They were divided into five families representing different evolutionary lineages, and further into subfamilies. Elements within each subfamily evolved essentially following the model of AGR, leading to the reconstruction of an internally deleted copy in the donor site following transposition. It is likely that different families vary in their potential to trigger the process of AGR. One peculiarity observed in a group of elements representing subfamily <it>MtPH-A6-4 </it>was the presence of 60 bp long VNTRs in one or both subterminal regions or even spanning over the entire internal region of the TE. Some of the identified elements are closely related to several MITE families, including a previously described <it>MITRAV </it>family. Also, some of the newly identified short elements can be viewed as <it>in statu nascendi </it>MITEs, provided that conditions for a rapid burst of their mobility would be met. Further investigation is necessary for a more detailed evaluation of the copy number, transpositional activity, and insertional polymorphism of the TEs, including MITEs, as they could be utilized as a source of molecular markers.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Semi-automated mining of <it>PIF/Harbinger</it>-like elements</p>
            </st>
            <p>The experiment was performed on the <it>M. truncatula </it>genomic DNA sequence database consisting of 1540 BACs, updated Aug 2005 <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. As the size of the whole <it>M. truncatula </it>genome ranges from 500 to 600 Mbp <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> and the average non-overlapping coverage by each BAC was ca. 100 Kb <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>, we estimated that the input sequence data amounted 26&#8211;30% of whole genome.</p>
            <p>The predicted protein sequence of DDE domain and the whole TPase sequence of the previously identified <it>MtMaster </it>element <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> was used as the initial query for a TBLASTN search against the BAC sequence database, using the E-value threshold of 1e-20. The output file was then processed to eliminate redundancy coming from overlapping BACs, and significant hits were extracted, along with up to 30 kb flanking sequences. The extracted sequences were scanned for the presence TIRs and TSDs, using a newly developed tool named TIRfinder, identifying TIRs and TSDs and returning a file with a list of found elements fulfilling user-defined requirements. To provide fast computation on whole genome, the algorithm uses very efficient data structures, such as suffix trees. TIRfinder is an open source software accessible online <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. The program was written in Java and can be run on Windows or Linux.</p>
            <p>We allowed up to four mismatches inside 14 bp of the TIRs and no mismatch in TSDs. Another condition was the presence of the conserved G(N)<sub>5</sub>GTT motif at the 5' end of the TIR. <it>In silico </it>prediction of the presence of coding regions was performed for all identified sequences using FGENESH <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>.</p>
            <p>To identify internally deleted copies of elements related to those found previously, 217 bp-long (3 bp TSD + 14 bp TIR + 200 bp subterminal sequence) terminal regions were extracted from all putative autonomous elements. These sequences were used to scan the <it>M. truncatula </it>genomic DNA sequence database (BLASTN, E-value threshold &#8211; 1e-10), and regions showing homology to any of the terminal regions were identified. The output was automatically filtered to find sequences of length ranging from 400 to 30,000 bp, flanked with TIRs showing homology to the same autonomous element on both ends. All newly found sequences have been checked whether they contained a region coding for the TPase. All TEs were scanned using Censor <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>, to identify the presence of nested elements.</p>
         </sec>
         <sec>
            <st>
               <p>Phylogenetic analyses, grouping, and visualization of TE sequence similarity</p>
            </st>
            <p>Multiple alignment of 48 transposase sequences of <it>PIF/Harbinger</it>-like transposable elements was obtained using T-Coffee <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. Bootstrap analysis was performed with PHYLIP using seqboot, neighbor, protdist and consense programs <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. The sequence similarity of 89 TEs was analyzed by the hierarchical clustering method and visualized with help of multidimensional scaling. For both tasks we used the R statistical environment <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. As a measure of dissimilarity between sequences we used the E-value of BLAST. Hierarchical cluster analysis of a set of dissimilarities was done by hclust (complete linkage) method <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. Multidimensional scaling <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> visualization is primarily dependent on the analogy of similarity and proximity (and hence of dissimilarity and distance). It re-scales a set of dissimilarity data into distances and produces the low-dimensional configuration that generated them. The visualization for our data was obtained with isoMDS R procedure.</p>
         </sec>
         <sec>
            <st>
               <p>TE structure analysis</p>
            </st>
            <p>Sequences were visually compared, aligned, edited, and analysed using BioEdit and the included accessory applications <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>. Pairwise sequence comparisons were performed using 'blast 2 sequences' <abbrgrp><abbr bid="B42">42</abbr></abbrgrp> and Yass <abbrgrp><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr></abbrgrp>. Dot-plots were generated using Nucleic Acid Dot Plots <abbrgrp><abbr bid="B45">45</abbr></abbrgrp> with a window size of 25 nucleotides and a mismatch limit of 5 positions. Tandem repeats identification was performed using 'mreps' software <abbrgrp><abbr bid="B46">46</abbr><abbr bid="B47">47</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Documentation of mobility</p>
            </st>
            <p>In order to find RESites (Related to Empty Sites) in the <it>M. truncatula </it>genome we performed a computer-based search, essentially as described by Le et al. <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. Briefly, we extracted 1 Kb sequence flanking both sides of each of the mined elements, combined them into one sequence of 2 Kb, and used it as a query for a BLASTN search on the whole BAC sequence database. Hits spanning on both sides of the insertion were considered as those representing RESites.</p>
            <p>EST search was performed using nucleotide sequences of the putative autonomous elements, using a BLAST tool run against the <it>M. truncatula </it>EST database <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>PCR conditions</p>
            </st>
            <p>PCR assay was performed on plants representing cv. Jemalong A17 and seven populations from the core <it>M. truncatula </it>collection (CC8, as described by Ronfort et al. <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>). Primer pairs were anchored in the regions flanking the mined elements. They were designed using Primer3 <abbrgrp><abbr bid="B49">49</abbr></abbrgrp> to obtain amplification of ca. 600 bp long fragment for the putative empty site. Two cycling protocols were employed. For TEs of length not exceeding 2 Kb a standard PCR was performed. The reaction was set up in the volume of 20 &#956;l and contained 0.25 mM each dNTP, 2 mM MgCl<sub>2</sub>, 10 pmol of each primer, 1 unit of TAQ polymerase (Fermentas) and 2 &#956;l of the PCR buffer supplied by the manufacturer. The thermal profile of the reaction was as followed: 94&#176;C for 2 min., 35 cycles of: 94&#176;C for 30 s, 53&#176;C for 30 s, and 68&#176;C for 90 s, and completed with 68&#176;C for 5 min. For larger elements we used long PCR protocol. Amplification was performed in the volume of 20 &#956;l containing 0.25 mM each dNTP, 10 pmol of each primer, 0,5 unit of long PCR enzyme mix (Fermentas) and 2 &#956;l of the Long PCR buffer supplemented with MgCl<sub>2 </sub>(Fermentas), using the following thermal profile: 94&#176;C for 2 min., 35 cycles of: 94&#176;C for 15 s, 53&#176;C for 30 s, and 68&#176;C for 7 min., and completed with 68&#176;C for 10 min. All reactions were carried out in the Mastercycler or Mastercycler Gradient (Eppendorf). Amplification products were separated on 1% agarose gels and visualized with ethidium bromide under UV.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>DG developed strategy for the study, performed the fine-scale analysis of the TEs, performed the PCR, and prepared the final version of the manuscript, SL and TG developed algorithms for TE identification, TG edited HC and MDS graphs, GK analysed tandem repeats, and AG participated in the design of the study, performed HC and MDS analyses, and participated in drafting the manuscript. All authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>The research project was funded by the Polish Ministry of Science and Higher Education grant no. N301 036 31/1203, for the years 2006&#8211;2008. SL, AG and GK were supported by the Polonium and ECO-NET programs of the French Ministry of Foreign Affairs. The authors wish to thank Dr. J-M Prosperi for donating seeds of <it>M. truncatula </it>populations used in the study, two anonymous reviewers for their helpful suggestions, and Mrs M Gladysz for her technical assistance.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Analysis of the genome sequence of the flowering plant <it>Arabidopsis thaliana</it></p>
            </title>
            <aug>
               <au>
                  <cnm>Arabidopsis Genome Initiative</cnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>408</volume>
            <fpage>796</fpage>
            <lpage>815</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35048692</pubid>
                  <pubid idtype="pmpid" link="fulltext">11130711</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Abundance, distribution, and transcriptional activity of repetitive elements in the maize genome</p>
            </title>
            <aug>
               <au>
                  <snm>Meyers</snm>
                  <fnm>BC</fnm>
               </au>
               <au>
                  <snm>Tingey</snm>
                  <fnm>SV</fnm>
               </au>
               <au>
                  <snm>Morgante</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2001</pubdate>
            <volume>11</volume>
            <fpage>1660</fpage>
            <lpage>1676</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">311155</pubid>
                  <pubid idtype="pmpid" link="fulltext">11591643</pubid>
                  <pubid idtype="doi">10.1101/gr.188201</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>De novo identification of repeat families in large genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Price</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>NC</fnm>
               </au>
               <au>
                  <snm>Pevzner</snm>
                  <fnm>PA</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <fpage>351</fpage>
            <lpage>358</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1093/bioinformatics/bti1018</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>MAK, a computational tool kit for automated MITE analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Hall</snm>
                  <fnm>TC</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <fpage>3659</fpage>
            <lpage>3665</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">168938</pubid>
                  <pubid idtype="pmpid" link="fulltext">12824388</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg531</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Automated de novo identification of repeat sequence families in sequenced genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Bao</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Eddy</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2002</pubdate>
            <volume>12</volume>
            <fpage>1269</fpage>
            <lpage>1276</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">186642</pubid>
                  <pubid idtype="pmpid" link="fulltext">12176934</pubid>
                  <pubid idtype="doi">10.1101/gr.88502</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>REPuter: fast computation of maximal repeats in complete genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Kurtz</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Schleiermacher</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>1999</pubdate>
            <volume>15</volume>
            <fpage>426</fpage>
            <lpage>427</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/15.5.426</pubid>
                  <pubid idtype="pmpid" link="fulltext">10366664</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Molecular paleontology of transposable elements from <it>Arabidopsis thaliana</it></p>
            </title>
            <aug>
               <au>
                  <snm>Kapitonov</snm>
                  <fnm>VV</fnm>
               </au>
               <au>
                  <snm>Jurka</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Genetica</source>
            <pubdate>1999</pubdate>
            <volume>107</volume>
            <fpage>27</fpage>
            <lpage>37</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1023/A:1004030922447</pubid>
                  <pubid idtype="pmpid" link="fulltext">10952195</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Transposon diversity in <it>Arabidopsis thaliana</it></p>
            </title>
            <aug>
               <au>
                  <snm>Le</snm>
                  <fnm>QH</fnm>
               </au>
               <au>
                  <snm>Wright</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Bureau</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2000</pubdate>
            <volume>97</volume>
            <fpage>7376</fpage>
            <lpage>7381</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">16553</pubid>
                  <pubid idtype="pmpid" link="fulltext">10861007</pubid>
                  <pubid idtype="doi">10.1073/pnas.97.13.7376</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p><it>Mutator </it>-like elements in <it>Arabidopsis thaliana</it>: Structure, diveristy and evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Yu</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Wright</snm>
                  <fnm>SI</fnm>
               </au>
               <au>
                  <snm>Bureau</snm>
                  <fnm>TE</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2000</pubdate>
            <volume>156</volume>
            <fpage>2019</fpage>
            <lpage>2031</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1461377</pubid>
                  <pubid idtype="pmpid" link="fulltext">11102392</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Rice transposable elements: A survey of 73,000 sequence-tagged-connectors</p>
            </title>
            <aug>
               <au>
                  <snm>Mao</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Wood</snm>
                  <fnm>TC</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Budiman</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Tomkins</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Woo</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sasinowski</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Presting</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Frisch</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Goff</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Dean</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Wing</snm>
                  <fnm>RA</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2000</pubdate>
            <volume>10</volume>
            <fpage>982</fpage>
            <lpage>990</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">310901</pubid>
                  <pubid idtype="pmpid" link="fulltext">10899147</pubid>
                  <pubid idtype="doi">10.1101/gr.10.7.982</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Survey of transposable elements from rice genomic sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Tucrotte</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Srinivasan</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bureau</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Plant J</source>
            <pubdate>2001</pubdate>
            <volume>25</volume>
            <fpage>169</fpage>
            <lpage>179</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-313x.2001.00945.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">11169193</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>P instability factor: an active maize transposon system associated with the amplification of <it>Tourist</it>-like MITEs and a new superfamily of transposases</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Feschotte</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Jiang</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Eggelston</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Wessler</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2001</pubdate>
            <volume>98</volume>
            <fpage>12572</fpage>
            <lpage>12577</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">60095</pubid>
                  <pubid idtype="pmpid" link="fulltext">11675493</pubid>
                  <pubid idtype="doi">10.1073/pnas.211442198</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>An active DNA transposon family in rice</p>
            </title>
            <aug>
               <au>
                  <snm>Jiang</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Bao</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Hirochika</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Eddy</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>McCouch</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Wessler</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2003</pubdate>
            <volume>421</volume>
            <fpage>163</fpage>
            <lpage>167</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature01214</pubid>
                  <pubid idtype="pmpid" link="fulltext">12520302</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p><it>Master </it>: a novel family of <it>PIF/Harbinger</it>-like transposable elements identified in carrot (<it>Daucus carota </it>L.)</p>
            </title>
            <aug>
               <au>
                  <snm>Grzebelus</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Yau</snm>
                  <fnm>YY</fnm>
               </au>
               <au>
                  <snm>Simon</snm>
                  <fnm>PW</fnm>
               </au>
            </aug>
            <source>Mol Genet Genomics</source>
            <pubdate>2006</pubdate>
            <volume>275</volume>
            <issue>5</issue>
            <fpage>450</fpage>
            <lpage>459</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s00438-006-0102-3</pubid>
                  <pubid idtype="pmpid" link="fulltext">16482474</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p><it>PIF</it>- and <it>Pong</it>-like transposable elements: distribution, evolution and relationship with <it>Tourist</it>-like miniature inverted repeat transposable elements</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Jiang</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Feschotte</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Wessler</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2004</pubdate>
            <volume>166</volume>
            <fpage>971</fpage>
            <lpage>986</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1470744</pubid>
                  <pubid idtype="pmpid" link="fulltext">15020481</pubid>
                  <pubid idtype="doi">10.1534/genetics.166.2.971</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p><it>PIFs </it>meet <it>Tourists </it>and <it>Harbingers</it>: a superfamily reunion</p>
            </title>
            <aug>
               <au>
                  <snm>Jurka</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kapitonov</snm>
                  <fnm>VV</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2001</pubdate>
            <volume>98</volume>
            <fpage>12315</fpage>
            <lpage>12316</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">60043</pubid>
                  <pubid idtype="pmpid" link="fulltext">11675478</pubid>
                  <pubid idtype="doi">10.1073/pnas.231490598</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Genome size and base composition in <it>Medicago sativa </it>and <it>M. truncatula </it>species</p>
            </title>
            <aug>
               <au>
                  <snm>Blondon</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Marie</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kondorosi</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Genome</source>
            <pubdate>1994</pubdate>
            <volume>37</volume>
            <fpage>264</fpage>
            <lpage>270</lpage>
         </bibl>
         <bibl id="B18">
            <title>
               <p><it>Bigfoot </it>: a new family of MITE elements characterized from the <it>Medicago </it>genus</p>
            </title>
            <aug>
               <au>
                  <snm>Charrier</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Foucher</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Kondorosi</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>d'Aubenton-Carafa</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Thermes</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Kondorosi</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ratet</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Plant J</source>
            <pubdate>1999</pubdate>
            <volume>18</volume>
            <fpage>431</fpage>
            <lpage>441</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10406126</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Ogre elements &#8211; A distinct group of plant Ty3/gypsy-like retrotransposons</p>
            </title>
            <aug>
               <au>
                  <snm>Macas</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Neumann</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>2007</pubdate>
            <volume>390</volume>
            <fpage>108</fpage>
            <lpage>116</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.gene.2006.08.007</pubid>
                  <pubid idtype="pmpid" link="fulltext">17052864</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Repbase Update: a database and an electronic journal of repetitive elements</p>
            </title>
            <aug>
               <au>
                  <snm>Jurka</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2000</pubdate>
            <volume>9</volume>
            <fpage>418</fpage>
            <lpage>420</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/S0168-9525(00)02093-X</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>The transposable element landscape of the model legume <it>Lotus japonicus</it></p>
            </title>
            <aug>
               <au>
                  <snm>Holligan</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Jiang</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Pritham</snm>
                  <fnm>EJ</fnm>
               </au>
               <au>
                  <snm>Wessler</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2006</pubdate>
            <volume>174</volume>
            <fpage>2215</fpage>
            <lpage>2228</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1698628</pubid>
                  <pubid idtype="pmpid" link="fulltext">17028332</pubid>
                  <pubid idtype="doi">10.1534/genetics.106.062752</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>MITRAV: A miniature DNA transposon from barrel medic</p>
            </title>
            <aug>
               <au>
                  <snm>Shankar</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Jurka</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Repbase Reports</source>
            <pubdate>2007</pubdate>
            <volume>7</volume>
            <fpage>38</fpage>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Abortive gap repair: underlying mechanism for <it>Ds </it>element formation</p>
            </title>
            <aug>
               <au>
                  <snm>Rubin</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Levy</snm>
                  <fnm>AA</fnm>
               </au>
            </aug>
            <source>Mol Cell Biol</source>
            <pubdate>1997</pubdate>
            <volume>17</volume>
            <issue>11</issue>
            <fpage>6294</fpage>
            <lpage>6302</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">232480</pubid>
                  <pubid idtype="pmpid" link="fulltext">9343390</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>The MITE family <it>Heartbreaker </it>(<it>Hbr</it>): molecular markers in maize</p>
            </title>
            <aug>
               <au>
                  <snm>Casa</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Brouwer</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Nagel</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Kresovich</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wessler</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2000</pubdate>
            <volume>97</volume>
            <fpage>10083</fpage>
            <lpage>10090</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">27704</pubid>
                  <pubid idtype="pmpid" link="fulltext">10963671</pubid>
                  <pubid idtype="doi">10.1073/pnas.97.18.10083</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p><it>Kiddo</it>, a new transposable element closely associated with rice genes</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Dong</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Chandrasekharan</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Hall</snm>
                  <fnm>TC</fnm>
               </au>
            </aug>
            <source>Mol Genet Genomics</source>
            <pubdate>2001</pubdate>
            <volume>266</volume>
            <fpage>417</fpage>
            <lpage>424</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s004380100530</pubid>
                  <pubid idtype="pmpid" link="fulltext">11713671</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>The plant MITE <it>mPing </it>is mobilized in anther culture</p>
            </title>
            <aug>
               <au>
                  <snm>Kikuchi</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Terauchi</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Wada</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hirano</snm>
                  <fnm>H-Y</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2003</pubdate>
            <volume>421</volume>
            <fpage>167</fpage>
            <lpage>170</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature01218</pubid>
                  <pubid idtype="pmpid" link="fulltext">12520303</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Mobilization of a transposon in the rice genome</p>
            </title>
            <aug>
               <au>
                  <snm>Nakazaki</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Okumoto</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Horibata</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Yamahira</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Teraishi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Nishida</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Inoue</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Tanisaka</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2003</pubdate>
            <volume>421</volume>
            <fpage>170</fpage>
            <lpage>172</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature01219</pubid>
                  <pubid idtype="pmpid" link="fulltext">12520304</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Evaluation of <it>Hbr </it>(MITE) markers for assessment of genetic relationships among maize (<it>Zea mays </it>L.) inbred lines</p>
            </title>
            <aug>
               <au>
                  <snm>Casa</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Mitchell</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>OS</fnm>
               </au>
               <au>
                  <snm>Register III</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Wessler</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Kresovich</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Theor Appl Genet</source>
            <pubdate>2002</pubdate>
            <volume>104</volume>
            <fpage>104</fpage>
            <lpage>110</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s001220200012</pubid>
                  <pubid idtype="pmpid" link="fulltext">12579434</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p><it>Rim2/Hipa </it>CACTA transposon display; a new genetic marker technique in <it>Oryza </it>species</p>
            </title>
            <aug>
               <au>
                  <snm>Kwon</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Park</snm>
                  <fnm>KC</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>JK</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>NS</fnm>
               </au>
            </aug>
            <source>BMC Genetics</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>15</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1079816</pubid>
                  <pubid idtype="pmpid" link="fulltext">15766385</pubid>
                  <pubid idtype="doi">10.1186/1471-2156-6-15</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>The <it>DcMaster </it>Transposon Display maps polymorphic insertion sites in the carrot (<it>Daucus carota </it>L.) genome</p>
            </title>
            <aug>
               <au>
                  <snm>Grzebelus</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Jagosz</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Simon</snm>
                  <fnm>PW</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>2007</pubdate>
            <volume>390</volume>
            <fpage>67</fpage>
            <lpage>74</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.gene.2006.07.041</pubid>
                  <pubid idtype="pmpid" link="fulltext">17011731</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Microsatellite diversity and broad scale geographic structure in a model legume: building a set of nested core collection for studying naturally occurring variation in <it>Medicago truncatula</it></p>
            </title>
            <aug>
               <au>
                  <snm>Ronfort</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bataillon</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Santoni</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Delalande</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>David</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Prosperi</snm>
                  <fnm>J-M</fnm>
               </au>
            </aug>
            <source>BMC Plant Biology</source>
            <pubdate>2006</pubdate>
            <volume>6</volume>
            <fpage>28</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1762007</pubid>
                  <pubid idtype="pmpid" link="fulltext">17166278</pubid>
                  <pubid idtype="doi">10.1186/1471-2229-6-28</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Medicago sequencing resources</p>
            </title>
            <url>http://www.medicago.org/genome/</url>
         </bibl>
         <bibl id="B33">
            <title>
               <p>TIRfinder</p>
            </title>
            <url>http://www.sourceforge.net/projects/TIRfinder/</url>
         </bibl>
         <bibl id="B34">
            <title>
               <p><it>Ab initio </it>gene finding in Drosophila genomic DNA</p>
            </title>
            <aug>
               <au>
                  <snm>Salamov</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Solovyev</snm>
                  <fnm>VV</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2000</pubdate>
            <volume>10</volume>
            <fpage>516</fpage>
            <lpage>522</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">310882</pubid>
                  <pubid idtype="pmpid" link="fulltext">10779491</pubid>
                  <pubid idtype="doi">10.1101/gr.10.4.516</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Annotation, submission and screening of repetitive elements in Repbase: RepbaseSubmitter and Censor</p>
            </title>
            <aug>
               <au>
                  <snm>Kohany</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Gentles</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Hankus</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Jurka</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>474</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1634758</pubid>
                  <pubid idtype="pmpid" link="fulltext">17064419</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-7-474</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>T-Coffee: A novel method for multiple sequence alignments</p>
            </title>
            <aug>
               <au>
                  <snm>Notredame</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Higgins</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Heringa</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2000</pubdate>
            <volume>302</volume>
            <fpage>205</fpage>
            <lpage>217</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.2000.4042</pubid>
                  <pubid idtype="pmpid" link="fulltext">10964570</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>PHYLIP</p>
            </title>
            <url>http://evolution.genetics.washington.edu/phylip.html</url>
         </bibl>
         <bibl id="B38">
            <aug>
               <au>
                  <snm>Venables</snm>
                  <fnm>WN</fnm>
               </au>
               <au>
                  <snm>Ripley</snm>
                  <fnm>BD</fnm>
               </au>
            </aug>
            <source>Modern Applied Statistics with S. Springer, New York</source>
            <pubdate>2002</pubdate>
         </bibl>
         <bibl id="B39">
            <title>
               <p>An efficient algorithm for a complete link method</p>
            </title>
            <aug>
               <au>
                  <snm>Defays</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Comput J</source>
            <pubdate>1977</pubdate>
            <volume>20</volume>
            <fpage>364</fpage>
            <lpage>366</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1093/comjnl/20.4.364</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <aug>
               <au>
                  <snm>Borg</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Groenen</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Modern Multidimensional Scaling: Theory and Applications. Springer-Verlag New York</source>
            <pubdate>1997</pubdate>
         </bibl>
         <bibl id="B41">
            <title>
               <p>BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT</p>
            </title>
            <aug>
               <au>
                  <snm>Hall</snm>
                  <fnm>TA</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Symp Ser</source>
            <pubdate>1999</pubdate>
            <volume>41</volume>
            <fpage>95</fpage>
            <lpage>98</lpage>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Blast 2 sequences &#8211; a new tool for comparing protein and nucleotide sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Tatusova</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Madden</snm>
                  <fnm>TL</fnm>
               </au>
            </aug>
            <source>FEMS Microbiol Lett</source>
            <pubdate>1999</pubdate>
            <volume>174</volume>
            <fpage>247</fpage>
            <lpage>250</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1574-6968.1999.tb13575.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">10339815</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>YASS: enhancing the sensitivity of DNA similarity search</p>
            </title>
            <aug>
               <au>
                  <snm>Noe</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Kucherov</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <fpage>W540</fpage>
            <lpage>W543</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1160238</pubid>
                  <pubid idtype="pmpid" link="fulltext">15980530</pubid>
                  <pubid idtype="doi">10.1093/nar/gki478</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>genomic DNA local alignment similarity search tool</p>
            </title>
            <url>http://bioinfo.lifl.fr/yass/</url>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Nucleic Acid Dot Plots</p>
            </title>
            <url>http://www.vivo.colostate.edu/molkit/dnadot/</url>
         </bibl>
         <bibl id="B46">
            <title>
               <p>mreps: efficient and flexible detection of tandem repeats in DNA</p>
            </title>
            <aug>
               <au>
                  <snm>Kolpakov</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Bana</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Kucherov</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <fpage>3672</fpage>
            <lpage>3678</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">169196</pubid>
                  <pubid idtype="pmpid" link="fulltext">12824391</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg617</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>mreps</p>
            </title>
            <url>http://bioinfo.lifl.fr/mreps/</url>
         </bibl>
         <bibl id="B48">
            <title>
               <p>Gene Indices &#8211; Blast Search</p>
            </title>
            <url>http://compbio.dfci.harvard.edu/tgi/cgi-bin/tgi/Blast/index.cgi</url>
         </bibl>
         <bibl id="B49">
            <title>
               <p>Primer3</p>
            </title>
            <aug>
               <au>
                  <snm>Rozen</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Skaletsky</snm>
                  <fnm>HJ</fnm>
               </au>
            </aug>
            <pubdate>1998</pubdate>
            <url>http://primer3.sourceforge.net</url>
         </bibl>
      </refgrp>
   </bm>
</art>
