<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2164-6-119</ui>
   <ji>1471-2164</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Computational evidence for hundreds of non-conserved plant microRNAs</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Lindow</snm>
               <fnm>Morten</fnm>
               <insr iid="I1"/>
               <email>morten@binf.ku.dk</email>
            </au>
            <au id="A2" ca="yes">
               <snm>Krogh</snm>
               <fnm>Anders</fnm>
               <insr iid="I1"/>
               <email>krogh@binf.ku.dk</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Bioinformatics Centre, Institute of Molecular Biology, University of Copenhagen, Denmark</p>
            </ins>
         </insg>
         <source>BMC Genomics</source>
         <issn>1471-2164</issn>
         <pubdate>2005</pubdate>
         <volume>6</volume>
         <issue>1</issue>
         <fpage>119</fpage>
         <url>http://www.biomedcentral.com/1471-2164/6/119</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">16159385</pubid>
               <pubid idtype="doi">10.1186/1471-2164-6-119</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>14</day>
               <month>4</month>
               <year>2005</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>13</day>
               <month>9</month>
               <year>2005</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>13</day>
               <month>9</month>
               <year>2005</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2005</year>
         <collab>Lindow and Krogh; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>MicroRNAs (miRNA) are small (20&#8211;25 nt) non-coding RNA molecules that regulate gene expression through interaction with mRNA in plants and metazoans. A few hundred miRNAs are known or predicted, and most of those are evolutionarily conserved. In general plant miRNA are different from their animal counterpart: most plant miRNAs show near perfect complementarity to their targets. Exploiting this complementarity we have developed a method for identification plant miRNAs that does not rely on phylogenetic conservation.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Using the presumed targets for the known miRNA as positive controls, we list and filter all segments of the genome of length ~20 that are complementary to a target mRNA-transcript. From the positive control we recover 41 (of 92 possible) of the already known miRNA-genes (representing 14 of 16 families) with only four false positives.</p>
               <p>Applying the procedure to find possible new miRNAs targeting any annotated mRNA, we predict of 592 new miRNA genes, many of which are not conserved in other plant genomes. A subset of our predicted miRNAs is additionally supported by having more than one target that are not homologues.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>These results indicate that it is possible to reliably predict miRNA-genes without using genome comparisons. Furthermore it suggests that the number of plant miRNAs have been underestimated and points to the existence of recently evolved miRNAs in <it>Arabidopsis</it>.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="bmc" subtype="user_supplied_xml" id="refman"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>MicroRNAs (miRNAs), 20&#8211;25 nucleotides in length, are involved in negative post transcriptional regulation in most multi-cellular organisms (for a review see e.g. <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>). The generality and importance of this recently discovered regulatory mechanism is gradually becoming apparent, and here we present computational evidence for new miRNAs indicating that their numbers are more abundant than previously believed, and argue that they play a major role in evolution.</p>
         <p>Most of the miRNAs identified so far are conserved in other species, some remarkably well<abbrgrp><abbr bid="B3">3</abbr></abbrgrp>. Previous computational screens for miRNA have relied on this evolutionary conservation to identify a few hundred putative miRNAs in vertebrates<abbrgrp><abbr bid="B4">4</abbr></abbrgrp>, <it>C. elegans</it><abbrgrp><abbr bid="B5">5</abbr></abbrgrp>, and plants <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp>, and many have been experimentally confirmed (reviewed in <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>). However, these screens miss all miRNAs that have diverged since the last common ancestor of the genomes under comparison. A recent study using a combined bioinformatic and high-throughput experimental approach have identified 53 miRNAs not conserved beyond primates<abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. In plants, where comparisons have been between the distantly related <it>A. thaliana </it>(thale cress) and <it>O. sativa </it>(rice) genomes that diverged some 200 million years ago<abbrgrp><abbr bid="B11">11</abbr></abbrgrp>, it is probable that there are miRNAs which have escaped detection. Of the 112 <it>Arabidopsis </it>miRNA-genes currently registered<abbrgrp><abbr bid="B12">12</abbr></abbrgrp>, only 56 are conserved in the monocot rice (see methods section), indicating the existence of a substantial number of unconserved miRNA-genes. miRNA and short interfering RNAs (siRNA) are very similar in function, but different in biogenesis. According to the current nomenclature<abbrgrp><abbr bid="B13">13</abbr></abbrgrp> both microRNAs (miRNAs) and short interfering RNAs (siRNAs) are 20&#8211;25 nucleotides long single stranded molecules that arise from processing of double stranded RNA (dsRNA) precursors. They are distinguished by the type of dsRNA they are excised from. While siRNAs come from long exogenous or endogenous dsRNA molecules (very long hairpins or RNA duplexes), mature miRNAs come from the stem region of shorter hairpins.</p>
         <p>The mature miRNA or siRNA forms part of the RNA induced silencing complex (RISC) that binds to mRNAs. miRNA/siRNAs that bind with almost perfect complementarity to an mRNA often results in the cleavage of its target. Currently it seems that the higher the degree of complementarity to a target mRNA, the larger chance of that target being degraded. miRNAs with imperfect complementarity to a 3' untranslated region of a mRNA have been shown to inhibit translation of the mRNA<abbrgrp><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr></abbrgrp></p>
         <p>When the base pairing between the miRNA and the target is incomplete it is non-trivial to identify targets for a miRNA <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp>. In plants, however, most of the known miRNAs pair almost perfectly with one or more mRNAs, making it straightforward to identify likely plant targets (miRNAs often have more than one target). Using this observation it is possible to predict miRNA candidates in <it>Arabidopsis </it>that exhibit near perfect base pairing with the targets, without relying on homology to other organisms<abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. Here this idea is extended and refined to yield a highly specific screen that finds plant miRNAs in numbers much larger than previously thought.</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <sec>
            <st>
               <p>Identification of non-conserved miRNAs</p>
            </st>
            <p>The general approach is outlined in figure <figr fid="F1">1</figr>. Initially, a mRNA is compared with the genomic sequence to identify matching regions of 20&#8211;27 nucleotides with at most 2 mismatches (allowing 3 mismatches produced more than 10 000 matches per mRNA). These are called micromatches, and the genomic part is referred to as a genomic match. An average mRNA gives rise to about 1000 such micromatches, the vast majority (often all) of which we assume are spurious non-miRNA hits. However, it is possible, without comparing to other genomes, to filter the micromatches and achieve highly specific and fairly sensitive predictions of miRNA genes (Figure <figr fid="F1">1</figr>).</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Procedure for miRNA prediction</p>
               </caption>
               <text>
                  <p><b>Procedure for miRNA prediction. </b>The number of matches between a mRNA and a segment of the genome (micromatches) after each step is shown in parenthesis. mRNAs are compared with the genomic sequence to identify matching regions of 20&#8211;27 nucleotides with at most 2 mismatches. Matches overlapping annotated exons, repeats or low-complexity regions are discarded. Additionally, the miRNA:mRNA-duplexes must be stable and the potential miRNAs must have a structure similar to known miRNAs to be included in the base set predictions. The multi-target set is a more reliable subset of those that have more than one target. See text for more details.</p>
               </text>
               <graphic file="1471-2164-6-119-1"/>
            </fig>
            <p>Six filters were used to identify a base set of genomic sequences as candidate miRNAs (with percentages of the initial micromatches that were remaining after each filter given in brackets): (1) they had high sequence complexity (26.9%); (2) they had no overlap with annotated exons on the same or the opposite strand (3.3%); (3) they had no overlap with repeat sequences defined by RepeatMasker (2.6%); (4) the putative miRNA:mRNA duplex should be relatively stable<abbrgrp><abbr bid="B17">17</abbr><abbr bid="B21">21</abbr></abbrgrp> with a calculated free energy of less than -34 kcal/mol (0.20%); (5) they had no more than identical 10 copies in the genome (0.19%), to eliminate repeated sequences not detected by standard repeat-masking; and (6) the miRNA was contained within a precursor structure that was similar to those observed in known <it>Arabidopsis </it>miRNA precursors, i.e. was predicted to be largely contained (at least 16 paired bases) within the stem of a double stranded stem-loop structure whose stem was predicted to have a free energy less than -60 kcal/mol, with at least 4 paired bases flanking the putative miRNA, and an intervening loop larger than 9 but less than 130 bases (0.0002%).</p>
            <p>Although the base set predictions have a low number of false positives (see below), they can be even more refined to identify a subset of the predictions with extra confidence, because the probability of more than one mRNA matching a falsely predicted miRNA is minimal, unless the matching mRNA-targets are close homologs (in which case the multiple targets do not add much extra confidence). Most of the known miRNA in <it>Arabidopsis </it>are thought to have multiple targets often within the same family of homologous proteins<abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. If a known miRNA only has targets in a highly conserved protein family this filter can however be expected to falsely eliminate them.</p>
            <p>In order to check the validity of our approach we took the mRNA targets of the known miRNAs and set out to see if using these as queries we would be able to correctly identify the known miRNA-genes. Of the 112 precursor sequences registered in RFAM (ver 5.1), we were able to map 92 perfectly to the current RefSeq assembly (TIGR ver 5.0) of the <it>Arabidopsis </it>genome; the remaining precursors were excluded from the positive control set. Likely targets for <it>Arabidopsis </it>miRNAs have previously been predicted allowing for up to 3 mismatches<abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. Repeating this procedure we find that our known miRNAs match 142 different annotated mRNA*. These are the positive control targets (refered to a 'known targets') and many have been experimentally confirmed<abbrgrp><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr></abbrgrp>. Initially, the 142 mRNAs in the positive control set yielded 359,976 micromatches after removal of low complexity sequences. However, the filtering procedure reduces this dramatically to 45 different loci (41 of which are already known) representing 16 different families (14 known). Assuming that the 'unknown' loci we find are false positives the procedure has 91% specificity and 45% sensitivity on the level of loci identified. Using the refinement step requiring more than one non-homologouos target only true positives are found, but at the expense of halving the sensitivity to 22%. The validity of the estimates of specificity and sensitivity is discussed below.</p>
         </sec>
         <sec>
            <st>
               <p>Hundreds of novel miRNAs</p>
            </st>
            <p>Applying the micromatcher procedure to all 28860 mRNAs annotated in <it>Arabidopsis </it>identifies 592 miRNA candidate loci (480 families) in the base set (<supplr sid="S1">Additional file 1</supplr>). In the final step this is reduced to a set of 90 (70 new) when more than one non-homologouos target per miRNA is required. This is called the multi-target set and is a subset of the base set.</p>
            <suppl id="S1">
               <title>
                  <p>Additional File 1</p>
               </title>
               <text>
                  <p>Predicted miRNA genes. List of predicted miRNA-genes, their predicted targets, genomic location and graphics showing predicted structure of the precursors.</p>
               </text>
               <file name="1471-2164-6-119-S1.html">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>All miRNA gene predictions, their targets (with some basic annotation) and the predicted secondary structure of the precursor are available as supplementary data [<supplr sid="S1">Additional file 1</supplr>], and at our website<abbrgrp><abbr bid="B26">26</abbr></abbrgrp></p>
            <p>Using public databases we were able to acquire evidence for the expression of a small number of the predictions, 9 in the base set overlap with RNA molecules recently sequenced in a large scale cloning effort of <it>Arabidopsis </it>small RNA<sup>4</sup>, 109 have significant matches to <it>Arabidopsis </it>ESTs and 52 of the predicted precursors contain a 20-mer sequence tag from the <it>Arabidopsis MPSS database</it><abbrgrp><abbr bid="B27">27</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Evolutionary conservation of the predicted miRNA-genes</p>
            </st>
            <p>From an evolutionary point of view, it would seem to be a lot easier to adapt 20 bases in a miRNA for a new target than to evolve a protein for a specific regulatory task.</p>
            <p>For mammals it has been suggested that the more targets a microRNA has the more likely it is to be conserved<abbrgrp><abbr bid="B28">28</abbr></abbrgrp> because of the additional constraints of having to match multiple targets.</p>
            <p>Indeed also for plants: comparison of our predictions in <it>Arabidopsis </it>to two other plant species reveals that the more targets a miRNA is predicted to have, the more likely it is to be conserved (Figure <figr fid="F2">2</figr>). Although no <it>Brassica </it>species is yet completely sequenced and we had to use a conjunction of all single sequence <it>Brassica </it>entries from GenBank, significantly more of the predicted miRNAs are conserved in <it>Brassica </it>than in rice, indicating that many miRNA-genes have diverged beyond recognition since the divergence of monocots and dicots approximately 200 million years ago.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Duplex energy is a strong discriminant between true and false micromatches</p>
               </caption>
               <text>
                  <p><b>Duplex energy is a strong discriminant between true and false micromatches. </b>The procedure was started with 142 mRNAs targeted by known miRNAs. Micromatches were filtered for low-complexity, overlap with exons and repeats. Then the remaining micromatches were divided in two bins: true positives (green trace) that overlap with known miRNA genes and false positives (red trace) that do not.</p>
               </text>
               <graphic file="1471-2164-6-119-2"/>
            </fig>
            <p>Thus, we speculate that the highly conserved miRNAs are likely to be central regulators, often of many target mRNAs (imposing the evolutionary constraint to stay conserved), and are more likely to be highly expressed. Whereas more recently evolved miRNA would have fewer targets, and a more localized spatiotemporal expression, making them less likely to be detected by cloning efforts.</p>
            <p>Since evolutionary conservation is part of many of the previous discovery procedures, it is likely that the set of known miRNAs is biased towards those that are conserved, and our data suggest that in fact, miRNAs evolve fast and are less conserved than e.g. protein-coding genes.</p>
            <p>It has been proposed that some miRNAs originate from inverted duplication of target sequences, exemplified by the single locus miRNAs miR-161 and miR-163, which have precursors that show extended homology to the target mRNAs also outside the mature miRNA sequence<abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. However, our structural filters require that the match between miRNA and target is in the range 20&#8211;25, effectively eliminating such miRNA with extended homology.</p>
         </sec>
         <sec>
            <st>
               <p>Comparison to other studies</p>
            </st>
            <p>Of the predicted 592 precursors in the base set, 29 overlap with the 92 predictions made by Bonnet et al.<abbrgrp><abbr bid="B30">30</abbr></abbrgrp>, and 4 of those by Wang et al.<abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. Thus, the different methods complement each other: The present method based on matching targets and miRNA is capable of finding non-conserved miRNAs, whereas the interspecies comparisons<abbrgrp><abbr bid="B8">8</abbr><abbr bid="B31">31</abbr></abbrgrp> can find miRNAs without obvious targets.</p>
            <p>The idea to use potential targets to find miRNA-genes has recently been employed in two other studies. Xie et al. <abbrgrp><abbr bid="B32">32</abbr></abbrgrp> started by finding frequently occurring subsequences of human 3' UTR sequences conserved in other mammals and successfully searched the genome for new miRNA genes.</p>
            <p>Moreover Adai and coworkers<abbrgrp><abbr bid="B33">33</abbr></abbrgrp> published results in <it>Arabidopsis </it>using potential targets to find new miRNA-genes. However, our approach differs significantly from theirs in the way the matches (that we term micromatches) are analysed and the kind of conclusions that can be drawn: Adai et al. looks for a 'cluster' of miRNA-genes that target the same sequence of a mRNA, and then aligns the candidates in such a cluster, scoring the alignment high if it shows a characteristic pattern where the miRNA and miRNA* are more conserved than the intertwining sequence. Thus, their method is limited to finding miRNAs that occur more than once in the genome, presumably as a result of duplication events. Moreover as a postfilter, Adai et al. require conservation in rice to generate their short-list used for experimental validation. Also, Adai et al. do not make any estimation of the specificity of their computational procedure and are consequently unable to speculate about the number of miRNAs.</p>
            <p>In contrast our method is independent of whether a candidate has been duplicated in the genome or is conserved across species. Instead our aggressive filtering on the structural properties of the precursor enables us to make highly specific prediction (judging from the results using targets for known miRNAs as queries).</p>
            <p>The multi-target miRNAs have a total of 528 different mRNA targets, which are involved in a variety of functions, but there is a notable over-representation of proteins with transcription factor activity and receptor binding activity as well as involvement in developmental processes (false discovery rate &lt; 0.001, see <supplr sid="S2">Additional file 2</supplr>). The predicted miRNA-genes are generally found scattered throughout the genome (Table <tblr tid="T2">2</tblr>). Unlike in mammals where 90 out of 232 miRNA-genes are within introns of protein coding genes <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>, there is only one previously discovered <it>Arabidopsis </it>microRNA situated in an intron. This trend of plant microRNAs to be outside protein-coding genes also holds for our baseset predictions and even stronger for the multiple target predictions (Table <tblr tid="T2">2</tblr>).</p>
            <suppl id="S2">
               <title>
                  <p>Additional File 2</p>
               </title>
               <text>
                  <p>Functional analysis of the predicted miRNA targets. Analysis of overrepresented Gene Ontology terms among the mRNAs predicted to be targeted by miRNAs.</p>
               </text>
               <file name="1471-2164-6-119-S2.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>The distribution of predicted miRNA-genes in relation to genomic features. IGR, intergenic region. The ratio of the number of bases annotated as intergenic vs. intron is 3.1 in the genome as a whole.</p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c cspan="3" ca="left">
                        <p>
                           <b>Position of predicted miRNA genes</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Base set</p>
                     </c>
                     <c ca="center">
                        <p>>1 target</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Total number of loci</p>
                     </c>
                     <c ca="center">
                        <p>592</p>
                     </c>
                     <c ca="center">
                        <p>90</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>In introns (sense strand)</p>
                     </c>
                     <c ca="center">
                        <p>24</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>in introns (antisense)</p>
                     </c>
                     <c ca="center">
                        <p>18</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>In intergenic regions (both strands)</p>
                     </c>
                     <c ca="center">
                        <p>550</p>
                     </c>
                     <c ca="center">
                        <p>85</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Within 500 bases upstream of gene</p>
                     </c>
                     <c ca="center">
                        <p>26</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Within 500 bases downstream of gene</p>
                     </c>
                     <c ca="center">
                        <p>52</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Ratio IGR/introns</p>
                     </c>
                     <c ca="center">
                        <p>14</p>
                     </c>
                     <c ca="center">
                        <p>18</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>Although estimating the sensitivity and specificity on the basis of the ability to correctly identify the small set of known miRNAs carries the danger of biasing, the presently most important concern must be not to massively overpredict new miRNA-genes. In constructing the filters we have therefore aim at high specificity at the expense of sensitivity. While false positives undoubtfully remain, the fact that the predictions share the properties of functional overrepresentation and bias of genomic location (properties not selected for in the filters) with known miRNAs provides independent indication that we indeed do not massively overpredict new miRNA-genes.</p>
            <p>It is becoming evident that many regions between protein coding genes are transcribed (e.g. <abbrgrp><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr></abbrgrp>). Indeed given the cases of miRNAs that have been suggested to regulate other miRNAs<abbrgrp><abbr bid="B37">37</abbr></abbrgrp> or RNAs that guide methylation DNA<abbrgrp><abbr bid="B38">38</abbr></abbrgrp>, it would be interesting to extend our filtered intragenomic match approach to identify other possible miRNAs whose targets are not mRNAs.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>The present analysis predicts 71 new <it>Arabidopsis </it>miRNA genes with very few false positives (estimated specificity is 100%) and over five hundred with an estimate of 9% false predictions. The procedure misses some real miRNAs, such as those encoded in untranslated regions of genes, those with very many targets (classified as repeats by our method), and those not fulfilling our strict structural constraints, and we believe that the real number could be several thousands. Although, the predictions should eventually be confirmed in the lab, our data suggest that the <it>Arabidopsis </it>genome encodes substantially more miRNA genes than previously thought, and that the number of miRNAs is comparable to the number of protein transcription factors. Our results also indicate that many miRNA are specific to small groups of related species and we speculate that they could play a part in speciation. Finally we find it unlikely that these conclusions are specific to plants, and we hypothesize that they extend to most other multicellular organisms.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Sequences</p>
            </st>
            <p><it>Arabidopsis </it>genome and annotation were the RefSeq sequences based on the 5.0 version released by TIGR. Known miRNAs were from the 5.1 release of the microRNA registry<abbrgrp><abbr bid="B39">39</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>The micromatcher procedure</p>
            </st>
            <sec>
               <st>
                  <p>Finding all micromatches</p>
               </st>
               <p>For each annotated spliced mRNA we exhaustively searched the genome for micromatches of length at least 20 with maximum 2 mismatches (no gaps allowed) using the suffixarray based program vmatch<abbrgrp><abbr bid="B40">40</abbr></abbrgrp> (This search took 6 days on an Intel Xeon 2.2 Ghz machine running Linux).</p>
               <p>Note about the positive control set of mRNAs: To select the positive control mRNA-targets we allow for 3 mismatches over the whole length of the mature miRNA; this potentially includes in the positive control set mRNAs that will be unable to recover the matching miRNA allowing only 2 mismatches over a length of 20 bases (the criterion used later). This discrepancy can lead to a too pessimistic estimation of the performance of procedure.</p>
            </sec>
            <sec>
               <st>
                  <p>Lowcomplexity filter</p>
               </st>
               <p>Genomic micromatches not fulfilling a simple low complexity filter were discarded: 1) all four bases had to be present at least once, and 2) at most 11 of the three most frequent dinucleotides in the sequence were allowed.</p>
            </sec>
            <sec>
               <st>
                  <p>Duplex stability</p>
               </st>
               <p>Using the program RNAcofold (Vienna RNA package<abbrgrp><abbr bid="B41">41</abbr></abbrgrp>) the free energy change when a microRNA-candidate binds to a target site was calculated. Micromatches where this duplex energy is larger than -34 kcal/mol were discarded.</p>
            </sec>
            <sec>
               <st>
                  <p>Long matches</p>
               </st>
               <p>Micromatches longer than 26 residues were discarded. To ascertain that a micromatch was not part of a longer match, the two parts of the micromatch extended by 50 bases to each side were aligned with bl2seq (two sequence NCBI blast), and those with a match longer than 26 were discarded.</p>
            </sec>
            <sec>
               <st>
                  <p>Overlaps with known features and repeats</p>
               </st>
               <p>A micromatch was discarded if it had any bases in common with annotated exons (including matches to the reverse strand of the exon) or repeats as determined by RepeatMasker<abbrgrp><abbr bid="B42">42</abbr></abbrgrp> run with <it>Arabidopsis </it>specific repeat libraries (RepBase Update 8.12, RM database version 20040306).</p>
            </sec>
            <sec>
               <st>
                  <p>Copy number</p>
               </st>
               <p>Additionally to traditional repeat-masking that relies on the identification of <it>known </it>repeats, we made an additional pragmatic repeat filter: We simply determined the number of times all candidate sequences occurs in the entire genome, and removed candidates with a copy number higher than 10.</p>
            </sec>
            <sec>
               <st>
                  <p>Filtering on properties of the possible precursor</p>
               </st>
               <p>In order to predict a possible precursor molecule, two genomic sequences around each micromatch were extracted: One starting 10 bases 5' of the micromatch and extending 240 bases 3' of the micromatch, and one with the extension lengths reversed. Each of these was treated independently in the following analysis. First the potential precursor sequence was folded with RNAfold<abbrgrp><abbr bid="B43">43</abbr></abbrgrp> to find the minimum free energy structure These values are comparable, because all sequences are of almost equal length. Candidates with a folding free energy larger than -60 kcal/mol are discarded. This is a highly permissive filter. The mature miRNA has to be fully contained in a double stranded region of the precursor. The complementary part of the miRNA in this stem is denoted miRNA*. It is demanded that all base pairs between the miRNA and the miRNA* are pairing in the same direction opposite each other. The number of paired bases in the mature miRNA is required to be 16 or more.</p>
               <p>In the known miRNA precursors, the stem is always longer than just the length of the mature miRNA. To find how far the stem of a candidate extends from the mature miRNA, we count how far inward towards the loop or outwards toward the ends of RNA-string the stem extends using the following algorithm: Moving out from the terminal basepair between miRNA and miRNA* a score of 1 is assigned for each base pair encountered and a score of -1 for each unpaired base. The extension is stopped when the current score is less than 5 lower than the maximum score so far. The last base pair is considered the terminus of the stem. Candidates with extensions less than 4 bases on either side of the mature miRNA were discarded. It was also required that the shortest number of bases between the miRNA and miRNA* were larger than 9 and less than 130.</p>
               <p>Taken together these structural criteria constitute a highly selective, but somewhat conservative filter.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Matches to ESTs and ASRP</p>
            </st>
            <p>BLASTN was used to search all <it>Arabidopsis </it>ESTs downloaded from GenBank on September 27, 2004. Hits longer than 70 nucleotides with more than 95% identity between a predicted precursor and an EST were considered positive. Sequences cloned and sequenced as part of the <it>Arabidopsis </it>Small RNA Project (ASRP)<abbrgrp><abbr bid="B44">44</abbr></abbrgrp>, were downloaded from <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>. All matches at least 15 long with at most one mismatch with our predicted mature miRNA-sequences were found using vmatch<abbrgrp><abbr bid="B46">46</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Conservation in other genomes</p>
            </st>
            <p>To determine how many of our predictions were conserved in other plant genomes, we blasted the predicted <it>Arabidopsis </it>precursors against the rice-genome and <it>brassica </it>sequence downloaded from <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>. A miRNA prediction was taken to be conserved if it had a significant (e-value &lt; 0.01) blast hit containing the mature miRNA with no more than 2 mismatches and the homolog had flanking sequence capable of folding back on the mature miRNA with at least 15 base pairs between the miRNA and miRNA*.</p>
         </sec>
         <sec>
            <st>
               <p>The number of non-homologous targets for a putative miRNA</p>
            </st>
            <p>For all candidate microRNAs in the baseset matching more than one mRNA, we found the number of different non-homologous targets by performing single linkage clustering on the aminoacid sequences of the corresponding mRNAs using the program 'blastclust' from NCBI. Two proteins were considered homologous if they had more than 70% identity across at least 50% of the length.</p>
         </sec>
         <sec>
            <st>
               <p>Clustering of micromatches into genomic loci</p>
            </st>
            <p>Micromatches with genomic start position within 4 nucleotides were logically grouped into the same locus.</p>
         </sec>
         <sec>
            <st>
               <p>Clustering of similar miRNA sequences into families</p>
            </st>
            <p>We used the program vmatch<abbrgrp><abbr bid="B48">48</abbr></abbrgrp> to align and perform single linkage clustering of the predicted mature miRNA sequences. Candidate pairs aligning over at least 17 bases, allowing an edit distance of 1 were grouped in the same family.</p>
         </sec>
         <sec>
            <st>
               <p>Functional analysis of targets</p>
            </st>
            <p>We obtained gene ontology annotation (GOSLIM) from <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>. From each GOSLIM category we constructed a 2 &#215; 2 contingency table counting the number of targets vs non-targets with or without the GOSLIM annotation. We used R<abbrgrp><abbr bid="B50">50</abbr></abbrgrp> to calculate p-values with Fisher's Exact Test and employed the package 'qvalue'<abbrgrp><abbr bid="B51">51</abbr></abbrgrp> to correct for multiple testing setting a false discovery rate level at 0.001. The results are included as [<supplr sid="S2">Additional file 2</supplr>], along with the R-code used.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>ML and AK designed the study. ML wrote the programs. ML and AK drafted the manuscript. Both authors read and approved the final manuscript.</p>
         <fig id="F3">
            <title>
               <p>Figure 3</p>
            </title>
            <caption>
               <p>Multi target predictions tend to be better conserved</p>
            </caption>
            <text>
               <p><b>Multi target predictions tend to be better conserved</b>. The precursor sequences of the predictions were used as queries for a blast search against rice (downloaded from tigr.org, March 2004) or <it>brassica </it>(downloaded from arabidopsis.org, August 2004), respectively. Columns show the proportion of miRNA predictions in <it>Arabidopsis </it>that were found to be conserved. Numbers refer to the actual number of conserved miRNA predictions.</p>
            </text>
            <graphic file="1471-2164-6-119-3"/>
         </fig>
         <tbl id="T1">
            <title>
               <p>Table 1</p>
            </title>
            <caption>
               <p>Summary of the results, starting with 136 mRNA targets to known miRNAs or all mRNAs, respectively. Numbers in parenthesis indicate the number of already known (RFAM) miRNA genes or families.</p>
            </caption>
            <tblbdy cols="5">
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="right">
                     <p>
                        <b>micromatches</b>
                     </p>
                  </c>
                  <c ca="right">
                     <p>
                        <b>miRNA genes found</b>
                     </p>
                  </c>
                  <c ca="right">
                     <p>
                        <b>distinct families</b>
                     </p>
                  </c>
                  <c ca="right">
                     <p>
                        <b>distinct targets</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c cspan="5">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c cspan="2" ca="left">
                     <p>
                        <b>
                           <it>Query: known targets</it>
                        </b>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Baseset</p>
                  </c>
                  <c ca="right">
                     <p>176</p>
                  </c>
                  <c ca="right">
                     <p>45(41)</p>
                  </c>
                  <c ca="right">
                     <p>16(14)</p>
                  </c>
                  <c ca="right">
                     <p>51</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>>1 non-homologous target</p>
                  </c>
                  <c ca="right">
                     <p>63</p>
                  </c>
                  <c ca="right">
                     <p>20(20)</p>
                  </c>
                  <c ca="right">
                     <p>12(12)</p>
                  </c>
                  <c ca="right">
                     <p>34</p>
                  </c>
               </r>
               <r>
                  <c cspan="2" ca="left">
                     <p>
                        <b>
                           <it>Query: all mRNAs</it>
                        </b>
                     </p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Baseset</p>
                  </c>
                  <c ca="right">
                     <p>927</p>
                  </c>
                  <c ca="right">
                     <p>592</p>
                  </c>
                  <c ca="right">
                     <p>480</p>
                  </c>
                  <c ca="right">
                     <p>656</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>>1 target-homologous target</p>
                  </c>
                  <c ca="right">
                     <p>255</p>
                  </c>
                  <c ca="right">
                     <p>90</p>
                  </c>
                  <c ca="right">
                     <p>73</p>
                  </c>
                  <c ca="right">
                     <p>205</p>
                  </c>
               </r>
            </tblbdy>
         </tbl>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We wish to thank anonymous reviewers for helpful comments and suggestions.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>MicroRNAs: small RNAs with a big role in gene regulation</p>
            </title>
            <aug>
               <au>
                  <snm>He</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Hannon</snm>
                  <fnm>GJ</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>522</fpage>
            <lpage>531</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrg1379</pubid>
                  <pubid idtype="pmpid" link="fulltext">15211354</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Perspective: machines for RNAi</p>
            </title>
            <aug>
               <au>
                  <snm>Tomari</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Zamore</snm>
                  <fnm>PD</fnm>
               </au>
            </aug>
            <source>Genes Dev</source>
            <pubdate>2005</pubdate>
            <volume>19</volume>
            <fpage>517</fpage>
            <lpage>529</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gad.1284105</pubid>
                  <pubid idtype="pmpid" link="fulltext">15741316</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Gene regulation: ancient microRNA target sequences in plants</p>
            </title>
            <aug>
               <au>
                  <snm>Floyd</snm>
                  <fnm>SK</fnm>
               </au>
               <au>
                  <snm>Bowman</snm>
                  <fnm>JL</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2004</pubdate>
            <volume>428</volume>
            <fpage>485</fpage>
            <lpage>486</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/428485a</pubid>
                  <pubid idtype="pmpid" link="fulltext">15057819</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Vertebrate microRNA genes</p>
            </title>
            <aug>
               <au>
                  <snm>Lim</snm>
                  <fnm>LP</fnm>
               </au>
               <au>
                  <snm>Glasner</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Yekta</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Burge</snm>
                  <fnm>CB</fnm>
               </au>
               <au>
                  <snm>Bartel</snm>
                  <fnm>DP</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2003</pubdate>
            <volume>299</volume>
            <fpage>1540</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1080372</pubid>
                  <pubid idtype="pmpid" link="fulltext">12624257</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>The microRNAs of Caenorhabditis elegans</p>
            </title>
            <aug>
               <au>
                  <snm>Lim</snm>
                  <fnm>LP</fnm>
               </au>
               <au>
                  <snm>Lau</snm>
                  <fnm>NC</fnm>
               </au>
               <au>
                  <snm>Weinstein</snm>
                  <fnm>EG</fnm>
               </au>
               <au>
                  <snm>Abdelhakim</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Yekta</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Rhoades</snm>
                  <fnm>MW</fnm>
               </au>
               <etal/>
            </aug>
            <source>Genes Dev</source>
            <pubdate>2003</pubdate>
            <volume>17</volume>
            <fpage>991</fpage>
            <lpage>1008</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">196042</pubid>
                  <pubid idtype="pmpid" link="fulltext">12672692</pubid>
                  <pubid idtype="doi">10.1101/gad.1074403</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Computational Identification of Plant MicroRNAs and Their Targets, Including a Stress-Induced miRNA</p>
            </title>
            <aug>
               <au>
                  <snm>Jones-Rhoades</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Bartel</snm>
                  <fnm>DP</fnm>
               </au>
            </aug>
            <source>Mol Cell</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <fpage>787</fpage>
            <lpage>799</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.molcel.2004.05.027</pubid>
                  <pubid idtype="pmpid" link="fulltext">15200956</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Detection of 91 potential conserved plant microRNAs in Arabidopsis thaliana and Oryza sativa identifies important target genes</p>
            </title>
            <aug>
               <au>
                  <snm>Bonnet</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Wuyts</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Rouze</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Van de</snm>
                  <fnm>PY</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2004</pubdate>
            <volume>101</volume>
            <fpage>11511</fpage>
            <lpage>11516</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">509231</pubid>
                  <pubid idtype="pmpid" link="fulltext">15272084</pubid>
                  <pubid idtype="doi">10.1073/pnas.0404025101</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Prediction and identification of Arabidopsis thaliana microRNAs and their mRNA targets</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>XJ</fnm>
               </au>
               <au>
                  <snm>Reyes</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Chua</snm>
                  <fnm>NH</fnm>
               </au>
               <au>
                  <snm>Gaasterland</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>R65</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">522872</pubid>
                  <pubid idtype="pmpid" link="fulltext">15345049</pubid>
                  <pubid idtype="doi">10.1186/gb-2004-5-9-r65</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>RNA silencing in plants</p>
            </title>
            <aug>
               <au>
                  <snm>Baulcombe</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2004</pubdate>
            <volume>431</volume>
            <fpage>356</fpage>
            <lpage>363</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature02874</pubid>
                  <pubid idtype="pmpid" link="fulltext">15372043</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Identification of hundreds of conserved and nonconserved human microRNAs</p>
            </title>
            <aug>
               <au>
                  <snm>Bentwich</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Avniel</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Karov</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Aharonov</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Gilad</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Barad</snm>
                  <fnm>O</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2005</pubdate>
            <volume>37</volume>
            <fpage>766</fpage>
            <lpage>770</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1590</pubid>
                  <pubid idtype="pmpid" link="fulltext">15965474</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Comparative genomics of rice and Arabidopsis. Analysis of 727 cytochrome P450 genes and pseudogenes from a monocot and a dicot</p>
            </title>
            <aug>
               <au>
                  <snm>Nelson</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Schuler</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Paquette</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Werck-Reichhart</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Bak</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2004</pubdate>
            <volume>135</volume>
            <fpage>756</fpage>
            <lpage>772</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">514113</pubid>
                  <pubid idtype="pmpid" link="fulltext">15208422</pubid>
                  <pubid idtype="doi">10.1104/pp.104.039826</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>The microRNA Registry</p>
            </title>
            <aug>
               <au>
                  <snm>Griffiths-Jones</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <issue>Database</issue>
            <fpage>D109</fpage>
            <lpage>D111</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308757</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681370</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh023</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>A uniform system for microRNA annotation</p>
            </title>
            <aug>
               <au>
                  <snm>Ambros</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Bartel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Bartel</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Burge</snm>
                  <fnm>CB</fnm>
               </au>
               <au>
                  <snm>Carrington</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>X</fnm>
               </au>
               <etal/>
            </aug>
            <source>RNA</source>
            <pubdate>2003</pubdate>
            <volume>9</volume>
            <fpage>277</fpage>
            <lpage>279</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1261/rna.2183803</pubid>
                  <pubid idtype="pmpid" link="fulltext">12592000</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>siRNAs can function as miRNAs</p>
            </title>
            <aug>
               <au>
                  <snm>Doench</snm>
                  <fnm>JG</fnm>
               </au>
               <au>
                  <snm>Petersen</snm>
                  <fnm>CP</fnm>
               </au>
               <au>
                  <snm>Sharp</snm>
                  <fnm>PA</fnm>
               </au>
            </aug>
            <source>Genes Dev</source>
            <pubdate>2003</pubdate>
            <volume>17</volume>
            <fpage>438</fpage>
            <lpage>442</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">195999</pubid>
                  <pubid idtype="pmpid" link="fulltext">12600936</pubid>
                  <pubid idtype="doi">10.1101/gad.1064703</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Specificity of microRNA target selection in translational repression</p>
            </title>
            <aug>
               <au>
                  <snm>Doench</snm>
                  <fnm>JG</fnm>
               </au>
               <au>
                  <snm>Sharp</snm>
                  <fnm>PA</fnm>
               </au>
            </aug>
            <source>Genes Dev</source>
            <pubdate>2004</pubdate>
            <volume>18</volume>
            <fpage>504</fpage>
            <lpage>511</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">374233</pubid>
                  <pubid idtype="pmpid" link="fulltext">15014042</pubid>
                  <pubid idtype="doi">10.1101/gad.1184404</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Specificity of microRNA target selection in translational repression</p>
            </title>
            <aug>
               <au>
                  <snm>Doench</snm>
                  <fnm>JG</fnm>
               </au>
               <au>
                  <snm>Sharp</snm>
                  <fnm>PA</fnm>
               </au>
            </aug>
            <source>Genes Dev</source>
            <pubdate>2004</pubdate>
            <volume>18</volume>
            <fpage>504</fpage>
            <lpage>511</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">374233</pubid>
                  <pubid idtype="pmpid" link="fulltext">15014042</pubid>
                  <pubid idtype="doi">10.1101/gad.1184404</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>MicroRNA targets in Drosophila</p>
            </title>
            <aug>
               <au>
                  <snm>Enright</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>John</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Gaul</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Tuschl</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Sander</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Marks</snm>
                  <fnm>DS</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2003</pubdate>
            <volume>5</volume>
            <fpage>R1</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/gb-2003-5-1-r1</pubid>
                  <pubid idtype="pmpid" link="fulltext">14709173</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Identification of Drosophila MicroRNA Targets</p>
            </title>
            <aug>
               <au>
                  <snm>Stark</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Brennecke</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Russell</snm>
                  <fnm>RB</fnm>
               </au>
               <au>
                  <snm>Cohen</snm>
                  <fnm>SM</fnm>
               </au>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2003</pubdate>
            <volume>1</volume>
            <fpage>E60</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">270017</pubid>
                  <pubid idtype="pmpid" link="fulltext">14691535</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0000060</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Prediction of mammalian microRNA targets</p>
            </title>
            <aug>
               <au>
                  <snm>Lewis</snm>
                  <fnm>BP</fnm>
               </au>
               <au>
                  <snm>Shih</snm>
                  <fnm>IH</fnm>
               </au>
               <au>
                  <snm>Jones-Rhoades</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Bartel</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Burge</snm>
                  <fnm>CB</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2003</pubdate>
            <volume>115</volume>
            <fpage>787</fpage>
            <lpage>798</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0092-8674(03)01018-3</pubid>
                  <pubid idtype="pmpid" link="fulltext">14697198</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Computational prediction of miRNAs in Arabidopsis thaliana</p>
            </title>
            <aug>
               <au>
                  <snm>Adai</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Mlotshwa</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>rcher-Evans</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Manocha</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Vance</snm>
                  <fnm>V</fnm>
               </au>
               <etal/>
            </aug>
            <source>Genome Res</source>
            <pubdate>2005</pubdate>
            <volume>15</volume>
            <fpage>78</fpage>
            <lpage>91</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">540280</pubid>
                  <pubid idtype="pmpid" link="fulltext">15632092</pubid>
                  <pubid idtype="doi">10.1101/gr.2908205</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Prediction of mammalian microRNA targets</p>
            </title>
            <aug>
               <au>
                  <snm>Lewis</snm>
                  <fnm>BP</fnm>
               </au>
               <au>
                  <snm>Shih</snm>
                  <fnm>IH</fnm>
               </au>
               <au>
                  <snm>Jones-Rhoades</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Bartel</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Burge</snm>
                  <fnm>CB</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2003</pubdate>
            <volume>115</volume>
            <fpage>787</fpage>
            <lpage>798</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0092-8674(03)01018-3</pubid>
                  <pubid idtype="pmpid" link="fulltext">14697198</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Prediction of plant microRNA targets</p>
            </title>
            <aug>
               <au>
                  <snm>Rhoades</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Reinhart</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Lim</snm>
                  <fnm>LP</fnm>
               </au>
               <au>
                  <snm>Burge</snm>
                  <fnm>CB</fnm>
               </au>
               <au>
                  <snm>Bartel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Bartel</snm>
                  <fnm>DP</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2002</pubdate>
            <volume>110</volume>
            <fpage>513</fpage>
            <lpage>520</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0092-8674(02)00863-2</pubid>
                  <pubid idtype="pmpid" link="fulltext">12202040</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Prediction of plant microRNA targets</p>
            </title>
            <aug>
               <au>
                  <snm>Rhoades</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Reinhart</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Lim</snm>
                  <fnm>LP</fnm>
               </au>
               <au>
                  <snm>Burge</snm>
                  <fnm>CB</fnm>
               </au>
               <au>
                  <snm>Bartel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Bartel</snm>
                  <fnm>DP</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2002</pubdate>
            <volume>110</volume>
            <fpage>513</fpage>
            <lpage>520</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0092-8674(02)00863-2</pubid>
                  <pubid idtype="pmpid" link="fulltext">12202040</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Cleavage of Scarecrow-like mRNA targets directed by a class of Arabidopsis miRNA</p>
            </title>
            <aug>
               <au>
                  <snm>Llave</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Xie</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Kasschau</snm>
                  <fnm>KD</fnm>
               </au>
               <au>
                  <snm>Carrington</snm>
                  <fnm>JC</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>297</volume>
            <fpage>2053</fpage>
            <lpage>2056</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1076311</pubid>
                  <pubid idtype="pmpid" link="fulltext">12242443</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Regulation of flowering time and floral organ identity by a MicroRNA and its APETALA2-like target genes</p>
            </title>
            <aug>
               <au>
                  <snm>Aukerman</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Sakai</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>2003</pubdate>
            <volume>15</volume>
            <fpage>2730</fpage>
            <lpage>2741</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">280575</pubid>
                  <pubid idtype="pmpid" link="fulltext">14555699</pubid>
                  <pubid idtype="doi">10.1105/tpc.016238</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <url>http://www.binf.ku.dk/users/morten/mimatcher/arabidopsis/mirnapredictions.html</url>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Arabidopsis MPSS. An online resource for quantitative expression analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Meyers</snm>
                  <fnm>BC</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>DK</fnm>
               </au>
               <au>
                  <snm>Vu</snm>
                  <fnm>TH</fnm>
               </au>
               <au>
                  <snm>Tej</snm>
                  <fnm>SS</fnm>
               </au>
               <au>
                  <snm>Edberg</snm>
                  <fnm>SB</fnm>
               </au>
               <au>
                  <snm>Matvienko</snm>
                  <fnm>M</fnm>
               </au>
               <etal/>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2004</pubdate>
            <volume>135</volume>
            <fpage>801</fpage>
            <lpage>813</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">514116</pubid>
                  <pubid idtype="pmpid" link="fulltext">15173564</pubid>
                  <pubid idtype="doi">10.1104/pp.104.039495</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Small regulatory RNAs in mammals</p>
            </title>
            <aug>
               <au>
                  <snm>Mattick</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Makunin</snm>
                  <fnm>IV</fnm>
               </au>
            </aug>
            <source>Hum Mol Genet</source>
            <pubdate>2005</pubdate>
            <volume>14</volume>
            <issue>Spec No 1</issue>
            <fpage>R121</fpage>
            <lpage>R132</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/hmg/ddi101</pubid>
                  <pubid idtype="pmpid" link="fulltext">15809264</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Evolution of microRNA genes by inverted duplication of target gene sequences in Arabidopsis thaliana</p>
            </title>
            <aug>
               <au>
                  <snm>Allen</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Xie</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Gustafson</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Sung</snm>
                  <fnm>GH</fnm>
               </au>
               <au>
                  <snm>Spatafora</snm>
                  <fnm>JW</fnm>
               </au>
               <au>
                  <snm>Carrington</snm>
                  <fnm>JC</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2004</pubdate>
            <volume>36</volume>
            <fpage>1282</fpage>
            <lpage>1290</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1478</pubid>
                  <pubid idtype="pmpid" link="fulltext">15565108</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Detection of 91 potential conserved plant microRNAs in Arabidopsis thaliana and Oryza sativa identifies important target genes</p>
            </title>
            <aug>
               <au>
                  <snm>Bonnet</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Wuyts</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Rouze</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Van de</snm>
                  <fnm>PY</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2004</pubdate>
            <volume>101</volume>
            <fpage>11511</fpage>
            <lpage>11516</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">509231</pubid>
                  <pubid idtype="pmpid" link="fulltext">15272084</pubid>
                  <pubid idtype="doi">10.1073/pnas.0404025101</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Detection of 91 potential conserved plant microRNAs in Arabidopsis thaliana and Oryza sativa identifies important target genes</p>
            </title>
            <aug>
               <au>
                  <snm>Bonnet</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Wuyts</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Rouze</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Van de</snm>
                  <fnm>PY</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2004</pubdate>
            <volume>101</volume>
            <fpage>11511</fpage>
            <lpage>11516</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">509231</pubid>
                  <pubid idtype="pmpid" link="fulltext">15272084</pubid>
                  <pubid idtype="doi">10.1073/pnas.0404025101</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Systematic discovery of regulatory motifs in human promoters and 3' UTRs by comparison of several mammals</p>
            </title>
            <aug>
               <au>
                  <snm>Xie</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Lu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kulbokas</snm>
                  <fnm>EJ</fnm>
               </au>
               <au>
                  <snm>Golub</snm>
                  <fnm>TR</fnm>
               </au>
               <au>
                  <snm>Mootha</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Lindblad-Toh</snm>
                  <fnm>K</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2005</pubdate>
            <volume>434</volume>
            <fpage>338</fpage>
            <lpage>345</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature03441</pubid>
                  <pubid idtype="pmpid" link="fulltext">15735639</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Computational prediction of miRNAs in Arabidopsis thaliana</p>
            </title>
            <aug>
               <au>
                  <snm>Adai</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Mlotshwa</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>rcher-Evans</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Manocha</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Vance</snm>
                  <fnm>V</fnm>
               </au>
               <etal/>
            </aug>
            <source>Genome Res</source>
            <pubdate>2005</pubdate>
            <volume>15</volume>
            <fpage>78</fpage>
            <lpage>91</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">540280</pubid>
                  <pubid idtype="pmpid" link="fulltext">15632092</pubid>
                  <pubid idtype="doi">10.1101/gr.2908205</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Identification of Mammalian microRNA Host Genes and Transcription Units</p>
            </title>
            <aug>
               <au>
                  <snm>Rodriguez</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Griffiths-Jones</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ashurst</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Bradley</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Transcriptional maps of 10 human chromosomes at 5-nucleotide resolution</p>
            </title>
            <aug>
               <au>
                  <snm>Cheng</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kapranov</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Drenkow</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Dike</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Brubaker</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Patel</snm>
                  <fnm>S</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2005</pubdate>
            <volume>308</volume>
            <fpage>1149</fpage>
            <lpage>1154</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1108625</pubid>
                  <pubid idtype="pmpid" link="fulltext">15790807</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Identification of transcribed sequences in Arabidopsis thaliana by using high-resolution genome tiling arrays</p>
            </title>
            <aug>
               <au>
                  <snm>Stolc</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Samanta</snm>
                  <fnm>MP</fnm>
               </au>
               <au>
                  <snm>Tongprasit</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Sethi</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Liang</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>DC</fnm>
               </au>
               <etal/>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <fpage>4453</fpage>
            <lpage>4458</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.0408203102</pubid>
                  <pubid idtype="pmpid" link="fulltext">15755812</pubid>
                  <pubid idtype="pmcid">555476</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Complementary miRNA pairs suggest a regulatory role for miRNA:miRNA duplexes</p>
            </title>
            <aug>
               <au>
                  <snm>Lai</snm>
                  <fnm>EC</fnm>
               </au>
               <au>
                  <snm>Wiel</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>GM</fnm>
               </au>
            </aug>
            <source>RNA</source>
            <pubdate>2004</pubdate>
            <volume>10</volume>
            <fpage>171</fpage>
            <lpage>175</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1261/rna.5191904</pubid>
                  <pubid idtype="pmpid" link="fulltext">14730015</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>RNA-directed DNA methylation</p>
            </title>
            <aug>
               <au>
                  <snm>Mathieu</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Bender</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>J Cell Sci</source>
            <pubdate>2004</pubdate>
            <volume>117</volume>
            <fpage>4881</fpage>
            <lpage>4888</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1242/jcs.01479</pubid>
                  <pubid idtype="pmpid" link="fulltext">15456843</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>The microRNA Registry</p>
            </title>
            <aug>
               <au>
                  <snm>Griffiths-Jones</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <issue>Database</issue>
            <fpage>D109</fpage>
            <lpage>D111</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308757</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681370</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh023</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>The Vmatch large scale sequence analysis software</p>
            </title>
            <aug>
               <au>
                  <snm>Kurtz</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Ref Type: Computer Program</source>
            <note>4-12-2003</note>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Vienna RNA secondary structure server</p>
            </title>
            <aug>
               <au>
                  <snm>Hofacker</snm>
                  <fnm>IL</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <fpage>3429</fpage>
            <lpage>3431</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">169005</pubid>
                  <pubid idtype="pmpid" link="fulltext">12824340</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg599</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <url>http://www.repeatmasker.org</url>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Vienna RNA secondary structure server</p>
            </title>
            <aug>
               <au>
                  <snm>Hofacker</snm>
                  <fnm>IL</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <fpage>3429</fpage>
            <lpage>3431</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">169005</pubid>
                  <pubid idtype="pmpid" link="fulltext">12824340</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg599</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Genetic and functional diversification of small RNA pathways in plants</p>
            </title>
            <aug>
               <au>
                  <snm>Xie</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Johansen</snm>
                  <fnm>LK</fnm>
               </au>
               <au>
                  <snm>Gustafson</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Kasschau</snm>
                  <fnm>KD</fnm>
               </au>
               <au>
                  <snm>Lellis</snm>
                  <fnm>AD</fnm>
               </au>
               <au>
                  <snm>Zilberman</snm>
                  <fnm>D</fnm>
               </au>
               <etal/>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2004</pubdate>
            <volume>2</volume>
            <fpage>E104</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">350667</pubid>
                  <pubid idtype="pmpid" link="fulltext">15024409</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0020104</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <url>http://asrp.cgrb.oregonstate.edu/db</url>
         </bibl>
         <bibl id="B46">
            <title>
               <p>The Vmatch large scale sequence analysis software</p>
            </title>
            <aug>
               <au>
                  <snm>Kurtz</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Ref Type: Computer Program</source>
            <note>4-12-2003</note>
         </bibl>
         <bibl id="B47">
            <url>http://www.arabidopsis.org</url>
         </bibl>
         <bibl id="B48">
            <title>
               <p>The Vmatch large scale sequence analysis software</p>
            </title>
            <aug>
               <au>
                  <snm>Kurtz</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Ref Type: Computer Program</source>
            <note>4-12-2003</note>
         </bibl>
         <bibl id="B49">
            <title>
               <p>Functional annotation of the Arabidopsis genome using controlled vocabularies</p>
            </title>
            <aug>
               <au>
                  <snm>Berardini</snm>
                  <fnm>TZ</fnm>
               </au>
               <au>
                  <snm>Mundodi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Reiser</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Huala</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Garcia-Hernandez</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>P</fnm>
               </au>
               <etal/>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2004</pubdate>
            <volume>135</volume>
            <fpage>745</fpage>
            <lpage>755</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">514112</pubid>
                  <pubid idtype="pmpid" link="fulltext">15173566</pubid>
                  <pubid idtype="doi">10.1104/pp.104.040071</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <url>http://www.r-project.org</url>
         </bibl>
         <bibl id="B51">
            <title>
               <p>A direct approach to false discovery rates</p>
            </title>
            <aug>
               <au>
                  <snm>Storey</snm>
                  <fnm>JD</fnm>
               </au>
            </aug>
            <source>Journal of the Royal Statistical Society, Series B</source>
            <pubdate>2002</pubdate>
            <volume>64</volume>
            <fpage>479</fpage>
            <lpage>498</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1111/1467-9868.00346</pubid>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
