<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-8-341</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Software</dochead>
      <bibl>
         <title>
            <p>MiRFinder: an improved approach and software implementation for genome-wide fast microRNA precursor scans</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Huang</snm>
               <fnm>Ting-Hua</fnm>
               <insr iid="I1"/>
               <email>huangtinghua@webmail.hzau.edu.cn</email>
            </au>
            <au id="A2">
               <snm>Fan</snm>
               <fnm>Bin</fnm>
               <insr iid="I1"/>
               <email>binfan@webmail.hzau.edu.cn</email>
            </au>
            <au id="A3">
               <snm>Rothschild</snm>
               <mi>F</mi>
               <fnm>Max</fnm>
               <insr iid="I2"/>
               <email>mfrothschild@iastate.edu</email>
            </au>
            <au id="A4">
               <snm>Hu</snm>
               <fnm>Zhi-Liang</fnm>
               <insr iid="I2"/>
               <email>zhu@iastate.edu</email>
            </au>
            <au id="A5">
               <snm>Li</snm>
               <fnm>Kui</fnm>
               <insr iid="I3"/>
               <email>klzg@beijingnky.edu</email>
            </au>
            <au id="A6" ca="yes">
               <snm>Zhao</snm>
               <fnm>Shu-Hong</fnm>
               <insr iid="I1"/>
               <email>shzhao@mail.hzau.edu.cn</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Key Lab of Agricultural Animal Genetics, Breeding, and Reproduction of Ministry of Education &amp; Key Laboratory of Swine Genetics and Breeding of Ministry of Agriculture, Huazhong Agricultural University, Wuhan, 430070, P R China</p>
            </ins>
            <ins id="I2">
               <p>Department of Animal Science and Center for Integrated Animal Genomics, Iowa State University, Ames, IA, 50011, USA</p>
            </ins>
            <ins id="I3">
               <p>Department of Gene and Cell Engineering, Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing 100094, P R China</p>
            </ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <issn>1471-2105</issn>
         <pubdate>2007</pubdate>
         <volume>8</volume>
         <issue>1</issue>
         <fpage>341</fpage>
         <url>http://www.biomedcentral.com/1471-2105/8/341</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">17868480</pubid>
               <pubid idtype="doi">10.1186/1471-2105-8-341</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>13</day>
               <month>2</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>17</day>
               <month>9</month>
               <year>2007</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>17</day>
               <month>9</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Huang et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>MicroRNAs (miRNAs) are recognized as one of the most important families of non-coding RNAs that serve as important sequence-specific post-transcriptional regulators of gene expression. Identification of miRNAs is an important requirement for understanding the mechanisms of post-transcriptional regulation. Hundreds of miRNAs have been identified by direct cloning and computational approaches in several species. However, there are still many miRNAs that remain to be identified due to lack of either sequence features or robust algorithms to efficiently identify them.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We have evaluated features valuable for pre-miRNA prediction, such as the local secondary structure differences of the stem region of miRNA and non-miRNA hairpins. We have also established correlations between different types of mutations and the secondary structures of pre-miRNAs. Utilizing these features and combining some improvements of the current pre-miRNA prediction methods, we implemented a computational learning method SVM (support vector machine) to build a high throughput and good performance computational pre-miRNA prediction tool called MiRFinder. The tool was designed for genome-wise, pair-wise sequences from two related species. The method built into the tool consisted of two major steps: 1) genome wide search for hairpin candidates and 2) exclusion of the non-robust structures based on analysis of 18 parameters by the SVM method. Results from applying the tool for chicken/human and D. melanogaster/D. pseudoobscura pair-wise genome alignments showed that the tool can be used for genome wide pre-miRNA predictions.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>The MiRFinder can be a good alternative to current miRNA discovery software. This tool is available at <url>http://www.bioinformatics.org/mirfinder/</url>.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <sec>
            <st>
               <p>An overview of miRNA</p>
            </st>
            <p>MicroRNA (miRNA) is a special class of endogenic RNA molecules that can down-regulate the expression of protein coding genes at the post-transcriptional level by means of incomplete complementary interactions. The biogenesis of miRNA involves several steps: 1) The majority of long primary transcripts of the miRNA genes are transcribed by RNA polymerase II <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>; 2) The 7-methylguanosine capped and poly(A) tailed transcripts are cleaved by the nuclear RNase III Drosha to release the precursors of miRNA (pre-miRNA) in the nucleus <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>; 3) The precursors of miRNA that possess a thermodynamic stabile hairpin structure are exported into the cytoplasm by Exportin-5 or HASTY <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp> and 4) An additional cleavage in the cytoplasm yields 18&#8211;23 nt mature miRNA <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>. The first two miRNAs, lin-4 and let-7, were discovered as important post-transcriptional regulators for the development of Caenorhabditis elegans in the early larval stage <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. Since then, considerable effort has been devoted to finding miRNA genes, and to date, numerous miRNAs have been identified. Recent experiments, aimed at elucidation of the function of miRNAs, have confirmed that many miRNAs are involved in potentially many developmental and physiological processes [summarized in additional file <supplr sid="S1">1</supplr> table 1].</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p>Supplemental document. The document provided supplemental information of the manuscript.</p>
               </text>
               <file name="1471-2105-8-341-S1.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Existing approaches for miRNA identification</p>
            </st>
            <p>Systematic miRNA identification was first made by the cloning and sequencing of cDNAs prepared from the approximately 22-nuleotide (NT) fraction of total RNA <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp>. A number of miRNAs from various species have been cloned by this method. However, the expression levels of miRNAs are quite different in different tissues and at different developmental stages <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. The expression levels of some miRNAs are too low for easy detection. Moreover, in many cases not all of the tissues and developmental stages were sampled. The majority of miRNAs cloned by this method are abundantly/ubiquitously expressed ones that dominate the extracted RNA products due to technical difficulties.</p>
            <p>Computational methods, using newly acquired genome sequences from a variety of species, represent another useful way to avoid these problems in miRNA identification [summarized in additional file <supplr sid="S1">1</supplr> table 2]. The conserved structure, phylogenetic shadowing and other features of miRNAs suggest that a computational approach may complement well the direct cloning method. A homology search, which can detect homologues of known miRNAs, was first successfully implemented in miRAlign <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. With a primary focus on pair-wise genome sequences, combined with some sequence features to distinguish miRNA and non-miRNA hairpins, a number of tools have successfully predicted miRNA genes that display close homology in two species <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp>.</p>
            <p>Furthermore, some machine-learning methods, including the SVM method, have been introduced into miRNA prediction and have been used with some success <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp>. The SVM method was first introduced by Pfeffer et al. <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. The features they used are simple and straightforward: the free energy of folding, the length of the longest symmetrical stem, the count of A, C, G and U nucleotides in the symmetrical stem, and the number of A-U, G-C and G-U pairs in the predicted minimal energy structure. After training they obtained a model that assigned a positive score to 71% of the true positives and to only 3% of false positives. Another set of novel secondary structure description syntaxes were developed by Xue et al. <abbrgrp><abbr bid="B21">21</abbr></abbrgrp> who used triplet elements to represent the local contiguous structure-sequence information and proposed a set of new parameters. After training with positive and negative datasets, they achieved a level of about 90% accuracy with human data.</p>
            <p>In three recent studies, RNAmicro, miRNA SVM and miPred extended the usage of SVM in miRNA prediction <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr></abbrgrp>. Utilizing multiple sequence alignments, Hertel et al. developed a SVM based program, RNAmicro, to predict miRNAs in various organisms <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. Descriptors introduced into the program include the properties of the hairpin, Z-score related properties and entropy related properties. The tool can be used to recognize microRNA precursors in multiple sequence alignments and has been successfully applied to recent genome-wide surveys of mammals, urochordates and nematodes. The miRNA SVM program introduced by Helvik et al. was based on prediction of 5' Drosha processing sites in hairpins, which are essential for pre-miRNA discovery <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. The classifier can correctly predict the processing site for 50% of the known human 5' miRNAs. The miRNA SVM program used 18 features including the composition properties of the hairpin and a set of processing site related properties. A definitive effort to compile 29 global intrinsic hairpin folding attributes from the pre-miRNA sequences without relying on the comparative genomic information was performed by Kwang et al. <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. They characterized a pre-miRNA at the dinucleotide sequence, hairpin folding, non-linear statistical thermodynamics and topological levels. The SVM classifier model was trained on 200 human pre-miRNAs and 400 non-miRNA hairpins, and achieved 93.50% accuracy.</p>
         </sec>
         <sec>
            <st>
               <p>Motivation of our study</p>
            </st>
            <p>It is commonly recognized that the small miRNA family is quite large. To date, 474 human and 78 fly miRNAs have been discovered, and more are likely to be identified <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. A major concern in miRNA identification now is the need to improve existing prediction methods and develop new methods for better performance and efficiency.</p>
            <p>In a large genome, there are many sequence segments that can fold into hairpin secondary structures similar to pre-miRNA. However, pre-miRNAs are only a very small proportion of these sequence segments. Therefore, distinguishing between miRNA and non-miRNA hairpins is crucial in the computational identification of miRNAs. The hairpin structure of pre-miRNA is a good feature for miRNA prediction, but hairpin structures are not unique to miRNAs. The short length of pre-miRNA sequences, with low specificity relative to the overwhelming number of genome background sequences, makes genome-wide miRNA prediction complicated. The majority of the non-miRNA hairpins residing in a genome can be removed by genome comparisons. The drawback of this method is that multiple genome alignment is computationally intensive. In addition, the existing packages using multiple alignments that detect pre-miRNA candidates may lose real pre-miRNAs that are less conserved or conserved only between two species. On the other hand, the pair-wise genome alignments are relatively easy to implement.</p>
            <p>Combining previously published work, our analyses of the pre-miRNA sequences indicated that the current knowledge of the secondary structure and the mutation characteristics of the pre-miRNAs are incomplete. Comparative analyses and computer simulation revealed a set of mutation-related features valuable for pre-miRNA prediction. Based on the evaluation of the features discovered so far, we have improved the syntax to describe the stem-loop structure for effective miRNA prediction and developed a new tool, miRFinder, which uses a comprehensive combination of many well-selected parameter measurements for improved miRNA prediction. Here we report our successful <it>in silicon </it>prediction of pre-miRNA candidates using miRFinder.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Implementation</p>
         </st>
         <sec>
            <st>
               <p>Vectors representing the features of pre-miRNA</p>
            </st>
            <p>The miRFinder tool improves the ability to distinguish between miRNA and non-miRNA hairpins by improving the representation of the sequence and structure features of pre-miRNA. Our investigation showed that the relationship between mutation patterns and the secondary structures of pre-miRNA are significantly distinct from that of non-miRNA hairpins. According to most literature, the pre-miRNA coding arm suffers the highest selective pressure, followed by the non-coding arm, stem region, loop region, and flanking sequence. A mutation on the stem region containing the mature miRNA seldom happens <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B27">27</abbr></abbrgrp>. Our analysis revealed that 23 out of the 72 conserved pre-miRNAs between D. melanogaster and D. pseudoobscura have mutations in the stem region. We also found a large number of similar pre-miRNAs that have mutations in the stem region in the human, the mouse and other organisms. Further analysis showed that all of these mutations have only slight changes in the secondary structure of pre-miRNA (Figure <figr fid="F1">1</figr>). We call them neutral mutations. Theoretically, mutations between A and G, and U and C suffer relatively lower selective pressure due to the compatibility of G-U base-pairing in pre-miRNA during evolution, which may increase their mutation frequency (Table <tblr tid="T1">1</tblr>, pMutFeq). Unfortunately the mutation frequencies between A and G, and U and C are not sufficiently different to distinguish the miRNA and non-miRNA hairpins. This is due mainly to the relatively short length of pre-miRNA and the masking effect of the inherent high mutation frequency between A and G, and U and C. However, in non-miRNA hairpins the disturbance of the secondary structure and MFE resulting from mutations is much higher than that of real pre-miRNAs (Table <tblr tid="T1">1</tblr>, pVStrc and pVMFE). The mutation types of pre-miRNAs and their influence on the secondary structure are valuable features for pre-miRNA prediction but have been seldom used for prediction.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Mutation profile of miRNA</p>
               </caption>
               <text>
                  <p>Mutation profile of miRNA. There are three types of mutations that cause slight disturbance of the secondary structure of pre-miRNA: (1) mutations in the loop region; (2) mutations between A and G, U and C in the stem region; (3) mutations in the interrelated region of both arms that do not break the base-pairing. The three types of mutations are marked by the numbers 1, 2 and 3, respectively, under the alignments. The conserved nucleotides are marked as "*".</p>
               </text>
               <graphic file="1471-2105-8-341-1"/>
            </fig>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Test results of the 18 parameters implemented in miRFinder</p>
               </caption>
               <tblbdy cols="8">
                  <r>
                     <c cspan="2" ca="center">
                        <p>
                           <b>Parameters</b>
                        </p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>
                           <b>Pre-miRNA</b>
                        </p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>
                           <b>Non-miRNA Hairpins</b>
                        </p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>
                           <b>Parameter Test</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>ID</p>
                     </c>
                     <c ca="center">
                        <p>Symbol</p>
                     </c>
                     <c ca="center">
                        <p>Mean</p>
                     </c>
                     <c ca="center">
                        <p>Std. Deviation</p>
                     </c>
                     <c ca="center">
                        <p>Mean</p>
                     </c>
                     <c ca="center">
                        <p>Std. Deviation</p>
                     </c>
                     <c ca="center">
                        <p>F value</p>
                     </c>
                     <c ca="center">
                        <p>T test</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>01</p>
                     </c>
                     <c ca="center">
                        <p>pMFE</p>
                     </c>
                     <c ca="center">
                        <p>-0.444</p>
                     </c>
                     <c ca="center">
                        <p>0.072</p>
                     </c>
                     <c ca="center">
                        <p>-0.218</p>
                     </c>
                     <c ca="center">
                        <p>0.086</p>
                     </c>
                     <c ca="center">
                        <p>1.44</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>02</p>
                     </c>
                     <c ca="center">
                        <p>pVMFE</p>
                     </c>
                     <c ca="center">
                        <p>0.455</p>
                     </c>
                     <c ca="center">
                        <p>0.936</p>
                     </c>
                     <c ca="center">
                        <p>0.506</p>
                     </c>
                     <c ca="center">
                        <p>0.632</p>
                     </c>
                     <c ca="center">
                        <p>0.03</p>
                     </c>
                     <c ca="center">
                        <p>0.014</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>03</p>
                     </c>
                     <c ca="center">
                        <p>pVStrc</p>
                     </c>
                     <c ca="center">
                        <p>0.903</p>
                     </c>
                     <c ca="center">
                        <p>1.812</p>
                     </c>
                     <c ca="center">
                        <p>3.635</p>
                     </c>
                     <c ca="center">
                        <p>4.703</p>
                     </c>
                     <c ca="center">
                        <p>0.41</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>04</p>
                     </c>
                     <c ca="center">
                        <p>pMatch</p>
                     </c>
                     <c ca="center">
                        <p>0.892</p>
                     </c>
                     <c ca="center">
                        <p>0.063</p>
                     </c>
                     <c ca="center">
                        <p>0.639</p>
                     </c>
                     <c ca="center">
                        <p>0.176</p>
                     </c>
                     <c ca="center">
                        <p>1.06</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>05</p>
                     </c>
                     <c ca="center">
                        <p>pDI</p>
                     </c>
                     <c ca="center">
                        <p>0.050</p>
                     </c>
                     <c ca="center">
                        <p>0.058</p>
                     </c>
                     <c ca="center">
                        <p>0.088</p>
                     </c>
                     <c ca="center">
                        <p>0.126</p>
                     </c>
                     <c ca="center">
                        <p>0.21</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>06</p>
                     </c>
                     <c ca="center">
                        <p>pMismatch</p>
                     </c>
                     <c ca="center">
                        <p>0.097</p>
                     </c>
                     <c ca="center">
                        <p>0.060</p>
                     </c>
                     <c ca="center">
                        <p>0.196</p>
                     </c>
                     <c ca="center">
                        <p>0.119</p>
                     </c>
                     <c ca="center">
                        <p>0.55</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>07</p>
                     </c>
                     <c ca="center">
                        <p>pBulge</p>
                     </c>
                     <c ca="center">
                        <p>0.014</p>
                     </c>
                     <c ca="center">
                        <p>0.036</p>
                     </c>
                     <c ca="center">
                        <p>0.169</p>
                     </c>
                     <c ca="center">
                        <p>0.209</p>
                     </c>
                     <c ca="center">
                        <p>0.64</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>08</p>
                     </c>
                     <c ca="center">
                        <p>pMutFreq</p>
                     </c>
                     <c ca="center">
                        <p>0.018</p>
                     </c>
                     <c ca="center">
                        <p>0.024</p>
                     </c>
                     <c ca="center">
                        <p>0.136</p>
                     </c>
                     <c ca="center">
                        <p>0.126</p>
                     </c>
                     <c ca="center">
                        <p>0.78</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>09</p>
                     </c>
                     <c ca="center">
                        <p>"=-"</p>
                     </c>
                     <c ca="center">
                        <p>0.043</p>
                     </c>
                     <c ca="center">
                        <p>0.029</p>
                     </c>
                     <c ca="center">
                        <p>0.035</p>
                     </c>
                     <c ca="center">
                        <p>0.031</p>
                     </c>
                     <c ca="center">
                        <p>0.13</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>10</p>
                     </c>
                     <c ca="center">
                        <p>"=="</p>
                     </c>
                     <c ca="center">
                        <p>0.649</p>
                     </c>
                     <c ca="center">
                        <p>0.081</p>
                     </c>
                     <c ca="center">
                        <p>0.466</p>
                     </c>
                     <c ca="center">
                        <p>0.089</p>
                     </c>
                     <c ca="center">
                        <p>1.08</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>11</p>
                     </c>
                     <c ca="center">
                        <p>"=:"</p>
                     </c>
                     <c ca="center">
                        <p>0.090</p>
                     </c>
                     <c ca="center">
                        <p>0.041</p>
                     </c>
                     <c ca="center">
                        <p>0.093</p>
                     </c>
                     <c ca="center">
                        <p>0.037</p>
                     </c>
                     <c ca="center">
                        <p>0.04</p>
                     </c>
                     <c ca="center">
                        <p>0.001</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>12</p>
                     </c>
                     <c ca="center">
                        <p>"--"</p>
                     </c>
                     <c ca="center">
                        <p>0.023</p>
                     </c>
                     <c ca="center">
                        <p>0.041</p>
                     </c>
                     <c ca="center">
                        <p>0.043</p>
                     </c>
                     <c ca="center">
                        <p>0.076</p>
                     </c>
                     <c ca="center">
                        <p>0.17</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>13</p>
                     </c>
                     <c ca="center">
                        <p>"-="</p>
                     </c>
                     <c ca="center">
                        <p>0.043</p>
                     </c>
                     <c ca="center">
                        <p>0.029</p>
                     </c>
                     <c ca="center">
                        <p>0.035</p>
                     </c>
                     <c ca="center">
                        <p>0.031</p>
                     </c>
                     <c ca="center">
                        <p>0.13</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>14</p>
                     </c>
                     <c ca="center">
                        <p>"^^"</p>
                     </c>
                     <c ca="center">
                        <p>0.008</p>
                     </c>
                     <c ca="center">
                        <p>0.021</p>
                     </c>
                     <c ca="center">
                        <p>0.090</p>
                     </c>
                     <c ca="center">
                        <p>0.130</p>
                     </c>
                     <c ca="center">
                        <p>0.54</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>15</p>
                     </c>
                     <c ca="center">
                        <p>"^="</p>
                     </c>
                     <c ca="center">
                        <p>0.014</p>
                     </c>
                     <c ca="center">
                        <p>0.018</p>
                     </c>
                     <c ca="center">
                        <p>0.044</p>
                     </c>
                     <c ca="center">
                        <p>0.026</p>
                     </c>
                     <c ca="center">
                        <p>0.69</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>16</p>
                     </c>
                     <c ca="center">
                        <p>"::"</p>
                     </c>
                     <c ca="center">
                        <p>0.044</p>
                     </c>
                     <c ca="center">
                        <p>0.044</p>
                     </c>
                     <c ca="center">
                        <p>0.102</p>
                     </c>
                     <c ca="center">
                        <p>0.086</p>
                     </c>
                     <c ca="center">
                        <p>0.45</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>17</p>
                     </c>
                     <c ca="center">
                        <p>":^"</p>
                     </c>
                     <c ca="center">
                        <p>0.014</p>
                     </c>
                     <c ca="center">
                        <p>0.018</p>
                     </c>
                     <c ca="center">
                        <p>0.044</p>
                     </c>
                     <c ca="center">
                        <p>0.026</p>
                     </c>
                     <c ca="center">
                        <p>0.69</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>18</p>
                     </c>
                     <c ca="center">
                        <p>":="</p>
                     </c>
                     <c ca="center">
                        <p>0.076</p>
                     </c>
                     <c ca="center">
                        <p>0.041</p>
                     </c>
                     <c ca="center">
                        <p>0.048</p>
                     </c>
                     <c ca="center">
                        <p>0.037</p>
                     </c>
                     <c ca="center">
                        <p>0.36</p>
                     </c>
                     <c ca="center">
                        <p>0.000</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Note: T tests are 2-tailed. The F value represents the discriminative power of the parameters. The 18 parameters were coded as. 1: Minimum Free Energy; 2: The difference of the MFE of the sequence pair; 3: The difference of the structure of the sequence pair; 4&#8211;7: Base pairing and other properties of the 22 mer hypothesized mature miRNA; 8: The mutation frequency of the sequence segment pair; 9&#8211;18: The frequency of the 10 possible secondary structure elements (combinations of 2 adjacent characters) in the pseudo code of stem region (represented by the new syntax). [See details in additional file <supplr sid="S1">1</supplr>].</p>
               </tblfn>
            </tbl>
            <p>Recent reports have shown that local sequence features, such as the distribution of the loops, are distinctly different between that of miRNA and non-miRNA hairpins. We improved the syntaxes proposed by Xue et al. <abbrgrp><abbr bid="B21">21</abbr></abbrgrp> to further elucidate the information of the local secondary structure [see additional file <supplr sid="S1">1</supplr> for details of the syntax]. We introduced five symbols "=", ":", ".", "-" and "^" (indicating states of paired, unpaired, insertion, deletion and bulge, respectively) to mark the states of each nucleotide in secondary structure prediction. The new syntax focused on the information of every two adjacent symbols. The frequency of each combination defines a set of novel and useful features (Table <tblr tid="T1">1</tblr>, Parameters 9&#8211;18). As an example, Figure <figr fid="F2">2</figr> illustrates how a hairpin is represented using the new syntax.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>An example of how a hairpin is represented using the new syntax</p>
               </caption>
               <text>
                  <p>An example of how a hairpin is represented using the new syntax. Symbols of "=", ":", ".", "-"and "^" indicate states of paired, unpaired, insertion, deletion and bulge, respectively. The frequency of each element (combinations of every two adjacent symbols) of the pseudo code of the structure will be used as input vectors for SVM.</p>
               </text>
               <graphic file="1471-2105-8-341-2"/>
            </fig>
            <p>A selection criterion, which has been used by Dror et al. is used to show the discriminative power of these parameters <abbrgrp><abbr bid="B28">28</abbr></abbrgrp> (Table <tblr tid="T1">1</tblr>). The results show that these parameters represent important features for pre-miRNA prediction.</p>
         </sec>
         <sec>
            <st>
               <p>Dataset preparation for SVM model training and testing</p>
            </st>
            <p>Construction of the training datasets involved several steps. 1) Construction of positive training subsets. The positive training subsets contained about 4,000 pre-miRNA pairs. The pre-miRNA sequences of human, mouse, pig, cattle, dog and sheep collected from the miRBase (release 8.2) <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> were compared with each other to find the conserved pairs between any two species. The pairs of secondary structure containing multiple loops were eliminated from the datasets. 2) Construction of negative training subsets. The negative training subsets were constructed by the sequence segments extracted from UCSC genome pair-wise alignments (human, mouse) <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. We used a program that implemented the SW-like algorithm [see the algorithm in additional file <supplr sid="S1">1</supplr>] to scan the sequence segments that can fold to form hairpin secondary structures. About 10% of the sequence segments were extracted by a stratified selection to generate a subset. The sequences that contained experimentally confirmed pre-miRNAs were eliminated manually. The negative training subsets were constructed by randomly selecting about 4,000 sequence segments from the subset. [See the datasets in additional file <supplr sid="S2">2</supplr>].</p>
            <suppl id="S2">
               <title>
                  <p>Additional file 2</p>
               </title>
               <text>
                  <p>Datasets. This archive contains the training and testing datasets.</p>
               </text>
               <file name="1471-2105-8-341-S2.rar">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>We also created test datasets containing a negative subset simulating the background of the genome sequence and a positive subset containing homolog pre-miRNA pairs. The construction of the negative subset was based on earlier methods for computational problems described in the literature, co-mingling a set of non-miRNA genomic sequences from different species with a set of shuffling sequences <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. We tried to avoid an unbalanced case study by using a combination of each sequence type (6,193 chicken non-miRNA genomic sequences and 5,000 shuffling sequences). The positive subset (containing 500 homolog pre-miRNA pairs) was generated by a comparison of pre-miRNAs between different species. [See the datasets in additional file <supplr sid="S2">2</supplr>].</p>
         </sec>
         <sec>
            <st>
               <p>Development of new tool for pre-miRNA prediction</p>
            </st>
            <p>Utilizing the 18 parameters (Table <tblr tid="T1">1</tblr>), we developed a tool, called MiRFinder, to predict pre-miRNAs that are conserved in two genomes. There are three major steps built into the program (Figure <figr fid="F3">3</figr>). An algorithm based on the Smith-Waterman algorithm <abbrgrp><abbr bid="B32">32</abbr></abbrgrp> was developed to quickly scan the genome pair-wise sequence to get the regions that have high potential to form a hairpin [see additional file <supplr sid="S1">1</supplr> for details]. The criteria for the selection were: a) a minimum length of the hairpin of 18 nucleotides (lowest number of base pairings of mature miRNA) and b) no multiple loops. The good loops were folded by a modified version of the Vienna RNA package <abbrgrp><abbr bid="B33">33</abbr></abbrgrp> to get all of the possible secondary structures. Hairpin loops were picked up, and the relevant punish scores corresponding to the 18 parameters were calculated based on the sequence information, MFE and secondary structure. The final classification of pre-mirRNAs from non-mirRNA hairpins was based on excluding non-robust structures by SVM scoring.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>The pipeline of the miRFinder</p>
               </caption>
               <text>
                  <p>The pipeline of the miRFinder. It consists of 5 steps: (1) Smith-Waterman like algorithm searches the genome of short hairpins; (2) The sequence is folded by RNALfold (Hofacker et al., 2004) to get the local structure; (3) the extended good loops is picked out by schLoop; (4) the good loops are re-folded by RNAfold (Zuker &amp; Stiegler, 1981) to get the MFE and secondary structure; (5) the Punish program calculates the punish score of each paired sequence segments; (6) the sequence is then predicted to be miRNAs or non-miRNA hairpins using the SVM (support vector machine) classifier.</p>
               </text>
               <graphic file="1471-2105-8-341-3"/>
            </fig>
            <p>The punish scores of 18 proposed parameters of the training datasets (see "dataset preparation for SVM model training and testing" section) were calculated to generate score datasets. The score datasets were split into two subsets (TS1, TS2), one for training and one for cross validation. Each subset included 1,500 positive samples and 1,500 negative samples selected from the score dataset by a random procedure. For each dataset, all parameters were scaled linearly from -1 to 1. The TS1 was used for the SVM model training. A SVM classification program, LIBSVM <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>, was trained to generate a model to classify the loops as pre-miRNA or other sequences. A cross validation (CV) technique was used for the selection of the most suitable parameters for training.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <sec>
            <st>
               <p>MirFinder can accurately distinguish miRNA and non-miRNA hairpins</p>
            </st>
            <p>The training of the model yielded an accuracy rate of 99.6% (radial basis function-kernel, g-0.125 c-32, five folds cross validation). The TS2 subset was subsequently used to test the model. The results show that the model could correctly assign 99.4% of the samples in TS2. The ROC cure analysis of the model showed that the AUC is approximately equal to 1 (Figure <figr fid="F4">4</figr>). The results show that our method had good performance distinguishing between miRNA and non-miRNA hairpins.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>The ROC-curve</p>
               </caption>
               <text>
                  <p>The ROC-curve. The solid line shows the ROC-curve for the miRFinder that was trained on miRNAs versus non-miRNA hairpins. The points for Sewer, ProMir, Triplet-SVM, miRNA-SVM, miPred, and RNAmicro are the sensitivities and specificities reported by Sewer et al. (2005), Nam et al. (2005), Xue et al. (2005), Hertel et al. (2006), Helvik et al. (2007) and Kwang Loong et al. (2007). The sensitivity and specificity of miRScan are 0.5 and 0.7, respectively, and are not included in the figure. Panel (A) is a detailed excerpt of Panel (B).</p>
               </text>
               <graphic file="1471-2105-8-341-4"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>An actual example: testing of the tool with aligned genome data from chicken/human and D. melanogaster/D. pseudoobscura comparisons</p>
            </st>
            <p>To test the performance of the tool in actual prediction, miRFinder was used to predict pre-miRNAs from chicken/human pair-wise genome alignments. The alignments were downloaded from the UCSC bioinformatics site <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. The program was run on a desktop computer (1.8 GHZ CPU, WindowsXP and 256 M RAM). A total of 222 good candidates were obtained [score>0.9, see additional file <supplr sid="S1">1</supplr> figure 3A]. These candidates were aligned to the pre-miRNAs collected from miRBase <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. A total of 60 matched experimentally confirmed chicken pre-miRNAs were identified [with 86 experimentally confirmed pre-miRNAs that are highly conserved between the chicken and human genomes; the prediction match rate is 70% (60/86), see additional file <supplr sid="S1">1</supplr> figure 1A and additional file <supplr sid="S3">3</supplr> table 1]. In total, 159 sequence segments with high potential to be pre-miRNAs were detected by miRFinder [see additional file <supplr sid="S1">1</supplr> figure 1B and additional file <supplr sid="S3">3</supplr> table 1]. The prediction results of the chicken/human genome alignments showed that the tool has good performance. In our experience the tool is easy to operate and does not demand much computing power, thus it may be used for high throughput prediction.</p>
            <suppl id="S3">
               <title>
                  <p>Additional file 3</p>
               </title>
               <text>
                  <p>Supplemental table 1. Chicken/Human candidate miRNAs predicted by miRFinder.</p>
               </text>
               <file name="1471-2105-8-341-S3.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>To test whether the miRFinder was suitable for organisms other than vertebrates, it was used to predict pre-miRNAs in D. melanogaster/D. pseudoobscura genome alignments. We obtained 188 good candidates [score>0.9, see additional file <supplr sid="S1">1</supplr> figure 3B], of which 34 matched experimentally confirmed miRNAs [see additional file <supplr sid="S4">4</supplr> table 2]. With about 73 pre-miRNAs highly conserved between the D. melanogaster and D. pseudoobscura genomes, the prediction results showed that the detection rate was 47% (34/73). Our results suggest that the tool can be implemented in the fly genome, but the performance was apparently worse than in the chicken genome.</p>
            <suppl id="S4">
               <title>
                  <p>Additional file 4</p>
               </title>
               <text>
                  <p>Supplemental table 2. D.pseudoobscura/D.melanogaster pre-miRNA candidates detected by miRFinder.</p>
               </text>
               <file name="1471-2105-8-341-S4.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Assessing the tool</p>
            </st>
            <p>In this study, we assessed the miRFinder along with other similar miRNA prediction tools, miRscan and triplet-SVM <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B35">35</abbr></abbrgrp>. The miRscan is one of the most well-known and widely used miRNA prediction software designed for miRNA prediction in the C. elegans/C. briggsae genomes <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. The triplet-SVM classifier is well regarded for distinguishing between miRNA and non-miRNA hairpins in animals, plants and other genomes, and was optimized for the human genome <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. These tools have relatively good performance. Some other tools also reported good performance, but they are methodologically different or not supported to scan genomes, such as ProMiR, and thus not included in this assessment.</p>
            <p>In assessing the tool, two major aspects were taken into consideration: 1) the false discriminative rates (the false positive rate) and 2) the detectable rate (the sensitivity). Each program was run with the test datasets on the default configuration settings.</p>
            <p>We used relatively small test datasets (see "dataset preparation for SVM model training and testing" section) to examine the performance of miRFinder and miRscan. The results of the miRFinder and miRscan trials are similar, to some extent. For the negative datasets the false discriminative rates of miRFinder and miRscan were 0.70% (79/11,193) and 0.23% (26/11,193), respectively. Interestingly, 11 sequences were recognized as good candidates by both of the software programs. However, for the positive datasets only 158 (158/500) sequences were recognized as good pre-miRNA candidates by miRScan, while over 99% of these pre-miRNAs were detected by miRFinder. These results are similar to the reports that the application of MiRscan for the C. elegans/C. briggsae genome analysis can detect only half of the 58 previously known miRNAs <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>.</p>
            <p>For the 11,193 hairpin-like sequences derived from the partial sequences of the chicken genome, over 1,000 were recognized as good candidates by triplet-SVM. This result is similar to the evaluations of triplet-SVM classifier reported by Helvik et al. <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. Compared with triplet-SVM, miRFinder reduced the number of the candidates to about 10%. Nevertheless, miRFinder was focused on the conserved pre-miRNAs and thus possibly missed the non-conserved pre-miRNAs.</p>
            <p>Noticeably, processing a large vertebrate genome for pre-miRNA prediction is time consuming. Test results revealed that miRFinder is faster than miRscan (hundreds of mega-bases per CPU hour compared to several mega-bases per CPU hour, respectively). For example, to process 530 sequences, miRFinder took only 40 seconds while miRscan took 215 seconds [see additional file <supplr sid="S1">1</supplr> figure 1E].</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>MirFinder can accurately distinguish between miRNA and non-miRNA hairpins. Compared to similar methods, our method has better performance. At sensitivity levels, mirFinder is comparable to methods, such as RNAmicro, that rely on sequence or structure conservation <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. Furthermore, our method reduces the number of candidates, which makes it more practical than others. A down side might be that the species specific pre-miRNAs could be lost since these miRNAs would be left out in the sequence alignment step before starting the prediction.</p>
      </sec>
      <sec>
         <st>
            <p>Availability and requirements</p>
         </st>
         <p>Project Name: MiRFinder</p>
         <p>Project Home Page: <url>Http://www.bioinformatics.org/mirfinder/</url>. [Also see the application and source code in additional file <supplr sid="S5">5</supplr>].</p>
         <suppl id="S5">
            <title>
               <p>Additional file 5</p>
            </title>
            <text>
               <p>miRFinder. This archive contains application and source code of miRFinder.</p>
            </text>
            <file name="1471-2105-8-341-S5.rar">
               <p>Click here for file</p>
            </file>
         </suppl>
      </sec>
      <sec>
         <st>
            <p>Competing interests </p>
         </st>
         <p>The author(s) declares that there are no competing interests.</p>
         <p>Operating Systems: All platforms with GNU C++ compiler.</p>
         <p>Programming Languages: C++</p>
         <p>License: Academic Free License <url>http://www.opensource.org/licenses/academic.php</url>.</p>
         <p>Non-academics Restrictions: License needed</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>SHZ, MFR, BF and KL initiated the project and guided the forming of the ideas. THH developed the method and wrote the source code and implemented most of the experiments under the guide of SHZ. MFR and ZLH provided helpful insight in the method development and helped in the writing and assessment of the manuscript. All authors have read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>Financial support was provided by the National Natural Science Foundation of China (30300250, 30671138), Key Project of National Basic Research and Developmental Plan (2006CB102105) of China, the Hubei Province natural science creative team project (2006ABC008), and the Young Scientist Project of Wuhan. We thank Min Yao for assistance in preparing the data. We thank the editor for her help with English editing. Support for M. Rothschild and Z-L Hu was provided in part by USDA Pig Genome Coordination funds, the Iowa Agriculture and Home Economics Experiment Station, and Hatch and the State of Iowa funds.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>MicroRNA genes are transcribed by RNA polymerase II</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Han</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Yeom</snm>
                  <fnm>KH</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Baek</snm>
                  <fnm>SH</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>VN</fnm>
               </au>
            </aug>
            <source>Embo J</source>
            <pubdate>2004</pubdate>
            <volume>23</volume>
            <issue>20</issue>
            <fpage>4051</fpage>
            <lpage>4060</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">524334</pubid>
                  <pubid idtype="pmpid" link="fulltext">15372072</pubid>
                  <pubid idtype="doi">10.1038/sj.emboj.7600385</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Human microRNAs are processed from capped, polyadenylated transcripts that can also function as mRNAs</p>
            </title>
            <aug>
               <au>
                  <snm>Cai</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Hagedorn</snm>
                  <fnm>CH</fnm>
               </au>
               <au>
                  <snm>Cullen</snm>
                  <fnm>BR</fnm>
               </au>
            </aug>
            <source>Rna</source>
            <pubdate>2004</pubdate>
            <volume>10</volume>
            <issue>12</issue>
            <fpage>1957</fpage>
            <lpage>1966</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1370684</pubid>
                  <pubid idtype="pmpid" link="fulltext">15525708</pubid>
                  <pubid idtype="doi">10.1261/rna.7135204</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>The nuclear RNase III Drosha initiates microRNA processing</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Ahn</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Han</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Choi</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Yim</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Provost</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Radmark</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>VN</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2003</pubdate>
            <volume>425</volume>
            <issue>6956</issue>
            <fpage>415</fpage>
            <lpage>419</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature01957</pubid>
                  <pubid idtype="pmpid" link="fulltext">14508493</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Nuclear export of microRNA precursors</p>
            </title>
            <aug>
               <au>
                  <snm>Lund</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Guttinger</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Calado</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Dahlberg</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Kutay</snm>
                  <fnm>U</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2004</pubdate>
            <volume>303</volume>
            <issue>5654</issue>
            <fpage>95</fpage>
            <lpage>98</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1090599</pubid>
                  <pubid idtype="pmpid" link="fulltext">14631048</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Exportin-5 mediates the nuclear export of pre-microRNAs and short hairpin RNAs</p>
            </title>
            <aug>
               <au>
                  <snm>Yi</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Qin</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Macara</snm>
                  <fnm>IG</fnm>
               </au>
               <au>
                  <snm>Cullen</snm>
                  <fnm>BR</fnm>
               </au>
            </aug>
            <source>Genes &amp; development</source>
            <pubdate>2003</pubdate>
            <volume>17</volume>
            <issue>24</issue>
            <fpage>3011</fpage>
            <lpage>3016</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">305252</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681208</pubid>
                  <pubid idtype="doi">10.1101/gad.1158803</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Exportin 5 is a RanGTP-dependent dsRNA-binding protein that mediates nuclear export of pre-miRNAs</p>
            </title>
            <aug>
               <au>
                  <snm>Bohnsack</snm>
                  <fnm>MT</fnm>
               </au>
               <au>
                  <snm>Czaplinski</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Gorlich</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Rna</source>
            <pubdate>2004</pubdate>
            <volume>10</volume>
            <issue>2</issue>
            <fpage>185</fpage>
            <lpage>191</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1370530</pubid>
                  <pubid idtype="pmpid" link="fulltext">14730017</pubid>
                  <pubid idtype="doi">10.1261/rna.5167604</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Exportin-5 mediates nuclear export of minihelix-containing RNAs</p>
            </title>
            <aug>
               <au>
                  <snm>Gwizdek</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ossareh-Nazari</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Brownawell</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Doglio</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bertrand</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Macara</snm>
                  <fnm>IG</fnm>
               </au>
               <au>
                  <snm>Dargemont</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2003</pubdate>
            <volume>278</volume>
            <issue>8</issue>
            <fpage>5505</fpage>
            <lpage>5508</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.C200668200</pubid>
                  <pubid idtype="pmpid" link="fulltext">12509441</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>A cellular function for the RNA-interference enzyme Dicer in the maturation of the let-7 small temporal RNA</p>
            </title>
            <aug>
               <au>
                  <snm>Hutvagner</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>McLachlan</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Pasquinelli</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Balint</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Tuschl</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Zamore</snm>
                  <fnm>PD</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2001</pubdate>
            <volume>293</volume>
            <issue>5531</issue>
            <fpage>834</fpage>
            <lpage>838</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1062961</pubid>
                  <pubid idtype="pmpid" link="fulltext">11452083</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Dicer functions in RNA interference and in synthesis of small RNA involved in developmental timing in C. elegans</p>
            </title>
            <aug>
               <au>
                  <snm>Ketting</snm>
                  <fnm>RF</fnm>
               </au>
               <au>
                  <snm>Fischer</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Bernstein</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Sijen</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hannon</snm>
                  <fnm>GJ</fnm>
               </au>
               <au>
                  <snm>Plasterk</snm>
                  <fnm>RH</fnm>
               </au>
            </aug>
            <source>Genes Dev</source>
            <pubdate>2001</pubdate>
            <volume>15</volume>
            <issue>20</issue>
            <fpage>2654</fpage>
            <lpage>2659</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">312808</pubid>
                  <pubid idtype="pmpid" link="fulltext">11641272</pubid>
                  <pubid idtype="doi">10.1101/gad.927801</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>A role for the RNase III enzyme DCR-1 in RNA interference and germ line development in Caenorhabditis elegans</p>
            </title>
            <aug>
               <au>
                  <snm>Knight</snm>
                  <fnm>SW</fnm>
               </au>
               <au>
                  <snm>Bass</snm>
                  <fnm>BL</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2001</pubdate>
            <volume>293</volume>
            <issue>5538</issue>
            <fpage>2269</fpage>
            <lpage>2271</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1855227</pubid>
                  <pubid idtype="pmpid" link="fulltext">11486053</pubid>
                  <pubid idtype="doi">10.1126/science.1062039</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Control of developmental timing by small temporal RNAs: a paradigm for RNA-mediated regulation of gene expression</p>
            </title>
            <aug>
               <au>
                  <snm>Banerjee</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Slack</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Bioessays</source>
            <pubdate>2002</pubdate>
            <volume>24</volume>
            <issue>2</issue>
            <fpage>119</fpage>
            <lpage>129</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/bies.10046</pubid>
                  <pubid idtype="pmpid" link="fulltext">11835276</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Identification of novel genes coding for small expressed RNAs</p>
            </title>
            <aug>
               <au>
                  <snm>Lagos-Quintana</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rauhut</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Lendeckel</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Tuschl</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2001</pubdate>
            <volume>294</volume>
            <issue>5543</issue>
            <fpage>853</fpage>
            <lpage>858</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1064921</pubid>
                  <pubid idtype="pmpid" link="fulltext">11679670</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>An extensive class of small RNAs in Caenorhabditis elegans</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Ambros</snm>
                  <fnm>V</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2001</pubdate>
            <volume>294</volume>
            <issue>5543</issue>
            <fpage>862</fpage>
            <lpage>864</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1065329</pubid>
                  <pubid idtype="pmpid" link="fulltext">11679672</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>An abundant class of tiny RNAs with probable regulatory roles in Caenorhabditis elegans</p>
            </title>
            <aug>
               <au>
                  <snm>Lau</snm>
                  <fnm>NC</fnm>
               </au>
               <au>
                  <snm>Lim</snm>
                  <fnm>LP</fnm>
               </au>
               <au>
                  <snm>Weinstein</snm>
                  <fnm>EG</fnm>
               </au>
               <au>
                  <snm>Bartel</snm>
                  <fnm>DP</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2001</pubdate>
            <volume>294</volume>
            <issue>5543</issue>
            <fpage>858</fpage>
            <lpage>862</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1065062</pubid>
                  <pubid idtype="pmpid" link="fulltext">11679671</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>MicroRNA identification based on sequence and structure alignment</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Gu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>He</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Bioinformatics (Oxford, England)</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>18</issue>
            <fpage>3610</fpage>
            <lpage>3614</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bti562</pubid>
                  <pubid idtype="pmpid" link="fulltext">15994192</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Computational identification of Drosophila microRNA genes</p>
            </title>
            <aug>
               <au>
                  <snm>Lai</snm>
                  <fnm>EC</fnm>
               </au>
               <au>
                  <snm>Tomancak</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Williams</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>GM</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <issue>7</issue>
            <fpage>R42</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">193629</pubid>
                  <pubid idtype="pmpid" link="fulltext">12844358</pubid>
                  <pubid idtype="doi">10.1186/gb-2003-4-7-r42</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Detection of 91 potential conserved plant microRNAs in Arabidopsis thaliana and Oryza sativa identifies important target genes</p>
            </title>
            <aug>
               <au>
                  <snm>Bonnet</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Wuyts</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Rouze</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Van de Peer</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2004</pubdate>
            <volume>101</volume>
            <issue>31</issue>
            <fpage>11511</fpage>
            <lpage>11516</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">509231</pubid>
                  <pubid idtype="pmpid" link="fulltext">15272084</pubid>
                  <pubid idtype="doi">10.1073/pnas.0404025101</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Computational identification of plant microRNAs and their targets, including a stress-induced miRNA</p>
            </title>
            <aug>
               <au>
                  <snm>Jones-Rhoades</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Bartel</snm>
                  <fnm>DP</fnm>
               </au>
            </aug>
            <source>Mol Cell</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <issue>6</issue>
            <fpage>787</fpage>
            <lpage>799</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.molcel.2004.05.027</pubid>
                  <pubid idtype="pmpid" link="fulltext">15200956</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Human microRNA prediction through a probabilistic co-learning model of sequence and structure</p>
            </title>
            <aug>
               <au>
                  <snm>Nam</snm>
                  <fnm>JW</fnm>
               </au>
               <au>
                  <snm>Shin</snm>
                  <fnm>KR</fnm>
               </au>
               <au>
                  <snm>Han</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>VN</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>BT</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <issue>11</issue>
            <fpage>3570</fpage>
            <lpage>3581</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1159118</pubid>
                  <pubid idtype="pmpid" link="fulltext">15987789</pubid>
                  <pubid idtype="doi">10.1093/nar/gki668</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Identification of clustered microRNAs using an ab initio prediction method</p>
            </title>
            <aug>
               <au>
                  <snm>Sewer</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Paul</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Landgraf</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Aravin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Pfeffer</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Brownstein</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Tuschl</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>van Nimwegen</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Zavolan</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>267</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1315341</pubid>
                  <pubid idtype="pmpid" link="fulltext">16274478</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-6-267</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Classification of real and pseudo microRNA precursors using local structure-sequence features and support vector machine</p>
            </title>
            <aug>
               <au>
                  <snm>Xue</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>He</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>GP</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>X</fnm>
               </au>
            </aug>
            <source>BMC bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>310</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1360673</pubid>
                  <pubid idtype="pmpid" link="fulltext">16381612</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-6-310</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Identification of microRNAs of the herpesvirus family</p>
            </title>
            <aug>
               <au>
                  <snm>Pfeffer</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sewer</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lagos-Quintana</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sheridan</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Sander</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Grasser</snm>
                  <fnm>FA</fnm>
               </au>
               <au>
                  <snm>van Dyk</snm>
                  <fnm>LF</fnm>
               </au>
               <au>
                  <snm>Ho</snm>
                  <fnm>CK</fnm>
               </au>
               <au>
                  <snm>Shuman</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Chien</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Russo</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Ju</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Randall</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Lindenbach</snm>
                  <fnm>BD</fnm>
               </au>
               <au>
                  <snm>Rice</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Simon</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Ho</snm>
                  <fnm>DD</fnm>
               </au>
               <au>
                  <snm>Zavolan</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Tuschl</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Nature methods</source>
            <pubdate>2005</pubdate>
            <volume>2</volume>
            <issue>4</issue>
            <fpage>269</fpage>
            <lpage>276</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nmeth746</pubid>
                  <pubid idtype="pmpid" link="fulltext">15782219</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Hairpins in a Haystack: recognizing microRNA precursors in comparative genomics data</p>
            </title>
            <aug>
               <au>
                  <snm>Hertel</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Stadler</snm>
                  <fnm>PF</fnm>
               </au>
            </aug>
            <source>Bioinformatics (Oxford, England)</source>
            <pubdate>2006</pubdate>
            <volume>22</volume>
            <issue>14</issue>
            <fpage>e197</fpage>
            <lpage>202</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btl257</pubid>
                  <pubid idtype="pmpid" link="fulltext">16873472</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Reliable prediction of Drosha processing sites improves microRNA gene prediction</p>
            </title>
            <aug>
               <au>
                  <snm>Helvik</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Snove</snm>
                  <fnm>O</fnm>
                  <suf>Jr</suf>
               </au>
               <au>
                  <snm>Saetrom</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Bioinformatics (Oxford, England)</source>
            <pubdate>2007</pubdate>
            <volume>23</volume>
            <issue>2</issue>
            <fpage>142</fpage>
            <lpage>149</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btl570</pubid>
                  <pubid idtype="pmpid" link="fulltext">17105718</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>De Novo SVM Classification of Precursor MicroRNAs from Genomic Pseudo Hairpins Using Global and Intrinsic Folding Measures</p>
            </title>
            <aug>
               <au>
                  <snm>Kwang Loong</snm>
                  <fnm>SN</fnm>
               </au>
               <au>
                  <snm>Mishra</snm>
                  <fnm>SK</fnm>
               </au>
            </aug>
            <source>Bioinformatics (Oxford, England)</source>
            <pubdate>2007</pubdate>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Genomics of microRNA</p>
            </title>
            <aug>
               <au>
                  <snm>Kim</snm>
                  <fnm>VN</fnm>
               </au>
               <au>
                  <snm>Nam</snm>
                  <fnm>JW</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2006</pubdate>
            <volume>22</volume>
            <issue>3</issue>
            <fpage>165</fpage>
            <lpage>173</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tig.2006.01.003</pubid>
                  <pubid idtype="pmpid" link="fulltext">16446010</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Phylogenetic shadowing and computational identification of human microRNA genes</p>
            </title>
            <aug>
               <au>
                  <snm>Berezikov</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Guryev</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>van de Belt</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wienholds</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Plasterk</snm>
                  <fnm>RH</fnm>
               </au>
               <au>
                  <snm>Cuppen</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2005</pubdate>
            <volume>120</volume>
            <issue>1</issue>
            <fpage>21</fpage>
            <lpage>24</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cell.2004.12.031</pubid>
                  <pubid idtype="pmpid" link="fulltext">15652478</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Accurate identification of alternatively spliced exons using support vector machine</p>
            </title>
            <aug>
               <au>
                  <snm>Dror</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Sorek</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Shamir</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Bioinformatics (Oxford, England)</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>7</issue>
            <fpage>897</fpage>
            <lpage>901</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bti132</pubid>
                  <pubid idtype="pmpid" link="fulltext">15531599</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>miRBase: microRNA sequences, targets and gene nomenclature</p>
            </title>
            <aug>
               <au>
                  <snm>Griffiths-Jones</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Grocock</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>van Dongen</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bateman</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Enright</snm>
                  <fnm>AJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2006</pubdate>
            <issue>34 Database</issue>
            <fpage>D140</fpage>
            <lpage>144</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1347474</pubid>
                  <pubid idtype="pmpid" link="fulltext">16381832</pubid>
                  <pubid idtype="doi">10.1093/nar/gkj112</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>The UCSC Genome Browser Database: update 2006</p>
            </title>
            <aug>
               <au>
                  <snm>Hinrichs</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Karolchik</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Baertsch</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Barber</snm>
                  <fnm>GP</fnm>
               </au>
               <au>
                  <snm>Bejerano</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Clawson</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Diekhans</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Furey</snm>
                  <fnm>TS</fnm>
               </au>
               <au>
                  <snm>Harte</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Hsu</snm>
                  <fnm>F</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2006</pubdate>
            <issue>34 Database</issue>
            <fpage>D590</fpage>
            <lpage>598</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1347506</pubid>
                  <pubid idtype="pmpid" link="fulltext">16381938</pubid>
                  <pubid idtype="doi">10.1093/nar/gkj144</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Assessing computational tools for the discovery of transcription factor binding sites</p>
            </title>
            <aug>
               <au>
                  <snm>Tompa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Bailey</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Church</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>De Moor</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Eskin</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Favorov</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Frith</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Fu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Kent</snm>
                  <fnm>WJ</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature biotechnology</source>
            <pubdate>2005</pubdate>
            <volume>23</volume>
            <issue>1</issue>
            <fpage>137</fpage>
            <lpage>144</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nbt1053</pubid>
                  <pubid idtype="pmpid" link="fulltext">15637633</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Identification of common molecular subsequences</p>
            </title>
            <aug>
               <au>
                  <snm>Smith</snm>
                  <fnm>TF</fnm>
               </au>
               <au>
                  <snm>Waterman</snm>
                  <fnm>MS</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1981</pubdate>
            <volume>147</volume>
            <issue>1</issue>
            <fpage>195</fpage>
            <lpage>197</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0022-2836(81)90087-5</pubid>
                  <pubid idtype="pmpid" link="fulltext">7265238</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structure</p>
            </title>
            <aug>
               <au>
                  <snm>Mathews</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Sabina</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zuker</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Turner</snm>
                  <fnm>DH</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1999</pubdate>
            <volume>288</volume>
            <issue>5</issue>
            <fpage>911</fpage>
            <lpage>940</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.1999.2700</pubid>
                  <pubid idtype="pmpid" link="fulltext">10329189</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>LIBSVM: a library for support vector machines</p>
            </title>
            <aug>
               <au>
                  <snm>Chang</snm>
                  <fnm>C-C</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>C-J</fnm>
               </au>
            </aug>
            <pubdate>2001</pubdate>
         </bibl>
         <bibl id="B35">
            <title>
               <p>The microRNAs of Caenorhabditis elegans</p>
            </title>
            <aug>
               <au>
                  <snm>Lim</snm>
                  <fnm>LP</fnm>
               </au>
               <au>
                  <snm>Lau</snm>
                  <fnm>NC</fnm>
               </au>
               <au>
                  <snm>Weinstein</snm>
                  <fnm>EG</fnm>
               </au>
               <au>
                  <snm>Abdelhakim</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Yekta</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Rhoades</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Burge</snm>
                  <fnm>CB</fnm>
               </au>
               <au>
                  <snm>Bartel</snm>
                  <fnm>DP</fnm>
               </au>
            </aug>
            <source>Genes Dev</source>
            <pubdate>2003</pubdate>
            <volume>17</volume>
            <issue>8</issue>
            <fpage>991</fpage>
            <lpage>1008</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">196042</pubid>
                  <pubid idtype="pmpid" link="fulltext">12672692</pubid>
                  <pubid idtype="doi">10.1101/gad.1074403</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
