<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art><ui>1471-2164-14-S1-S2</ui><ji>1471-2164</ji><fm>
<dochead>Proceedings</dochead>
<bibl>
<title>
<p>A computational approach for identifying microRNA-target interactions using high-throughput CLIP and PAR-CLIP sequencing</p>
</title>
<aug>
<au ce="yes" id="A1"><snm>Chou</snm><fnm>Chih-Hung</fnm><insr iid="I1"/><email>chchou23@gmail.com</email></au>
<au ce="yes" id="A2"><snm>Lin</snm><fnm>Feng-Mao</fnm><insr iid="I1"/><email>kiralintw@gmail.com</email></au>
<au id="A3"><snm>Chou</snm><fnm>Min-Te</fnm><insr iid="I1"/><email>poi5303@gmail.com</email></au>
<au id="A4"><snm>Hsu</snm><fnm>Sheng-Da</fnm><insr iid="I1"/><email>ken.sd.hsu@gmail.com</email></au>
<au id="A5"><snm>Chang</snm><fnm>Tzu-Hao</fnm><insr iid="I2"/><email>kevinchang@tmu.edu.tw</email></au>
<au id="A6"><snm>Weng</snm><fnm>Shun-Long</fnm><insr iid="I1"/><insr iid="I3"/><insr iid="I4"/><insr iid="I5"/><insr iid="I6"/><email>a4467@ms7.mmh.org.tw</email></au>
<au id="A7"><snm>Shrestha</snm><fnm>Sirjana</fnm><insr iid="I1"/><email>sirju10@yahoo.co.in</email></au>
<au id="A8"><snm>Hsiao</snm><fnm>Chiung-Chih</fnm><insr iid="I1"/><email>chiungchih.hsiao@gmail.com</email></au>
<au ca="yes" id="A9"><snm>Hung</snm><fnm>Jui-Hung</fnm><insr iid="I1"/><insr iid="I3"/><email>juihunghung@gmail.com</email></au>
<au ca="yes" id="A10"><snm>Huang</snm><fnm>Hsien-Da</fnm><insr iid="I1"/><insr iid="I3"/><email>bryan@mail.nctu.edu.tw</email></au>
</aug>
<insg>
<ins id="I1"><p>Institute of Bioinformatics and Systems Biology, National Chiao Tung University, Hsin-Chu 300, Taiwan</p></ins>
<ins id="I2"><p>Graduate Institute of Biomedical Informatics, Taipei Medical University, Taiwan</p></ins>
<ins id="I3"><p>Department of Biological Science and Technology, National Chiao Tung University, Hsin-Chu 300, Taiwan</p></ins>
<ins id="I4"><p>Department of Obstetrics and Gynecology, Hsinchu Mackay Memorial Hospital, Hsinchu, Taiwan</p></ins>
<ins id="I5"><p>Mackay Medicine, Nursing and Management College, Taipei, Taiwan</p></ins>
<ins id="I6"><p>Department of Medicine, Mackay Medical College, New Taipei City, Taiwan</p></ins>
</insg>
<source>BMC Genomics</source>


<supplement><title><p>Selected articles from the Eleventh Asia Pacific Bioinformatics Conference (APBC 2013): Genomics</p></title><editor>Cenk Sahinalp and Steven Jones</editor><sponsor><note>Publication of this supplement was funded by the authors.</note></sponsor><note>Proceedings</note></supplement><conference><title><p>The Eleventh Asia Pacific Bioinformatics Conference (APBC 2013)</p></title><location>Vancouver, Canada</location><date-range>21-24 January 2013</date-range><url>http://apbc2013.org/</url></conference><issn>1471-2164</issn>
<pubdate>2013</pubdate>
<volume>14</volume>
<issue>Suppl 1</issue>
<fpage>S2</fpage>
<url>http://www.biomedcentral.com/1471-2164/14/S1/S2</url>
<xrefbib><pubidlist><pubid idtype="pmpid">23368412</pubid><pubid idtype="doi">10.1186/1471-2164-14-S1-S2</pubid></pubidlist></xrefbib>
</bibl>
<history><pub><date><day>21</day><month>1</month><year>2013</year></date></pub></history>
<cpyrt><year>2013</year><collab>Chou et al.; licensee BioMed Central Ltd.</collab><note>This is an open access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note></cpyrt>
<abs>
<sec>
<st>
<p>Abstract</p>
</st>
<sec>
<st>
<p>Background</p>
</st>
<p>MicroRNAs (miRNAs) play a critical role in down-regulating gene expression. By coupling with Argonaute family proteins, miRNAs bind to target sites on mRNAs and employ translational repression. A large amount of miRNA-target interactions (MTIs) have been identified by the crosslinking and immunoprecipitation (CLIP) and the photoactivatable-ribonucleoside-enhanced CLIP (PAR-CLIP) along with the next-generation sequencing (NGS). PAR-CLIP shows high efficiency of RNA co-immunoprecipitation, but it also lead to T to C conversion in miRNA-RNA-protein crosslinking regions. This artificial error obviously reduces the mappability of reads. However, a specific tool to analyze CLIP and PAR-CLIP data that takes T to C conversion into account is still in need.</p>
</sec>
<sec>
<st>
<p>Results</p>
</st>
<p>We herein propose the first CLIP and PAR-CLIP sequencing analysis platform specifically for miRNA target analysis, namely miRTarCLIP. From scratch, it automatically removes adaptor sequences from raw reads, filters low quality reads, reverts C to T, aligns reads to 3'UTRs, scans for read clusters, identifies high confidence miRNA target sites, and provides annotations from external databases. With multi-threading techniques and our novel C to T reversion procedure, miRTarCLIP greatly reduces the running time comparing to conventional approaches. In addition, miRTarCLIP serves with a web-based interface to provide better user experiences in browsing and searching targets of interested miRNAs. To demonstrate the superior functionality of miRTarCLIP, we applied miRTarCLIP to two public available CLIP and PAR-CLIP sequencing datasets. miRTarCLIP not only shows comparable results to that of other existing tools in a much faster speed, but also reveals interesting features among these putative target sites. Specifically, we used miRTarCLIP to disclose that T to C conversion within position 1-7 and that within position 8-14 of miRNA target sites are significantly different (<it>p </it>value = 0.02), and even more significant when focusing on sites targeted by top 102 highly expressed miRNAs only (<it>p </it>value = 0.01). These results comply with previous findings and further suggest that combining miRNA expression and PAR-CLIP data can improve accuracy of the miRNA target prediction.</p>
</sec>
<sec>
<st>
<p>Conclusion</p>
</st>
<p>To sum up, we devised a systematic approach for mining miRNA-target sites from CLIP-seq and PAR-CLIP sequencing data, and integrated the workflow with a graphical web-based browser, which provides a user friendly interface and detailed annotations of MTIs. We also showed through real-life examples that miRTarCLIP is a powerful tool for understanding miRNAs. Our integrated tool can be accessed online freely at <url>http://miRTarCLIP.mbc.nctu.edu.tw</url>.</p>
</sec>
</sec>
</abs>
</fm><bdy>
<sec>
<st>
<p>Background</p>
</st>
<p>MicroRNAs (miRNAs) are about 22-nucletide-length endogenous non-coding RNA molecules that suppress target gene expression. Functional miRNAs typically form RNA-induced silencing complexes (RISCs) that hybridize complementary sequences at 3'-untranslated regions (3' UTRs) of target genes to either degrade mRNA molecules or suppress protein translation <abbrgrp>
<abbr bid="B1">1</abbr>
</abbrgrp>. In animals and plants, miRNAs regulate many cellular processes including cell proliferation, differentiation, apoptosis and development <abbrgrp>
<abbr bid="B2">2</abbr>
</abbrgrp>. miRNA regulation could be the etiological factor of many diseases including cancer, as well as neurological, and cardiovascular disorders <abbrgrp>
<abbr bid="B3">3</abbr>
</abbrgrp>. Biologists have discovered that, on each miRNA, the second to seventh nucleotides (position 2-7) called "seed region" is indispensable for miRNA-target interactions (MTIs) <abbrgrp>
<abbr bid="B4">4</abbr>
</abbrgrp>. The seed region in miRNAs should match with the 3' UTR sequence complementarily. So far, the conventionally approaches to verify MTIs such as the reporter assay are still time consuming and incapable of handling the large-scale screening.</p>
<p>Recent works demonstrated that the novel miRNAs, miRNA expression, or MTIs can be uncovered in a large scale by using the next-generation sequencing (NGS) technology. For example, miRDeep <abbrgrp>
<abbr bid="B5">5</abbr>
</abbrgrp> predicts the novel miRNAs in NGS data according to a probabilistic model of miRNA biogenesis. Its newest version, miRDeep2 <abbrgrp>
<abbr bid="B6">6</abbr>
</abbrgrp>, reaches the accuracy around 98.6%-99.9%. Additionally, several tools or web servers were used to identify novel miRNAs or detect miRNA expression levels via NGS such as deepBase <abbrgrp>
<abbr bid="B7">7</abbr>
</abbrgrp>, Geoseq <abbrgrp>
<abbr bid="B8">8</abbr>
</abbrgrp>, miRanalyzer <abbrgrp>
<abbr bid="B9">9</abbr>
</abbrgrp>, SeqBuster <abbrgrp>
<abbr bid="B10">10</abbr>
</abbrgrp>, mirTools <abbrgrp>
<abbr bid="B11">11</abbr>
</abbrgrp>, DSAP <abbrgrp>
<abbr bid="B12">12</abbr>
</abbrgrp>, miRNAkey <abbrgrp>
<abbr bid="B13">13</abbr>
</abbrgrp> and miRExpress <abbrgrp>
<abbr bid="B14">14</abbr>
</abbrgrp>.</p>
<p>Ultraviolet (UV) crosslinking and immunoprecipitation (CLIP) was used to identify specific protein-RNA interaction. Functional miRNA was loaded into Argonate protein and then bound to their target gene to slicing gene expression. Hence the function of Argonate-mRNA-miRNA complex can be verified through CLIP technology. Nowadays, ChIP-seq technology study in protein-DNA interaction by high-throughput sequencing, CLIP-seq technology has been developed to identify protein-RNA interaction by high-throughput sequencing. In 2009, Chi <it>et al.</it>
<abbrgrp>
<abbr bid="B15">15</abbr>
</abbrgrp> pioneered the use of crosslinking and immunoprecipitation (CLIP) method combining with the next-generation sequencing (NGS) technology to discover MTIs in order to obtain Argonaute proteins with mRNA molecules (i.e., targets) in mouse brain. Furthermore, Hafner <it>et al. </it>
<abbrgrp>
<abbr bid="B16">16</abbr>
</abbrgrp> developed a modified CLIP method, namely Photoactivatable-Ribonucleoside-Enhanced Crosslinking and Immunoprecipitation (PAR-CLIP), to enhance the resolution of the original CLIP method. PAR-CLIP enhances protein-RNA crosslinking by introducing photoactivatable ribonucleoside (4-thiouridine, 4SU) into RNAs, makes RNAs sustain in ultra-violet light (UV) with higher energies. Thus, tighter binding was created and results in higher efficiency of RNA co-immunoprecipitation. However it also leads to T to C conversion in the miRNA-RNA-protein crosslinking regions due to the fact that thymine tends to be replaced by 4SU, which could be misidentified as cytosine.</p>
<p>Recently, more and more research groups investigated large-scale MTIs using the CLIP-seq <abbrgrp>
<abbr bid="B17">17</abbr>
<abbr bid="B18">18</abbr>
<abbr bid="B19">19</abbr>
<abbr bid="B20">20</abbr>
</abbrgrp>, and there are several databases, such as CLIPZ <abbrgrp>
<abbr bid="B21">21</abbr>
</abbrgrp>, starBase <abbrgrp>
<abbr bid="B22">22</abbr>
</abbrgrp>, doRiNA <abbrgrp>
<abbr bid="B23">23</abbr>
</abbrgrp>, and TarBase 6.0 <abbrgrp>
<abbr bid="B24">24</abbr>
</abbrgrp>, compile public available CLIP and PAR-CLIP sequencing datasets and use their in-house software toolkits to analyze the raw data. Among them, only the CLIPZ provides a free web-based analytics environment to the public, and users have to upload their data to the server, which is impractical due to the huge size of the raw sequences and the limited internet bandwidth. Regarding to standalone tools, PARalyzer <abbrgrp>
<abbr bid="B25">25</abbr>
</abbrgrp> is the only one that focuses on PAR-CLIP dataset analysis so far, and its execution time is not satisfactory. In other words, there are only two public available tools that are capable of analyzing CLIP and PAR-CLIP sequencing data, and none of them were designed specifically for MTIs.</p>
<p>We herein propose the first CLIP and PAR-CLIP sequencing analysis platform specifically for miRNA target analysis, namely miRTarCLIP. We devised a unique C to T reversion in its workflow to significantly reduce its running time, and included other novel features (see below), which increase miRTarCLIP's functionality. In addition, miRTarCLIP serves with a web-based interface to provide better user experiences in browsing and searching targets of interested miRNAs.</p>
</sec>
<sec>
<st>
<p>Results</p>
</st>
<sec>
<st>
<p>An overview of the miRTarCLIP system</p>
</st>
<p>miRTarCLIP consists of six steps (see Methods for details). It automatically removes adaptor sequences from raw reads, filters low quality reads, reverts C to T, aligns reads to 3'UTRs, scans for read clusters, identifies high confidence miRNA target sites, and provides annotations from external databases (Figure <figr fid="F1">1</figr> and Figure <figr fid="F2">2</figr>). All of the clusters and miRNA target sites and annotations from external databases are automatically presented in a web-based browser created according to a template. The browser also provides a summary table of putative miRNA target sites with scores from TargetScan, target site locations, target gene annotations, and seed region types. In addition, this system takes advantage of the multi-threading technology to enhance the performance.</p>
<fig id="F1"><title><p>Figure 1</p></title><caption><p>The system flow of miRTarCLIP</p></caption><text>
   <p><b>The system flow of miRTarCLIP</b>. The miRTarCLIP system flow consists of three parts: (A) preparation of the CLIP/PAR-CLIP sequencing data; (B) loading the raw data into the miRTarCLIP's core algorithms; and (C) presenting the analysis in a web-based browser.</p>
</text><graphic file="1471-2164-14-S1-S2-1"/></fig>
<fig id="F2"><title><p>Figure 2</p></title><caption><p>The miRTarCLIP's core algorithms</p></caption><text>
   <p><b>The miRTarCLIP's core algorithms</b>. miRTarCLIP automatically removes adaptor sequences from raw reads, filters low quality reads, reverts C to T, aligns reads to 3'UTRs, scans for read clusters, identifies high confidence miRNA target sites, and provides annotations from external databases.</p>
</text><graphic file="1471-2164-14-S1-S2-2"/></fig>
</sec>
<sec>
<st>
<p>The comparison with other CLIP-seq/ PAR-CLIP databases and tools</p>
</st>
<p>As mentioned above, several databases and tools, CLIPZ, doRiNA, starBase, and PARalyzer analyze CLIP/PAR-CLIP sequencing datasets. Table <tblr tid="T1">1</tblr> lists the major differences among several resources for CLIP/PAR-CLIP data analysis. CLIPZ provides a web service environment for online analysis. PARalyzer <abbrgrp>
<abbr bid="B25">25</abbr>
</abbrgrp> is the only stand-alone tool before this work, but it only handles PAR-CLIP data and does not provide a graphical interface. Here, our miRTarCLIP is implemented as a stand-alone tool, which can analyze the new CLIP-seq/PAR-CLIP data on users' local desktops. It provides high-confidence miRNA-target sites with information in detail and presents them in a web-based interface.</p>
<tbl id="T1"><title><p>Table 1</p></title><caption><p>The comparison of miRTarCLIP with other related CLIP/PAR-CLIP sequencing resources</p></caption><tblbdy cols="6">
      <r>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>
               <b>CLIPZ</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>doRiNA</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>StarBase</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>PARalyzer</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>miRTarCLIP</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="6">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>Resource type</b>
            </p>
         </c>
         <c ca="center">
            <p>Database/Web tool</p>
         </c>
         <c ca="center">
            <p>Database</p>
         </c>
         <c ca="center">
            <p>Database</p>
         </c>
         <c ca="center">
            <p>Standalone tool</p>
         </c>
         <c ca="center">
            <p>Standalone tool</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>Data type</b>
            </p>
         </c>
         <c ca="center">
            <p>Both</p>
         </c>
         <c ca="center">
            <p>Both</p>
         </c>
         <c ca="center">
            <p>Both</p>
         </c>
         <c ca="center">
            <p>PAR-CLIP</p>
         </c>
         <c ca="center">
            <p>Both</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>Mapping reference</b>
            </p>
         </c>
         <c ca="center">
            <p>Genome/transcript</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="center">
            <p>-</p>
         </c>
         <c ca="center">
            <p>Genome</p>
         </c>
         <c ca="center">
            <p>23way transcript 3'UTR</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>miRNA-target</b>
            </p>
            <p>
               <b>interaction</b>
            </p>
         </c>
         <c ca="center">
            <p>ElMMo <abbrgrp><abbr bid="B35">35</abbr></abbrgrp></p>
         </c>
         <c ca="center">
            <p>PicTar <abbrgrp><abbr bid="B36">36</abbr></abbrgrp></p>
         </c>
         <c ca="center">
            <p>Seed-rule <abbrgrp><abbr bid="B29">29</abbr></abbrgrp></p>
         </c>
         <c ca="center">
            <p>Seed-rule <abbrgrp><abbr bid="B29">29</abbr></abbrgrp></p>
         </c>
         <c ca="center">
            <p>TargetScan, <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr></abbrgrp></p>
            <p>miRTarBase <abbrgrp><abbr bid="B31">31</abbr></abbrgrp></p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>Visualization Browser</b>
            </p>
         </c>
         <c ca="center">
            <p>Yes</p>
         </c>
         <c ca="center">
            <p>Yes</p>
         </c>
         <c ca="center">
            <p>Yes</p>
         </c>
         <c ca="center">
            <p>No</p>
         </c>
         <c ca="center">
            <p>Yes</p>
         </c>
      </r>
   </tblbdy></tbl>
<p>Most uniquely, miRTarCLIP performs a C to T reversion in its workflow for PAR-CLIP dataset, which works along with multithreading techniques to significantly reduce the running time. After mapping reverted reads to 3' UTRs (see Methods), miRTarCLIP clusters reads to search for possible miRNA target sites and uses TargetScan to identify miRNAs that target them. If a candidate miRNA and its target sites had experimental verifications according to miRTarBase, the systems will rank these MTIs on the top of the list in a web-based browser.</p>
</sec>
<sec>
<st>
<p>Applying miRTarCLIP to a CLIP-seq dataset</p>
</st>
<p>To demonstrate how our system works on CLIP-seq data, it is necessary to apply a dataset for analysis. Additional file <supplr sid="S1">1</supplr> shows the web interface of the miRTarCLIP analysing a CLIP-seq data from Chi et al. <abbrgrp>
<abbr bid="B15">15</abbr>
</abbrgrp> (BrainA_130_50_fastq). In Additional file <supplr sid="S1">1</supplr>, Lamc1 and mmu-miR-124 were input in the "Gene Symbol" box and the "miRNA name" box respectively. Lamc1 and miR-124 were chosen because this MTI (miR-124::Lamc1) was experimentally verified by Chi et al. <abbrgrp>
<abbr bid="B15">15</abbr>
</abbrgrp>. Figure <figr fid="F3">3</figr> summarizes the complete annotations and visualization results. In Figure <figr fid="F3">3A</figr>, all possible miRNA-target sites in a read cluster are shown with the miRNA seeds on top. In this case, the read cluster in Lamc1 3' UTR (position 2418-2449) suggests a candidate AGO-Lamc1-miRNA terney complex. According to miRNA expression and the context scores given by TargetScan, miRTarCLIP ranks mmu-miR-124a the most possible miRNA that is involved in this MTI, which is as what we anticipated. Figure <figr fid="F3">3B</figr> gives the locations of miRNA target sites (i.e., miRNA_start and miRNA_end) and the context score from TargetScan.</p>
<suppl id="S1">
<title>
<p>Additional file 1</p>
</title>
<text>
<p>
<b>The web-based browser interface of the miRTarCLIP system</b>.</p>
</text>
<file name="1471-2164-14-S1-S2-S1.doc">
   <p>Click here for file</p>
</file>
</suppl>
<fig id="F3"><title><p>Figure 3</p></title><caption><p>mmu-miR-124a targets Lamc1 in the Chi et al. CLIP dataset</p></caption><text>
   <p><b>mmu-miR-124a targets Lamc1 in the Chi et al. CLIP dataset</b>. (A) A read cluster in Lamc1 3' UTR (position 2418-2449) indicates a candidate AGO-Lamc1-miRNA terney complex (shown in the red sequence within the green rectangle). Above that, a pile of miRNA seed sequences are provided according to the TargetScan. The seed of miR-124 is highlighted in a red box. All the reads of this cluster are aligned underneath the 3' UTR sequence of Lamc1. Red letters in reads are mismatches. (B) Detailed positions and TargetScan context scores of MTIs. According to TargetScan, "seed match" 1 indicates 7mer-A1, which implies perfect match in position 2-7 of the mature miRNA and the nucleotide at position 1 is A in the mRNA target site (defined by TargetScan). Others TargetScan score such as local AU, position, TA, SPS, context+ score, and score percentile are are also defined by TargetScan6.2.</p>
</text><graphic file="1471-2164-14-S1-S2-3"/></fig>
</sec>
<sec>
<st>
<p>Applying miRTarCLIP to a PAR-CLIP sequencing dataset</p>
</st>
<p>We took the AGO1 PAR-CLIP sequencing dataset (SRR048973) from Hafner et al. <abbrgrp>
<abbr bid="B16">16</abbr>
</abbrgrp> as an example. According to Hafner <it>et al</it>. <abbrgrp>
<abbr bid="B16">16</abbr>
</abbrgrp>, miR-103a is a highly expressed miRNA and it targets PAG1. Hafner <it>et al. </it>
<abbrgrp>
<abbr bid="B16">16</abbr>
</abbrgrp> indicated a high T to C conversion at the region between 8<sup>th </sup>-13<sup>th </sup>nucleotide in the miRNA target sites. miRTarCLIP identified the same region (position 9 in this case) that contains the most T to C conversion (Figure <figr fid="F4">4</figr>). The system also provides multiple sequence alignments for visualizing conserved target sites among 23 species (Additional file <supplr sid="S2">2</supplr>). In this case, miR-103a target sites in PAG1 are clearly the conserved ones, but they are less likely targeted in rats because this region is not shown in the alignment (see Additional file <supplr sid="S2">2</supplr>, 10116 is a taxonomy id of rat). Figure <figr fid="F3">3</figr> and <figr fid="F4">4</figr> indicate that miRTarCLIP can produce similar results of the original study and provide novel insights of MTIs.</p>
<fig id="F4"><title><p>Figure 4</p></title><caption><p>hsa-miR-103a targets PAG1 in the Hafner et al. PAR-CLIP dataset</p></caption><text>
   <p><b>hsa-miR-103a targets PAG1 in the Hafner et al. PAR-CLIP dataset</b>. Similar to Figure 3. (A) Specifically to PAR-CLIP dataset, green letters in reads denote the T to C conversion sites. The site with the highest conversion ratio is marked in the purple box. (B) Seed match 2 means that perfect match in position 2-8 of the mature miRNA.</p>
</text><graphic file="1471-2164-14-S1-S2-4"/></fig>
<suppl id="S2">
<title>
<p>Additional file 2</p>
</title>
<text>
<p>
<b>The multiple species sequence alignment viewer</b>.</p>
</text>
<file name="1471-2164-14-S1-S2-S2.doc">
   <p>Click here for file</p>
</file>
</suppl>
</sec>
<sec>
<st>
<p>The statistic of T to C conversion sites in the Hafner <it>et al. </it>
<abbrgrp>
<abbr bid="B16">16</abbr>
</abbrgrp> PAR-CLIP sequencing dataset</p>
</st>
<p>The PAR-CLIP reveals a higher efficiency in RNA co-immunoprecipitation than the regular CLIP. The PAR-CLIP incorporates 4-thiouridine (4SU) into transcripts and applies more energetic UV to enhance the crosslinking between proteins and RNAs, but it also produces artificial T to C conversion. Reads with these errors are difficult to map. Therefore, existing tool, like PARalyzer <abbrgrp>
<abbr bid="B25">25</abbr>
</abbrgrp>, allows a read to have two mismatches against the reference. However, it dramatically increases the search space and time needed for finding a good match, and in some cases, it could lead to mistaken mappings (see Discussions).</p>
<p>Hafner <it>et al. </it>
<abbrgrp>
<abbr bid="B16">16</abbr>
</abbrgrp> and PARalyzer's authors <abbrgrp>
<abbr bid="B25">25</abbr>
</abbrgrp> indicated in their works that the ratio of T to C conversion is high in position 8 to 13 of the target sites. The high ratio is considered an evident sign of real miRNA target sites in PAR-CLIP data. To confirm this, we compared the T to C conversion rate within position 1-7 to that within position 8-14 of miRNA target sites, the results indicate that the T to C conversion is significantly different in these two regions (<it>p </it>value = 0.02, by one- tailed Student's T test, see Figure <figr fid="F5">5A</figr>, Additional file <supplr sid="S3">3</supplr>, Table <tblr tid="T2">2</tblr>). To further understand the association between T to C conversion levels and high-confidence MTIs, we looked for only highly expressed miRNAs and their target sites. The results show that the T to C conversion rates differ in an even higher degree between these regions (<it>p </it>value = 0.01. See Figure <figr fid="F5">5B</figr>, Table <tblr tid="T2">2</tblr> and Additional file <supplr sid="S3">3</supplr>). These two results suggest that by incorporating miRNA expression, it is possible to reduce the false positives in finding miRNA targets. The rules of miRNA target prediction usually put constraints on the sequence conservation and miRNA seed regions <abbrgrp>
<abbr bid="B4">4</abbr>
</abbrgrp>. Therefore, we also tested whether the conservation and seed regions play a role here. Analytical results indicate that all of the nonconserved seed regions (i.e., N78, N8, N7, see Figure <figr fid="F5">5</figr>, Table <tblr tid="T2">2</tblr>) and total miRNA/CN7 miRNA-target do not exhibit significant difference (<it>p </it>value &gt; 0.05) (Figure <figr fid="F5">5A</figr>, Table <tblr tid="T2">2</tblr> and Additional file <supplr sid="S3">3</supplr>). The results suggest the importance of seeds and conservation. Above results consent with the finding that T to C conversion is located in non-complementary regions of the ternary AGO complex <abbrgrp>
<abbr bid="B16">16</abbr>
<abbr bid="B26">26</abbr>
</abbrgrp>.</p>
<fig id="F5"><title><p>Figure 5</p></title><caption><p>Comparison of T to C conversion ratio between 8-14 mer and 1-7mer target sites in the Hafner et al. PAR-CLIP sequencing data</p></caption><text>
   <p><b>Comparison of T to C conversion ratio between 8-14 mer and 1-7mer target sites in the Hafner et al. PAR-CLIP sequencing data</b>. <b>C</b>: <b>C</b>onserved, <b>N</b>: <b>N</b>onserved, <b>7</b>: <b>7</b>mer seed matching, <b>8</b>: <b>8</b>mer seed matching. For example: The CN78 group consists of miRNA target sites within Conserved, Nonconserved UTRs with both 7mer and 8mer matching. In panel J to R, we used only top 102 expressed miRNAs (from Hefner et al.) to calculate the ratios. (A) All miRNAs (B) top 102 expressed miRNAs. Astric marks indicates significant differnece between position 8-14 and 1-7 (<it>p </it>value &lt; 0.05).</p>
</text><graphic file="1471-2164-14-S1-S2-5"/></fig>
<suppl id="S3">
<title>
<p>Additional file 3</p>
</title>
<text>
<p>
<b>The distribution of T to C conversion ratio around target sites in the Hafner et al. PAR-CLIP sequencing data</b>.</p>
</text>
<file name="1471-2164-14-S1-S2-S3.doc">
   <p>Click here for file</p>
</file>
</suppl>
<tbl id="T2"><title><p>Table 2</p></title><caption><p>Comparison of the T to C conversion ratio in different MTI sets</p></caption><tblbdy cols="3">
      <r>
         <c>
            <p/>
         </c>
         <c ca="center">
            <p>
               <b>Total miRNA</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>Highly expressed miRNA</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="3">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>CN78</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>0.020</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>0.011</b>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>CN8</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>0.008</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>0.010</b>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>CN7</b>
            </p>
         </c>
         <c ca="center">
            <p>0.137</p>
         </c>
         <c ca="center">
            <p>
               <b>0.047</b>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>C78</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>0.004</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>0.013</b>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>C8</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>0.014</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>0.016</b>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>C7</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>0.014</b>
            </p>
         </c>
         <c ca="center">
            <p>
               <b>0.030</b>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>N78</b>
            </p>
         </c>
         <c ca="center">
            <p>0.210</p>
         </c>
         <c ca="center">
            <p>0.055</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>N8</b>
            </p>
         </c>
         <c ca="center">
            <p>0.050</p>
         </c>
         <c ca="center">
            <p>0.055</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>N7</b>
            </p>
         </c>
         <c ca="center">
            <p>0.357</p>
         </c>
         <c ca="center">
            <p>0.096</p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p>MTI sets marked in red have higher T to C conversion ratios in 8th to 14th regions (<it>p </it>value &lt; 0.05. See Figure 5)</p>
   </tblfn></tbl>
</sec>
</sec>
<sec>
<st>
<p>Conclusions and discussion</p>
</st>
<p>This work develops an integrated approach to analyze CLIP/PAR-CLIP sequencing data in order to identify the miRNA target site. User can study interesting miRNAs or genes/transcripts via a web-based interface. Moreover, the entire source code of miRTarCLIP is freely available on the internet for bioinformatics experts to improve and extend our system.</p>
<p>Comparing with other strategies that allow 2 mismatches in mapping (e.g., PARalyzer <abbrgrp>
<abbr bid="B25">25</abbr>
</abbrgrp>), this study introduces a C to T reversion step that tolerates 1 mismatch to reduce the computationally costs and mistaken mapping. Although by doing so (see Methods), we are not free from wrong alignments, but since we only introduce one type of variants (T/C), the chance of getting wrong is only a fraction of what PARalyzer <abbrgrp>
<abbr bid="B25">25</abbr>
</abbrgrp> does (allowing one more mismatch actually introduces all pairwise combinations of four nucleotides).</p>
<p>Comparing with the original study (Hafner <it>et al.</it>
<abbrgrp>
<abbr bid="B16">16</abbr>
</abbrgrp>), this study gets the similar results regarding to the statistic of T to C conversion ratio between specific regions. Our analysis further indicates that the regions with high convertion frequency are outside of the seed regions in the conserved targets (Figure <figr fid="F6">6C</figr>). The interesting association between T to C conversion levels and high-confidence MTIs is also investigated using miRTarCLIP. More experimental evidences are needed in the future to clarify the underlying biology.</p>
<fig id="F6"><title><p>Figure 6</p></title><caption><p>The distribution of mismatch ratio in the Hafner et al. PAR-CLIP sequencing dataset</p></caption><text>
   <p><b>The distribution of mismatch ratio in the Hafner et al. PAR-CLIP sequencing dataset</b>. The red lines indicates the miRNA seed regions.</p>
</text><graphic file="1471-2164-14-S1-S2-6"/></fig>
<p>There are more than 2,000 miRNAs discovered in humans (according to the miRBase version 19), but only less than 300 of them had their MTIs understood by the researchers (according to the miRTarBase version 2.5). The large-scale technologies for discovering MTIs such as CLIP-seq/PAR-CLIP-seq will play a key role in miRNA related studies. We strongly believe miRTarCLIP will be an important resource for the society to reveal more mechanisms of miRNA post-translational regulation.</p>
</sec>
<sec>
<st>
<p>Materials and methods</p>
</st>
<sec>
<st>
<p>CLIP-seq and PAR-CLIP datasets</p>
</st>
<p>Chi <it>et al</it>. <abbrgrp>
<abbr bid="B15">15</abbr>
</abbrgrp> recently analyzed MTIs in the mouse brain tissue by high throughput sequencing and CLIP. Hafner <it>et al.</it>
<abbrgrp>
<abbr bid="B16">16</abbr>
</abbrgrp> modified the original CLIP methods by incorporating 4-thiouridine (4SU) into transcripts to increase the efficiency of crosslinking and provide high resolution in protein-RNA binding sites. The raw data of AGO1-AGO4 can be obtained from Gene Expression Omnibus (GEO: GSM545212, GSM545213, GSM545214, GSM545215). The sequencing raw data of these two studies are used in this proposed miRTarCLIP system.</p>
</sec>
<sec>
<st>
<p>Information of miRNA and miRNA targets</p>
</st>
<p>The miRNA related information, including the accessions and miRNA sequences were obtained from miRBase release 18 <abbrgrp>
<abbr bid="B27">27</abbr>
<abbr bid="B28">28</abbr>
</abbrgrp>. microRNA indexes are created to replace miRNA names because of the inconsistency of miRNA naming among different versions of miRBase. The miRNA target prediction and 3' UTR data are obtained from TargetScan release 6.2 <abbrgrp>
<abbr bid="B29">29</abbr>
<abbr bid="B30">30</abbr>
</abbrgrp>. The experimentally confirmed MTIs were collected from miRTarBase release 2.5 <abbrgrp>
<abbr bid="B31">31</abbr>
</abbrgrp>, which was developed previously by our group.</p>
</sec>
<sec>
<st>
<p>miRTarCLIP analysis pipeline</p>
</st>
<p>Figure <figr fid="F2">2</figr> illustrates the analysis flow of miRTarCLIP pipeline. The FASTX-Toolkit <abbrgrp>
<abbr bid="B32">32</abbr>
</abbrgrp>, SRA-Toolkit <abbrgrp>
<abbr bid="B33">33</abbr>
</abbrgrp>, and bowtie <abbrgrp>
<abbr bid="B34">34</abbr>
</abbrgrp> was incorporated into the miRTarCLIP analysis pipeline. The pipeline has six steps: (1) adapter trimming, (2) quality control, (3) C to T reversion, (4) read alignment, (5) cluster analysis, (6) MTI identification analysis. We also take advantage of multi-threading to enhance the performance of the algorithms.</p>
</sec>
<sec>
<st>
<p>Step 1: adapter trimming for sequencing reads</p>
</st>
<p>This step removes the adapter sequence, if any, at the 3' end of each read. If a trimmed read is shorter than 15 nucleotides or contains any ambiguous nucleotides, the reads are discarded.</p>
</sec>
<sec>
<st>
<p>Step 2: quality control of sequencing reads</p>
</st>
<p>Following the adapter trimming step, we scan the quality at the tail of each read. The elimination rules are based on the phred quality score. Notably, the nucleotides at the 3' end are removed when their phred scores are lower than 20. Similarly, a reads is discarded if its length less than 15 nucleotides after the tail trimming. Reads with the same sequences are collapsed into one to save the time for mapping duplicates.</p>
</sec>
<sec>
<st>
<p>Step 3: cytosine to thymine reversion for PAR-CLIP data</p>
</st>
<p>PAR-CLIP technology is implemented by incorporating 4-thiouridone (4SU) to cause thymidine to cytidine transition in the RNA binding protein sites on transcripts. For each cytidine in a read, this step will create a new read with that C converted to T. For instance, a read sequence, AATG<b>
<ul>C</ul>
</b>T<b>
<ul>C</ul>
</b>AATGG<b>
<ul>C</ul>
</b>GA, will be converted to AATG<b>
<ul>T</ul>
</b>T<b>
<ul>C</ul>
</b>AATGG<b>
<ul>C</ul>
</b>GA, AATG<b>
<ul>C</ul>
</b>T<b>
<ul>T</ul>
</b>AATGG<b>
<ul>C</ul>
</b>GA, and AATG<b>
<ul>C</ul>
</b>T<b>
<ul>C</ul>
</b>AATGG<b>
<ul>T</ul>
</b>GA. All four sequences (i.e., one original read and three converted reads) are used to align against the references.</p>
</sec>
<sec>
<st>
<p>Step 4: aligning sequencing reads against reference sequences</p>
</st>
<p>miRNAs target mRNAs at 3' UTRs, so instead of aligning reads to the entire genome, we use exclusively the 3' UTR sequences from TargetScan. The reads are mapped with at most one mismatch. Other tools, like PARalyzer <abbrgrp>
<abbr bid="B25">25</abbr>
</abbrgrp> uses two mismatches to address the T to C conversion issue in PAR-CLIP dataset. However, allowing two mismatches in mapping (e.g., using <b>bowtie) </b>is very time consuming and error-prone. To resolve this problem, a better strategy is to revert C back to T in reads (as described in Step 3), and align them to the references with at most one mismatch, in which reduces the computational costs. We have tested our results with published PAR-CLIP data from Hafner <it>et al.</it>
<abbrgrp>
<abbr bid="B16">16</abbr>
</abbrgrp> (<ul>SRX020783</ul>). We validated the fact that our C to T reversion combining with one mismatch mapping tolerance in bowtie is more efficient than doing mapping directly by allowing 2 mismatches. The result shows that we reduced the computation time by two folds and generated 0.64 folds output despite our C to T reversion introduced 7 folds of extra input (Table <tblr tid="T3">3</tblr>).</p>
<tbl id="T3"><title><p>Table 3</p></title><caption><p>Comparison of computational time and bowtie mapping</p></caption><tblbdy cols="4">
      <r>
         <c>
            <p/>
         </c>
         <c ca="left">
            <p>
               <b>miRTarCLIP</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>PARalyzer</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>Comparison</b>
            </p>
         </c>
      </r>
      <r>
         <c cspan="4">
            <hr/>
         </c>
      </r>
      <r>
         <c ca="left" cspan="4">
            <p><b>Times </b>(sec)</p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>C to T program</p>
         </c>
         <c ca="left">
            <p>11.7</p>
         </c>
         <c ca="left">
            <p>-</p>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Bowtie</p>
         </c>
         <c ca="left">
            <p>135</p>
         </c>
         <c ca="left">
            <p>301.7</p>
         </c>
         <c>
            <p/>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>
               <b>Total</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>146.7</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>301.7</b>
            </p>
         </c>
         <c ca="left">
            <p>
               <b>2 folds</b>
            </p>
         </c>
      </r>
      <r>
         <c ca="left" cspan="4">
            <p>
               <b>Bowtie input and output</b>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Input reads</p>
         </c>
         <c ca="left">
            <p>6429483</p>
         </c>
         <c ca="left">
            <p>919698</p>
         </c>
         <c ca="left">
            <p>
               <b>7 folds</b>
            </p>
         </c>
      </r>
      <r>
         <c ca="left">
            <p>Output reports</p>
         </c>
         <c ca="left">
            <p>10785713</p>
         </c>
         <c ca="left">
            <p>16895608</p>
         </c>
         <c ca="left">
            <p>
               <b>0.64 folds</b>
            </p>
         </c>
      </r>
   </tblbdy><tblfn>
      <p># Dataset: PAR-CLIP data from Hafner et. al. (<ul>SRX020783</ul>).</p>
      <p># Bowtie parameter: one mismatch in miRTarCLIP and two mismatch in PARalyzer.</p>
      <p># System: Linux x64, Intel(R) Xeon(R) CPU E5620 @ 2.40GHz, 16G RAM.</p>
   </tblfn></tbl>
</sec>
<sec>
<st>
<p>Step 5: cluster searching and analysis</p>
</st>
<p>These reads are clustered based on their minimum overlap between each other, at least 20% of the reads in a cluster should have the T to C conversion; the minimum number of reads in a cluster is five reads. In the PAR-CLIP dataset, a cluster reads should contain at least 20% of the T to C conversion. Whether the cluster sequence is a possible target site is confirmed using the miRNA seed region sequences extracted from miRBase.</p>
</sec>
<sec>
<st>
<p>Step 6: miRNA-target interaction (MTI) analysis</p>
</st>
<p>The clustering results are used to search for possible miRNA target sites by TargetScan. If a candidate target site is experimentally validated according to miRTarBase, the system will display it on the top. Other candidates will be ranked according to the context scores assigned by TargetScan.</p>
</sec>
</sec>
<sec>
<st>
<p>Availability and requirements</p>
</st>
<p>miRTarCLIP software was implemented by PHP programming language and integrated FASTX-Toolkit, SRA-Toolkit and a bowtie program written in C++ programming language. The software can be executed in 32 or 64 bit Linux machine. The software and case study results can be accessed online at <url>http://miRTarCLIP.mbc.nctu.edu.tw</url>.</p>
</sec>
<sec>
<st>
<p>Competing interests</p>
</st>
<p>The authors declare that they have no competing interests.</p>
</sec>
<sec>
<st>
<p>Authors' contributions</p>
</st>
<p>CHC carried out all experimental concepts, wrote part of the program and the manuscript. JHH organized the study, and write the manuscript. FML carried out some experimental concepts and assisted in the design of the study. MTC assisted in the design of the study and programming. SDH, THC, SLW, SS, and CCH assisted in the design of the study. HDH managed the study in the initial model, and assisted write and revise the manuscript. All authors read and approved the final manuscript.</p>
</sec>
<sec>
<st>
<p>Declaration</p>
</st>
<p>The authors approve the submission of this paper to <it>BMC Genomics</it> for publication. The payment of a publishing charge to BioMed Central for this article was supported by National Science Council of the Republic of China, No. NSC 101-2311-B-009-003-MY3 and NSC 100-2627-B-009-002. This publishing charge was supported in part by the UST-UCSD International Center of Excellence in Advanced Bio-engineering sponsored by the Taiwan National Science Council I-RiCE Program under Grant Number: NSC 101-2911-I-009-101, and Veterans General Hospitals and University System of Taiwan (VGHUST) Joint Research Program under Grant Number: VGHUST101-G5-1-1. This publishing charge was also partially supported by MOE ATU.</p>
<p>This article has been published as part of <it>BMC Genomics </it>Volume 14 Supplement 1, 2013: Selected articles from the Eleventh Asia Pacific Bioinformatics Conference (APBC 2013): Genomics. The full contents of the supplement are available online at <url>http://www.biomedcentral.com/bmcgenomics/supplements/14/S1</url>.</p>
</sec>
</bdy><bm>
<ack>
<sec>
<st>
<p>Acknowledgements</p>
</st>
<p>The authors would like to thank the National Science Council of the Republic of China for financially supporting this research under Contract No. NSC 101-2311-B-009-003-MY3 and NSC 100-2627-B-009-002. This work was supported in part by the UST-UCSD International Center of Excellence in Advanced Bio-engineering sponsored by the Taiwan National Science Council I-RiCE Program under Grant Number: NSC 100-2911-I-009-101, and Veterans General Hospitals and University System of Taiwan (VGHUST) Joint Research Program under Grant Number: VGHUST101-G5-1-1. This work was also partially supported by MOE ATU.</p>
</sec>
</ack>
<refgrp><bibl id="B1"><title><p>MicroRNAs: genomics, biogenesis, mechanism, and function</p></title><aug><au><snm>Bartel</snm><fnm>DP</fnm></au></aug><source>Cell</source><pubdate>2004</pubdate><volume>116</volume><issue>2</issue><fpage>281</fpage><lpage>297</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/S0092-8674(04)00045-5</pubid><pubid idtype="pmpid" link="fulltext">14744438</pubid></pubidlist></xrefbib></bibl><bibl id="B2"><title><p>The widespread regulation of microRNA biogenesis, function and decay</p></title><aug><au><snm>Krol</snm><fnm>J</fnm></au><au><snm>Loedige</snm><fnm>I</fnm></au><au><snm>Filipowicz</snm><fnm>W</fnm></au></aug><source>Nat Rev Genet</source><pubdate>2010</pubdate><volume>11</volume><issue>9</issue><fpage>597</fpage><lpage>610</lpage><xrefbib><pubid idtype="pmpid" link="fulltext">20661255</pubid></xrefbib></bibl><bibl id="B3"><title><p>Non-coding RNAs in human disease</p></title><aug><au><snm>Esteller</snm><fnm>M</fnm></au></aug><source>Nat Rev Genet</source><pubdate>2011</pubdate><volume>12</volume><issue>12</issue><fpage>861</fpage><lpage>874</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nrg3074</pubid><pubid idtype="pmpid" link="fulltext">22094949</pubid></pubidlist></xrefbib></bibl><bibl id="B4"><title><p>MicroRNAs: target recognition and regulatory functions</p></title><aug><au><snm>Bartel</snm><fnm>DP</fnm></au></aug><source>Cell</source><pubdate>2009</pubdate><volume>136</volume><issue>2</issue><fpage>215</fpage><lpage>233</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.cell.2009.01.002</pubid><pubid idtype="pmpid" link="fulltext">19167326</pubid></pubidlist></xrefbib></bibl><bibl id="B5"><title><p>Discovering microRNAs from deep sequencing data using miRDeep</p></title><aug><au><snm>Friedlander</snm><fnm>MR</fnm></au><au><snm>Chen</snm><fnm>W</fnm></au><au><snm>Adamidi</snm><fnm>C</fnm></au><au><snm>Maaskola</snm><fnm>J</fnm></au><au><snm>Einspanier</snm><fnm>R</fnm></au><au><snm>Knespel</snm><fnm>S</fnm></au><au><snm>Rajewsky</snm><fnm>N</fnm></au></aug><source>Nat Biotechnol</source><pubdate>2008</pubdate><volume>26</volume><issue>4</issue><fpage>407</fpage><lpage>415</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nbt1394</pubid><pubid idtype="pmpid" link="fulltext">18392026</pubid></pubidlist></xrefbib></bibl><bibl id="B6"><title><p>miRDeep2 accurately identifies known and hundreds of novel microRNA genes in seven animal clades</p></title><aug><au><snm>Friedlander</snm><fnm>MR</fnm></au><au><snm>Mackowiak</snm><fnm>SD</fnm></au><au><snm>Li</snm><fnm>N</fnm></au><au><snm>Chen</snm><fnm>W</fnm></au><au><snm>Rajewsky</snm><fnm>N</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2012</pubdate><volume>40</volume><issue>1</issue><fpage>37</fpage><lpage>52</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkr688</pubid><pubid idtype="pmcid">3245920</pubid><pubid idtype="pmpid" link="fulltext">21911355</pubid></pubidlist></xrefbib></bibl><bibl id="B7"><title><p>deepBase: a database for deeply annotating and mining deep sequencing data</p></title><aug><au><snm>Yang</snm><fnm>JH</fnm></au><au><snm>Shao</snm><fnm>P</fnm></au><au><snm>Zhou</snm><fnm>H</fnm></au><au><snm>Chen</snm><fnm>YQ</fnm></au><au><snm>Qu</snm><fnm>LH</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2010</pubdate><volume>38</volume><issue>Database</issue><fpage>D123</fpage><lpage>130</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkp943</pubid><pubid idtype="pmcid">2808990</pubid><pubid idtype="pmpid" link="fulltext">19966272</pubid></pubidlist></xrefbib></bibl><bibl id="B8"><title><p>Geoseq: a tool for dissecting deep-sequencing datasets</p></title><aug><au><snm>Gurtowski</snm><fnm>J</fnm></au><au><snm>Cancio</snm><fnm>A</fnm></au><au><snm>Shah</snm><fnm>H</fnm></au><au><snm>Levovitz</snm><fnm>C</fnm></au><au><snm>George</snm><fnm>A</fnm></au><au><snm>Homann</snm><fnm>R</fnm></au><au><snm>Sachidanandam</snm><fnm>R</fnm></au></aug><source>BMC Bioinformatics</source><pubdate>2010</pubdate><volume>11</volume><fpage>506</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2105-11-506</pubid><pubid idtype="pmcid">2972303</pubid><pubid idtype="pmpid" link="fulltext">20939882</pubid></pubidlist></xrefbib></bibl><bibl id="B9"><title><p>miRanalyzer: a microRNA detection and analysis tool for next-generation sequencing experiments</p></title><aug><au><snm>Hackenberg</snm><fnm>M</fnm></au><au><snm>Sturm</snm><fnm>M</fnm></au><au><snm>Langenberger</snm><fnm>D</fnm></au><au><snm>Falcon-Perez</snm><fnm>JM</fnm></au><au><snm>Aransay</snm><fnm>AM</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2009</pubdate><volume>37</volume><issue>Web Server</issue><fpage>W68</fpage><lpage>76</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkp347</pubid><pubid idtype="pmcid">2703919</pubid><pubid idtype="pmpid" link="fulltext">19433510</pubid></pubidlist></xrefbib></bibl><bibl id="B10"><title><p>SeqBuster, a bioinformatic tool for the processing and analysis of small RNAs datasets, reveals ubiquitous miRNA modifications in human embryonic cells</p></title><aug><au><snm>Pantano</snm><fnm>L</fnm></au><au><snm>Estivill</snm><fnm>X</fnm></au><au><snm>Marti</snm><fnm>E</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2010</pubdate><volume>38</volume><issue>5</issue><fpage>e34</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkp1127</pubid><pubid idtype="pmcid">2836562</pubid><pubid idtype="pmpid" link="fulltext">20008100</pubid></pubidlist></xrefbib></bibl><bibl id="B11"><title><p>mirTools: microRNA profiling and discovery based on high-throughput sequencing</p></title><aug><au><snm>Zhu</snm><fnm>E</fnm></au><au><snm>Zhao</snm><fnm>F</fnm></au><au><snm>Xu</snm><fnm>G</fnm></au><au><snm>Hou</snm><fnm>H</fnm></au><au><snm>Zhou</snm><fnm>L</fnm></au><au><snm>Li</snm><fnm>X</fnm></au><au><snm>Sun</snm><fnm>Z</fnm></au><au><snm>Wu</snm><fnm>J</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2010</pubdate><volume>38</volume><issue>Web Server</issue><fpage>W392</fpage><lpage>397</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkq393</pubid><pubid idtype="pmcid">2896132</pubid><pubid idtype="pmpid" link="fulltext">20478827</pubid></pubidlist></xrefbib></bibl><bibl id="B12"><title><p>DSAP: deep-sequencing small RNA analysis pipeline</p></title><aug><au><snm>Huang</snm><fnm>PJ</fnm></au><au><snm>Liu</snm><fnm>YC</fnm></au><au><snm>Lee</snm><fnm>CC</fnm></au><au><snm>Lin</snm><fnm>WC</fnm></au><au><snm>Gan</snm><fnm>RR</fnm></au><au><snm>Lyu</snm><fnm>PC</fnm></au><au><snm>Tang</snm><fnm>P</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2010</pubdate><volume>38</volume><issue>Web Server</issue><fpage>W385</fpage><lpage>391</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkq392</pubid><pubid idtype="pmcid">2896168</pubid><pubid idtype="pmpid" link="fulltext">20478825</pubid></pubidlist></xrefbib></bibl><bibl id="B13"><title><p>miRNAkey: a software for microRNA deep sequencing analysis</p></title><aug><au><snm>Ronen</snm><fnm>R</fnm></au><au><snm>Gan</snm><fnm>I</fnm></au><au><snm>Modai</snm><fnm>S</fnm></au><au><snm>Sukacheov</snm><fnm>A</fnm></au><au><snm>Dror</snm><fnm>G</fnm></au><au><snm>Halperin</snm><fnm>E</fnm></au><au><snm>Shomron</snm><fnm>N</fnm></au></aug><source>Bioinformatics</source><pubdate>2010</pubdate><volume>26</volume><issue>20</issue><fpage>2615</fpage><lpage>2616</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/bioinformatics/btq493</pubid><pubid idtype="pmpid" link="fulltext">20801911</pubid></pubidlist></xrefbib></bibl><bibl id="B14"><title><p>miRExpress: analyzing high-throughput sequencing data for profiling microRNA expression</p></title><aug><au><snm>Wang</snm><fnm>WC</fnm></au><au><snm>Lin</snm><fnm>FM</fnm></au><au><snm>Chang</snm><fnm>WC</fnm></au><au><snm>Lin</snm><fnm>KY</fnm></au><au><snm>Huang</snm><fnm>HD</fnm></au><au><snm>Lin</snm><fnm>NS</fnm></au></aug><source>BMC Bioinformatics</source><pubdate>2009</pubdate><volume>10</volume><fpage>328</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2105-10-328</pubid><pubid idtype="pmcid">2767369</pubid><pubid idtype="pmpid" link="fulltext">19821977</pubid></pubidlist></xrefbib></bibl><bibl id="B15"><title><p>Argonaute HITS-CLIP decodes microRNA-mRNA interaction maps</p></title><aug><au><snm>Chi</snm><fnm>SW</fnm></au><au><snm>Zang</snm><fnm>JB</fnm></au><au><snm>Mele</snm><fnm>A</fnm></au><au><snm>Darnell</snm><fnm>RB</fnm></au></aug><source>Nature</source><pubdate>2009</pubdate><volume>460</volume><issue>7254</issue><fpage>479</fpage><lpage>486</lpage><xrefbib><pubidlist><pubid idtype="pmcid">2733940</pubid><pubid idtype="pmpid" link="fulltext">19536157</pubid></pubidlist></xrefbib></bibl><bibl id="B16"><title><p>Transcriptome-wide identification of RNA-binding protein and microRNA target sites by PAR-CLIP</p></title><aug><au><snm>Hafner</snm><fnm>M</fnm></au><au><snm>Landthaler</snm><fnm>M</fnm></au><au><snm>Burger</snm><fnm>L</fnm></au><au><snm>Khorshid</snm><fnm>M</fnm></au><au><snm>Hausser</snm><fnm>J</fnm></au><au><snm>Berninger</snm><fnm>P</fnm></au><au><snm>Rothballer</snm><fnm>A</fnm></au><au><snm>Ascano</snm><fnm>M</fnm><suf>Jr</suf></au><au><snm>Jungkamp</snm><fnm>AC</fnm></au><au><snm>Munschauer</snm><fnm>M</fnm></au><etal/></aug><source>Cell</source><pubdate>2010</pubdate><volume>141</volume><issue>1</issue><fpage>129</fpage><lpage>141</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.cell.2010.03.009</pubid><pubid idtype="pmcid">2861495</pubid><pubid idtype="pmpid" link="fulltext">20371350</pubid></pubidlist></xrefbib></bibl><bibl id="B17"><title><p>Comprehensive discovery of endogenous Argonaute binding sites in Caenorhabditis elegans</p></title><aug><au><snm>Zisoulis</snm><fnm>DG</fnm></au><au><snm>Lovci</snm><fnm>MT</fnm></au><au><snm>Wilbert</snm><fnm>ML</fnm></au><au><snm>Hutt</snm><fnm>KR</fnm></au><au><snm>Liang</snm><fnm>TY</fnm></au><au><snm>Pasquinelli</snm><fnm>AE</fnm></au><au><snm>Yeo</snm><fnm>GW</fnm></au></aug><source>Nature structural &amp; molecular biology</source><pubdate>2010</pubdate><volume>17</volume><issue>2</issue><fpage>173</fpage><lpage>179</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nsmb.1745</pubid><pubid idtype="pmpid" link="fulltext">23235501</pubid></pubidlist></xrefbib></bibl><bibl id="B18"><title><p>Genome-wide identification of Ago2 binding sites from mouse embryonic stem cells with and without mature microRNAs</p></title><aug><au><snm>Leung</snm><fnm>AK</fnm></au><au><snm>Young</snm><fnm>AG</fnm></au><au><snm>Bhutkar</snm><fnm>A</fnm></au><au><snm>Zheng</snm><fnm>GX</fnm></au><au><snm>Bosson</snm><fnm>AD</fnm></au><au><snm>Nielsen</snm><fnm>CB</fnm></au><au><snm>Sharp</snm><fnm>PA</fnm></au></aug><source>Nat Struct Mol Biol</source><pubdate>2011</pubdate><volume>18</volume><issue>2</issue><fpage>237</fpage><lpage>244</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nsmb.1991</pubid><pubid idtype="pmcid">3078052</pubid><pubid idtype="pmpid" link="fulltext">21258322</pubid></pubidlist></xrefbib></bibl><bibl id="B19"><title><p>In vivo and transcriptome-wide identification of RNA binding protein target sites</p></title><aug><au><snm>Jungkamp</snm><fnm>AC</fnm></au><au><snm>Stoeckius</snm><fnm>M</fnm></au><au><snm>Mecenas</snm><fnm>D</fnm></au><au><snm>Grun</snm><fnm>D</fnm></au><au><snm>Mastrobuoni</snm><fnm>G</fnm></au><au><snm>Kempa</snm><fnm>S</fnm></au><au><snm>Rajewsky</snm><fnm>N</fnm></au></aug><source>Mol Cell</source><pubdate>2011</pubdate><volume>44</volume><issue>5</issue><fpage>828</fpage><lpage>840</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.molcel.2011.11.009</pubid><pubid idtype="pmcid">3253457</pubid><pubid idtype="pmpid" link="fulltext">22152485</pubid></pubidlist></xrefbib></bibl><bibl id="B20"><title><p>Viral microRNA targetome of KSHV-infected primary effusion lymphoma cell lines</p></title><aug><au><snm>Gottwein</snm><fnm>E</fnm></au><au><snm>Corcoran</snm><fnm>DL</fnm></au><au><snm>Mukherjee</snm><fnm>N</fnm></au><au><snm>Skalsky</snm><fnm>RL</fnm></au><au><snm>Hafner</snm><fnm>M</fnm></au><au><snm>Nusbaum</snm><fnm>JD</fnm></au><au><snm>Shamulailatpam</snm><fnm>P</fnm></au><au><snm>Love</snm><fnm>CL</fnm></au><au><snm>Dave</snm><fnm>SS</fnm></au><au><snm>Tuschl</snm><fnm>T</fnm></au><etal/></aug><source>Cell Host Microbe</source><pubdate>2011</pubdate><volume>10</volume><issue>5</issue><fpage>515</fpage><lpage>526</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.chom.2011.09.012</pubid><pubid idtype="pmcid">3222872</pubid><pubid idtype="pmpid" link="fulltext">22100165</pubid></pubidlist></xrefbib></bibl><bibl id="B21"><title><p>CLIPZ: a database and analysis environment for experimentally determined binding sites of RNA-binding proteins</p></title><aug><au><snm>Khorshid</snm><fnm>M</fnm></au><au><snm>Rodak</snm><fnm>C</fnm></au><au><snm>Zavolan</snm><fnm>M</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2011</pubdate><volume>39</volume><issue>Database</issue><fpage>D245</fpage><lpage>252</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkq940</pubid><pubid idtype="pmcid">3013791</pubid><pubid idtype="pmpid" link="fulltext">21087992</pubid></pubidlist></xrefbib></bibl><bibl id="B22"><title><p>starBase: a database for exploring microRNA-mRNA interaction maps from Argonaute CLIP-Seq and Degradome-Seq data</p></title><aug><au><snm>Yang</snm><fnm>JH</fnm></au><au><snm>Li</snm><fnm>JH</fnm></au><au><snm>Shao</snm><fnm>P</fnm></au><au><snm>Zhou</snm><fnm>H</fnm></au><au><snm>Chen</snm><fnm>YQ</fnm></au><au><snm>Qu</snm><fnm>LH</fnm></au></aug><source>Nucleic acids research</source><pubdate>2011</pubdate><volume>39</volume><issue>Database</issue><fpage>D202</fpage><lpage>209</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkq1056</pubid><pubid idtype="pmcid">3013664</pubid><pubid idtype="pmpid" link="fulltext">21037263</pubid></pubidlist></xrefbib></bibl><bibl id="B23"><title><p>doRiNA: a database of RNA interactions in post-transcriptional regulation</p></title><aug><au><snm>Anders</snm><fnm>G</fnm></au><au><snm>Mackowiak</snm><fnm>SD</fnm></au><au><snm>Jens</snm><fnm>M</fnm></au><au><snm>Maaskola</snm><fnm>J</fnm></au><au><snm>Kuntzagk</snm><fnm>A</fnm></au><au><snm>Rajewsky</snm><fnm>N</fnm></au><au><snm>Landthaler</snm><fnm>M</fnm></au><au><snm>Dieterich</snm><fnm>C</fnm></au></aug><source>Nucleic acids research</source><pubdate>2012</pubdate><volume>40</volume><issue>Database</issue><fpage>D180</fpage><lpage>186</lpage><xrefbib><pubidlist><pubid idtype="pmcid">3245013</pubid><pubid idtype="pmpid" link="fulltext">22086949</pubid></pubidlist></xrefbib></bibl><bibl id="B24"><title><p>TarBase 6.0: capturing the exponential growth of miRNA targets with experimental support</p></title><aug><au><snm>Vergoulis</snm><fnm>T</fnm></au><au><snm>Vlachos</snm><fnm>IS</fnm></au><au><snm>Alexiou</snm><fnm>P</fnm></au><au><snm>Georgakilas</snm><fnm>G</fnm></au><au><snm>Maragkakis</snm><fnm>M</fnm></au><au><snm>Reczko</snm><fnm>M</fnm></au><au><snm>Gerangelos</snm><fnm>S</fnm></au><au><snm>Koziris</snm><fnm>N</fnm></au><au><snm>Dalamagas</snm><fnm>T</fnm></au><au><snm>Hatzigeorgiou</snm><fnm>AG</fnm></au></aug><source>Nucleic acids research</source><pubdate>2012</pubdate><volume>40</volume><issue>Database</issue><fpage>D222</fpage><lpage>229</lpage><xrefbib><pubidlist><pubid idtype="pmcid">3245116</pubid><pubid idtype="pmpid" link="fulltext">22135297</pubid></pubidlist></xrefbib></bibl><bibl id="B25"><title><p>PARalyzer: definition of RNA binding sites from PAR-CLIP short-read sequence data</p></title><aug><au><snm>Corcoran</snm><fnm>DL</fnm></au><au><snm>Georgiev</snm><fnm>S</fnm></au><au><snm>Mukherjee</snm><fnm>N</fnm></au><au><snm>Gottwein</snm><fnm>E</fnm></au><au><snm>Skalsky</snm><fnm>RL</fnm></au><au><snm>Keene</snm><fnm>JD</fnm></au><au><snm>Ohler</snm><fnm>U</fnm></au></aug><source>Genome Biol</source><pubdate>2011</pubdate><volume>12</volume><issue>8</issue><fpage>R79</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2011-12-8-r79</pubid><pubid idtype="pmcid">3302668</pubid><pubid idtype="pmpid" link="fulltext">21851591</pubid></pubidlist></xrefbib></bibl><bibl id="B26"><title><p>Structure of an argonaute silencing complex with a seed-containing guide DNA and target RNA duplex</p></title><aug><au><snm>Wang</snm><fnm>Y</fnm></au><au><snm>Juranek</snm><fnm>S</fnm></au><au><snm>Li</snm><fnm>H</fnm></au><au><snm>Sheng</snm><fnm>G</fnm></au><au><snm>Tuschl</snm><fnm>T</fnm></au><au><snm>Patel</snm><fnm>DJ</fnm></au></aug><source>Nature</source><pubdate>2008</pubdate><volume>456</volume><issue>7224</issue><fpage>921</fpage><lpage>926</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nature07666</pubid><pubid idtype="pmcid">2765400</pubid><pubid idtype="pmpid" link="fulltext">19092929</pubid></pubidlist></xrefbib></bibl><bibl id="B27"><title><p>The microRNA Registry</p></title><aug><au><snm>Griffiths-Jones</snm><fnm>S</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2004</pubdate><volume>32</volume><issue>Database</issue><fpage>D109</fpage><lpage>111</lpage><xrefbib><pubidlist><pubid idtype="pmcid">308757</pubid><pubid idtype="pmpid" link="fulltext">14681370</pubid></pubidlist></xrefbib></bibl><bibl id="B28"><title><p>miRBase: integrating microRNA annotation and deep-sequencing data</p></title><aug><au><snm>Kozomara</snm><fnm>A</fnm></au><au><snm>Griffiths-Jones</snm><fnm>S</fnm></au></aug><source>Nucleic acids research</source><pubdate>2011</pubdate><volume>39</volume><issue>Database</issue><fpage>D152</fpage><lpage>157</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkq1027</pubid><pubid idtype="pmcid">3013655</pubid><pubid idtype="pmpid" link="fulltext">21037258</pubid></pubidlist></xrefbib></bibl><bibl id="B29"><title><p>Conserved seed pairing, often flanked by adenosines, indicates that thousands of human genes are microRNA targets</p></title><aug><au><snm>Lewis</snm><fnm>BP</fnm></au><au><snm>Burge</snm><fnm>CB</fnm></au><au><snm>Bartel</snm><fnm>DP</fnm></au></aug><source>Cell</source><pubdate>2005</pubdate><volume>120</volume><issue>1</issue><fpage>15</fpage><lpage>20</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1016/j.cell.2004.12.035</pubid><pubid idtype="pmpid" link="fulltext">15652477</pubid></pubidlist></xrefbib></bibl><bibl id="B30"><title><p>Weak seed-pairing stability and high target-site abundance decrease the proficiency of lsy-6 and other microRNAs</p></title><aug><au><snm>Garcia</snm><fnm>DM</fnm></au><au><snm>Baek</snm><fnm>D</fnm></au><au><snm>Shin</snm><fnm>C</fnm></au><au><snm>Bell</snm><fnm>GW</fnm></au><au><snm>Grimson</snm><fnm>A</fnm></au><au><snm>Bartel</snm><fnm>DP</fnm></au></aug><source>Nat Struct Mol Biol</source><pubdate>2011</pubdate><volume>18</volume><issue>10</issue><fpage>1139</fpage><lpage>1146</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/nsmb.2115</pubid><pubid idtype="pmcid">3190056</pubid><pubid idtype="pmpid" link="fulltext">21909094</pubid></pubidlist></xrefbib></bibl><bibl id="B31"><title><p>miRTarBase: a database curates experimentally validated microRNA-target interactions</p></title><aug><au><snm>Hsu</snm><fnm>SD</fnm></au><au><snm>Lin</snm><fnm>FM</fnm></au><au><snm>Wu</snm><fnm>WY</fnm></au><au><snm>Liang</snm><fnm>C</fnm></au><au><snm>Huang</snm><fnm>WC</fnm></au><au><snm>Chan</snm><fnm>WL</fnm></au><au><snm>Tsai</snm><fnm>WT</fnm></au><au><snm>Chen</snm><fnm>GZ</fnm></au><au><snm>Lee</snm><fnm>CJ</fnm></au><au><snm>Chiu</snm><fnm>CM</fnm></au><etal/></aug><source>Nucleic Acids Res</source><pubdate>2011</pubdate><volume>39</volume><issue>Database</issue><fpage>D163</fpage><lpage>169</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1093/nar/gkq1107</pubid><pubid idtype="pmcid">3013699</pubid><pubid idtype="pmpid" link="fulltext">21071411</pubid></pubidlist></xrefbib></bibl><bibl id="B32"><title><p>FASTQ/A short-reads pre-processing tools</p></title><aug><au><cnm>FASTX-Toolkit</cnm></au></aug><url>http://hannonlab.cshl.edu/fastx_toolkit/</url></bibl><bibl id="B33"><title><p>The Sequence Read Archive: explosive growth of sequencing data</p></title><aug><au><snm>Kodama</snm><fnm>Y</fnm></au><au><snm>Shumway</snm><fnm>M</fnm></au><au><snm>Leinonen</snm><fnm>R</fnm></au><au><cnm>International Nucleotide Sequence Database C</cnm></au></aug><source>Nucleic Acids Res</source><pubdate>2012</pubdate><volume>40</volume><issue>Database</issue><fpage>D54</fpage><lpage>56</lpage><xrefbib><pubidlist><pubid idtype="pmcid">3245110</pubid><pubid idtype="pmpid" link="fulltext">22009675</pubid></pubidlist></xrefbib></bibl><bibl id="B34"><title><p>Ultrafast and memory-efficient alignment of short DNA sequences to the human genome</p></title><aug><au><snm>Langmead</snm><fnm>B</fnm></au><au><snm>Trapnell</snm><fnm>C</fnm></au><au><snm>Pop</snm><fnm>M</fnm></au><au><snm>Salzberg</snm><fnm>SL</fnm></au></aug><source>Genome Biol</source><pubdate>2009</pubdate><volume>10</volume><issue>3</issue><fpage>R25</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/gb-2009-10-3-r25</pubid><pubid idtype="pmcid">2690996</pubid><pubid idtype="pmpid" link="fulltext">19261174</pubid></pubidlist></xrefbib></bibl><bibl id="B35"><title><p>Inference of miRNA targets using evolutionary conservation and pathway analysis</p></title><aug><au><snm>Gaidatzis</snm><fnm>D</fnm></au><au><snm>van Nimwegen</snm><fnm>E</fnm></au><au><snm>Hausser</snm><fnm>J</fnm></au><au><snm>Zavolan</snm><fnm>M</fnm></au></aug><source>BMC Bioinformatics</source><pubdate>2007</pubdate><volume>8</volume><fpage>69</fpage><xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2105-8-69</pubid><pubid idtype="pmcid">1838429</pubid><pubid idtype="pmpid" link="fulltext">17331257</pubid></pubidlist></xrefbib></bibl><bibl id="B36"><title><p>Combinatorial microRNA target predictions</p></title><aug><au><snm>Krek</snm><fnm>A</fnm></au><au><snm>Grun</snm><fnm>D</fnm></au><au><snm>Poy</snm><fnm>MN</fnm></au><au><snm>Wolf</snm><fnm>R</fnm></au><au><snm>Rosenberg</snm><fnm>L</fnm></au><au><snm>Epstein</snm><fnm>EJ</fnm></au><au><snm>MacMenamin</snm><fnm>P</fnm></au><au><snm>da Piedade</snm><fnm>I</fnm></au><au><snm>Gunsalus</snm><fnm>KC</fnm></au><au><snm>Stoffel</snm><fnm>M</fnm></au><etal/></aug><source>Nat Genet</source><pubdate>2005</pubdate><volume>37</volume><issue>5</issue><fpage>495</fpage><lpage>500</lpage><xrefbib><pubidlist><pubid idtype="doi">10.1038/ng1536</pubid><pubid idtype="pmpid" link="fulltext">15806104</pubid></pubidlist></xrefbib></bibl></refgrp>
</bm></art>