<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2164-8-406</ui>
   <ji>1471-2164</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Computational RNomics of Drosophilids</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Rose</snm>
               <fnm>Dominic</fnm>
               <insr iid="I1"/>
               <email>dominic@bioinf.uni-leipzig.de</email>
            </au>
            <au id="A2">
               <snm>Hackerm&#252;ller</snm>
               <fnm>J&#246;rg</fnm>
               <insr iid="I2"/>
               <insr iid="I3"/>
               <email>joerg.hackermueller@izi.fraunhofer.de</email>
            </au>
            <au id="A3">
               <snm>Washietl</snm>
               <fnm>Stefan</fnm>
               <insr iid="I4"/>
               <email>wash@tbi.univie.ac.at</email>
            </au>
            <au id="A4">
               <snm>Reiche</snm>
               <fnm>Kristin</fnm>
               <insr iid="I1"/>
               <email>kristin@bioinf.uni-leipzig.de</email>
            </au>
            <au id="A5">
               <snm>Hertel</snm>
               <fnm>Jana</fnm>
               <insr iid="I1"/>
               <insr iid="I4"/>
               <email>jana@bioinf.uni-leipzig.de</email>
            </au>
            <au id="A6">
               <snm>Findei&#223;</snm>
               <fnm>Sven</fnm>
               <insr iid="I1"/>
               <email>sven@bioinf.uni-leipzig.de</email>
            </au>
            <au id="A7">
               <snm>Stadler</snm>
               <mi>F</mi>
               <fnm>Peter</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <insr iid="I3"/>
               <insr iid="I4"/>
               <insr iid="I6"/>
               <email>studla@bioinf.uni-leipzig.de</email>
            </au>
            <au id="A8" ca="yes">
               <snm>Prohaska</snm>
               <mi>J</mi>
               <fnm>Sonja</fnm>
               <insr iid="I5"/>
               <email>sonja.prohaska@asu.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Bioinformatics Group, Department of Computer Science, University of Leipzig, H&#228;rtelstra&#223;e 16-18, Leipzig, Germany, D-04107</p>
            </ins>
            <ins id="I2">
               <p>Fraunhofer Institute for Cell Therapy and Immunology, Deutscher Platz 5e, Leipzig, Germany, D-04103</p>
            </ins>
            <ins id="I3">
               <p>Interdisciplinary Center for Bioinformatics, University of Leipzig, H&#228;rtelstra&#223;e 16-18, Leipzig, Germany, D-04107 </p>
            </ins>
            <ins id="I4">
               <p>Department of Theoretical Chemistry, University of Vienna, W&#228;hringerstra&#223;e 17,Wien, Austria, A-1090 </p>
            </ins>
            <ins id="I5">
               <p>Biomedical Informatics, Arizona State University, Tempe, PO-Box 878809, USA, AZ 85287</p>
            </ins>
            <ins id="I6">
               <p>Santa Fe Institute,1399 Hyde Park Rd., Santa Fe, USA, NM 87501</p>
            </ins>
         </insg>
         <source>BMC Genomics</source>
         <issn>1471-2164</issn>
         <pubdate>2007</pubdate>
         <volume>8</volume>
         <issue>1</issue>
         <fpage>406</fpage>
         <url>http://www.biomedcentral.com/1471-2164/8/406</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">17996037</pubid>
               <pubid idtype="doi">10.1186/1471-2164-8-406</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>15</day>
               <month>1</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>08</day>
               <month>11</month>
               <year>2007</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>08</day>
               <month>11</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Rose et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Recent experimental and computational studies have provided overwhelming evidence for a plethora of diverse transcripts that are unrelated to protein-coding genes. One subclass consists of those RNAs that require distinctive secondary structure motifs to exert their biological function and hence exhibit distinctive patterns of sequence conservation characteristic for positive selection on RNA secondary structure.</p>
               <p>The deep-sequencing of 12 drosophilid species coordinated by the NHGRI provides an ideal data set of comparative computational approaches to determine those genomic loci that code for evolutionarily conserved RNA motifs. This class of loci includes the majority of the known small ncRNAs as well as structured RNA motifs in mRNAs. We report here on a genome-wide survey using RNAz.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We obtain 16 000 high quality predictions among which we recover the majority of the known ncRNAs. Taking a pessimistically estimated false discovery rate of 40% into account, this implies that at least some ten thousand loci in the <it>Drosophila </it>genome show the hallmarks of stabilizing selection action of RNA structure, and hence are most likely functional at the RNA level. A subset of RNAz predictions overlapping with TRF1 and BRF binding sites [Isogai <it>et al</it>., <it>EMBO J</it>. 26: 79&#8211;89 (2007)], which are plausible candidates of Pol III transcripts, have been studied in more detail. Among these sequences we identify several "clusters" of ncRNA candidates with striking structural similarities.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>The statistical evaluation of the RNAz predictions in comparison with a similar analysis of vertebrate genomes [Washietl <it>et al., Nat. Biotech</it>. <b>23</b>: 1383&#8211;1390 (2005)] shows that qualitatively similar fractions of structured RNAs are found in introns, UTRs, and intergenic regions. The intergenic RNA structures, however, are concentrated much more closely around known protein-coding loci, suggesting that flies have significantly smaller complement of independent structured ncRNAs compared to mammals.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>High-throughput transcriptome data obtained in particular using tiling arrays <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp> and cDNA sequencing <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp> in conjunction with detailed functional studies of individual genes have profoundly changed our picture of eukaryotic gene regulation by emphasizing multiple regulatory layers, many of which involve non-protein-coding RNAs (ncRNAs). In contrast to protein-coding genes, however, ncRNAs do not form a homogeneous group of transcripts but rather belong to a diverse array of classes with vastly different structures, functions, and evolutionary patterns <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>.</p>
         <p>Efficient computational methods <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp> have recently been developed to determine the genomic inventory of a large subgroup of ncRNAs, namely those that exhibit evolutionarily conserved secondary structures. As stabilizing selection acts to preserve structure in the presence of sequence variation, these transcripts are very likely to have discernible biological function &#8211; as opposed to being a mere byproduct of transcriptional noise <abbrgrp><abbr bid="B19">19</abbr></abbrgrp> or gene regulation by transcriptional interference <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. The group of structured RNAs that we are considering here consequently includes the classical families of small ncRNAs (tRNAs, rRNAs, miRNAs, snRNAs, snoRNAs, RNAse P RNA, etc) as well as structured, usually regulatory, motifs associated with larger coding or non-coding transcripts, such as internal ribosomal entry sites, IRE, and SECIS signals see e.g. <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. The RNAz approach <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> has proven to produce rather high quality predictions. In particular, as part of the detailed analysis of the ENCODE regions <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>, the verification of many unannotated RNAz predictions by means of RT-PCR has been reported, and for a substantial fraction of RNAz predictions corroborating evidence from high-throughput experiments has been obtained.</p>
         <p>Computational screens for structured RNAs have been reported so far for mammalian <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>, urochordate <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>, nematode <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>, and yeast <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp> genomes. However, no comprehensive analysis of structured ncRNAs in insect genomes has been published so far, even though there is statistical evidence for an enrichment of structured RNAs within highly conserved non-coding elements of drosophilids <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. Drosophilids, which have been deeply sequenced by a consortium coordinated by the NHGRI <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr></abbrgrp>, provide an ideal model system for this task, since their evolutionary divergence is comparable to those of mammals. As a consequence, large portions of their genomes are alignable, while at the same time there is substantial sequence variation. Both are necessary prerequisites for currently available ncRNA detection tools.</p>
         <p>In addition to the statistical evidence for wide-spread structured RNAs in insects, two recent genome-wide experimental studies provide evidence of a large reservoir of novel ncRNAs in <it>Drosophila melanogaster</it>: Isogai <it>et al</it>. <abbrgrp><abbr bid="B32">32</abbr></abbrgrp> mapped TRF1 and BRF binding sites in the <it>D. melanogaster </it>genome and showed that, unlike most other eukaryotes, TRF1/BRF binding appears responsible for the initiation of all classes of Polymerase-III (Pol III) transcription. As the known Pol III transcripts are small ncRNAs, their data suggests that drosophilids are likely to have a large set of previously unannotated small ncRNAs. A large-scale tiling array study of transcription in the early development of <it>D. melanogaster </it><abbrgrp><abbr bid="B5">5</abbr></abbrgrp> found that about 20% of the observed transcripts in <it>D. melanogaster </it>come from stand-alone intergenic or intronic sources and may constitute new types of RNAs, including a substantial fraction of ncRNAs.</p>
      </sec>
      <sec>
         <st>
            <p>Results and Discussion</p>
         </st>
         <p>We report here on a computational screen for structured RNA motifs in Drosophilids based on 12-species Pecan alignments provided by the Consortium. The detected RNAz hits are either (parts of) independently transcribed non-coding RNAs with evolutionarily conserved secondary structures, or they are structured elements that are parts of coding transcripts such as SECIS or IRE elements.</p>
         <sec>
            <st>
               <p>Sensitivity and Specificity</p>
            </st>
            <p>Overall, 42 482 RNAz hits corresponding to roughly 5 Mb in the <it>D. melanogaster </it>genome show evidence of evolutionarily conserved RNA secondary structure. About 20% of these overlap existing annotation. The 16 377 loci of the high confidence set covers approximately 2.1 Mb of DNA, see Tab. <tblr tid="T1">1</tblr>.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Overall statistics of the RNAz screen. Initial filtering of Pecan alignments leaves roughly 50% as input for RNAz respectively to the ncRNA prediction. The distribution of RNAz hits does not show a chromosomal bias. We counted the number of predicted loci and their overall length at two probability thresholds (<it>p </it>> 0.5, <it>p </it>> 0.9) for normal and also randomized alignments. Obtained relative frequencies (given as percentages) can be interpreted as false discovery rates (FDR). As expected, the FDR decreases with a higher RNAz p-value.</p>
               </caption>
               <tblbdy cols="8">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>overall</p>
                     </c>
                     <c cspan="6" ca="center">
                        <p>chromosomes</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>2L</p>
                     </c>
                     <c ca="left">
                        <p>2R</p>
                     </c>
                     <c ca="left">
                        <p>3L</p>
                     </c>
                     <c ca="left">
                        <p>3R</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>X</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>alignments</p>
                     </c>
                     <c ca="left">
                        <p>4077</p>
                     </c>
                     <c ca="left">
                        <p>659</p>
                     </c>
                     <c ca="left">
                        <p>804</p>
                     </c>
                     <c ca="left">
                        <p>676</p>
                     </c>
                     <c ca="left">
                        <p>861</p>
                     </c>
                     <c ca="left">
                        <p>65</p>
                     </c>
                     <c ca="left">
                        <p>1012</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>aligned DNA [Mb]</p>
                     </c>
                     <c ca="left">
                        <p>117</p>
                     </c>
                     <c ca="left">
                        <p>22</p>
                     </c>
                     <c ca="left">
                        <p>21</p>
                     </c>
                     <c ca="left">
                        <p>23</p>
                     </c>
                     <c ca="left">
                        <p>28</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>22</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>screened by RNAz [Mb]</p>
                     </c>
                     <c ca="left">
                        <p>57.4</p>
                     </c>
                     <c ca="left">
                        <p>11</p>
                     </c>
                     <c ca="left">
                        <p>10</p>
                     </c>
                     <c ca="left">
                        <p>12</p>
                     </c>
                     <c ca="left">
                        <p>14</p>
                     </c>
                     <c ca="left">
                        <p>0.4</p>
                     </c>
                     <c ca="left">
                        <p>10</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>percentage</p>
                     </c>
                     <c ca="left">
                        <p>49</p>
                     </c>
                     <c ca="left">
                        <p>50</p>
                     </c>
                     <c ca="left">
                        <p>48</p>
                     </c>
                     <c ca="left">
                        <p>52</p>
                     </c>
                     <c ca="left">
                        <p>50</p>
                     </c>
                     <c ca="left">
                        <p>40</p>
                     </c>
                     <c ca="left">
                        <p>46</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>RNAz <it>p </it>> 0.5</p>
                     </c>
                     <c ca="left">
                        <p>42 482</p>
                     </c>
                     <c ca="left">
                        <p>7 824</p>
                     </c>
                     <c ca="left">
                        <p>6 646</p>
                     </c>
                     <c ca="left">
                        <p>8 765</p>
                     </c>
                     <c ca="left">
                        <p>10 351</p>
                     </c>
                     <c ca="left">
                        <p>196</p>
                     </c>
                     <c ca="left">
                        <p>8 700</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>[Kb]</p>
                     </c>
                     <c ca="left">
                        <p>5 079</p>
                     </c>
                     <c ca="left">
                        <p>927</p>
                     </c>
                     <c ca="left">
                        <p>783</p>
                     </c>
                     <c ca="left">
                        <p>1 060</p>
                     </c>
                     <c ca="left">
                        <p>1 229</p>
                     </c>
                     <c ca="left">
                        <p>25</p>
                     </c>
                     <c ca="left">
                        <p>1 055</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>RNAz <it>p </it>> 0.9</p>
                     </c>
                     <c ca="left">
                        <p>16 377</p>
                     </c>
                     <c ca="left">
                        <p>2 940</p>
                     </c>
                     <c ca="left">
                        <p>2 473</p>
                     </c>
                     <c ca="left">
                        <p>3 413</p>
                     </c>
                     <c ca="left">
                        <p>3 862</p>
                     </c>
                     <c ca="left">
                        <p>80</p>
                     </c>
                     <c ca="left">
                        <p>3 609</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>[Kb]</p>
                     </c>
                     <c ca="left">
                        <p>2 167</p>
                     </c>
                     <c ca="left">
                        <p>385</p>
                     </c>
                     <c ca="left">
                        <p>321</p>
                     </c>
                     <c ca="left">
                        <p>461</p>
                     </c>
                     <c ca="left">
                        <p>511</p>
                     </c>
                     <c ca="left">
                        <p>11</p>
                     </c>
                     <c ca="left">
                        <p>478</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>FDR <it>p </it>> 0.5 hits</p>
                     </c>
                     <c ca="left">
                        <p>56.5</p>
                     </c>
                     <c ca="left">
                        <p>54.5</p>
                     </c>
                     <c ca="left">
                        <p>57.2</p>
                     </c>
                     <c ca="left">
                        <p>57.5</p>
                     </c>
                     <c ca="left">
                        <p>55.9</p>
                     </c>
                     <c ca="left">
                        <p>68.4</p>
                     </c>
                     <c ca="left">
                        <p>57.3</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>sequence</p>
                     </c>
                     <c ca="left">
                        <p>52.8</p>
                     </c>
                     <c ca="left">
                        <p>50.7</p>
                     </c>
                     <c ca="left">
                        <p>53.6</p>
                     </c>
                     <c ca="left">
                        <p>53.9</p>
                     </c>
                     <c ca="left">
                        <p>52.4</p>
                     </c>
                     <c ca="left">
                        <p>64.0</p>
                     </c>
                     <c ca="left">
                        <p>53.0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>FDR <it>p </it>> 0.9</p>
                     </c>
                     <c ca="left">
                        <p>45.3</p>
                     </c>
                     <c ca="left">
                        <p>43.6</p>
                     </c>
                     <c ca="left">
                        <p>45.1</p>
                     </c>
                     <c ca="left">
                        <p>47.8</p>
                     </c>
                     <c ca="left">
                        <p>46.2</p>
                     </c>
                     <c ca="left">
                        <p>43.7</p>
                     </c>
                     <c ca="left">
                        <p>43.8</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>sequence</p>
                     </c>
                     <c ca="left">
                        <p>40.2</p>
                     </c>
                     <c ca="left">
                        <p>38.2</p>
                     </c>
                     <c ca="left">
                        <p>40.2</p>
                     </c>
                     <c ca="left">
                        <p>42.5</p>
                     </c>
                     <c ca="left">
                        <p>41.1</p>
                     </c>
                     <c ca="left">
                        <p>36.4</p>
                     </c>
                     <c ca="left">
                        <p>38.7</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>In total, 336 hits correspond to known non-coding RNAs according to at least one source of annotation (FlyBase: 316; BLAST against miRBase: 79; BLAST against Noncode: 44; BLAST against Rfam: 222; tRNAscan: 159). Tab. <tblr tid="T2">2</tblr> summarizes the recall of the screen on several "classical" ncRNAs families. Note that some classes of ncRNAs were deliberately removed already in the Pecan alignments, notably the 5S rRNA sequences. We recovered 96% of the known <it>D. melanogaster </it>miRNAs.</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Sensitivity of the RNAz screen on known ncRNAs.</p>
               </caption>
               <tblbdy cols="6">
                  <r>
                     <c ca="left">
                        <p>Class</p>
                     </c>
                     <c ca="right">
                        <p>RNAz</p>
                     </c>
                     <c ca="right">
                        <p>input</p>
                     </c>
                     <c ca="right">
                        <p>annotated</p>
                     </c>
                     <c ca="right">
                        <p>sensitivity (%)</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>tRNA</p>
                     </c>
                     <c ca="right">
                        <p>171</p>
                     </c>
                     <c ca="right">
                        <p>250</p>
                     </c>
                     <c ca="right">
                        <p>297</p>
                     </c>
                     <c ca="right">
                        <p>69</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>5S rRNA</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>99</p>
                     </c>
                     <c ca="right">
                        <p>--</p>
                     </c>
                     <c ca="left">
                        <p>not in input</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>SRP RNA</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>0</p>
                     </c>
                     <c ca="right">
                        <p>2</p>
                     </c>
                     <c ca="right">
                        <p>--</p>
                     </c>
                     <c ca="left">
                        <p>not in input</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>RNAse P</p>
                     </c>
                     <c ca="right">
                        <p>1</p>
                     </c>
                     <c ca="right">
                        <p>1</p>
                     </c>
                     <c ca="right">
                        <p>1</p>
                     </c>
                     <c ca="right">
                        <p>100</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>snRNA</p>
                     </c>
                     <c ca="right">
                        <p>18</p>
                     </c>
                     <c ca="right">
                        <p>22</p>
                     </c>
                     <c ca="right">
                        <p>22</p>
                     </c>
                     <c ca="right">
                        <p>81</p>
                     </c>
                     <c ca="left">
                        <p>U6 not detected</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>snoRNA</p>
                     </c>
                     <c ca="right">
                        <p>96</p>
                     </c>
                     <c ca="right">
                        <p>202</p>
                     </c>
                     <c ca="right">
                        <p>250</p>
                     </c>
                     <c ca="right">
                        <p>48</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>miRNA</p>
                     </c>
                     <c ca="right">
                        <p>75</p>
                     </c>
                     <c ca="right">
                        <p>78</p>
                     </c>
                     <c ca="right">
                        <p>85</p>
                     </c>
                     <c ca="right">
                        <p>96</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>A BLAST search of the drosophilid RNAz hits against the results of prior RNAz surveys of mammals <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>, urochordates <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>, and nematodes <abbrgrp><abbr bid="B25">25</abbr></abbrgrp> yielded the following pattern of conservation: 167 tRNA hits and 11 snRNAs associated with the major spliceosome. Furthermore, we recover the U6atac snRNA (which was previously unannotated) and 5 microRNAs.</p>
            <p>In order to estimate the false discovery rate (FDR), we repeated the screen with shuffled alignments as described in <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. Alignments are shuffled such that two alignment columns are swapped only if both their gap pattern and their sequence conservation pattern is the same. This amounts to a very "gentle" shuffling that in particular preserves pairwise sequence divergence within any given window. This gentle shuffling procedure may fail to remove the secondary structure signal in some cases because too few pairs of alignment columns satisfy the stringent conditions for shuffling. This is at least one reason why we observe 3 239 (7.6%) hits in which true and shuffled screen intersect, including 53 of the 756 annotated <it>D. melanogaster </it>ncRNAs, almost exclusively tRNAs. The estimated FDR of roughly 50% for <it>p </it>> 0.5 and 40% for the high quality set in Tab. <tblr tid="T1">1</tblr> should therefore be regarded as pessimistic estimates.</p>
            <p>The results of a second, more "vigorous" shuffling approach lead to much more optimistic estimates: shuffling of the columns without considering their sequence conservation or gap pattern reduces the estimated FDR by factor of 35 to only a 1&#8211;2%. One may argue, of course, that shuffling columns independently will change the gap pattern of the alignment (even though it still conserves pairwise sequence identities). Hence, this procedure may well underestimate the FDR. The dramatic difference in the result highlights a general problem that so far has not been solved in a satisfactory way, namely how to systematically construct randomized <it>alignments </it>that preserve all correlation features of the genomic background except the one under consideration. One important feature which must be mentioned at this point is dinucleotide content. Due to the stacking energy contributions in the folding model, dinucleotide content can affect folding energies and thus FDR estimates considerably. Since there is still no way of randomizing alignments preserving dinucleotide content, we cannot control for this effect. However, we found that, in contrast to mammalian genomes <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>, there is no strong dinucleotide bias in the genomic background of <it>D. melanogaster </it>that effects folding energies. Therefore, our estimates from mononucleotide shuffled alignments will not differ dramatically from estimates one would obtain from controls with the same dinucleotide content.</p>
            <p>In contrast to most previous RNAz screens, we have not removed coding sequences from the input alignments. Notably, 8 021 hits for <it>p </it>> 0.5 and 2208 hits for <it>p </it>> 0.9 overlap with annotated coding regions, accounting for 19% and 13% of the RNAz hits, respectively. These fractions are much smaller than the expected FDRs; we therefore expect that most of these signals are indeed false positives. Interestingly, if we base our analysis on the number of nucleotides that are predicted to lie in regions with conserved structures instead of counting the RNAz hits the estimates are reduced to 15%, and 11.5%, respectively (cp. Fig. <figr fid="F1">1</figr>). Conversely, only 12% (8 326) at <it>p </it>> 0.5 and less than 4% (2 522) at <it>p </it>> 0.9 of the annotated coding regions are detected by RNAz. Note that 1 398 RNAz hits overlap more than one annotated coding region. The small percentage of RNAz hits in annotated CDS indicates that even a possibly large number of unannotated coding sequences will not have a significant impact on the interpretation of the RNAz results in the sense that only a small fraction of the RNAz hits may be previously unannotated CDS. To further corroborate this point we have computed the overlap of the RNAz predictions with various gene prediction tracks available in the UCSC Table Browser, yielding no significant increase in the number of RNAz hits located in putative CDS: In total only 11 172 (<it>p </it>> 0.5) and 3 144 (<it>p </it>> 0.9) RNAz hits lie in regions with any evidence for coding capacity.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Genomic distribution of D. melanogaster RNAz hits</p>
               </caption>
               <text>
                  <p><b>Genomic distribution of D. melanogaster RNAz hits</b>. We compare genomic locations of the RNAz hits in <it>D. melanogaster </it>for two different classification thresholds with the corresponding distribution of the input alignments (relative to the current FlyBase gene track from the UCSC Table Browser, April 2004). In addition, the corresponding distribution for the human ENCODE regions [22] is shown. The numbers differ slightly from ref. [22] since here we have normalized them to 100%. Percentages for the 5'-UTRs are not given due to the very small bar areas; the values are (from left to right): 1.24%, 1.69%, 1.70% and 0.6%. In general, the distribution of structured RNAs closely follows that of conserved sequence, i.e., there is no strong enrichment of RNAz hits in a particular annotation class. The most striking difference between human and fly is the much larger fraction of intronic RNAz hits in the ENCODE data.</p>
               </text>
               <graphic file="1471-2164-8-406-1"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Genomic Distribution</p>
            </st>
            <p>The genomic distribution of structured RNA candidates in <it>D. melanogaster </it>is comparable to the observations in previous RNAz-based screens, see Fig. <figr fid="F1">1</figr>. As in the ENCODE data <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>, the distribution of RNAz hits largely follows the patterns of sequence conservation. In the fly data, only 5'UTRs show a substantial enrichment relative to the input data. In contrast, the largest enrichment in the human ENCODE data was observed from 3'UTRs <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. The most striking difference between fly and human data is that the relative fraction of both intronic RNAz hits and intronic sequence conservation is twice as large in human.</p>
            <p>In a recent article Manak and colleagues <abbrgrp><abbr bid="B5">5</abbr></abbrgrp> describe widespread transcriptional activity in the <it>D. melanogaster </it>genome during 12 timepoints of early embryonic development detected by genomic tiling arrays. When comparing the RNAz hits to this data, we identify 4 236 (<it>p </it>> 0.5) and 1 713 (<it>p </it>> 0.9) hits that overlap a Transfrag in any of the 12 timepoints. A comparison of the fractions of RNAz hits from normal and control screen which overlap Transfrags in one, several or all timepoints yields, however, no significant enrichments (see Additional file <supplr sid="S1">1</supplr> for details).</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p>Supplemental figures and tables. Figure 1: The <it>D. melanogaster </it>antennapedia complex. Figure 2: The <it>D. melanogaster </it>bithorax complex. Figure 3: Exemplary consensus secondary structures of two RNAz predictions. Figure 4: Comparison of obtained p-values. Figure 5: Complete WPGMA cluster tree of RNA candidates overlapping TRF and BRF binding regions. Table 1: Comparison of RNAz predicted ncRNAs using normal and randomized alignments. Table 2: Summary of RNAz predicted ncRNAs. Table 3: Number of predicted ncRNAs which overlap with Transfrags from <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. Table 4: Number of predicted ncRNAs which overlap with Transfrags from <abbrgrp><abbr bid="B5">5</abbr></abbrgrp> in one, several or all timepoints. Table 5: Intersection (> 80%) of RNAz predictions and UCSC Table Browser tracks. Table 6: Most prominent structural clusters of novel RNA candidates that overlap TRF or BRF binding regions.</p>
               </text>
               <file name="1471-2164-8-406-S1.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>The distance distribution of intergenic RNAz hits reveals a striking difference between the situation in the human and the fly genome, Fig. <figr fid="F2">2</figr>. Since the <it>D. melanogaster </it>genome is much more compact than the human one, we need to compare the distribution of the distances between RNAz hits and the nearest coding sequence relative to the length distribution of the intergenic regions (IGR). In Fig. <figr fid="F2">2</figr> we plot the relative frequency of IGR with a length exceeding a given distance <it>D</it>, and the relative frequency of RNAz hits with a distance larger than <it>D </it>from the nearest coding region. If intergenic RNAz hits are uniformly distributed within the IGR, the distribution of RNAz-CDS distances looks like the distribution of IGR distances, just shifted to the left by a factor of 4. Indeed, this is observed in the human data, albeit the shift is a factor between 3 and 4, indicating that the placement of intergenic RNAz hits in human is nearly uniform, with a small tendency of avoiding the proximity of coding genes.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Distributions of IGR length and distances of RNAz hits to their nearest annotated CDS element in fly and human</p>
               </caption>
               <text>
                  <p><b>Distributions of IGR length and distances of RNAz hits to their nearest annotated CDS element in fly and human</b>. The two curves with shaded backgrounds show the distribution of IGRs that exceed a given length <it>D </it>for <it>Homo sapiens </it>and <it>Drosophila melanogaster</it>, respectively. The shape of these curves is very similar. Note that, although distances <it>D </it>&lt; 50 nt are omitted in the plot, all cumulative distributions of course reach 1 at <it>D </it>= 1. The main difference is that the IGRs in fly are on average two orders of magnitude shorter. Thick lines indicate the distribution of distances of RNAz hits that have a distance of more than <it>D </it>from the nearest coding sequence. In humans, this distribution is similar to the IGR distribution, shifted to the left by a factor of 3 to 4. In contrast, we observe a completely different shape in flies: A fraction of about 40% of the RNAz hits is located adjacent to the annotated genes. On the other hand, a small fraction of the RNAz hits is located further away from coding genes than expected. RNAz hits refer to the comprehensive set <it>p </it>> 0.5.</p>
               </text>
               <graphic file="1471-2164-8-406-2"/>
            </fig>
            <p>In contrast, about 40% of the <it>D. melanogaster </it>RNAz hits in intergenic regions are located adjacent to coding sequences. This may indicate that current annotation of the fly genome lists boundaries of protein coding genes that systematically truncate the UTRs. If this is the case, however, then we would have to interpret more than 15% of the total RNAz hits as located in UTRs. Our data could be explained if a situation similar to the <it>minifly </it>gene is prevalent in the fly: For this gene a recent study <abbrgrp><abbr bid="B34">34</abbr></abbrgrp> described several alternative poly-A sites and multiple small ncRNAs that are processed from the alternative 3'UTRs. At least one of these ncRNAs is structured: the snoRNA H1 was also detected in our screen. In any case, the structured RNAs by RNAz are on average much more closely linked to protein coding genes in flies than in human.</p>
            <p>On the other hand, a small fraction (&#8776; 10%) of the intergenic RNAz hits, i.e., the tail in Fig. <figr fid="F2">2</figr>, is located much further away from CDS than expected for random placement. This suggests the existence of a distinct class of RNAz hits with a propensity for large IGRs. Most likely, these signals correspond to independently transcribed ncRNAs.</p>
            <p>About 20% of the unannotated transcripts observed in <it>D. melanogaster </it>early development arise from stand-alone intergenic or intronic sources (relative to FlyBase annotation) <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. Only a relatively small fraction of the novel independent transcripts (5.1% of the total transcriptional output) had intergenic origin. In comparison, more than 13% [21.9% of 60%] of the transcriptional output recorded by comparable methods from the ENCODE regions has a distal intergenic source (in relation to annotated exons) <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. This difference is in agreement with closer association of most RNAz hits with protein coding genes in the fly.</p>
         </sec>
         <sec>
            <st>
               <p>Further Annotation of RNAz Predictions</p>
            </st>
            <p>In a recent study, Isogai <it>et al</it>. <abbrgrp><abbr bid="B32">32</abbr></abbrgrp> identified TRF1 and BRF binding sites using high-resolution genome tiling microarrays and provided evidence that in <it>Drosophila </it>the alternative TRF1/BRF complex appears responsible for the initiation of all known classes of Pol III transcription. At the <it>p </it>> 0.9 significance level RNAz hits are about three-fold enriched in these regions. We have therefore analyzed the distribution of RNAz hits within the experimentally determined TRF1 and BRF binding regions. As reported in <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>, most of the sites correspond to tRNAs, 7SL RNAs, and a subset of snoRNAs. In addition to these known ncRNAs, the loci contain 197 unannotated RNAz hits, which are prime candidates for novel Pol III transcripts.</p>
            <p>In order to identify putative microRNAs, we screened all RNAz hits with RNAmicro <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. This results in 607 candidates, of which 541 are unannotated so far. 176 of these signals are located in annotated CDS and are therefore most likely false positives, leaving 365 plausible microRNA candidates. The recent discovery of hundreds of new human microRNAs that are not conserved beyond primates strongly suggests that "evolution of miRNAs is an ongoing process and that along with ancient, highly conserved miRNAs, there are a number of emerging miRNAs" <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. In the light of these data, a large number of drosophilid-specific microRNAs does not come unexpected.</p>
            <p>Using SnoReport <abbrgrp><abbr bid="B58">58</abbr></abbrgrp>,  RNAz hits are classified as putative box H/ACA snoRNAs, of which 4 intersect with previously annotated snoRNAs. Taking into account that for only 22 of the 250 annotated snoRNAs the annotation distinguishes between box H/ACA (3), box C/D (18), and scaRNAs (1), the small overlap with the existing annotation is not surprising. Again, recent experimental surveys in other species, including nematodes <abbrgrp><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr></abbrgrp> and mammals <abbrgrp><abbr bid="B39">39</abbr></abbrgrp> have discovered a substantial number of previously unannotated snoRNAs in these species, suggesting that the current annotation of snoRNAs in <it>D. melanogaster </it>is also far from complete.</p>
            <p>Finally, 1 700 RNAz hits have direct evidence for expression through ESTs that are not related to protein coding genes, i.e., through ESTs that do not intersect with the FlyBase, RefSeq, N-SCAN, Genscan, Human Proteins gene prediction and mRNA tracks of the UCSC Table Browser.</p>
         </sec>
         <sec>
            <st>
               <p>Structure-Based Clustering</p>
            </st>
            <p>Since the 197 RNAz hits that overlap TRF1 or BRF binding regions <abbrgrp><abbr bid="B32">32</abbr></abbrgrp> are good candidates without annotation for <it>bona fide </it>ncRNAs, we applied structure-based clustering to this small subset of our predictions to identify common secondary structures and, hence, putative novel functional RNAs. The complete clustering tree as well as a table of the most prominent clusters is given in the Additional file <supplr sid="S1">1</supplr>. Since all clusters have a mean pairwise identity less than 45%, structurally related candidates are typically highly diverged at sequence level.</p>
            <p>In Fig. <figr fid="F3">3</figr> an example cluster of complex structures is given. Clusters 22, 25 and 28 have a structure with two stem loops in common. All consensus structures show compensatory mutations.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Cluster of complex structures</p>
               </caption>
               <text>
                  <p><b>Cluster of complex structures</b>. Structure-based clustering of RNAz hits with evidence for transcription by Pol III identifies a group of <it>Y </it>-shaped, potentially related putative ncRNAs. Abbreviations: N...number of sequences in cluster. MPI...mean pairwise identity of multiple alignment. SCI...structure conservation index.</p>
               </text>
               <graphic file="1471-2164-8-406-3"/>
            </fig>
            <p>Fig. <figr fid="F4">4</figr> depicts a large cluster of simple hairpin structures. They show a relatively high structural conservation (high structure conservation index, SCI) whereas the sequence similarity (expressed as main pairwise identity, MPI) is small.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Cluster of simple hairpin structures</p>
               </caption>
               <text>
                  <p><b>Cluster of simple hairpin structures</b>. A fraction of the RNAz hits with evidence for transcription by Pol III exhibits hairpin structure. However, they lack any other annotation. This is in line with the finding that miRNAs are not transcribed by Pol III. Abbreviations: N...number of sequences in cluster. MPI...mean pairwise identity of multiple alignment. SCI...structure conservation index.</p>
               </text>
               <graphic file="1471-2164-8-406-4"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Phylogenetic Distribution</p>
            </st>
            <p>In order to study the phylogenetic distribution of the RNAz prediction we determine the last common ancestor for each RNAz hit that contains the corresponding sequence in the input alignment. Fig. <figr fid="F5">5</figr> summarizes these results for both the true data and the "gentle" control screen.</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Phylogenetic distribution of ncRNA candidates</p>
               </caption>
               <text>
                  <p><b>Phylogenetic distribution of ncRNA candidates</b>. The tree only represents the topology and is not drawn to scale. Branch lengths are indicated below by large numbers in sans serif font, measured in terms of substitutions per site for 4-fold degenerated sites. For each branch we mark the number of RNAz hits for <it>p </it>> 0.5 and <it>p </it>> 0.9, respectively above the branch leading to the last common ancestor (LCA) of the sequences in the corresponding input alignment (full boxes). Below the branches we indicate the corresponding numbers for the "gentle" control screen. Below the tree the ratio of the fraction of newly appearing RNAz hits and the branch length is given, indicating little variation in the "innovation rate". Since the original tree is unrooted without an outgroup, no data are available for the branch separating <it>Sophophora </it>from the rest. The number of RNAz hits listed in the tree is smaller than the total number of RNAz hits because we only considered sequences present in all single windows of an RNAz hit here.</p>
               </text>
               <graphic file="1471-2164-8-406-5"/>
            </fig>
            <p>More than 50% of the RNAz hits are found only within the <it>melanogaster </it>subgroup. To interpret this result we compute the ratio of newly appearing RNAz hits and the branch length for each branch in the tree leading to <it>D. melanogaster</it>. We observe little variation in the data, with the exception of a reduced rate of innovation along the most recent branch. This reduction is, however, most likely a methodological artefact, since the pairwise mutation distances between <it>D. melanogaster</it>, <it>D. simulans</it>, and <it>D. sechellia </it>are only about 0.1 and RNAz is known to be less sensitive for highly similar sequences.</p>
            <p>Approximately 12% of the RNAz hits are conserved throughout all drosophilids. In comparison, a screen of vertebrate genomes <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> found about 3% of mammalian RNAz candidates (1 000 out of 36 000) to be conserved throughout vertebrates.</p>
            <p>A comparison of true and shuffled screens furthermore indicates a small but significant decrease of the FDR with phylogenetic age of the RNAz hit.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>The present computational survey of drosophilid genomes yields about 16 000 high quality predictions. Taking into account the (very pessimistically estimated) false discovery rate of about 40%, this implies that at least some ten thousand loci in the <it>Drosophila </it>genome show the hallmarks of stabilizing selection acting on RNA structure, and hence are most likely functional at the RNA level. The elucidation of these functions, however, remains elusive in many cases. Here, we have studied a small subset in more detail. Almost 200 RNAz hits overlap with loci that are likely to be transcribed by Pol III, strongly suggesting that these are <it>bona fide </it>ncRNAs. Using structural clustering, we discovered several groups of structural similar ncRNA candidates in these regions.</p>
         <p>This number of putative ncRNAs and a regulatory RNA element is not unexpected given that about 36 000 high quality RNAz hits have been found by a similar procedure in a screen of mammalian genomes <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>, which was based on a comparable size of the input set comprising about 103 Mb of the human genome and a similar number of putative ncRNAs was reported using the SCFG-based evofold approach <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>.</p>
         <p>A comparison with the results from a similar RNAz screen of the human genome <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> and with an analysis of ENCODE regions <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>, shows many similarities and several striking differences. We observe a smaller fraction of intronic and larger fraction of protein coding hits (cp. Fig. <figr fid="F1">1</figr>) in flies. A comparison of the distances between RNAz hits and their nearest annotated protein coding sequence shows that structured RNAs are concentrated much more strongly around known genes in flies than in human, even when accounting for the much more compact <it>D. melanogaster </it>genome. This observation agrees with recent tiling array data <abbrgrp><abbr bid="B5">5</abbr></abbrgrp> which showed that a much smaller fraction of intergenic transcription is truly independent from surrounding protein coding genes in flies compared to human <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>.</p>
         <p>The inventory of structurally conserved RNAs is only a very small subset of the total non-coding transcriptional output, which covers most of the non-repetitive genome <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. The current computational approach relies on substantial sequence conservation. Indeed many of the known ncRNAs that were missed in our survey were not in the input set. In fact, RNAz explicitly requires two independent signals for stabilizing selection: (1) sequence conservation so that a good alignment can be computed as input, and (2) stabilizing selection on RNA secondary structure in the presence of sequence variation. RNAz hits are therefore subject to specific selection pressures that make it highly likely that RNAz predictions have distinctive biological function. In contrast, it has been shown recently, that in some cases, such as the bithoraxoid ncRNAs of the <it>Drosophila </it>bithorax complex, ncRNA transcription itself, acting in cis, represses a target gene (in this case Ubx) <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. In such a scenario, however, we do not expect to observe high levels of sequence conservation of the non-coding transcripts or the tell-tale substitution patterns of conserved secondary structures.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Data Sources</p>
            </st>
            <p>For our analysis we used the Pecan <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> alignment of the 12 drosophilid genomes <abbrgrp><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr></abbrgrp> of the Comparable Analysis Freeze 1 (CAF1, Feb. 2006). The alignments were downloaded from <abbrgrp><abbr bid="B31">31</abbr><abbr bid="B41">41</abbr></abbrgrp>. We favored the Pecan alignments over two other sets of drosophilid alignments that are available at <abbrgrp><abbr bid="B31">31</abbr><abbr bid="B42">42</abbr></abbrgrp>. Visual inspection strongly suggested that Mavid alignments <abbrgrp><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr></abbrgrp> are more biased towards protein coding regions. We did not use the Multiz alignments, because they contain three additional genomes (insects), and removing those sequences would effectively require a complete realignment in order to obtain a fair comparison between screens performed on different input alignments. The Pecan alignments comprise the <it>D. melanogaster </it>chromosomes 2L (22.4 Mb), 2R (20.8 Mb), 3L (23.8 Mb), 3R (27.9 Mb), 4 (1.3 Mb), and X (22.2 Mb).</p>
         </sec>
         <sec>
            <st>
               <p>Preprocessing of Input Alignments</p>
            </st>
            <p>The current implementation of RNAz is restricted to input alignments containing at most 6 sequences and a maximum length of 400 nt due to the training of the underlying SVM <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>. In addition, certain restrictions apply for the fraction of gaps and on the overall base composition as a consequence of the data sets that were used to train the SVM model. The original genomic alignments thus need to be re-processed. The protocol used in this contribution closely follows that of previous RNAz-based studies:</p>
            <p>Alignments longer than 120 nt are cut into 120 nt slices in 40 nt steps, so that subsequent slices overlap in 80 nt. This default length is motivated by the fact that many structured RNAs are less than 100 nt long. Such short signals would "drown" in the noise of longer alignments that are then mostly unstructured. On the other hand, alignments that are too short do not yield reliable signals for secondary structure conservation. In a series of filtering steps, sequences were removed from the individual alignments or alignment slices if they are (a) shorter than 50 nt, or (b) contain more than 25% gap characters, or (c) have a base composition outside the definition range of RNAz (e.g. GC content > 0.75 or &lt; 0.25).</p>
            <p>Alignments were discarded completely if fewer than 3 sequences were left after the filtering steps, or they did not contain a <it>D. melanogaster </it>sequence, since this species serves as a reference and as the basis for subsequent annotation. All preprocessing steps were performed using the script rnazWindows.pl of the current release of the RNAz package <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>.</p>
            <p>For alignment slices with more than 6 sequences, rnazWindows.pl selects a representative subset consisting of the <it>D. melanogaster </it>sequence and five additional sequences in such a way that 6 sequences are as evenly distributed in the dataset as possible and approach an average pairwise sequence identity of 80%, the optimal working range of the RNAz program. In practice that means that only a single representative from nearly identical sequences is chosen, and highly divergent sequences are excluded provided there is sufficient sequence variation in the remaining alignment. For the technical details of the procedure we refer to the documentation of the RNAz package .<abbrgrp><abbr bid="B45">45</abbr></abbrgrp></p>
            <p>Tab. <tblr tid="T1">1</tblr> summarizes the initial filtering steps. Roughly 50% of the nucleotides in the Pecan alignments are still contained in the RNAz input data.</p>
         </sec>
         <sec>
            <st>
               <p>RNAz Classification and Annotation</p>
            </st>
            <p>RNAz was applied to the filtered input alignment slices in both reading directions. Overlapping slices with a positive ncRNA classification probability of <it>p </it>> 0.5 were combined using rnazCluster.pl to a single annotation element, which we will refer to as <it>"RNAz hit"</it>. From these data, we extract a subset of high confidence RNAz hits that contain at least one slice with a prediction confidence of <it>p </it>> 0.9.</p>
            <p>In order to estimate the false discovery rate (FDR) of the screen we repeated the entire procedure with shuffled input alignments as described in <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. The alignments were (1) shuffled using the rnazRandomizeAln.pl script (part of the RNAz package). It wraps a conservative shuffling procedure that maintains local characteristics of an alignment, e.g. columns with the same gap and conservation pattern. All remaining RNAz hits of this control screen are then shuffled once again (2) using a more stringent shuffling method that explicitly shuffles all columns of a given alignment randomly (cp. Tab. <tblr tid="T3">3</tblr>).</p>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Numbers of positive scored RNAz windows of the control screen.</p>
               </caption>
               <tblbdy cols="8">
                  <r>
                     <c ca="left">
                        <p>shuffling method</p>
                     </c>
                     <c cspan="7" ca="center">
                        <p>chromosomes</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>all</p>
                     </c>
                     <c ca="left">
                        <p>2L</p>
                     </c>
                     <c ca="left">
                        <p>2R</p>
                     </c>
                     <c ca="left">
                        <p>3L</p>
                     </c>
                     <c ca="left">
                        <p>3R</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>X</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>conservative</p>
                     </c>
                     <c ca="left">
                        <p>29 938</p>
                     </c>
                     <c ca="left">
                        <p>5 220</p>
                     </c>
                     <c ca="left">
                        <p>631</p>
                     </c>
                     <c ca="left">
                        <p>6 402</p>
                     </c>
                     <c ca="left">
                        <p>7 254</p>
                     </c>
                     <c ca="left">
                        <p>160</p>
                     </c>
                     <c ca="left">
                        <p>6 271</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>complete</p>
                     </c>
                     <c ca="left">
                        <p>662</p>
                     </c>
                     <c ca="left">
                        <p>123</p>
                     </c>
                     <c ca="left">
                        <p>99</p>
                     </c>
                     <c ca="left">
                        <p>132</p>
                     </c>
                     <c ca="left">
                        <p>155</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>152</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>The RNAz hits were annotated using the <it>D. melanogaster </it>sequence as reference. We performed the following annotation steps:</p>
            <p>&#8226; <b>Overlap with known <it>D. melanogaster </it>annotation</b></p>
            <p>We used the coordinates of a set of <it>D. melanogaster </it>non-coding RNAs, publicly available as gff files at <abbrgrp><abbr bid="B46">46</abbr></abbrgrp> and <abbrgrp><abbr bid="B31">31</abbr><abbr bid="B47">47</abbr></abbrgrp> to identify already known <it>D. melanogaster </it>ncRNAs among our predictions. Furthermore, we computed the overlap of the RNAz hits with the CDS annotations from <abbrgrp><abbr bid="B31">31</abbr><abbr bid="B48">48</abbr></abbrgrp> (file = dmel-all-r4.3.filtered.gff) and <abbrgrp><abbr bid="B49">49</abbr></abbrgrp> (file = all_caf1_DGIL_TEX.gff).</p>
            <p>&#8226; <b>Overlap with public non-coding RNA databases</b></p>
            <p>We furthermore performed BLAST <abbrgrp><abbr bid="B50">50</abbr></abbrgrp> searches using rnazBlast.pl against the Rfam (version 7.0) <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>, Noncode (version 1.0) <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>, ncRNAdb <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>, FlyBase (version 2006 00.2 Beta) <abbrgrp><abbr bid="B54">54</abbr><abbr bid="B55">55</abbr></abbrgrp>, and miRBase (version 9.0) <abbrgrp><abbr bid="B56">56</abbr></abbrgrp>.</p>
            <p>&#8226; <b>Tools for annotation of specific RNA families</b></p>
            <p>We furthermore used tRNAscan <abbrgrp><abbr bid="B57">57</abbr></abbrgrp> to annotate tRNAs and RNAmicro <abbrgrp><abbr bid="B35">35</abbr></abbrgrp> to classify putative microRNAs. RNAmicro is an SVM based classification method that evaluates both thermodynamic stability and evolutionary conservation patterns.</p>
            <p>We used SnoReport to recognize putative snoRNAs (for technical details see <abbrgrp><abbr bid="B58">58</abbr></abbrgrp>). Similar to RNAmicro, SnoReport is composed of a pre-filter, a secondary structure prediction step, and a subsequent SVM-based classificator. In brief, the prefilter searches for consecutive H (pattern: ANANNA) and ACA boxes <abbrgrp><abbr bid="B59">59</abbr></abbrgrp>. In the second step, the constraint folding option of RNAfold <abbrgrp><abbr bid="B60">60</abbr></abbrgrp> is used to compute the secondary structure subject to the constraint that both boxes remain unpaired. If this results in an snoRNA-like secondary structure, several sequence and structure features are computed and passed to an SVM for classification. The model was trained on the set of snoRNAs that can be downloaded from the snoRNABase <abbrgrp><abbr bid="B61">61</abbr></abbrgrp>. C/D box and scaRNAs represented the negative and H/ACA box snoRNA sequences the positive samples. Estimated positive and negative prediction values for the model used here are 80% and 99.9%, respectively.</p>
            <p>Structure-based clustering was performed as described in <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>: The modified Sankoff algorithm implemented in the LocARNA program is used to compute local structural alignments and their consensus structure. The clustering tree is obtained by agglomerative clustering using LocARNA alignment scores as distance measures. To avoid that large scores influence the distance transformation we define distances by <it>d</it>(<it>i</it>, <it>j</it>) = <it>max</it>(0; <it>q </it>- <it>score</it>(<it>i</it>, <it>j</it>)), where q is here the 99% quantil of all pairwise scores. Since the procedure is computationally very demanding we have restricted this type of analysis here to a small subset of RNAz hits that are likely Pol III transcripts.</p>
            <p>The phylogenetic relationships within drosophilids are taken from the AAA (Alignment/Analysis/Annotation of 12 related <it>Drosophila species</it>) web site <abbrgrp><abbr bid="B63">63</abbr></abbrgrp>. Branch lengths are genomic mutation distances computed from 4-fold degenerate sites in all coding regions corrected for base composition as in <abbrgrp><abbr bid="B31">31</abbr></abbrgrp><abbrgrp><abbr bid=" B64">64</abbr></abbrgrp>. In order to determine the branch in the phylogenetic tree at which an RNAz hit first appears, we determine the last common ancestor (LCA) of the sequences in the corresponding input alignment and assign the RNAz hit to the branch in the tree leading to this internal node. Due to the fact that RNAz hits are a combination of single windows and each window represents a specific selection of sequences out of an n-way alignment, we only considered those sequences for the LCA analysis which are simultaneously present at all windows.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>DR performed the RNAz screens and coordinated the analysis; SJP initiated this study. All authors closely collaborated in annotation, statistical analysis, and interpretation of the data, contributed to writing and approved the final manuscript.</p>
      </sec>
      <sec>
         <st>
            <p>Additional Files</p>
         </st>
         <p>Machine readable annotation files and annotation tables for all RNAz predictions can be found at <abbrgrp><abbr bid="B65">65</abbr></abbrgrp>. Supplemental figures and tables are appended as separate PDF file (Additional file <supplr sid="S1">1</supplr>).</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank the "Drosophila 12 Genomes Consortium" for providing sequence, annotation and alignment data prior to publication. Special thanks go to Venky Iyer for assistence with data retrieval. This work was supported in part by the Bioinformatics Initiative of DFG in Germany and Austrian GEN-AU project "non coding RNA".</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Global identification of human transcribed sequences with genome tiling arrays</p>
            </title>
            <aug>
               <au>
                  <snm>Bertone</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Stolc</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Royce</snm>
                  <fnm>TE</fnm>
               </au>
               <au>
                  <snm>Rozowsky</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Urban</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Rinn</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Tongprasit</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Samanta</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Weissman</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gerstein</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Snyder</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2004</pubdate>
            <volume>306</volume>
            <issue>5705</issue>
            <fpage>2242</fpage>
            <lpage>2246</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1103388</pubid>
                  <pubid idtype="pmpid" link="fulltext">15539566</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Applications of DNA tiling arrays to experimental genome annotation and regulatory pathway discovery</p>
            </title>
            <aug>
               <au>
                  <snm>Bertone</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Gerstein</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Snyder</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Chromosome Res</source>
            <pubdate>2005</pubdate>
            <volume>13</volume>
            <issue>3</issue>
            <fpage>259</fpage>
            <lpage>274</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s10577-005-2165-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">15868420</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Novel RNAs identified from an in-depth analysis of the transcriptome of human chromosomes 21 and 22</p>
            </title>
            <aug>
               <au>
                  <snm>Kampa</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Cheng</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kapranov</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Yamanaka</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Brubaker</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Cawley</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Drenkow</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Piccolboni</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bekiranov</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Helt</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Tammana</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Gingeras</snm>
                  <fnm>TR</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <issue>3</issue>
            <fpage>331</fpage>
            <lpage>342</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">353210</pubid>
                  <pubid idtype="pmpid" link="fulltext">14993201</pubid>
                  <pubid idtype="doi">10.1101/gr.2094104</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Dark matter in the genome: evidence of widespread transcription detected by microarray tiling experiments</p>
            </title>
            <aug>
               <au>
                  <snm>Johnson</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Edwards</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Shoemaker</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Schadt</snm>
                  <fnm>EE</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>2</issue>
            <fpage>93</fpage>
            <lpage>102</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tig.2004.12.009</pubid>
                  <pubid idtype="pmpid" link="fulltext">15661355</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Biological function of unannotated transcription during the early development of Drosophila melanogaster</p>
            </title>
            <aug>
               <au>
                  <snm>Manak</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Dike</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sementchenko</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Kapranov</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Biemar</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Long</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Cheng</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bell</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Ghosh</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Piccolboni</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Gingeras</snm>
                  <fnm>TR</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2006</pubdate>
            <volume>38</volume>
            <issue>10</issue>
            <fpage>1151</fpage>
            <lpage>1158</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1875</pubid>
                  <pubid idtype="pmpid" link="fulltext">16951679</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project</p>
            </title>
            <aug>
               <au>
                  <cnm>The ENCODE Project Consortium</cnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2007</pubdate>
            <volume>447</volume>
            <issue>7146</issue>
            <fpage>799</fpage>
            <lpage>816</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature05874</pubid>
                  <pubid idtype="pmpid">17571346</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs</p>
            </title>
            <aug>
               <au>
                  <snm>Okazaki</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Furuno</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kasukawa</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Adachi</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bono</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kondo</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nikaido</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Osato</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Saito</snm>
                  <fnm>R</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>420</volume>
            <issue>6915</issue>
            <fpage>563</fpage>
            <lpage>573</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature01266</pubid>
                  <pubid idtype="pmpid" link="fulltext">12466851</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Integrative annotation of 21,037 human genes validated by full-length cDNA clones</p>
            </title>
            <aug>
               <au>
                  <snm>Imanishi</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Itoh</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Suzuki</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>O'Donovan</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Fukuchi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Koyanagi</snm>
                  <fnm>KO</fnm>
               </au>
               <au>
                  <snm>Barrero</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Tamura</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Yamaguchi-Kabata</snm>
                  <fnm>Y</fnm>
               </au>
               <etal/>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2004</pubdate>
            <volume>2</volume>
            <issue>6</issue>
            <fpage>e162</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">393292</pubid>
                  <pubid idtype="pmpid" link="fulltext">15103394</pubid>
                  <pubid idtype="doi">10.1371/journal.pbio.0020162</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Experimental validation of the regulated expression of large numbers of non-coding RNAs from the mouse genome</p>
            </title>
            <aug>
               <au>
                  <snm>Ravasi</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Suzuki</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Pang</snm>
                  <fnm>KC</fnm>
               </au>
               <au>
                  <snm>Katayama</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Furuno</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Okunishi</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Fukuda</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ru</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Frith</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Gongora</snm>
                  <fnm>MM</fnm>
               </au>
               <au>
                  <snm>Grimmond</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Hume</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Hayashizaki</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Mattick</snm>
                  <fnm>JS</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2006</pubdate>
            <volume>16</volume>
            <fpage>11</fpage>
            <lpage>19</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1356124</pubid>
                  <pubid idtype="pmpid" link="fulltext">16344565</pubid>
                  <pubid idtype="doi">10.1101/gr.4200206</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Non-coding RNA genes and the modern RNA world</p>
            </title>
            <aug>
               <au>
                  <snm>Eddy</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2001</pubdate>
            <volume>2</volume>
            <issue>12</issue>
            <fpage>919</fpage>
            <lpage>929</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35103511</pubid>
                  <pubid idtype="pmpid" link="fulltext">11733745</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>An expanding universe of noncoding RNAs</p>
            </title>
            <aug>
               <au>
                  <snm>Storz</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>296</volume>
            <issue>5571</issue>
            <fpage>1260</fpage>
            <lpage>1263</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1072249</pubid>
                  <pubid idtype="pmpid" link="fulltext">12016301</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Challenging the dogma: the hidden layer of non-protein-coding RNAs in complex organisms</p>
            </title>
            <aug>
               <au>
                  <snm>Mattick</snm>
                  <fnm>JS</fnm>
               </au>
            </aug>
            <source>Bioessays</source>
            <pubdate>2003</pubdate>
            <volume>25</volume>
            <issue>10</issue>
            <fpage>930</fpage>
            <lpage>939</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/bies.10332</pubid>
                  <pubid idtype="pmpid" link="fulltext">14505360</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Non-coding RNAs: hope or hype?</p>
            </title>
            <aug>
               <au>
                  <snm>H&#252;ttenhofer</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Schattner</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Polacek</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>5</issue>
            <fpage>289</fpage>
            <lpage>297</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tig.2005.03.007</pubid>
                  <pubid idtype="pmpid" link="fulltext">15851066</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Non-coding RNAs: new players in eukaryotic biology</p>
            </title>
            <aug>
               <au>
                  <snm>Costa</snm>
                  <fnm>FF</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>2005</pubdate>
            <volume>357</volume>
            <issue>2</issue>
            <fpage>83</fpage>
            <lpage>94</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.gene.2005.06.019</pubid>
                  <pubid idtype="pmpid" link="fulltext">16111837</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Non-coding RNAs: Lost in translation?</p>
            </title>
            <aug>
               <au>
                  <snm>Costa</snm>
                  <fnm>FF</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>2006</pubdate>
            <volume>386</volume>
            <issue>1-2</issue>
            <fpage>1</fpage>
            <lpage>10</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">17113247</pubid>
                  <pubid idtype="doi">10.1016/j.gene.2006.09.028</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>RNAs everywhere: Genome-wide annotation of structured RNAs</p>
            </title>
            <aug>
               <au>
                  <cnm>The Athanasius F Bompf&#252;newerer RNA Consortium</cnm>
               </au>
               <au>
                  <snm>Backofen</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Flamm</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Fried</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Fritzsch</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Hackerm&#252;ller</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hertel</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hofacker</snm>
                  <fnm>IL</fnm>
               </au>
               <au>
                  <snm>Missal</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Prohaska</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Mosig</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Rose</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Stadler</snm>
                  <fnm>PF</fnm>
               </au>
               <au>
                  <snm>Tanzer</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Washietl</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Will</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>J Exp Zool B Mol Dev Evol</source>
            <pubdate>2007</pubdate>
            <volume>308</volume>
            <issue>1</issue>
            <fpage>1</fpage>
            <lpage>25</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/jez.b.21130</pubid>
                  <pubid idtype="pmpid" link="fulltext">17171697</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Fast and reliable prediction of noncoding RNAs</p>
            </title>
            <aug>
               <au>
                  <snm>Washietl</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hofacker</snm>
                  <fnm>IL</fnm>
               </au>
               <au>
                  <snm>Stadler</snm>
                  <fnm>PF</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <fpage>2454</fpage>
            <lpage>2459</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">548974</pubid>
                  <pubid idtype="pmpid" link="fulltext">15665081</pubid>
                  <pubid idtype="doi">10.1073/pnas.0409169102</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Identification and classification of conserved RNA secondary structures in the human genome</p>
            </title>
            <aug>
               <au>
                  <snm>Pedersen</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Bejerano</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Siepel</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Rosenbloom</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Lindblad-Toh</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Lander</snm>
                  <fnm>ES</fnm>
               </au>
               <au>
                  <snm>Kent</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Haussler</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>PLoS Comput Biol</source>
            <pubdate>2006</pubdate>
            <volume>2</volume>
            <issue>4</issue>
            <fpage>e33</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1440920</pubid>
                  <pubid idtype="pmpid" link="fulltext">16628248</pubid>
                  <pubid idtype="doi">10.1371/journal.pcbi.0020033</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Phenotypic consequences of promoter-mediated transcriptional noise</p>
            </title>
            <aug>
               <au>
                  <snm>Blake</snm>
                  <fnm>WJ</fnm>
               </au>
               <au>
                  <snm>Balazsi</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Kohanski</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Isaacs</snm>
                  <fnm>FJ</fnm>
               </au>
               <au>
                  <snm>Murphy</snm>
                  <fnm>KF</fnm>
               </au>
               <au>
                  <snm>Kuang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Cantor</snm>
                  <fnm>CR</fnm>
               </au>
               <au>
                  <snm>Walt</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Collins</snm>
                  <fnm>JJ</fnm>
               </au>
            </aug>
            <source>Mol Cell</source>
            <pubdate>2006</pubdate>
            <volume>24</volume>
            <fpage>853</fpage>
            <lpage>865</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.molcel.2006.11.003</pubid>
                  <pubid idtype="pmpid" link="fulltext">17189188</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Transcription of <it>bxd </it>noncoding RNAs promoted by trithorax represses <it>Ubx </it>in cis by transcriptional interference</p>
            </title>
            <aug>
               <au>
                  <snm>Petruk</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sedkov</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Riley</snm>
                  <fnm>KM</fnm>
               </au>
               <au>
                  <snm>Hodgson</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Schweisguth</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Hirose</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Jaynes</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Brock</snm>
                  <fnm>HW</fnm>
               </au>
               <au>
                  <snm>Mazo</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2006</pubdate>
            <volume>127</volume>
            <fpage>1209</fpage>
            <lpage>1221</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1866366</pubid>
                  <pubid idtype="pmpid" link="fulltext">17174895</pubid>
                  <pubid idtype="doi">10.1016/j.cell.2006.10.039</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>UTRdb and UTRsite: a collection of sequences and regulatory motifs of the untranslated regions of eukaryotic mRNAs</p>
            </title>
            <aug>
               <au>
                  <snm>Mignone</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Grillo</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Licciulli</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Iacono</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Liuni</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kersey</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Duarte</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Saccone</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Pesole</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <issue>33 Database</issue>
            <fpage>D141</fpage>
            <lpage>D146</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">539975</pubid>
                  <pubid idtype="pmpid" link="fulltext">15608165</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Structured RNAs in the ENCODE selected regions of the human genome</p>
            </title>
            <aug>
               <au>
                  <snm>Washietl</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Pedersen</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Korbel</snm>
                  <fnm>JO</fnm>
               </au>
               <au>
                  <snm>Stocsits</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Gruber</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Hackerm&#252;ller</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hertel</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lindemeyer</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Reiche</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Tanzer</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ucla</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Wyss</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Antonarakis</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Denoeud</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Lagarde</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Drenkow</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kapranov</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Gingeras</snm>
                  <fnm>TR</fnm>
               </au>
               <au>
                  <snm>Guig&#243;</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Snyder</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Gerstein</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Reymond</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hofacker</snm>
                  <fnm>IL</fnm>
               </au>
               <au>
                  <snm>Stadler</snm>
                  <fnm>PF</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2007</pubdate>
            <volume>17</volume>
            <issue>6</issue>
            <fpage>852</fpage>
            <lpage>864</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1891344</pubid>
                  <pubid idtype="pmpid" link="fulltext">17568003</pubid>
                  <pubid idtype="doi">10.1101/gr.5650707</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Mapping of conserved RNA Secondary Structures predicts Thousands of functional Non-Coding RNAs in the Human Genome</p>
            </title>
            <aug>
               <au>
                  <snm>Washietl</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hofacker</snm>
                  <fnm>IL</fnm>
               </au>
               <au>
                  <snm>Lukasser</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>H&#252;ttenhofer</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Stadler</snm>
                  <fnm>PF</fnm>
               </au>
            </aug>
            <source>Nature Biotech</source>
            <pubdate>2005</pubdate>
            <volume>23</volume>
            <fpage>1383</fpage>
            <lpage>1390</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nbt1144</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Non-coding RNAs in Ciona intestinalis</p>
            </title>
            <aug>
               <au>
                  <snm>Missal</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Rose</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Stadler</snm>
                  <fnm>PF</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>Suppl 2</issue>
            <fpage>ii77</fpage>
            <lpage>ii78</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bti1113</pubid>
                  <pubid idtype="pmpid" link="fulltext">16204130</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Prediction of structured non-coding RNAs in the genomes of the nematodes Caenorhabditis elegans and Caenorhabditis briggsae</p>
            </title>
            <aug>
               <au>
                  <snm>Missal</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Rose</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Deng</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Skogerbo</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Stadler</snm>
                  <fnm>PF</fnm>
               </au>
            </aug>
            <source>J Exp Zoolog B Mol Dev Evol</source>
            <pubdate>2006</pubdate>
            <volume>306</volume>
            <issue>4</issue>
            <fpage>379</fpage>
            <lpage>392</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">16425273</pubid>
                  <pubid idtype="doi">10.1002/jez.b.21086</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Computational identification of non-coding RNAs in Saccharomyces cerevisiae by comparative genomics</p>
            </title>
            <aug>
               <au>
                  <snm>McCutcheon</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Eddy</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <issue>14</issue>
            <fpage>4119</fpage>
            <lpage>4128</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">165953</pubid>
                  <pubid idtype="pmpid" link="fulltext">12853629</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg438</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Comparative Analysis of Structured RNAs in S. cerevisiae Indicates a Multitude of Different Functions</p>
            </title>
            <aug>
               <au>
                  <snm>Steigele</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Huber</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Stocsits</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Stadler</snm>
                  <fnm>PF</fnm>
               </au>
               <au>
                  <snm>Nieselt</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>BMC Biology</source>
            <pubdate>2007</pubdate>
            <volume>5</volume>
            <fpage>25</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1914338</pubid>
                  <pubid idtype="pmpid" link="fulltext">17577407</pubid>
                  <pubid idtype="doi">10.1186/1741-7007-5-25</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Siepel</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bejerano</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Pedersen</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Hinrichs</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Hou</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rosenbloom</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Clawson</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Spieth</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hillier</snm>
                  <fnm>LW</fnm>
               </au>
               <au>
                  <snm>Richards</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Weinstock</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Wilson</snm>
                  <fnm>RK</fnm>
               </au>
               <au>
                  <snm>Gibbs</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Kent</snm>
                  <fnm>WJ</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Haussler</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2005</pubdate>
            <volume>15</volume>
            <issue>8</issue>
            <fpage>1034</fpage>
            <lpage>1050</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1182216</pubid>
                  <pubid idtype="pmpid" link="fulltext">16024819</pubid>
                  <pubid idtype="doi">10.1101/gr.3715005</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>NHGRI &#8211; Fruity Genome Sequencing</p>
            </title>
            <url>http://www.genome.gov/11008080</url>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Assembly/Alignment/Annotation of 12 related Drosophila species</p>
            </title>
            <url>http://rana.lbl.gov/drosophila/</url>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Evolution of genes and genomes on the Drosophila phylogeny</p>
            </title>
            <aug>
               <au>
                  <cnm>Drosophila 12 Genomes Consortium</cnm>
               </au>
            </aug>
            <source>nature</source>
            <pubdate>2007</pubdate>
            <volume>450</volume>
            <issue>7167</issue>
            <fpage>203</fpage>
            <lpage>218</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature06341</pubid>
                  <pubid idtype="pmpid" link="fulltext">17994087</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Novel TRF1/BRF target genes revealed by genome-wide analysis of Drosophila Pol III transcription</p>
            </title>
            <aug>
               <au>
                  <snm>Isogai</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Takada</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Tjian</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Kele&#351;</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>EMBO J</source>
            <pubdate>2007</pubdate>
            <volume>26</volume>
            <fpage>79</fpage>
            <lpage>89</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/sj.emboj.7601448</pubid>
                  <pubid idtype="pmpid" link="fulltext">17170711</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Consensus folding of aligned sequences as a new measure for the detection of functional RNAs by comparative genomics</p>
            </title>
            <aug>
               <au>
                  <snm>Washietl</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hofacker</snm>
                  <fnm>IL</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2004</pubdate>
            <volume>342</volume>
            <fpage>19</fpage>
            <lpage>39</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.jmb.2004.07.018</pubid>
                  <pubid idtype="pmpid" link="fulltext">15313604</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>The coding/non-coding overlapping architecture of the gene encoding the Drosophila pseudouridine synthase</p>
            </title>
            <aug>
               <au>
                  <snm>Riccardo</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Tortoriello</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Giordano</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Turano</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Furia</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>BMC Mol Biol</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>15</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1821038</pubid>
                  <pubid idtype="pmpid" link="fulltext">17328797</pubid>
                  <pubid idtype="doi">10.1186/1471-2199-8-15</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Hairpins in a Haystack: Recognizing microRNA Precursors in Comparative Genomics Data</p>
            </title>
            <aug>
               <au>
                  <snm>Hertel</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Stadler</snm>
                  <fnm>PF</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>22</volume>
            <fpage>e197</fpage>
            <lpage>e202</lpage>
            <note>[ISMB 2006 contribution]</note>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btl257</pubid>
                  <pubid idtype="pmpid" link="fulltext">16873472</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Diversity of microRNAs in human and chimpanzee brain</p>
            </title>
            <aug>
               <au>
                  <snm>Berezikov</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Thuemmler</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>van Laake</snm>
                  <fnm>LW</fnm>
               </au>
               <au>
                  <snm>Kondova</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Bontrop</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Cuppen</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Plasterk</snm>
                  <fnm>RH</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2006</pubdate>
            <volume>38</volume>
            <fpage>1375</fpage>
            <lpage>1377</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/ng1914</pubid>
                  <pubid idtype="pmpid" link="fulltext">17072315</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Evolution of small nucleolar RNAs in nematodes</p>
            </title>
            <aug>
               <au>
                  <snm>Zemann</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>op de Bekke</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kiefmann</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Brosius</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Schmitz</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2006</pubdate>
            <volume>34</volume>
            <issue>9</issue>
            <fpage>2676</fpage>
            <lpage>2685</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1464110</pubid>
                  <pubid idtype="pmpid" link="fulltext">16714446</pubid>
                  <pubid idtype="doi">10.1093/nar/gkl359</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Organisation of the Caenorhabditis elegans small noncoding transcriptome: genomic features, biogenesis and expression</p>
            </title>
            <aug>
               <au>
                  <snm>Deng</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Skogerb&#248;</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Fu</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>He</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Ca</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Sun</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Bai</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Cui</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Jai</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Du</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2006</pubdate>
            <volume>16</volume>
            <fpage>20</fpage>
            <lpage>29</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1356125</pubid>
                  <pubid idtype="pmpid" link="fulltext">16344563</pubid>
                  <pubid idtype="doi">10.1101/gr.4139206</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>snoSeeker: an advanced computational package for screening of guide and orphan snoRNA genes in the human genome</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>XC</fnm>
               </au>
               <au>
                  <snm>Huang</snm>
                  <fnm>ZP</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Huang</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>YQ</fnm>
               </au>
               <au>
                  <snm>Qu</snm>
                  <fnm>LH</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2006</pubdate>
            <volume>34</volume>
            <fpage>5112</fpage>
            <lpage>5123</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1636440</pubid>
                  <pubid idtype="pmpid" link="fulltext">16990247</pubid>
                  <pubid idtype="doi">10.1093/nar/gkl672</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Pecan</p>
            </title>
            <url>http://www.ebi.ac.uk/~bjp/pecan/</url>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Pecan alignments of 12 Drosophila</p>
            </title>
            <url>http://www.sanger.ac.uk/Users/td2/pecan-CAF1</url>
         </bibl>
         <bibl id="B42">
            <title>
               <p>AAA-Wiki &#8211; Genome Alignments</p>
            </title>
            <url>http://rana.lbl.gov/drosophila/wiki/index.php/Alignment</url>
         </bibl>
         <bibl id="B43">
            <title>
               <p>MAVID multiple alignment server</p>
            </title>
            <aug>
               <au>
                  <snm>Bray</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Pachter</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <issue>13</issue>
            <fpage>3525</fpage>
            <lpage>3526</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">169029</pubid>
                  <pubid idtype="pmpid" link="fulltext">12824358</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg623</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>MAVID: constrained ancestral alignment of multiple sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Bray</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Pachter</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <issue>4</issue>
            <fpage>693</fpage>
            <lpage>699</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">383315</pubid>
                  <pubid idtype="pmpid" link="fulltext">15060012</pubid>
                  <pubid idtype="doi">10.1101/gr.1960404</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>RNAz</p>
            </title>
            <url>http://www.tbi.univie.ac.at/~wash/RNAz/</url>
         </bibl>
         <bibl id="B46">
            <title>
               <p><it>D. melanogaster </it>ncRNAs</p>
            </title>
            <url>http://bioinf.man.ac.uk/bergman/data/ncRNA/ncRNAreconciled271106.tgz</url>
         </bibl>
         <bibl id="B47">
            <title>
               <p>AAA-Wiki &#8211; Noncoding RNA</p>
            </title>
            <url>http://rana.lbl.gov/drosophila/wiki/index.php/Noncoding RNA</url>
         </bibl>
         <bibl id="B48">
            <title>
               <p><it>D. melanogaster </it>&#8211; CDS annotations</p>
            </title>
            <url>http://rana.lbl.gov/~venky/AAA/freeze_20061030/protein_coding_gene/GLEANR/annotation/</url>
         </bibl>
         <bibl id="B49">
            <title>
               <p>2000 new <it>D. melanogaster </it>genes and coding exons</p>
            </title>
            <url>http://insects.eugenes.org/species/data/dmel-dspp/newgenes/</url>
         </bibl>
         <bibl id="B50">
            <title>
               <p>Basic local alignment search tool</p>
            </title>
            <aug>
               <au>
                  <snm>Altschul</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Gish</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Myers</snm>
                  <fnm>EW</fnm>
               </au>
               <au>
                  <snm>Lipman</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1990</pubdate>
            <volume>215</volume>
            <issue>3</issue>
            <fpage>403</fpage>
            <lpage>410</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">2231712</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>Rfam: annotating non-coding RNAs in complete genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Griffiths-Jones</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Moxon</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Marshall</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Khanna</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Eddy</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Bateman</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <issue>33 Database</issue>
            <fpage>D121</fpage>
            <lpage>D124</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">540035</pubid>
                  <pubid idtype="pmpid" link="fulltext">15608160</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>NONCODE: an integrated knowledge database of non-coding RNAs</p>
            </title>
            <aug>
               <au>
                  <snm>Liu</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Bai</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Skogerbo</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Cai</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Deng</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Bu</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <issue>33 Database</issue>
            <fpage>D112</fpage>
            <lpage>D115</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">539995</pubid>
                  <pubid idtype="pmpid" link="fulltext">15608158</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>Noncoding RNA database</p>
            </title>
            <url>http://biobases.ibch.poznan.pl/ncRNA/</url>
         </bibl>
         <bibl id="B54">
            <title>
               <p>FlyBase: genomes by the dozen</p>
            </title>
            <aug>
               <au>
                  <snm>Crosby</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Goodman</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Strelets</snm>
                  <fnm>VB</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Gelbart</snm>
                  <fnm>WM</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2006</pubdate>
            <volume>35</volume>
            <issue>database</issue>
            <fpage>D486</fpage>
            <lpage>D491</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1669768</pubid>
                  <pubid idtype="pmpid" link="fulltext">17099233</pubid>
                  <pubid idtype="doi">10.1093/nar/gkl827</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>FlyBase: anatomical data, images and queries</p>
            </title>
            <aug>
               <au>
                  <snm>Grumbling</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Strelets</snm>
                  <fnm>V</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2006</pubdate>
            <issue>34 Database</issue>
            <fpage>D484</fpage>
            <lpage>D488</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1347431</pubid>
                  <pubid idtype="pmpid" link="fulltext">16381917</pubid>
                  <pubid idtype="doi">10.1093/nar/gkj068</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <title>
               <p>miRBase: the microRNA sequence database</p>
            </title>
            <aug>
               <au>
                  <snm>Griffiths-Jones</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Methods Mol Biol</source>
            <pubdate>2006</pubdate>
            <volume>342</volume>
            <fpage>129</fpage>
            <lpage>138</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16957372</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <title>
               <p>tRNAscan-SE: A program for improved detection of transfer RNA genes in genomic sequence</p>
            </title>
            <aug>
               <au>
                  <snm>Lowe</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Eddy</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Nucl Acids Res</source>
            <pubdate>1997</pubdate>
            <volume>25</volume>
            <fpage>955</fpage>
            <lpage>964</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">146525</pubid>
                  <pubid idtype="pmpid" link="fulltext">9023104</pubid>
                  <pubid idtype="doi">10.1093/nar/25.5.955</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>SnoReport: Computational identification of snoRNAs with unknown targets</p>
            </title>
            <aug>
               <au>
                  <snm>Hertel</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hofacker</snm>
                  <fnm>IL</fnm>
               </au>
               <au>
                  <snm>Stadler</snm>
                  <fnm>PF</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2007</pubdate>
            <xrefbib>
               <pubidlist>
                  <pubid>17895272</pubid>
                  <pubid idtype="pmpid" link="fulltext">17895272</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <title>
               <p>The RNA world of the nucleolus: two major families of small RNAs defined by different box elements with related functions</p>
            </title>
            <aug>
               <au>
                  <snm>Balakin</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Fournier</snm>
                  <fnm>MJ</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>1996</pubdate>
            <volume>86</volume>
            <issue>5</issue>
            <fpage>823</fpage>
            <lpage>834</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0092-8674(00)80156-7</pubid>
                  <pubid idtype="pmpid" link="fulltext">8797828</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B60">
            <title>
               <p>Fast Folding and Comparison of RNA Secondary Structures</p>
            </title>
            <aug>
               <au>
                  <snm>Hofacker</snm>
                  <fnm>IL</fnm>
               </au>
               <au>
                  <snm>Fontana</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Stadler</snm>
                  <fnm>PF</fnm>
               </au>
               <au>
                  <snm>Bonhoeffer</snm>
                  <fnm>LS</fnm>
               </au>
               <au>
                  <snm>Tacker</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Schuster</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Monatsh Chem</source>
            <pubdate>1994</pubdate>
            <volume>125</volume>
            <fpage>167</fpage>
            <lpage>188</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1007/BF00818163</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B61">
            <title>
               <p>snoRNABase</p>
            </title>
            <url>http://www-snorna.biotoul.fr/index.php</url>
         </bibl>
         <bibl id="B62">
            <title>
               <p>Inferring noncoding RNA families and classes by means of genome-scale structure-based clustering</p>
            </title>
            <aug>
               <au>
                  <snm>Will</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Reiche</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Hofacker</snm>
                  <fnm>IL</fnm>
               </au>
               <au>
                  <snm>Stadler</snm>
                  <fnm>PF</fnm>
               </au>
               <au>
                  <snm>Backofen</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>PLoS Comput Biol</source>
            <pubdate>2007</pubdate>
            <volume>3</volume>
            <issue>4</issue>
            <fpage>e65</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1851984</pubid>
                  <pubid idtype="pmpid" link="fulltext">17432929</pubid>
                  <pubid idtype="doi">10.1371/journal.pcbi.0030065</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B63">
            <title>
               <p>AAA-Wiki &#8211; Phylogeny</p>
            </title>
            <url>http://rana.lbl.gov/drosophila/wiki/index.php/Phylogeny</url>
         </bibl>
         <bibl id="B64">
            <title>
               <p>Temporal patterns of fruity Drosophila evolution revealed by mutation clocks</p>
            </title>
            <aug>
               <au>
                  <snm>Tamura</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Subramanian</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kumar</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2004</pubdate>
            <volume>21</volume>
            <fpage>36</fpage>
            <lpage>44</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msg236</pubid>
                  <pubid idtype="pmpid" link="fulltext">12949132</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B65">
            <title>
               <p>Computational RNomics of Drosophilids &#8211; Supplement</p>
            </title>
            <url>http://www.bioinf.uni-leipzig.de/Publications/SUPPLEMENTS/07-001/</url>
         </bibl>
      </refgrp>
   </bm>
</art>

