<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-5-140</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>A comprehensive comparison of comparative RNA structure prediction approaches</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Gardner</snm>
               <mi>P</mi>
               <fnm>Paul</fnm>
               <insr iid="I1"/>
               <email>PPGardner@bi.ku.dk</email>
            </au>
            <au id="A2">
               <snm>Giegerich</snm>
               <fnm>Robert</fnm>
               <insr iid="I2"/>
               <email>Robert@TechFak.Uni-Bielefeld.DE</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Evolutionary Biology, University of Copenhagen, Universitetsparken 15, 2100 Copenhagen &#216;, Denmark</p>
            </ins>
            <ins id="I2">
               <p>Faculty of Technology, University of Bielefeld, PO Box 10 01 31, 33501 Bielefeld, Germany</p>
            </ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <issn>1471-2105</issn>
         <pubdate>2004</pubdate>
         <volume>5</volume>
         <issue>1</issue>
         <fpage>140</fpage>
         <url>http://www.biomedcentral.com/1471-2105/5/140</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">15458580</pubid>
               <pubid idtype="doi">10.1186/1471-2105-5-140</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>12</day>
               <month>8</month>
               <year>2004</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>30</day>
               <month>9</month>
               <year>2004</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>30</day>
               <month>9</month>
               <year>2004</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2004</year>
         <collab>Gardner and Giegerich; licensee BioMed Central Ltd.</collab>
         <note>This is an open-access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>An increasing number of researchers have released novel RNA structure analysis and prediction algorithms for comparative approaches to structure prediction. Yet, independent benchmarking of these algorithms is rarely performed as is now common practice for protein-folding, gene-finding and multiple-sequence-alignment algorithms.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Here we evaluate a number of RNA folding algorithms using reliable RNA data-sets and compare their relative performance.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusions</p>
               </st>
               <p>We conclude that comparative data can enhance structure prediction but structure-prediction-algorithms vary widely in terms of both sensitivity and selectivity across different lengths and homologies. Furthermore, we outline some directions for future research.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <sec>
            <st>
               <p>Motivation</p>
            </st>
            <p>RNA, once considered a passive carrier of genetic information, is now known to play a more active role in nature. Many recently discovered RNAs are catalytic, for example RNase P which is involved in tRNA maturation and the self-splicing introns involved in mRNA maturation <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. In addition, there is evidence that RNA based organisms were an essential step in the evolution of modern DNA-protein based organisms <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>. The number of non-coding RNAs (ncRNA) in humans remains a mystery, but progress in this direction suggests the number of ncRNAs produced is comparable to the number of proteins <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp>. Surprisingly, the number of protein coding genes does not correlate with our concept of "organism complexity", hence it has been hypothesised that control of gene expression via a combination of alternative splicing and non-coding RNAs are responsible for this, implying that the "Central Dogma" (RNA is transcribed from DNA and translated into protein) at least in higher eukaryotes is woefully inadequate <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp>.</p>
            <p>A fundamental tenet of biology is that a stable tertiary structure is essential for biological function. In the case of RNA the secondary structure (the base-pair set for an RNA molecule) provides a scaffold for the tertiary structure <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>. Yet, the experimental determination of RNA structure remains difficult <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>; Researchers increasingly turn to computational methods. To date the most popular structure prediction algorithm is the Minimum Free Energy (MFE) method for folding a single sequence, this has been implemented by two packages: Mfold <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> and RNAfold <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. However, there are several independent reasons why the accuracy of MFE structure prediction is limited in practise (see discussion below). Generally the best accuracy can be achieved by employing comparative methods <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. This paper explores the extent to which this statement is true, given the current state of the art, for automated methods. There are currently three approaches to automated comparative RNA sequence analysis where the comparative study is supported by available algorithms (see plans A, B, and C, figure <figr fid="F1">1</figr>). A researcher following plan A may align sequences using standard multiple sequence alignment tools (i.e. ClustalW <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>, t-coffee <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>, prrn <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>,...), then use signals provided by structure neutral mutations for the inference of a consensus structure. Frequently the mutual-information measure is used for this <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr></abbrgrp>. Recently tools have been developed that use a combination of MFE and a covariation-score <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp> or probabilistic models compiled from large reference data-sets <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp>. However, a multiple-sequence-alignment step assumes a well conserved sequence. This is often not so with swiftly evolving ncRNA sequences, in this case incorrect sequence alignments can destroy any covariation signal.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>RNA analysis</p>
               </caption>
               <text>
                  <p><b>RNA analysis. </b>Current automated approaches to analysing homologous RNA sequences and structures usually follow one of three "plans". Plan A uses aligned sequences (usually produced by a standard multiple sequence alignment algorithm) to infer a consensus secondary structure from the evolutionary and energetic information contained in an alignment. This is a highly successful approach, but is limited to data-sets with sequence homology high enough for the alignment step to work yet divergent enough for detection of structurally consistent mutations. Plan B employs the "Sankoff algorithm" to simultaneously align and infer a consensus structure. This algorithm requires extreme amounts of memory and time. Plan C aligns RNA structures rather than sequences. This approach can be used in the rare situation where reliable structures are known. Representative algorithms which could be used for each plan are indicated within the figure.</p>
               </text>
               <graphic file="1471-2105-5-140-1"/>
            </fig>
            <p>This has motivated plan B, the use of the "Sankoff-Algorithm", an algorithm designed for the simultaneous alignment, folding and inference of a protosequence for a set of homologous structural RNA sequences <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. The recurrences combine sequence alignment and Nussinov (maximal pairing) folding <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. The algorithm requires extreme computational resources (<it>O</it>(<it>n</it><sup>3<it>m</it></sup>) in time, and <it>O</it>(<it>n</it><sup>2<it>m</it></sup>) in space, where <it>n </it>is the sequence length and <it>m </it>is the number of sequences). Current implementations, Foldalign <abbrgrp><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr></abbrgrp>, Dynalign <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> and PMcomp <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>, are restricted implementations of the Sankoff-algorithm which impose pragmatic limits on the size or shape of substructures.</p>
            <p>The final approach (plan C) applies when no helpful level of sequence conservation is observed. We may exclude the sequence alignment step, predict secondary structures for each sequence (or sub-group of sequences) separately, and directly align the structures. Because of the nested branching nature of RNA structures, these are adequately represented as trees. The concept of a similarity measurement via edit operations, a standard procedure for string comparisons, has been generalised to trees <abbrgrp><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr></abbrgrp>. Tree comparison and tree alignment models have been proposed <abbrgrp><abbr bid="B34">34</abbr><abbr bid="B35">35</abbr></abbrgrp> and implemented <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr></abbrgrp>. The crucial point in plan C is the question whether the initial independent folding produces at least some structures that align well and hence give clues as to the underlying consensus structure &#8211; when one exists. An increasing number of researchers have recently released novel RNA structure analysis and prediction algorithms <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr><abbr bid="B37">37</abbr><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr></abbrgrp>. Yet few algorithms are tested upon standardised example data-sets, and often they are not compared with algorithms of the same pedigree. Algorithm evaluation is a regular event for protein structure prediction groups <abbrgrp><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr><abbr bid="B47">47</abbr></abbrgrp>, gene-prediction <abbrgrp><abbr bid="B48">48</abbr><abbr bid="B49">49</abbr><abbr bid="B50">50</abbr></abbrgrp> and multiple sequence alignments <abbrgrp><abbr bid="B51">51</abbr><abbr bid="B52">52</abbr><abbr bid="B53">53</abbr><abbr bid="B54">54</abbr></abbrgrp>. Based on reliable data-sets, we evaluate:</p>
            <p>&#8226; the viability of plan A, B, or C given tools available today, and</p>
            <p>&#8226; the relative performance of the tools used within each plan.</p>
            <p>We shall explicitly not evaluate computational efficiency, which (by necessity) differs widely between the tools. We also do not evaluate user friendliness (such as ease of installation and convenience of input or output formats, etc.) except for some remarks in the discussion section. Data-sets, documentation and relevant scripts are freely available from <url>http://www.binf.ku.dk/users/pgardner/bralibase/</url>.</p>
         </sec>
         <sec>
            <st>
               <p>Structural alignments and consensus structures</p>
            </st>
            <p>RNA secondary structure inference is the prediction of base-pairs which form the <it>in vivo </it>structure, given only the sequence of bases. Three general considerations apply: (1) The <it>in vivo </it>structure is not only predetermined by the primary structure, but also by cellular components such as chaperones, base modifications, and even by the transcriptional process itself. There are currently no computational tools available that assess these effects. (2) There are 'ribo-switches', whereby two or more functional structures exist for a given sequence <abbrgrp><abbr bid="B55">55</abbr><abbr bid="B56">56</abbr><abbr bid="B57">57</abbr></abbrgrp>. Such cases will fool all the tools studied here, because asking for a single consensus structure is simply the wrong question. On the other hand, the potential of conformational switching can be reliably detected <abbrgrp><abbr bid="B58">58</abbr><abbr bid="B59">59</abbr><abbr bid="B60">60</abbr></abbrgrp>. (3) Structures may contain pseudo-knots, which are ignored by most current tools due to reasons of computational complexity and scarcity of these motifs. We do not consider pseudoknots here. However, several comparative approaches that include pseudoknots are currently under development, and certainly merit a comparative study of their own. Note that in an application scenario, we often do not know whether the considerations (1&#8211;3) apply.</p>
            <p>The comparative approach to structure inference is initiated from a set of homologous RNA sequences. Attempts are made to infer the <it>in-vivo </it>structure for each of them, as well as a consensus structure that captures the common, relevant structural aspects. The consensus structure per se does not exist <it>in vivo, </it>and so some mathematical rigour should be applied when working with this notion.</p>
            <p>An RNA sequence is a string over the RNA alphabet {<it>A</it>, <it>C</it>, <it>G</it>, <it>U</it>}. An RNA sequence <it>B </it>= <it>b</it><sub>1</sub>,...,<it>b</it><sub><it>n </it></sub>contains <it>n </it>bases, but no structural information. For comparative analysis, we are given the RNA sequences <it>B</it><sup>1</sup>,...,<it>B</it><sup><it>k</it></sup>. A secondary structure can be associated with each sequence <it>B </it>as a string <it>S </it>over the alphabet {"(",".",")"}, where parentheses in <it>S </it>must be properly nested, and <it>B </it>and <it>S </it>must be <it>compatible</it>: If (<it>s</it><sub><it>i</it></sub>, <it>s</it><sub><it>j</it></sub>) are matching parentheses, then (<it>b</it><sub><it>i</it></sub>, <it>b</it><sub><it>j</it></sub>) must be a legal base-pair. A base-pair is also denoted as <it>b</it><sub><it>i</it></sub>&#183;<it>b</it><sub><it>j</it></sub>, <it>s</it><sub><it>i</it></sub>&#183;<it>s</it><sub><it>j</it></sub>, or simply <it>i</it>&#183;<it>j </it>when the sequence is clear from the context. Both sequences and structures may be padded with a gap symbol "-", in order to align sequences and structures of different lengths. For compatibility of padded sequences and structures, we require that <it>b</it><sub><it>i </it></sub>= "-" iff s<sub><it>i </it></sub>= "-".</p>
            <p>A multiple <it>structural </it>alignment is a multiple sequence alignment of the 2 * <it>k </it>sequences, <it>B</it><sup>1</sup><it>, S</it><sup>1</sup>,..., <it>B</it><sup><it>k</it></sup>, <it>S</it><sup><it>k</it></sup>, such that <it>B</it><sup><it>i </it></sup>is compatible with <it>S</it><sub><it>i</it></sub>, and the following <it>consistency criterion </it>is satisfied: For any <it>S</it><sup><it>i </it></sup>and <it>S</it><sup><it>j </it></sup>and any base-pair <graphic file="1471-2105-5-140-i1.gif"/>, we have <graphic file="1471-2105-5-140-i2.gif"/> &#8800; ")" and <graphic file="1471-2105-5-140-i3.gif"/> &#8800; "(", and if <graphic file="1471-2105-5-140-i2.gif"/> = "(" or <graphic file="1471-2105-5-140-i3.gif"/> = ")", then <graphic file="1471-2105-5-140-i4.gif"/>. This means that if one partner of a base-pair in <it>S</it><sup><it>j </it></sup>is aligned to one partner in <it>S</it><sup><it>i</it></sup>, their partners must also be aligned to each other (see figure <figr fid="F2">2</figr> for an illustration).</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Alignment consistency</p>
               </caption>
               <text>
                  <p><b>Alignment consistency. </b>A violation of RNA structural alignment consistency is shown (left), together with a possible correction (right) &#8211; see text for details. Note that the inconsistent alignment may maximise sequence similarity, showing 3 mismatches versus 1 mismatch and 2 indels, with the concrete outcome depending on the gap scoring used. Inconsistency is the reason why it is dangerous to align two <it>structures </it>in string representation by a standard <it>sequence </it>alignment algorithm. Inconsistency is hard to detect by human eye inspection, and structural alignments in databases are not always free from consistency violations.</p>
               </text>
               <graphic file="1471-2105-5-140-2"/>
            </fig>
            <p>A <it>consensus structure C </it>for a multiple structural alignment can be determined by a majority rule approach using a threshold <it>p </it>with 0.5 &lt;<it>p </it>&#8804; 1. We define <it>c</it><sub><it>k </it></sub>= <it>x </it>if <graphic file="1471-2105-5-140-i5.gif"/> = <it>x </it>for at least <graphic file="1471-2105-5-140-i6.gif"/> sequences S<sup><it>i</it></sup>, and <it>c</it><sub><it>k </it></sub>= ".", otherwise. The latter definition is somewhat arbitrary; when relating the consensus structure to a particular sequence <it>B </it>in the alignment, we quietly turn those dots into gaps that align with gaps in <it>B</it>. For <it>p </it>= 1, we speak of a strict consensus, and the base-pair set in <it>C </it>is the intersection of the base-pairs in all <it>S</it><sup><it>i</it></sup>.</p>
            <p>A consensus structure exhibits base-pairs shared by the majority of structures under consideration, but has no sequence information associated with it. Each individual structure for a concrete sequence typically has additional base-pairs which are properly nested between those that constitute the consensus. Given a consensus structure <it>C </it>and a sequence <it>B </it>compatible with it, we can obtain a structure <it>refold</it>(<it>B</it>, <it>C</it>) which is the best thermodynamic folding for <it>B </it>that exhibits the base-pairs specified by <it>C</it>, plus additional ones that do not conflict with the former. Refolding can be achieved by <it>RNAfold </it>with option -<it>C </it>(this option is used to constrain the minimum free energy prediction with prior knowledge &#8211; such as known base-pairs, unpaired regions, etc). If <it>B </it>and <it>S </it>contain gaps, we remove them before refolding and reintroduce them in the same positions afterwards.</p>
            <p>Given a consistent structural alignment, it is easy to derive a consensus structure, as we can count majorities at individual positions. If the 5' partner of a base-pair passes the majority threshold, consistency implies that its 3' partner also makes it into the consensus.</p>
            <p>Given a consensus structure and a sequence alignment <it>without </it>structural information, we can approximate a structural alignment by computing <it>S</it><sup><it>i </it></sup>= <it>refold</it>(<it>B</it><sup><it>i</it></sup>, <it>C</it>). We call this structural alignment reconstruction. While all <it>S</it><sup><it>i </it></sup>will be consistent with <it>C</it>, and with each other as far as the base-pairs of <it>C </it>are concerned, they may be inconsistent for the base-pairs introduced in refolding. This is tolerable, since if we trust the consensus to capture the relevant common structural features, there is no need to require that all members of a family agree upon extra-consensus features.</p>
            <p>We note in passing that it seems worthwhile to study the conditions under which consensus derivation and structural alignment reconstruction are mutually inverse operations, but such theoretical issues are outside our present scope.</p>
         </sec>
         <sec>
            <st>
               <p>Interpreting database information</p>
            </st>
            <p>While the plans A, B and C we are about to evaluate strive to find a good consensus structure from sequence data, the "truth" available to us comes in a different form. Structural databases only convey a <it>consensus by example</it>: They provide a reference sequence, say <it>B</it><sup>1</sup>, with an experimentally proved structure S<sup>1</sup>, and provide a multiple sequence alignment of B<sup>1</sup>, <it>S</it><sup>1 </sup>and additional sequences B<sup>2</sup>,..., <it>B</it><sup><it>n </it></sup>in the family under consideration. The sequence alignment is chosen to exhibit structural similarities between the reference structure and the other family members, but in general, we do not know the precise model of achieving similarity, nor do we know whether this model has been solved to optimality.</p>
            <p>One consequence of this situation would be to conclude that the reference structure is the only reliable anchor point available to us for evaluation. Comparative analysis tools would then be evaluated by the capacity to predict this particular structure by using family information. This would be a meaningful way to proceed, however, the effect of structural homogeneity within a sequence family would go unmeasured, and so would the difficulty or success of exploiting it. We therefore proceed in a different way which we call <it>consensus reconstruction</it>.</p>
            <p>The reference structure <it>S</it><sup>1 </sup>need not be compatible with any <it>B</it><sup><it>i </it></sup>except for <it>i </it>= 1. However, we can still compute <it>S</it><sup><it>i </it></sup>:= <it>refold</it>(<it>B</it><sup><it>i</it></sup>, <it>S</it><sup>1</sup>) by treating bases as unpaired where they violate compatibility. (This is also achieved with <it>RNAfold</it>, option -C.) What we obtain in this way is a reconstructed structural alignment, which will be consistent to the extent that the reference structure indeed describes the common structural features, and to the extent that the database sequence alignment reflects these. In all our test cases, this alignment was overall consistent, an indicator that the families and their structural features are in fact well defined. From this alignment, we derive a consensus structure as explained above using a threshold <it>p </it>= 0.5, which will serve as the standard of truth in our evaluation.</p>
            <p>One may argue that our approach to reconstruct the truth is somewhat ad-hoc and should be replaced by a more systematic method. However, this is what the tools we evaluate try to achieve, and we should not add one of our own as the standard of truth. Hence, our consensus reconstruction is designed to stay as close as possible to the database information.</p>
         </sec>
         <sec>
            <st>
               <p>Caveats</p>
            </st>
            <p>Results of observations based on the above measures must be interpreted with care. We list a number of caveats that must be kept in mind when proceeding to the subsequent sections.</p>
            <sec>
               <st>
                  <p>Use of defaults</p>
               </st>
               <p>In all tests, one could possibly obtain better predictions by tuning the program's parameters. We felt that it would be inappropriate to do so, since in the evaluation, we know the correct result and could use this knowledge in the tuning, whereas in a true application context, one does not have such guidance. Hence we used the recommended defaults in all cases.</p>
            </sec>
            <sec>
               <st>
                  <p>Tool abuse</p>
               </st>
               <p>In some cases we apply a tool to data where we know that the model structure has features not recognised by the tool. An example is a structure with multiloops or pseudoknots, searched for with a tool that explicitly excludes such structures. We permit such cases, because again, in a true application context one does not know whether the tool is appropriate or not, and it is still of interest to see how close to the correct structure one can get.</p>
            </sec>
            <sec>
               <st>
                  <p>Standard of truth</p>
               </st>
               <p>We take for granted the correctness of structural alignments taken from the literature, and the consensus reconstructed thereof. Should one of the tested algorithms produce a result that is actually better (closer to the functionally important structure), it may be penalised. Also, we do not consider a large number of data-sets here, it is possible that performance of some algorithms improves on a different selection of data-sets.</p>
            </sec>
            <sec>
               <st>
                  <p>Tools improve</p>
               </st>
               <p>Our data reflect the state of the art in 2004. Most of the tools tested are very recent, and their authors are still improving them. Hence, not all observations will remain reproducible. In fact, we hope this study helps to obtain better results in the future.</p>
            </sec>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <p>We have compiled RNA sequence alignments consisting of up to 11 sequences derived from reliable sources (see table <tblr tid="T1">1</tblr>). These have been used to test several RNA analysis packages. Each alignment contains at least one reference sequence <it>B</it><sup>1 </sup>with (preferably) an experimentally verified secondary structure <it>S</it><sup>1</sup>. Experimental verification of a structure may be from a variety of sources: x-ray crystallography, NMR, enzymatic structure probing or phylogenetic inference. A comparison of phylogenetic with x-ray crystallographic structures has shown the phylogenetic predictions of rRNA to be very reliable (sensitivity > 97%) <abbrgrp><abbr bid="B61">61</abbr></abbrgrp>. This data specifies a "consensus by example", as explained above, to which our consensus reconstruction was applied to obtain the "true" consensus.</p>
         <tbl id="T1">
            <title>
               <p>Table 1</p>
            </title>
            <caption>
               <p>Characteristics and sources of the four test data-sets, columns from left to right show: data-set, lengths, mean pair-wise sequence similarity (mean pair-wise Kimura "2-parameter" distance is shown in parentheses [109]), the number of sequences in each alignment and the alignment and structure sources are given.</p>
            </caption>
            <tblbdy cols="8">
               <r>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c cspan="6" ca="center">
                     <p>
                        <b>Test data-set characteristics and sources</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c cspan="8">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Data-set</p>
                  </c>
                  <c ca="center">
                     <p>length</p>
                  </c>
                  <c cspan="2" ca="center">
                     <p>mean pairwise seq. identity</p>
                  </c>
                  <c cspan="2" ca="center">
                     <p>Number of Sequences</p>
                  </c>
                  <c ca="center">
                     <p>Alignment source</p>
                  </c>
                  <c ca="center">
                     <p>Structure source</p>
                  </c>
               </r>
               <r>
                  <c cspan="8">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>High</p>
                  </c>
                  <c ca="center">
                     <p>Med.</p>
                  </c>
                  <c ca="center">
                     <p>High</p>
                  </c>
                  <c ca="center">
                     <p>Med.</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p><it>E. coli </it>LSU rRNA</p>
                  </c>
                  <c ca="center">
                     <p>2904</p>
                  </c>
                  <c ca="center">
                     <p>88.1 (0.12)</p>
                  </c>
                  <c ca="center">
                     <p>72.0 (0.35)</p>
                  </c>
                  <c ca="center">
                     <p>11</p>
                  </c>
                  <c ca="center">
                     <p>11</p>
                  </c>
                  <c ca="center">
                     <p>Wuyts <it>et al</it>., (2001)</p>
                  </c>
                  <c ca="center">
                     <p>Cannone <it>et al</it>., (2002)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p><it>E. coli </it>SSU rRNA</p>
                  </c>
                  <c ca="center">
                     <p>1542</p>
                  </c>
                  <c ca="center">
                     <p>90.7 (0.08)</p>
                  </c>
                  <c ca="center">
                     <p>80.0 (0.21)</p>
                  </c>
                  <c ca="center">
                     <p>11</p>
                  </c>
                  <c ca="center">
                     <p>11</p>
                  </c>
                  <c ca="center">
                     <p>Wuyts <it>et al</it>., (2002)</p>
                  </c>
                  <c ca="center">
                     <p>Cannone <it>et al</it>., (2002)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p><it>E. coli </it>RNase P</p>
                  </c>
                  <c ca="center">
                     <p>377</p>
                  </c>
                  <c ca="center">
                     <p>81.5 (0.09)</p>
                  </c>
                  <c ca="center">
                     <p>67.1 (0.41)</p>
                  </c>
                  <c ca="center">
                     <p>9</p>
                  </c>
                  <c ca="center">
                     <p>11</p>
                  </c>
                  <c ca="center">
                     <p>Brown, (1999)</p>
                  </c>
                  <c ca="center">
                     <p>Brown, (1999)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p><it>S. cerevisiae </it>tRNA-PHE</p>
                  </c>
                  <c ca="center">
                     <p>73</p>
                  </c>
                  <c ca="center">
                     <p>84.4 (0.19)</p>
                  </c>
                  <c ca="center">
                     <p>60.0 (0.71)</p>
                  </c>
                  <c ca="center">
                     <p>11</p>
                  </c>
                  <c ca="center">
                     <p>11</p>
                  </c>
                  <c ca="center">
                     <p>Griffiths-Jones <it>et al</it>., (2003)</p>
                  </c>
                  <c ca="center">
                     <p>Sundaralingham &amp; Rao, (1975)</p>
                  </c>
               </r>
            </tblbdy>
         </tbl>
         <p>To avoid results bias, we constructed test alignments, with corresponding phylogenies that, wherever possible, were free of highly similar clades. In addition, we endeavoured to ensure that the reference sequence was central to the phylogeny, or more specifically, not an out group. To meet these requirements, sequences from large data-sets were sorted into high-similarity and medium-similarity groups (with respect to the model sequence), from which maximum-likelihood phylogenies <abbrgrp><abbr bid="B62">62</abbr></abbrgrp> were constructed. These were pruned until the desired size and topology was achieved. For each data-set two sequence alignments were constructed, one of high sequence identity (approximately 90&#8211;99%) and the other more diverse data-set of medium sequence identity (approximately 70&#8211;90%).</p>
         <p>Our data-sets are quite diverse and must for the purposes of this study be considered difficult to analyse in structural terms. The shape of ribosomal RNA is believed to be influenced by interaction with ribosomal proteins. The shape of RNase P shows relatively little sequence and structure conservation, and furthermore, it contains pseudoknots which are generally excluded by prediction algorithms. Transfer RNAs are known to be a hard case for thermodynamic folding, primarily due to the propensity of modified bases which influence structure formation. All tools tested may perform better upon less complex data-sets, but the purpose of this study is not to show how good the algorithms are but to compare relative performance when prediction is difficult.</p>
         <sec>
            <st>
               <p>Performance Measures</p>
            </st>
            <p><it>Sensitivity </it>(<it>X</it>) and <it>selectivity </it>(<it>Y</it>) are common measures for determining the accuracy of prediction methods <abbrgrp><abbr bid="B63">63</abbr></abbrgrp>. Selectivity is also known as the "specificity" <abbrgrp><abbr bid="B28">28</abbr></abbrgrp> and "positive predictive value" <abbrgrp><abbr bid="B64">64</abbr><abbr bid="B65">65</abbr></abbrgrp>. We use slightly modify versions of the standard definitions of <it>X </it>and <it>Y </it>for examining RNA secondary structure prediction:</p>
            <p>
               <graphic file="1471-2105-5-140-i7.gif"/>
            </p>
            <p>where <it>TP </it>is the number of "true positives" (correctly predicted base-pairs), <it>FN </it>is the number of "false negatives" (base-pairs in the reference structure that were not predicted) and <it>FP </it>is the number of "false positives" (in-correctly predicted base-pairs). However, not all <it>FP </it>base-pairs are equally false! We classify <it>FP </it>base-pairs as either <it>inconsistent</it>, <it>contradicting </it>or <it>compatible. </it>Predicted base-pairs which conflict with a base-pair in the reference structure are labelled <it>inconsistent </it>(i.e. <it>i</it>&#183;<it>j </it>is predicted where either <it>i</it>&#183;<it>k </it>and/or <it>h</it>&#183;<it>j </it>are paired in the reference structure (<it>h </it>&#8800; <it>i </it>and <it>j </it>&#8800; <it>k</it>)). Predicted base-pairs (<it>i</it>&#183;<it>j</it>) which are non-nested with respect to the reference structure are labelled <it>contradicting </it>(i.e. there exists base-pairs <it>k</it>&#183;<it>l </it>in the reference satisfying <it>k </it>&lt;<it>i </it>&lt;<it>l </it>&lt;<it>j</it>). Note that some base-pairs may both contradict and be inconsistent with the reference structure. Predicted base-pairs which are neither true positive, contradicting or inconsistent are labelled <it>compatible </it>and can be considered neutral with respect to algorithm accuracy. Hence these are subtracted in the selectivity evaluation, their number is <it>&#958; </it>in the above equation. It is of interest to note that the base-pair metric <abbrgrp><abbr bid="B66">66</abbr><abbr bid="B67">67</abbr></abbrgrp> between the reference and predicted structures <it>d</it><sub><it>BP</it></sub>(<it>S</it><sub><it>ref</it></sub>, <it>S</it><sub><it>pred</it></sub>) is the sum of <it>FN </it>and <it>FP</it>, and hence is different from the measure used here.</p>
            <p>A measure combining both selectivity and sensitivity is useful for ranking algorithms. For this we employ the <it>Matthews correlation coefficient </it><abbrgrp><abbr bid="B63">63</abbr></abbrgrp> defined below:</p>
            <p>
               <graphic file="1471-2105-5-140-i8.gif"/>
            </p>
            <p><it>MCC </it>ranges from -1 for extremely inaccurate (<it>TP </it>= <it>TN </it>= 0) to 1 for very accurate predictions (<it>FP </it>- <it>&#958; </it>= <it>FN </it>= 0). When comparing RNA structures <it>TN </it>= 0 occurs only in extreme examples, hence <it>MCC </it>generally ranges from 0 to 1. Furthermore, for the specific case of RNA structure comparisons, <it>MCC </it>can be approximated by the arithmetic-mean or geometric-mean of <it>X </it>and <it>Y </it><abbrgrp><abbr bid="B28">28</abbr></abbrgrp>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Single sequence methods</p>
            </st>
            <p>The accuracy of the MFE single sequence method has been evaluated elsewhere and was found to have an accuracy of 73% when averaged over many different RNAs and "base-pair slippage" was tolerated in the evaluation <abbrgrp><abbr bid="B68">68</abbr></abbrgrp>. A recent and more stringent work found MFE predictions had a sensitivity of 56% and selectivity of 46% for RNase P, SRP and tmRNA structures <abbrgrp><abbr bid="B64">64</abbr></abbrgrp>. Similar values are also reported by the "Gutell Lab" for tRNA and rRNA structures <abbrgrp><abbr bid="B69">69</abbr><abbr bid="B70">70</abbr><abbr bid="B71">71</abbr></abbrgrp>. We need to clarify the accuracy of this method on the particular data-sets we employ here for comparison with the multi-sequence methods. After all, if MFE folding worked perfectly for our given data-sets, there would be no need to resort to comparative methods.</p>
         </sec>
         <sec>
            <st>
               <p>Mfold &amp; RNAfold</p>
            </st>
            <p>Mfold <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B72">72</abbr></abbrgrp> and RNAfold <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B73">73</abbr></abbrgrp> both implement the Zuker-Stiegler algorithm for computing minimal free energy (MFE) structures assuming a "nearest neighbour model" and using empirical estimates of thermodynamic parameters for neighbouring interactions and loop entropies to score structures. The algorithm is <it>O</it>(<it>n</it><sup>3</sup>) in time and <it>O</it>(<it>n</it><sup>2</sup>) in memory where <it>n </it>is the sequence length. Both employ the same thermodynamic parameters <abbrgrp><abbr bid="B68">68</abbr></abbrgrp>. Hence, differences in the predictions are generally minor and are the result of slightly different implementations. There appears to be no significant differences in terms of algorithm accuracy.</p>
            <p>The sensitivity, selectivity and correlation of MFE methods (for the four data-sets considered here) ranged from 22&#8211;63%, 20&#8211;60% and 0.18&#8211;0.61 respectively (See figures <figr fid="F3">3</figr> &amp;<figr fid="F4">4</figr>). The low accuracies (22%, 20% &amp; 0.18) are due to an alternative long-stem conformation of <it>S. cerevisiae </it>tRNA-PHE which the free energy methods favour. Mfold infers 'suboptimal' structures by calculating minimum free energy structures with the restriction that every possible base-pair is forced in a one-by-one fashion. Unique structures are then ranked by energy. Investigating the top two suboptimal structures from Mfold resulted in an overall increase in the range of sensitivity, selectivity and correlation, 22&#8211;69%, 20&#8211;67% and 0.18&#8211;0.68 respectively. The predictions shown here are used to illustrate the potential advantages of using comparative analyses over single sequence methods.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Prediction correlation with reality</p>
               </caption>
               <text>
                  <p><b>Prediction correlation with reality. </b>Matthews correlation coefficient versus the logarithm of the sequence length for a range of different ncRNAs and structure prediction algorithms. Inset <b>A </b>shows accuracies of thermodynamic single sequence prediction algorithms. Insets <b>B </b>and <b>C </b>shows accuracies of comparative methods on the high and medium similarity data-sets respectively.</p>
               </text>
               <graphic file="1471-2105-5-140-3"/>
            </fig>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>ROC plots</p>
               </caption>
               <text>
                  <p><b>ROC plots. </b>We use ROC (receiver operating characteristic) plots to simultaneously display both sensitivity and selectivity for plans A, B and C respectively. Accuracies of the MFE methods (MFold, RNAFold and SFold) are shown in each plot to provide a base-line. Points on the line <it>X </it>= <it>Y </it>are as sensitive as they are selective, points below this line indicates a greater selectivity, points above indicate greater sensitivity. Points below the line <it>X </it>= 100 - <it>Y </it>are worse than "random" assignments; Assuming base-pairs are independent of each other (this is false for base-pairing). Points in the top right corner are "perfect" predictions. Interestingly many algorithms form characteristic clusters in these plots. Where the variance is sufficiently small these have been indicated with a closed curve.</p>
               </text>
               <graphic file="1471-2105-5-140-4"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Sfold</p>
            </st>
            <p>Sfold <abbrgrp><abbr bid="B41">41</abbr><abbr bid="B74">74</abbr></abbrgrp> represents another energy-based single-sequence folding algorithm. For a given RNA, Sfold stochastically samples all possible structures in the Boltzmann ensemble of secondary structures using conditional probabilities which are computed with the partition function <abbrgrp><abbr bid="B75">75</abbr></abbrgrp>. Clustering techniques could then be used to obtain representative ' likely ' structures. Instead, the current implementation samples 1000 structures, sorts these by energy, the minimum and maximum energy structures are computed and the energy range divided into 10 equally sized energy blocks. The minimum energy structure from each block is returned with ranking 1 to 10. We consider the top 3 structures labelled 'Sfold (1&#8211;3)'. In terms of accuracy, the results are very similar to those of the Zuker-Stiegler single sequence methods, although with a slightly higher variance (See figures <figr fid="F3">3</figr> &amp;<figr fid="F4">4</figr>).</p>
         </sec>
         <sec>
            <st>
               <p>Intrinsic limits of single sequence methods</p>
            </st>
            <p>There are systematic limits to the accuracy of single sequence prediction methods. The thermodynamics may not be accurate, as some parameters are extrapolated and parameter measuring conditions <it>in vitro </it>are different from <it>in vivo </it>conditions. Indeed the thermodynamic model itself is an estimate of the real physics of RNA folding. Also, many bases of structural RNAs are chemically modified by sugar methylation, pseudo-uridine, dihydrouracil, etc, these are generally ignored by these methods. Kinetics of folding are also ignored. Given only a single sequence, we have no way to distinguish base-pairs and structure elements important for the consensus from those that are peculiar for the given sequence. Finally, some functional RNAs have bistable structures, while in others, the structure is irrelevant, hence not conserved, and the optimal MFE structure is biologically meaningless. This is some justification of why researchers proceed to comparative methods.</p>
         </sec>
         <sec>
            <st>
               <p>Comparative method: alignment folding (plan A)</p>
            </st>
            <p>To simulate realistic RNA folding studies we use ClustalW <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> to re-align each of our test data-sets, then folded these using each of the methods mentioned below. The resultant predicted structures were then compared to our reconstructed consensus structures.</p>
            <sec>
               <st>
                  <p>RNAalifold</p>
               </st>
               <p>RNAalifold <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B76">76</abbr></abbrgrp> implements an extension of the Zuker-Stiegler algorithm for computing a consensus structure from RNA alignments. The algorithm computes an averaged energy matrix <graphic file="1471-2105-5-140-i9.gif"/> (where <it>N </it>is the number of sequences in the alignment) and a covariation score matrix, augmented with penalties for inconsistent sequences, <it>B</it><sub><it>ij</it></sub>. A standard trace-back procedure is performed to recover a consensus structure with the optimal sum-of-average-energy-and-covariation-score <graphic file="1471-2105-5-140-i10.gif"/>. The algorithm is remarkably efficient <it>O</it>(<it>N</it>&#183;<it>n</it><sup>2 </sup>+ <it>n</it><sup>3</sup>) in time and <it>O</it>(<it>n</it><sup>2</sup>) in memory.</p>
               <p>The sensitivity, selectivity and correlation of the RNAalifold predictions ranged from 57&#8211;91%, 57&#8211;100% and 0.57&#8211;0.95 respectively, showing a significant increase in the accuracy measures when compared to the MFE-methods.</p>
            </sec>
            <sec>
               <st>
                  <p>Pfold</p>
               </st>
               <p>Pfold implements a "stochastic context free grammar" (SCFG) designed to produce a "prior probability distribution of RNA structures" for an RNA alignment input <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr><abbr bid="B77">77</abbr></abbrgrp>. A maximum-likelihood phylogeny is used to weight posterior probabilities computed from large reference data-sets.</p>
               <p>The algorithm is generally accurate and efficient. Hence, the over-all sensitivity, selectivity and correlation of the Pfold predictions ranged from 0&#8211;100%, 0&#8211;100% and 0.0&#8211;1.0, respectively. But removing those points where Pfold predictions were empty structures (LSU rRNA (H &amp; M) and SSU rRNA (M), see figure <figr fid="F3">3</figr>), the prediction accuracies ranged from 66&#8211;100%, 89&#8211;100% and 0.77&#8211;1.0, respectively. The zeros are due to 'under-flow errors', a solution is presently under construction by the authors (pers. commun. Bjarne Knudsen).</p>
            </sec>
            <sec>
               <st>
                  <p>ILM</p>
               </st>
               <p>ILM (iterated loop matching) is one of the few comparative RNA folding algorithms which can return pseudo-knotted structures <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B78">78</abbr></abbrgrp>. It uses a combination of thermodynamic and mutual information content scores <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> to produce a secondary structure. All possible stems ("small" internal loops and bulges inclusive) are generated and ranked according to a combination of thermodynamic and mutual-information scores. The stem with maximal score is selected, scores are updated and stems conflicting the selection removed, then the next highest scoring stem is selected. This algorithm is iterated until no stems remain. ILM generally ranked low in terms of selectivity and was not as sensitive as either RNAalifold or Pfold on the high similarity data, but did improve on the medium similarity data-sets (see figure <figr fid="F3">3</figr>). The over-all sensitivity, selectivity and correlation of ILM predictions ranged from 44&#8211;100%, 37&#8211;75% and 0.40&#8211;0.86, respectively. To ensure the low selectivity values weren't due to the reference-structure being pseudo-knot free we re-evaluated ILM with reference-structures replete with pseudo-knots. The new sensitivity, selectivity and correlation values ranged from 31&#8211;100%, 26&#8211;75% and 0.29&#8211;0.86, in fact evaluating with pseudo-knotted structures did little to increase ILM selectivity. But, keep in mind that the sensitivity of the other (non-knot-inclusive) methods <it>must </it>decrease when a significant proportion of the true base-pairs are engaged in pseudo-knots.</p>
               <p>The inclusion of pseudo-knots prediction vastly increases the number of possible secondary structures, this is why they are generally excluded from exhaustive folding algorithms. In addition, there is a general lack of experimentally derived thermodynamic parameters which include pseudo-knots. ILM is a method still under development, hence the performance may improve once pseudo-knots can be more accurately modelled.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Comparative method: simultaneous sequence alignment and folding (plan B)</p>
            </st>
            <sec>
               <st>
                  <p>Sankoff</p>
               </st>
               <p>The Sankoff algorithm is a dynamic programming approach to obtain a common base-pair list with maximal sum of base-pair weights. Basically, this is a merger of sequence alignment and Nussinov <abbrgrp><abbr bid="B79">79</abbr></abbrgrp> (maximal-pairing) folding dynamic programming methods <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. Sankoff's algorithm can be used to obtain both an alignment and consensus structure. Full implementations of the "Sankoff algorithm" for the solution of simultaneous RNA folding, alignment and protosequence problems have proven too computationally taxing (<it>O</it>(<it>n</it><sup>3<it>m</it></sup>) in time, and <it>O</it>(<it>n</it><sup>2<it>m</it></sup>) in space for sequence length <it>n </it>and <it>m </it>sequences) to be practical <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. Hence, three restricted versions of this algorithm have been implemented. These are Foldalign <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>, Dynalign <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> and recently PMcomp has also been published <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. Carnac <abbrgrp><abbr bid="B80">80</abbr><abbr bid="B81">81</abbr></abbrgrp> is another recent innovation designed to detect conserved stems in unaligned sequences, we include it here as a relative of the Sankoff approach.</p>
            </sec>
            <sec>
               <st>
                  <p>Foldalign</p>
               </st>
               <p>Foldalign <abbrgrp><abbr bid="B27">27</abbr></abbrgrp> can be interpreted as "a mixture of local alignment and maximum number of base-pairs algorithm" <abbrgrp><abbr bid="B28">28</abbr><abbr bid="B82">82</abbr></abbrgrp>. A combination of "clustal" <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> and "consensus" <abbrgrp><abbr bid="B83">83</abbr></abbrgrp> heuristics are used to build multiple sequence alignments from pair-wise comparisons. Restricting maximum motif size (for this study 50 was used) and forbidding bifurcating structures (multi-loops) reduces the time complexity to <it>O</it>(<it>n</it><sup>4</sup><it>N</it>) in time (where <it>N </it>is the number of sequences and <it>n </it>is the length of the longest sequence). A simple match-based scoring scheme is used to rank putative conserved structure elements.</p>
               <p>The Tool Abuse Caveat generally applies to the tool Foldalign as all of our data-sets contain multi-loops. The use of Foldalign for the prediction of global, multi-looped secondary structures is not recommended-as Foldalign is specifically designed for the location of short regulatory motifs such as IREs <abbrgrp><abbr bid="B84">84</abbr></abbrgrp> where the motifs are only related at the level of (non-bifurcating) structure and not at the level of sequence. Hence the relatively poor sensitivity, selectivity and correlation, which ranged from 5&#8211;24%, 23&#8211;36% and 0.11&#8211;0.27 respectively, for our test data-sets.</p>
            </sec>
            <sec>
               <st>
                  <p>Dynalign</p>
               </st>
               <p>Dynalign <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B85">85</abbr></abbrgrp> is a pairwise implementation of the Sankoff algorithm, which uses a "full energy model" to locate a common low energy structure (including multi-loops) and align two structural RNAs. The computational complexity of the full Sankoff is reduced by restricting the difference in the indices <it>i </it>and <it>j </it>of aligned nucleotides (where <it>i </it>indexes positions in sequence 1 and <it>j </it>indexes sequence 2) to be less than <it>M</it>. In addition, Dynalign uses the same method employed by MFold to reduce the conformation space, by limiting the size of internal loops <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B86">86</abbr></abbrgrp>. The complexity is thus reduced to <it>O</it>(<it>n</it><sup>3</sup><it>M</it><sup>3</sup>).</p>
               <p>The current Dynalign implementation is restricted to pair-wise sequence comparisons. Rather than compute all <graphic file="1471-2105-5-140-i11.gif"/> pairwise foldings we compared all sequences with the reference structure. Due to the computational expense of this algorithm it could only be used to predict tRNA and RNase P structures. Dynalign performed well on the tRNA, medium sequence homology data-set (sensitivity, selectivity and correlation of 94%, 95% and 0.94 respectively, when averaged over all pairwise alignments with the reference sequence). With this one high-scoring point removed, averaged sensitivity, selectivity and correlation values ranged from 32&#8211;54%, 33&#8211;54% and 0.32&#8211;0.54 respectively. Comparing the performances of MFold and Dynalign showed that MFold performance was always superior on the RNase P data-set, Dynalign however did much better on the shorter and more diverse tRNA sequences. Performance gains could be made by investing more computer time and refolding RNase P with larger ' maximum insert size', which was set to 10 during this study. The use of Dynalign on the RNase P data-sets in this study is therefore a case of tool-abuse, as the parameters recommended by the authors of Dynalign were not used (to ensure calculations completed in reasonable time).</p>
            </sec>
            <sec>
               <st>
                  <p>Carnac</p>
               </st>
               <p>The Carnac algorithm, as mentioned previously, is not strictly an implementation of the Sankoff algorithm. A set of filters are employed through which sets of sequences are passed in a pair-wise fashion <abbrgrp><abbr bid="B80">80</abbr><abbr bid="B81">81</abbr><abbr bid="B87">87</abbr></abbrgrp>. Sequences are scanned for stems and "high similarity" regions of sequences (dubbed "anchor points") are identified, a dynamic program is used to select conserved stems using anchor point and covariation information.</p>
               <p>The Carnac algorithm was remarkably selective at base-pair predictions. However, the sensitivity of the algorithm was generally low, although when evaluated with the correlation coefficient it is comparable to RNAalifold and Pfold. Sensitivity, selectivity and correlation values for Carnac predictions ranged from 45&#8211;71%, 92&#8211;100% and 0.65&#8211;0.82 respectively. The sensitivity of Carnac can be increased by constraining a minimum free energy fold (i.e. with "RNAfold-C") with the Carnac predicted structure, but this cost in terms of selectivity. On average this increased the sensitivity by 22.5, decreased the selectivity by 17.2 and slightly increased the correlation by 0.05.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Alignment of predicted structures (plan C)</p>
            </st>
            <sec>
               <st>
                  <p>RNA forester</p>
               </st>
               <p>RNAforester <abbrgrp><abbr bid="B37">37</abbr><abbr bid="B88">88</abbr></abbrgrp> implements the tree alignment model. In contrast to approaches that produce only a similarity value, but no underlying alignment, it computes pairwise alignments of two input structures. RNAforester can produce either global or local alignments; we used the global mode. A structure alignment is itself a branching (tree-like) structure; the set of matched base-pairs can be derived from it and evaluated as with the other approaches.</p>
               <p>We used the tRNA and RNase P data-sets and generated structure single sequence predictions with RNAfold. All predicted structures were aligned pairwise and a neighbour-joining approach used to cluster and align high similarity sequences and structure profiles. The highest scoring alignment was used to derive a predicted consensus that was evaluated against the consensus tRNA model structures. Sensitivity, selectivity and correlation ranges of consensus structures computed from the highest scoring RNAforester alignments were 29&#8211;67%, 27&#8211;67% and 0.26&#8211;0.66 respectively. It seems likely that much of the inaccuracy of this approach is due to MFE structure prediction, however the structure-clustering approach frequently separates mis-folded MFE predictions from the accurate folds.</p>
            </sec>
            <sec>
               <st>
                  <p>MARNA</p>
               </st>
               <p>The MARNA algorithm <abbrgrp><abbr bid="B39">39</abbr><abbr bid="B89">89</abbr></abbrgrp> proceeds by constructing edge weights between nucleotides in a pairwise fashion. Weights are structure-enhanced-sequence-similarities transformed from edit distances proposed by Zhang <abbrgrp><abbr bid="B90">90</abbr></abbrgrp>. Phase two pipes the set of alignment edges into t-coffee <abbrgrp><abbr bid="B16">16</abbr></abbrgrp> for multiple alignment production. The resultant alignments are not strictly structural alignments in the sense defined above. Rather, these are sequence alignments influenced by structure.</p>
               <p>Sensitivity, selectivity and correlation values of consensus structures computed from MARNA alignments of MFE structures ranged from 29&#8211;52%, 32&#8211;84% and 0.30&#8211;0.65 respectively. We also tried trimming high entropy base-pairs from the MFE predictions using the bound <it>Q</it><sub><it>ij </it></sub>> 1, where <graphic file="1471-2105-5-140-i12.gif"/>, <graphic file="1471-2105-5-140-i13.gif"/>, and <it>p</it><sub><it>ij </it></sub>are pair-probabilities computed using McCaskilPs partition function <abbrgrp><abbr bid="B75">75</abbr></abbrgrp>. The new accuracy ranges were 29&#8211;71%, 92&#8211;100% and 0.53&#8211;0.84. A related approach for trimming of low probability was recently shown to improve the selectivity of MFE predictions <abbrgrp><abbr bid="B65">65</abbr></abbrgrp>. MARNA is generally less dependant upon the accuracy of the input structures hence performs slightly better with the poorly predicted tRNA structures than RNAforester.</p>
            </sec>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>We have evaluated three different strategies for comparative structure prediction, and altogether eight tools (not counting the single sequence methods). The results of which are summarised in figures <figr fid="F3">3</figr> &amp;<figr fid="F4">4</figr>. A surprising discovery given that the test data-sets are so diverse is that algorithm specific clusters formed in sensitivity versus selectivity scatter plots, indicating algorithm-specific eccentricities. A number of algorithms which might have been evaluated here have been excluded, primarily due to the heavy computational costs of the various implementations on our longer data-sets. We favoured recent algorithms which could be compiled on modern computers and those with input and output which could be simply dealt with (for example returning dot-bracket <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B37">37</abbr><abbr bid="B91">91</abbr></abbrgrp> or tabular-connect type formats <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B29">29</abbr><abbr bid="B41">41</abbr></abbrgrp>, rather than coordinates and lengths of stacks or graphic (gif/pdf) representations favoured by a minority of researchers).</p>
         <sec>
            <st>
               <p>Practical recommendations</p>
            </st>
            <p>For well aligned short sequences, both Pfold and RNAalifold generally perform well, PFold performed marginally better than RNAalifold. It is likely that some moderate refinements to RNAalifold would improve accuracy without altering the efficiency, for example, if gaps were not penalised in the free-energy evaluation and a more sophisticated model for scoring mutations was employed, perhaps ribosum matrices <abbrgrp><abbr bid="B92">92</abbr></abbrgrp> could be used to weight base-pair bonuses and penalties. For well aligned, long sequences the performance and speed of RNAalifold was excellent. For data-sets consisting of short (&lt; 200 bases) and diverse sequences Dynalign might do well, as it does not require sequence similarity &#8211; in fact the scoring function does not include sequence comparison. Otherwise, one might choose to use a mixture of RNAalifold and/or Pfold to fold similar clades and RNAforester and/or MARNA to align folded clades. Advocates of plan A should note that many multiple sequence alignment algorithms generally do not favour transitions over transversions or employ ad hoc 2-parameter methods to model these (ClustalW <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> for example). Structural RNA sequences however evolve rapidly via structure neutral mutations which are frequently transitions and rarely transversions <abbrgrp><abbr bid="B92">92</abbr><abbr bid="B93">93</abbr></abbrgrp>. Multiple sequence algorithms which employ more complex yet more accurate models of sequence evolution will undoubtedly produce "better" alignments for folding.</p>
            <p>Carnac produced highly selective structures for all the test data-sets, which if used to constrain a free energy fold produced sensitive predictions with a cost to selectivity. The consistency of Carnac performance is remarkable, for all the data-sets considered here this heuristic approach performed well. It is however unclear how Carnac will perform on highly diverse data-sets.</p>
            <p>For advocates of plan C, we have an encouraging message: Both MARNA and RNAforester perform better on the medium similarity data than on high similarity data. This seems paradoxical at first glance, but one must understand that for an approach purely based on predicted structures, high sequence similarity can be a curse rather than a blessing: If sequences are very similar, they may jointly fold into the wrong MFE structure. With more sequence variation, it becomes more likely that at least some family members have good predictions, which by their mutual similarity can be picked out from the rest. This means that especially in the case of low sequence similarity, where nothing else works, plan C, currently the least explored strategy of all, has a certain promise.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusions</p>
         </st>
         <p>Finally, let us outline some directions for future research.</p>
         <p>An implementation of the single sequence pseudoknot algorithms <abbrgrp><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr><abbr bid="B94">94</abbr></abbrgrp> employing similar strategies to RNAalifold <abbrgrp><abbr bid="B21">21</abbr></abbrgrp> for alignment folding would be most useful. Based upon the RNAalifold results this approach would dramatically increase the accuracy of these algorithms upon certain data-sets. Also, an extension of these allowing constrained foldings to incorporate prior knowledge would be of assistance, this has proved extremely useful for MFE predictions. Sampling structures from reference alignments is also likely to prove beneficial. The implementation of fast and accurate variants of the Sankoff algorithm remains an open problem.</p>
         <p>Again allowing constrained foldings and alignments would be useful. The further development of "BLAST-like" folding heuristics for this should be a priority, obviously Carnac is a good start. The MARNA approach for producing structurally enhanced multiple alignments produced rather selective results after trimming high-entropy base-pairs from MFE predictions. This suggests that weighting edit-distances with partition-function derived probabilities or entropies will produce reasonable RNA alignments. A consensus structure could then be derived from MFE-structures or from PFold or RNAalifold predictions on the resultant alignment. This approach would effectively decouple the Sankoff algorithm into manageable structure-enhanced-alignment and folding stages.</p>
         <sec>
            <st>
               <p>Note added in proof</p>
            </st>
            <p>Two further developments are likely to increase the power of plan C. Pure multiple structure alignment (as opposed to pairwise alignment used here) presented in <abbrgrp><abbr bid="B95">95</abbr></abbrgrp> may leave out some misfolded structures from a progressively constructed profile aligment. A small but representative set of near-optimal structures can now be derived by abstract shape analysis <abbrgrp><abbr bid="B96">96</abbr></abbrgrp>. Combining both approaches, one could consider a progressive multiple alignment approach where these representative, near-optimal structures are included for each sequence.</p>
            <p>More training data is essential for this field to progress, for this homology search tools are essential. Infernal <abbrgrp><abbr bid="B91">91</abbr><abbr bid="B97">97</abbr></abbrgrp> used to construct the Rfam database <abbrgrp><abbr bid="B98">98</abbr><abbr bid="B99">99</abbr></abbrgrp> is an excellent approach but sensitivity might be increased with a phylogenetic approach and RNA-specific sequence search tools. The implementation of methods combining energetics, covariation <abbrgrp><abbr bid="B21">21</abbr></abbrgrp> and co-transcriptional folding <abbrgrp><abbr bid="B100">100</abbr></abbrgrp> in a statistically reasonable manner is also a potentially fruitful direction for development.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>PPG carried out the experiments, the analysis and drafted the manuscript. RG suggested comparing comparative structure prediction methods and assisted in the manuscript preparation. All authors read and approved the final manuscript.</p>
         <tbl id="T2">
            <title>
               <p>Table 2</p>
            </title>
            <caption>
               <p>The following tables display results of several structure predictions using a variety of algorithms upon data-sets containing either <b><it>S. cerevisiae </it>tRNA-PHE, <it>E. coli </it>RNase P, <it>E. coli </it>SSU rRNA </b>or <b><it>E. coli </it>LSU rRNA </b>sequences. Reading columns from left to right we show: prediction method, number of base-pairs in the reference structure, number of base-pairs in the predicted structure, the number of true positive base-pairs in the prediction (% sensitivity as described earlier in parentheses), the number of false positive base-pairs in the prediction (% selectivity as described earlier in parentheses), correlation values are the "Matthews correlation coefficient" (with approximate correlation in parentheses). Each of these MFE-based attempts to predict the famous <it>S. cerevisiae </it>tRNA-PHE structure converges on an alternative lengthy-helix type structure. Adding prior knowledge, such as forcing modified bases in the RNA sequence to be unpaired can produce dramatic improvements.</p>
            </caption>
            <tblbdy cols="6">
               <r>
                  <c>
                     <p/>
                  </c>
                  <c cspan="5" ca="center">
                     <p>
                        <b><it>S. cerevisiae </it>tRNA-PHE: Single Sequence Methods</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c cspan="6">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Algorithm</p>
                  </c>
                  <c ca="center">
                     <p>number of bps in reference</p>
                  </c>
                  <c ca="center">
                     <p>number of bps in prediction</p>
                  </c>
                  <c ca="center">
                     <p>True Positives (% sensitivity)</p>
                  </c>
                  <c ca="center">
                     <p>False Positives (% selectivity)</p>
                  </c>
                  <c ca="center">
                     <p>Correlation (%)</p>
                  </c>
               </r>
               <r>
                  <c cspan="6">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>RNAfold</p>
                  </c>
                  <c ca="center">
                     <p>18</p>
                  </c>
                  <c ca="center">
                     <p>23</p>
                  </c>
                  <c ca="center">
                     <p>4 (22.2)</p>
                  </c>
                  <c ca="center">
                     <p>16 (20.0)</p>
                  </c>
                  <c ca="center">
                     <p>0.178 (21.1)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Mfold (1)</p>
                  </c>
                  <c ca="center">
                     <p>18</p>
                  </c>
                  <c ca="center">
                     <p>21</p>
                  </c>
                  <c ca="center">
                     <p>4 (22.2)</p>
                  </c>
                  <c ca="center">
                     <p>14 (22.2)</p>
                  </c>
                  <c ca="center">
                     <p>0.191 (22.2)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Mfold (2)</p>
                  </c>
                  <c ca="center">
                     <p>18</p>
                  </c>
                  <c ca="center">
                     <p>22</p>
                  </c>
                  <c ca="center">
                     <p>8 (44.4)</p>
                  </c>
                  <c ca="center">
                     <p>11 (42.1)</p>
                  </c>
                  <c ca="center">
                     <p>0.409 (43.3)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Mfold (3)</p>
                  </c>
                  <c ca="center">
                     <p>18</p>
                  </c>
                  <c ca="center">
                     <p>23</p>
                  </c>
                  <c ca="center">
                     <p>4 (22.2)</p>
                  </c>
                  <c ca="center">
                     <p>16 (20.0)</p>
                  </c>
                  <c ca="center">
                     <p>0.178 (21.1)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Sfold (1)</p>
                  </c>
                  <c ca="center">
                     <p>18</p>
                  </c>
                  <c ca="center">
                     <p>23</p>
                  </c>
                  <c ca="center">
                     <p>4 (22.2)</p>
                  </c>
                  <c ca="center">
                     <p>16 (20.0)</p>
                  </c>
                  <c ca="center">
                     <p>0.178 (21.1)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Sfold (2)</p>
                  </c>
                  <c ca="center">
                     <p>18</p>
                  </c>
                  <c ca="center">
                     <p>23</p>
                  </c>
                  <c ca="center">
                     <p>4 (22.2)</p>
                  </c>
                  <c ca="center">
                     <p>16 (20.0)</p>
                  </c>
                  <c ca="center">
                     <p>0.178 (21.1)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Sfold (3)</p>
                  </c>
                  <c ca="center">
                     <p>18</p>
                  </c>
                  <c ca="center">
                     <p>21</p>
                  </c>
                  <c ca="center">
                     <p>4 (22.2)</p>
                  </c>
                  <c ca="center">
                     <p>14 (22.2)</p>
                  </c>
                  <c ca="center">
                     <p>0.191 (22.2)</p>
                  </c>
               </r>
            </tblbdy>
         </tbl>
         <tbl id="T3">
            <title>
               <p>Table 3</p>
            </title>
            <caption>
               <p>Generally the comparative approaches perform much better than MFE methods at determining <it>S. cerevisiae </it>tRNA-PHE structure. For the consensus predictions of RNAalifold and Carnac we also computed "filled" structures using constrained MFE predictions. This usually improved the sensitivity of the methods. PFold a built-in stem-extension procedure to fill structures. As the tRNA structure contains a multi-loop Foldalign is not expected to perform well here. Dynalign performed well on the most diverse data-set (M) but didn't do well on the high similarity data-set. The structure alignment methods generally did poorly here. Most probably due to the miss-folded MFE structure which were used as input. Trimming high entropy base-pairs from the input structures produced modest improvements.</p>
            </caption>
            <tblbdy cols="6">
               <r>
                  <c>
                     <p/>
                  </c>
                  <c cspan="5" ca="left">
                     <p>
                        <b><it>S. cerevisiae </it>tRNA-PHE: Comparative Methods</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c cspan="6">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Algorithm</p>
                  </c>
                  <c ca="center">
                     <p>number of bps in reference</p>
                  </c>
                  <c ca="center">
                     <p>number of bps in prediction</p>
                  </c>
                  <c ca="center">
                     <p>True Positives (% sensitivity)</p>
                  </c>
                  <c ca="center">
                     <p>False Positives (% selectivity)</p>
                  </c>
                  <c ca="center">
                     <p>Correlation (%)</p>
                  </c>
               </r>
               <r>
                  <c cspan="6">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c cspan="6" ca="left">
                     <p>Plan A: ClustalW Alignment</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>RNAalifold (H)</p>
                  </c>
                  <c ca="center">
                     <p>21</p>
                  </c>
                  <c ca="center">
                     <p>20</p>
                  </c>
                  <c ca="center">
                     <p>19 (90.5)</p>
                  </c>
                  <c ca="center">
                     <p>0 (100.0)</p>
                  </c>
                  <c ca="center">
                     <p>0.950 (95.2)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>RNAalifold (H) + RNAfold-C</p>
                  </c>
                  <c ca="center">
                     <p>21</p>
                  </c>
                  <c ca="center">
                     <p>21</p>
                  </c>
                  <c ca="center">
                     <p>21 (100.0)</p>
                  </c>
                  <c ca="center">
                     <p>0 (100.0)</p>
                  </c>
                  <c ca="center">
                     <p>1.000 (100.0)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>RNAalifold (M)</p>
                  </c>
                  <c ca="center">
                     <p>18</p>
                  </c>
                  <c ca="center">
                     <p>14</p>
                  </c>
                  <c ca="center">
                     <p>14 (77.8)</p>
                  </c>
                  <c ca="center">
                     <p>0 (100.0)</p>
                  </c>
                  <c ca="center">
                     <p>0.880 (88.9)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>RNAalifold (M) + RNAfold-C</p>
                  </c>
                  <c ca="center">
                     <p>18</p>
                  </c>
                  <c ca="center">
                     <p>21</p>
                  </c>
                  <c ca="center">
                     <p>18 (100.0)</p>
                  </c>
                  <c ca="center">
                     <p>0 (100.0)</p>
                  </c>
                  <c ca="center">
                     <p>1.000 (100.0)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>ILM (H)</p>
                  </c>
                  <c ca="center">
                     <p>21</p>
                  </c>
                  <c ca="center">
                     <p>24</p>
                  </c>
                  <c ca="center">
                     <p>16 (76.2)</p>
                  </c>
                  <c ca="center">
                     <p>7 (69.6)</p>
                  </c>
                  <c ca="center">
                     <p>0.722 (72.9)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>ILM (M)</p>
                  </c>
                  <c ca="center">
                     <p>18</p>
                  </c>
                  <c ca="center">
                     <p>30</p>
                  </c>
                  <c ca="center">
                     <p>18 (100.0)</p>
                  </c>
                  <c ca="center">
                     <p>6 (75.0)</p>
                  </c>
                  <c ca="center">
                     <p>0.863 (87.5)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Pfold (H)</p>
                  </c>
                  <c ca="center">
                     <p>21</p>
                  </c>
                  <c ca="center">
                     <p>21</p>
                  </c>
                  <c ca="center">
                     <p>20 (95.2)</p>
                  </c>
                  <c ca="center">
                     <p>0 (100.0)</p>
                  </c>
                  <c ca="center">
                     <p>0.975 (97.6)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Pfold (M)</p>
                  </c>
                  <c ca="center">
                     <p>18</p>
                  </c>
                  <c ca="center">
                     <p>21</p>
                  </c>
                  <c ca="center">
                     <p>18 (100.0)</p>
                  </c>
                  <c ca="center">
                     <p>0 (100.0)</p>
                  </c>
                  <c ca="center">
                     <p>1.000 (100.0)</p>
                  </c>
               </r>
               <r>
                  <c cspan="6" ca="left">
                     <p>Plan B: Unaligned sequences</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Carnac (H)</p>
                  </c>
                  <c ca="center">
                     <p>21</p>
                  </c>
                  <c ca="center">
                     <p>17</p>
                  </c>
                  <c ca="center">
                     <p>15 (71.4)</p>
                  </c>
                  <c ca="center">
                     <p>1 (93.8)</p>
                  </c>
                  <c ca="center">
                     <p>0.815 (82.6)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Carnac (H) + RNAfold-C</p>
                  </c>
                  <c ca="center">
                     <p>21</p>
                  </c>
                  <c ca="center">
                     <p>21</p>
                  </c>
                  <c ca="center">
                     <p>19 (90.5)</p>
                  </c>
                  <c ca="center">
                     <p>1 (95.0)</p>
                  </c>
                  <c ca="center">
                     <p>0.925 (92.7)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Carnac (M)</p>
                  </c>
                  <c ca="center">
                     <p>21</p>
                  </c>
                  <c ca="center">
                     <p>13</p>
                  </c>
                  <c ca="center">
                     <p>12 (57.1)</p>
                  </c>
                  <c ca="center">
                     <p>1 (92.3)</p>
                  </c>
                  <c ca="center">
                     <p>0.722 (74.7)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Carnac (M) + RNAfold-C</p>
                  </c>
                  <c ca="center">
                     <p>21</p>
                  </c>
                  <c ca="center">
                     <p>22</p>
                  </c>
                  <c ca="center">
                     <p>16 (76.2)</p>
                  </c>
                  <c ca="center">
                     <p>5 (76.2)</p>
                  </c>
                  <c ca="center">
                     <p>0.757 (76.2)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Dynalign (H)</p>
                  </c>
                  <c ca="center">
                     <p>21</p>
                  </c>
                  <c ca="center">
                     <p>22.40</p>
                  </c>
                  <c ca="center">
                     <p>11.50 (54.78)</p>
                  </c>
                  <c ca="center">
                     <p>10.20 (54.45)</p>
                  </c>
                  <c ca="center">
                     <p>0.5353 (54.59)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Dynalign (M)</p>
                  </c>
                  <c ca="center">
                     <p>21</p>
                  </c>
                  <c ca="center">
                     <p>21.10</p>
                  </c>
                  <c ca="center">
                     <p>19.80 (94.27)</p>
                  </c>
                  <c ca="center">
                     <p>1.20 (95.00)</p>
                  </c>
                  <c ca="center">
                     <p>0.9448 (94.64)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Foldalign (H)</p>
                  </c>
                  <c ca="center">
                     <p>21</p>
                  </c>
                  <c ca="center">
                     <p>16</p>
                  </c>
                  <c ca="center">
                     <p>5 (23.8)</p>
                  </c>
                  <c ca="center">
                     <p>11 (31.2)</p>
                  </c>
                  <c ca="center">
                     <p>0.259 (27.5)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Foldalign (M)</p>
                  </c>
                  <c ca="center">
                     <p>21</p>
                  </c>
                  <c ca="center">
                     <p>16</p>
                  </c>
                  <c ca="center">
                     <p>5 (23.8)</p>
                  </c>
                  <c ca="center">
                     <p>10 (33.3)</p>
                  </c>
                  <c ca="center">
                     <p>0.268 (28.6)</p>
                  </c>
               </r>
               <r>
                  <c cspan="6" ca="left">
                     <p>Plan C: Structure alignment</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>MARNA (H)</p>
                  </c>
                  <c ca="center">
                     <p>21</p>
                  </c>
                  <c ca="center">
                     <p>19</p>
                  </c>
                  <c ca="center">
                     <p>6 (28.6)</p>
                  </c>
                  <c ca="center">
                     <p>12 (33.3)</p>
                  </c>
                  <c ca="center">
                     <p>0.295 (31.0)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>MARNA (M)</p>
                  </c>
                  <c ca="center">
                     <p>21</p>
                  </c>
                  <c ca="center">
                     <p>22</p>
                  </c>
                  <c ca="center">
                     <p>7 (33.3)</p>
                  </c>
                  <c ca="center">
                     <p>15 (31.8)</p>
                  </c>
                  <c ca="center">
                     <p>0.311 (32.6)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>MARNA-trim (H)</p>
                  </c>
                  <c ca="center">
                     <p>21</p>
                  </c>
                  <c ca="center">
                     <p>6</p>
                  </c>
                  <c ca="center">
                     <p>6 (28.6)</p>
                  </c>
                  <c ca="center">
                     <p>0 (100.0)</p>
                  </c>
                  <c ca="center">
                     <p>0.530 (64.3)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>MARNA-trim (M)</p>
                  </c>
                  <c ca="center">
                     <p>21</p>
                  </c>
                  <c ca="center">
                     <p>15</p>
                  </c>
                  <c ca="center">
                     <p>15 (71.4)</p>
                  </c>
                  <c ca="center">
                     <p>0 (100.0)</p>
                  </c>
                  <c ca="center">
                     <p>0.843 (85.7)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>RNAforester (H)</p>
                  </c>
                  <c ca="center">
                     <p>21</p>
                  </c>
                  <c ca="center">
                     <p>23</p>
                  </c>
                  <c ca="center">
                     <p>6 (28.6)</p>
                  </c>
                  <c ca="center">
                     <p>16 (27.3)</p>
                  </c>
                  <c ca="center">
                     <p>0.263 (27.9)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>RNAforester (M)</p>
                  </c>
                  <c ca="center">
                     <p>21</p>
                  </c>
                  <c ca="center">
                     <p>21</p>
                  </c>
                  <c ca="center">
                     <p>14 (66.7)</p>
                  </c>
                  <c ca="center">
                     <p>7 (66.7)</p>
                  </c>
                  <c ca="center">
                     <p>0.659 (66.7)</p>
                  </c>
               </r>
            </tblbdy>
         </tbl>
         <tbl id="T4">
            <title>
               <p>Table 4</p>
            </title>
            <caption>
               <p>Note the improvement in prediction accuracy on the supposedly more difficult and longer <it>E. coli </it>RNase P data-set. This shows that MFE methods are less sensitive to folding errors on longer data-sets but are also less likely to resolve the entire structure. There is little difference in algorithm accuracy for each of the methods explored here. Each employs the same energy parameters so differences are due to slightly different implementations.</p>
            </caption>
            <tblbdy cols="6">
               <r>
                  <c>
                     <p/>
                  </c>
                  <c cspan="5" ca="center">
                     <p>
                        <b><it>E. coli </it>RNase P: Single Sequence Methods</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c cspan="6">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Algorithm</p>
                  </c>
                  <c ca="center">
                     <p>number of bps in reference</p>
                  </c>
                  <c ca="center">
                     <p>number of bps in prediction</p>
                  </c>
                  <c ca="center">
                     <p>True Positives (% sensitivity)</p>
                  </c>
                  <c ca="center">
                     <p>False Positives (% selectivity)</p>
                  </c>
                  <c ca="center">
                     <p>Correlation (%)</p>
                  </c>
               </r>
               <r>
                  <c cspan="6">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>RNAfold</p>
                  </c>
                  <c ca="center">
                     <p>110</p>
                  </c>
                  <c ca="center">
                     <p>116</p>
                  </c>
                  <c ca="center">
                     <p>69 (62.7)</p>
                  </c>
                  <c ca="center">
                     <p>46 (60.0)</p>
                  </c>
                  <c ca="center">
                     <p>0.612 (61.4)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Mfold (1)</p>
                  </c>
                  <c ca="center">
                     <p>110</p>
                  </c>
                  <c ca="center">
                     <p>118</p>
                  </c>
                  <c ca="center">
                     <p>67 (60.9)</p>
                  </c>
                  <c ca="center">
                     <p>49 (57.8)</p>
                  </c>
                  <c ca="center">
                     <p>0.591 (59.3)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Mfold (2)</p>
                  </c>
                  <c ca="center">
                     <p>110</p>
                  </c>
                  <c ca="center">
                     <p>114</p>
                  </c>
                  <c ca="center">
                     <p>67 (60.9)</p>
                  </c>
                  <c ca="center">
                     <p>46 (59.3)</p>
                  </c>
                  <c ca="center">
                     <p>0.599 (60.1)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Mfold (3)</p>
                  </c>
                  <c ca="center">
                     <p>110</p>
                  </c>
                  <c ca="center">
                     <p>118</p>
                  </c>
                  <c ca="center">
                     <p>76 (69.1)</p>
                  </c>
                  <c ca="center">
                     <p>37 (67.3)</p>
                  </c>
                  <c ca="center">
                     <p>0.680 (68.2)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Sfold (1)</p>
                  </c>
                  <c ca="center">
                     <p>110</p>
                  </c>
                  <c ca="center">
                     <p>116</p>
                  </c>
                  <c ca="center">
                     <p>73 (66.4)</p>
                  </c>
                  <c ca="center">
                     <p>42 (63.5)</p>
                  </c>
                  <c ca="center">
                     <p>0.647 (64.9)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Sfold (2)</p>
                  </c>
                  <c ca="center">
                     <p>110</p>
                  </c>
                  <c ca="center">
                     <p>119</p>
                  </c>
                  <c ca="center">
                     <p>86 (78.2)</p>
                  </c>
                  <c ca="center">
                     <p>28 (75.4)</p>
                  </c>
                  <c ca="center">
                     <p>0.767 (76.8)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Sfold (3)</p>
                  </c>
                  <c ca="center">
                     <p>110</p>
                  </c>
                  <c ca="center">
                     <p>117</p>
                  </c>
                  <c ca="center">
                     <p>61 (55.5)</p>
                  </c>
                  <c ca="center">
                     <p>55 (52.6)</p>
                  </c>
                  <c ca="center">
                     <p>0.538 (54.0)</p>
                  </c>
               </r>
            </tblbdy>
         </tbl>
         <tbl id="T5">
            <title>
               <p>Table 5</p>
            </title>
            <caption>
               <p>RNase P is a difficult data-set to study. Five sequences in the high similarity data-set are truncated at both the 5 and 3 prime ends (due to the primers used for sequencing these). Sequences in the medium similarity data-set are full-length but do not align well using traditional tools such as ClustalW. Values corresponding to the re-evaluation of ILM with pseudo-knot inclusive reference structures are indicated by "ILM-pknot".</p>
            </caption>
            <tblbdy cols="6">
               <r>
                  <c>
                     <p/>
                  </c>
                  <c cspan="5" ca="center">
                     <p>
                        <b><it>E. coli </it>RNase P: Comparative Methods</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c cspan="6">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Algorithm</p>
                  </c>
                  <c ca="center">
                     <p>number of bps in reference</p>
                  </c>
                  <c ca="center">
                     <p>number of bps in prediction</p>
                  </c>
                  <c ca="center">
                     <p>True Positives (% sensitivity)</p>
                  </c>
                  <c ca="center">
                     <p>False Positives (% selectivity)</p>
                  </c>
                  <c ca="center">
                     <p>Correlation (%)</p>
                  </c>
               </r>
               <r>
                  <c cspan="6">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>RNAalifold (H)</p>
                  </c>
                  <c ca="center">
                     <p>71</p>
                  </c>
                  <c ca="center">
                     <p>113</p>
                  </c>
                  <c ca="center">
                     <p>56 (78.9)</p>
                  </c>
                  <c ca="center">
                     <p>16 (77.8)</p>
                  </c>
                  <c ca="center">
                     <p>0.782 (78.3)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>RNAalifold (H) + RNAfold-C</p>
                  </c>
                  <c ca="center">
                     <p>71</p>
                  </c>
                  <c ca="center">
                     <p>119</p>
                  </c>
                  <c ca="center">
                     <p>55 (77.5)</p>
                  </c>
                  <c ca="center">
                     <p>16 (77.5)</p>
                  </c>
                  <c ca="center">
                     <p>0.773 (77.5)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>RNAalifold (M)</p>
                  </c>
                  <c ca="center">
                     <p>54</p>
                  </c>
                  <c ca="center">
                     <p>66</p>
                  </c>
                  <c ca="center">
                     <p>31 (57.4)</p>
                  </c>
                  <c ca="center">
                     <p>23 (57.4)</p>
                  </c>
                  <c ca="center">
                     <p>0.571 (57.4)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>RNAalifold (M) + RNAfold-C</p>
                  </c>
                  <c ca="center">
                     <p>54</p>
                  </c>
                  <c ca="center">
                     <p>77</p>
                  </c>
                  <c ca="center">
                     <p>33 (61.1)</p>
                  </c>
                  <c ca="center">
                     <p>16 (67.3)</p>
                  </c>
                  <c ca="center">
                     <p>0.639 (64.2)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Pfold (H)</p>
                  </c>
                  <c ca="center">
                     <p>71</p>
                  </c>
                  <c ca="center">
                     <p>67</p>
                  </c>
                  <c ca="center">
                     <p>47 (66.2)</p>
                  </c>
                  <c ca="center">
                     <p>6 (88.7)</p>
                  </c>
                  <c ca="center">
                     <p>0.765 (77.4)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Pfold (M)</p>
                  </c>
                  <c ca="center">
                     <p>54</p>
                  </c>
                  <c ca="center">
                     <p>87</p>
                  </c>
                  <c ca="center">
                     <p>47 (87.0)</p>
                  </c>
                  <c ca="center">
                     <p>4 (92.2)</p>
                  </c>
                  <c ca="center">
                     <p>0.895 (89.6)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>ILM (H)</p>
                  </c>
                  <c ca="center">
                     <p>71</p>
                  </c>
                  <c ca="center">
                     <p>124</p>
                  </c>
                  <c ca="center">
                     <p>31 (43.7)</p>
                  </c>
                  <c ca="center">
                     <p>54 (36.5)</p>
                  </c>
                  <c ca="center">
                     <p>0.395 (40.1)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>ILM (M)</p>
                  </c>
                  <c ca="center">
                     <p>54</p>
                  </c>
                  <c ca="center">
                     <p>133</p>
                  </c>
                  <c ca="center">
                     <p>38 (70.4)</p>
                  </c>
                  <c ca="center">
                     <p>31 (55.1)</p>
                  </c>
                  <c ca="center">
                     <p>0.620 (62.7)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>ILM-pknot (H)</p>
                  </c>
                  <c ca="center">
                     <p>110</p>
                  </c>
                  <c ca="center">
                     <p>124</p>
                  </c>
                  <c ca="center">
                     <p>53 (48.2)</p>
                  </c>
                  <c ca="center">
                     <p>65 (44.9)</p>
                  </c>
                  <c ca="center">
                     <p>0.463 (46.5)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>ILM-pknot (M)</p>
                  </c>
                  <c ca="center">
                     <p>110</p>
                  </c>
                  <c ca="center">
                     <p>133</p>
                  </c>
                  <c ca="center">
                     <p>44 (40.0)</p>
                  </c>
                  <c ca="center">
                     <p>75 (37.0)</p>
                  </c>
                  <c ca="center">
                     <p>0.382 (38.5)</p>
                  </c>
               </r>
               <r>
                  <c cspan="6" ca="left">
                     <p>Plan B: Unaligned sequences</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Carnac (H)</p>
                  </c>
                  <c ca="center">
                     <p>71</p>
                  </c>
                  <c ca="center">
                     <p>40</p>
                  </c>
                  <c ca="center">
                     <p>36 (50.7)</p>
                  </c>
                  <c ca="center">
                     <p>0 (100.0)</p>
                  </c>
                  <c ca="center">
                     <p>0.712 (75.4)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Carnac (H) + RNAfold-C</p>
                  </c>
                  <c ca="center">
                     <p>71</p>
                  </c>
                  <c ca="center">
                     <p>116</p>
                  </c>
                  <c ca="center">
                     <p>50 (70.4)</p>
                  </c>
                  <c ca="center">
                     <p>25 (66.7)</p>
                  </c>
                  <c ca="center">
                     <p>0.684 (68.5)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Carnac (M)</p>
                  </c>
                  <c ca="center">
                     <p>97</p>
                  </c>
                  <c ca="center">
                     <p>80</p>
                  </c>
                  <c ca="center">
                     <p>63 (64.9)</p>
                  </c>
                  <c ca="center">
                     <p>3 (95.5)</p>
                  </c>
                  <c ca="center">
                     <p>0.787 (80.2)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Carnac (M) + RNAfold-C</p>
                  </c>
                  <c ca="center">
                     <p>97</p>
                  </c>
                  <c ca="center">
                     <p>118</p>
                  </c>
                  <c ca="center">
                     <p>78 (80.4)</p>
                  </c>
                  <c ca="center">
                     <p>25 (75.7)</p>
                  </c>
                  <c ca="center">
                     <p>0.779 (78.1)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Foldalign (H)</p>
                  </c>
                  <c ca="center">
                     <p>71</p>
                  </c>
                  <c ca="center">
                     <p>41</p>
                  </c>
                  <c ca="center">
                     <p>14 (19.7)</p>
                  </c>
                  <c ca="center">
                     <p>25 (35.9)</p>
                  </c>
                  <c ca="center">
                     <p>0.265 (27.8)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Foldalign (M)</p>
                  </c>
                  <c ca="center">
                     <p>97</p>
                  </c>
                  <c ca="center">
                     <p>24</p>
                  </c>
                  <c ca="center">
                     <p>5 (5.2)</p>
                  </c>
                  <c ca="center">
                     <p>17 (22.7)</p>
                  </c>
                  <c ca="center">
                     <p>0.107 (13.9)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Dynalign (H)</p>
                  </c>
                  <c ca="center">
                     <p>71</p>
                  </c>
                  <c ca="center">
                     <p>95.13</p>
                  </c>
                  <c ca="center">
                     <p>28.63 (40.31)</p>
                  </c>
                  <c ca="center">
                     <p>41.50 (39.59)</p>
                  </c>
                  <c ca="center">
                     <p>0.3974 (39.96)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Dynalign (M)</p>
                  </c>
                  <c ca="center">
                     <p>97</p>
                  </c>
                  <c ca="center">
                     <p>103.20</p>
                  </c>
                  <c ca="center">
                     <p>31.00 (31.95)</p>
                  </c>
                  <c ca="center">
                     <p>61.50 (32.80)</p>
                  </c>
                  <c ca="center">
                     <p>0.3208 (32.39)</p>
                  </c>
               </r>
               <r>
                  <c cspan="6" ca="left">
                     <p>Plan C: Structure alignment</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>MARNA (H)</p>
                  </c>
                  <c ca="center">
                     <p>71</p>
                  </c>
                  <c ca="center">
                     <p>89</p>
                  </c>
                  <c ca="center">
                     <p>37 (52.1)</p>
                  </c>
                  <c ca="center">
                     <p>23 (61.7)</p>
                  </c>
                  <c ca="center">
                     <p>0.566 (56.9)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>MARNA (M)</p>
                  </c>
                  <c ca="center">
                     <p>97</p>
                  </c>
                  <c ca="center">
                     <p>60</p>
                  </c>
                  <c ca="center">
                     <p>48 (49.5)</p>
                  </c>
                  <c ca="center">
                     <p>9 (84.2)</p>
                  </c>
                  <c ca="center">
                     <p>0.645 (66.8)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>MARNA-trim (H)</p>
                  </c>
                  <c ca="center">
                     <p>71</p>
                  </c>
                  <c ca="center">
                     <p>52</p>
                  </c>
                  <c ca="center">
                     <p>37 (52.1)</p>
                  </c>
                  <c ca="center">
                     <p>3 (92.5)</p>
                  </c>
                  <c ca="center">
                     <p>0.694 (72.3)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>MARNA-trim (M)</p>
                  </c>
                  <c ca="center">
                     <p>97</p>
                  </c>
                  <c ca="center">
                     <p>43</p>
                  </c>
                  <c ca="center">
                     <p>39 (40.2)</p>
                  </c>
                  <c ca="center">
                     <p>1 (97.5)</p>
                  </c>
                  <c ca="center">
                     <p>0.625 (68.9)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>RNAforester (H)</p>
                  </c>
                  <c ca="center">
                     <p>71</p>
                  </c>
                  <c ca="center">
                     <p>114</p>
                  </c>
                  <c ca="center">
                     <p>40 (56.3)</p>
                  </c>
                  <c ca="center">
                     <p>31 (56.3)</p>
                  </c>
                  <c ca="center">
                     <p>0.562 (56.3)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>RNAforester (M)</p>
                  </c>
                  <c ca="center">
                     <p>97</p>
                  </c>
                  <c ca="center">
                     <p>117</p>
                  </c>
                  <c ca="center">
                     <p>64 (66.0)</p>
                  </c>
                  <c ca="center">
                     <p>44 (59.3)</p>
                  </c>
                  <c ca="center">
                     <p>0.624 (62.6)</p>
                  </c>
               </r>
            </tblbdy>
         </tbl>
         <tbl id="T6">
            <title>
               <p>Table 6</p>
            </title>
            <caption>
               <p><it>E. coli </it>SSU rRNA with a length of approximately 1600 nucleotides is beyond the reach of many structure prediction algorithms such as RNAforester and Dynalign. The minimum free energy methods, however, can produce results.</p>
            </caption>
            <tblbdy cols="6">
               <r>
                  <c>
                     <p/>
                  </c>
                  <c cspan="5" ca="center">
                     <p>
                        <b><it>E. coli </it>SSU rRNA: Single Sequence Methods</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c cspan="6">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Algorithm</p>
                  </c>
                  <c ca="center">
                     <p>number of bps in reference</p>
                  </c>
                  <c ca="center">
                     <p>number of bps in prediction</p>
                  </c>
                  <c ca="center">
                     <p>True Positives (% sensitivity)</p>
                  </c>
                  <c ca="center">
                     <p>False Positives (% selectivity)</p>
                  </c>
                  <c ca="center">
                     <p>Correlation (%)</p>
                  </c>
               </r>
               <r>
                  <c cspan="6">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>RNAfold</p>
                  </c>
                  <c ca="center">
                     <p>468</p>
                  </c>
                  <c ca="center">
                     <p>493</p>
                  </c>
                  <c ca="center">
                     <p>207 (44.2)</p>
                  </c>
                  <c ca="center">
                     <p>271 (43.3)</p>
                  </c>
                  <c ca="center">
                     <p>0.437 (43.8)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Mfold (1)</p>
                  </c>
                  <c ca="center">
                     <p>468</p>
                  </c>
                  <c ca="center">
                     <p>480</p>
                  </c>
                  <c ca="center">
                     <p>240 (51.3)</p>
                  </c>
                  <c ca="center">
                     <p>224 (51.7)</p>
                  </c>
                  <c ca="center">
                     <p>0.515 (51.5)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Mfold (2)</p>
                  </c>
                  <c ca="center">
                     <p>468</p>
                  </c>
                  <c ca="center">
                     <p>487</p>
                  </c>
                  <c ca="center">
                     <p>242 (51.7)</p>
                  </c>
                  <c ca="center">
                     <p>229 (51.4)</p>
                  </c>
                  <c ca="center">
                     <p>0.515 (51.5)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Mfold (3)</p>
                  </c>
                  <c ca="center">
                     <p>468</p>
                  </c>
                  <c ca="center">
                     <p>487</p>
                  </c>
                  <c ca="center">
                     <p>202 (43.2)</p>
                  </c>
                  <c ca="center">
                     <p>273 (42.5)</p>
                  </c>
                  <c ca="center">
                     <p>0.428 (42.8)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Sfold (1)</p>
                  </c>
                  <c ca="center">
                     <p>468</p>
                  </c>
                  <c ca="center">
                     <p>481</p>
                  </c>
                  <c ca="center">
                     <p>232 (49.6)</p>
                  </c>
                  <c ca="center">
                     <p>229 (50.3)</p>
                  </c>
                  <c ca="center">
                     <p>0.499 (49.9)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Sfold (2)</p>
                  </c>
                  <c ca="center">
                     <p>468</p>
                  </c>
                  <c ca="center">
                     <p>499</p>
                  </c>
                  <c ca="center">
                     <p>231 (49.4)</p>
                  </c>
                  <c ca="center">
                     <p>249 (48.1)</p>
                  </c>
                  <c ca="center">
                     <p>0.487 (48.7)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Sfold (3)</p>
                  </c>
                  <c ca="center">
                     <p>468</p>
                  </c>
                  <c ca="center">
                     <p>475</p>
                  </c>
                  <c ca="center">
                     <p>232 (49.6)</p>
                  </c>
                  <c ca="center">
                     <p>230 (50.2)</p>
                  </c>
                  <c ca="center">
                     <p>0.498 (49.9)</p>
                  </c>
               </r>
            </tblbdy>
         </tbl>
         <tbl id="T7">
            <title>
               <p>Table 7</p>
            </title>
            <caption>
               <p>The probabilistic approach of PFold can, on occasion, suffer from "under-flow" errors caused by multiplying many probabilities together producing numbers too low to be dealt with on modern computers. This is what has happened on the medium similarity data-set.</p>
            </caption>
            <tblbdy cols="6">
               <r>
                  <c>
                     <p/>
                  </c>
                  <c cspan="5" ca="center">
                     <p>
                        <b><it>E. coli </it>SSU rRNA: Comparative Methods</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c cspan="6">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Algorithm</p>
                  </c>
                  <c ca="center">
                     <p>number of bps in reference</p>
                  </c>
                  <c ca="center">
                     <p>number of bps in prediction</p>
                  </c>
                  <c ca="center">
                     <p>True Positives (% sensitivity)</p>
                  </c>
                  <c ca="center">
                     <p>False Positives (% selectivity)</p>
                  </c>
                  <c ca="center">
                     <p>Correlation (%)</p>
                  </c>
               </r>
               <r>
                  <c cspan="6" ca="left">
                     <p>Plan A: ClustalW Alignment</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>RNAalifold (H)</p>
                  </c>
                  <c ca="center">
                     <p>460</p>
                  </c>
                  <c ca="center">
                     <p>472</p>
                  </c>
                  <c ca="center">
                     <p>275 (59.8)</p>
                  </c>
                  <c ca="center">
                     <p>179 (60.6)</p>
                  </c>
                  <c ca="center">
                     <p>0.601 (60.2)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>RNAalifold (H) + RNAfold-C</p>
                  </c>
                  <c ca="center">
                     <p>460</p>
                  </c>
                  <c ca="center">
                     <p>483</p>
                  </c>
                  <c ca="center">
                     <p>273 (59.3)</p>
                  </c>
                  <c ca="center">
                     <p>195 (58.3)</p>
                  </c>
                  <c ca="center">
                     <p>0.588 (58.8)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>RNAalifold (M)</p>
                  </c>
                  <c ca="center">
                     <p>441</p>
                  </c>
                  <c ca="center">
                     <p>433</p>
                  </c>
                  <c ca="center">
                     <p>372 (84.4)</p>
                  </c>
                  <c ca="center">
                     <p>32 (92.1)</p>
                  </c>
                  <c ca="center">
                     <p>0.881 (88.2)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>RNAalifold (M) + RNAfold-C</p>
                  </c>
                  <c ca="center">
                     <p>441</p>
                  </c>
                  <c ca="center">
                     <p>469</p>
                  </c>
                  <c ca="center">
                     <p>388 (88.0)</p>
                  </c>
                  <c ca="center">
                     <p>44 (89.8)</p>
                  </c>
                  <c ca="center">
                     <p>0.889 (88.9)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Pfold (H)</p>
                  </c>
                  <c ca="center">
                     <p>460</p>
                  </c>
                  <c ca="center">
                     <p>377</p>
                  </c>
                  <c ca="center">
                     <p>326 (70.9)</p>
                  </c>
                  <c ca="center">
                     <p>26 (92.6)</p>
                  </c>
                  <c ca="center">
                     <p>0.810 (81.7)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Pfold (M)</p>
                  </c>
                  <c ca="center">
                     <p>441</p>
                  </c>
                  <c ca="center">
                     <p>0</p>
                  </c>
                  <c ca="center">
                     <p>0 (0.0)</p>
                  </c>
                  <c ca="center">
                     <p>0 (0.0)</p>
                  </c>
                  <c ca="center">
                     <p>0.000 (0.0)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>ILM (H)</p>
                  </c>
                  <c ca="center">
                     <p>460</p>
                  </c>
                  <c ca="center">
                     <p>565</p>
                  </c>
                  <c ca="center">
                     <p>236 (51.3)</p>
                  </c>
                  <c ca="center">
                     <p>313 (43.0)</p>
                  </c>
                  <c ca="center">
                     <p>0.469 (47.1)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>ILM (M)</p>
                  </c>
                  <c ca="center">
                     <p>441</p>
                  </c>
                  <c ca="center">
                     <p>564</p>
                  </c>
                  <c ca="center">
                     <p>264 (59.9)</p>
                  </c>
                  <c ca="center">
                     <p>249 (51.5)</p>
                  </c>
                  <c ca="center">
                     <p>0.554 (55.7)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>ILM-pknot (H)</p>
                  </c>
                  <c ca="center">
                     <p>468</p>
                  </c>
                  <c ca="center">
                     <p>565</p>
                  </c>
                  <c ca="center">
                     <p>236 (50.4)</p>
                  </c>
                  <c ca="center">
                     <p>311 (43.1)</p>
                  </c>
                  <c ca="center">
                     <p>0.466 (46.8)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>ILM-pknot (M)</p>
                  </c>
                  <c ca="center">
                     <p>468</p>
                  </c>
                  <c ca="center">
                     <p>564</p>
                  </c>
                  <c ca="center">
                     <p>266 (56.8)</p>
                  </c>
                  <c ca="center">
                     <p>258 (50.8)</p>
                  </c>
                  <c ca="center">
                     <p>0.537 (53.8)</p>
                  </c>
               </r>
               <r>
                  <c cspan="6" ca="left">
                     <p>Plan B: Unaligned sequences</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Carnac (H)</p>
                  </c>
                  <c ca="center">
                     <p>460</p>
                  </c>
                  <c ca="center">
                     <p>233</p>
                  </c>
                  <c ca="center">
                     <p>206 (44.8)</p>
                  </c>
                  <c ca="center">
                     <p>12 (94.5)</p>
                  </c>
                  <c ca="center">
                     <p>0.650 (69.6)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Carnac (H) + RNAfold-C</p>
                  </c>
                  <c ca="center">
                     <p>460</p>
                  </c>
                  <c ca="center">
                     <p>470</p>
                  </c>
                  <c ca="center">
                     <p>332 (72.2)</p>
                  </c>
                  <c ca="center">
                     <p>112 (74.8)</p>
                  </c>
                  <c ca="center">
                     <p>0.734 (73.5)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Carnac (M)</p>
                  </c>
                  <c ca="center">
                     <p>448</p>
                  </c>
                  <c ca="center">
                     <p>294</p>
                  </c>
                  <c ca="center">
                     <p>259 (57.8)</p>
                  </c>
                  <c ca="center">
                     <p>18 (93.5)</p>
                  </c>
                  <c ca="center">
                     <p>0.735 (75.7)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Carnac (M) + RNAfold-C</p>
                  </c>
                  <c ca="center">
                     <p>448</p>
                  </c>
                  <c ca="center">
                     <p>471</p>
                  </c>
                  <c ca="center">
                     <p>337 (75.2)</p>
                  </c>
                  <c ca="center">
                     <p>110 (75.4)</p>
                  </c>
                  <c ca="center">
                     <p>0.753 (75.3)</p>
                  </c>
               </r>
            </tblbdy>
         </tbl>
         <tbl id="T8">
            <title>
               <p>Table 8</p>
            </title>
            <caption>
               <p><it>E. coli </it>LSU rRNA is approximately 3350 nucleotides in length. The longest member of our test-set. The highest ranked Sfold prediction is remarkably poor, resolving just 5.8% of the reference structure.</p>
            </caption>
            <tblbdy cols="6">
               <r>
                  <c>
                     <p/>
                  </c>
                  <c cspan="5" ca="center">
                     <p>
                        <b><it>E. coli </it>LSU rRNA: Single Sequence Methods</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c cspan="6">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Algorithm</p>
                  </c>
                  <c ca="center">
                     <p>number of bps in reference</p>
                  </c>
                  <c ca="center">
                     <p>number of bps in prediction</p>
                  </c>
                  <c ca="center">
                     <p>True Positives (% sensitivity)</p>
                  </c>
                  <c ca="center">
                     <p>False Positives (% selectivity)</p>
                  </c>
                  <c ca="center">
                     <p>Correlation (%)</p>
                  </c>
               </r>
               <r>
                  <c cspan="6">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>RNAfold</p>
                  </c>
                  <c ca="center">
                     <p>839</p>
                  </c>
                  <c ca="center">
                     <p>906</p>
                  </c>
                  <c ca="center">
                     <p>435 (51.8)</p>
                  </c>
                  <c ca="center">
                     <p>430 (50.3)</p>
                  </c>
                  <c ca="center">
                     <p>0.510 (51.1)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Mfold (1)</p>
                  </c>
                  <c ca="center">
                     <p>839</p>
                  </c>
                  <c ca="center">
                     <p>883</p>
                  </c>
                  <c ca="center">
                     <p>458 (54.6)</p>
                  </c>
                  <c ca="center">
                     <p>383 (54.5)</p>
                  </c>
                  <c ca="center">
                     <p>0.545 (54.5)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Mfold (2)</p>
                  </c>
                  <c ca="center">
                     <p>839</p>
                  </c>
                  <c ca="center">
                     <p>892</p>
                  </c>
                  <c ca="center">
                     <p>480 (57.2)</p>
                  </c>
                  <c ca="center">
                     <p>364 (56.9)</p>
                  </c>
                  <c ca="center">
                     <p>0.570 (57.0)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Mfold (3)</p>
                  </c>
                  <c ca="center">
                     <p>839</p>
                  </c>
                  <c ca="center">
                     <p>889</p>
                  </c>
                  <c ca="center">
                     <p>454 (54.1)</p>
                  </c>
                  <c ca="center">
                     <p>392 (53.7)</p>
                  </c>
                  <c ca="center">
                     <p>0.539 (53.9)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Sfold (1)</p>
                  </c>
                  <c ca="center">
                     <p>839</p>
                  </c>
                  <c ca="center">
                     <p>903</p>
                  </c>
                  <c ca="center">
                     <p>49 (5.8)</p>
                  </c>
                  <c ca="center">
                     <p>811 (5.7)</p>
                  </c>
                  <c ca="center">
                     <p>0.057 (5.8)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Sfold (2)</p>
                  </c>
                  <c ca="center">
                     <p>839</p>
                  </c>
                  <c ca="center">
                     <p>878</p>
                  </c>
                  <c ca="center">
                     <p>432 (51.5)</p>
                  </c>
                  <c ca="center">
                     <p>411 (51.2)</p>
                  </c>
                  <c ca="center">
                     <p>0.513 (51.4)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Sfold (3)</p>
                  </c>
                  <c ca="center">
                     <p>839</p>
                  </c>
                  <c ca="center">
                     <p>882</p>
                  </c>
                  <c ca="center">
                     <p>384 (45.8)</p>
                  </c>
                  <c ca="center">
                     <p>463 (45.3)</p>
                  </c>
                  <c ca="center">
                     <p>0.455 (45.6)</p>
                  </c>
               </r>
            </tblbdy>
         </tbl>
         <tbl id="T9">
            <title>
               <p>Table 9</p>
            </title>
            <caption>
               <p>Pfold predictions on both the high and medium similarity data-sets underflow on <it>E. coli </it>LSU rRNA. RNAalifold and Carnac, however, produce reasonable results.</p>
            </caption>
            <tblbdy cols="6">
               <r>
                  <c>
                     <p/>
                  </c>
                  <c cspan="5" ca="center">
                     <p>
                        <b><it>E. coli </it>LSU rRNA: Comparative Methods</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c cspan="6">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Algorithm</p>
                  </c>
                  <c ca="center">
                     <p>number of bps in reference</p>
                  </c>
                  <c ca="center">
                     <p>number of bps in prediction</p>
                  </c>
                  <c ca="center">
                     <p>True Positives (% sensitivity)</p>
                  </c>
                  <c ca="center">
                     <p>False Positives (% selectivity)</p>
                  </c>
                  <c ca="center">
                     <p>Correlation (%)</p>
                  </c>
               </r>
               <r>
                  <c cspan="6" ca="left">
                     <p>Plan A: ClustalW Alignment</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>RNAalifold (H)</p>
                  </c>
                  <c ca="center">
                     <p>794</p>
                  </c>
                  <c ca="center">
                     <p>879</p>
                  </c>
                  <c ca="center">
                     <p>627 (79.0)</p>
                  </c>
                  <c ca="center">
                     <p>195 (76.3)</p>
                  </c>
                  <c ca="center">
                     <p>0.776 (77.6)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>RNAalifold (H) + RNAfold-C</p>
                  </c>
                  <c ca="center">
                     <p>794</p>
                  </c>
                  <c ca="center">
                     <p>871</p>
                  </c>
                  <c ca="center">
                     <p>629 (79.2)</p>
                  </c>
                  <c ca="center">
                     <p>185 (77.3)</p>
                  </c>
                  <c ca="center">
                     <p>0.782 (78.2)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>RNAalifold (M)</p>
                  </c>
                  <c ca="center">
                     <p>819</p>
                  </c>
                  <c ca="center">
                     <p>721</p>
                  </c>
                  <c ca="center">
                     <p>614 (75.0)</p>
                  </c>
                  <c ca="center">
                     <p>53 (92.1)</p>
                  </c>
                  <c ca="center">
                     <p>0.831 (83.5)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>RNAalifold (M) + RNAfold-C</p>
                  </c>
                  <c ca="center">
                     <p>819</p>
                  </c>
                  <c ca="center">
                     <p>790</p>
                  </c>
                  <c ca="center">
                     <p>691 (84.4)</p>
                  </c>
                  <c ca="center">
                     <p>78 (89.9)</p>
                  </c>
                  <c ca="center">
                     <p>0.871 (87.1)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Pfold (H)</p>
                  </c>
                  <c ca="center">
                     <p>794</p>
                  </c>
                  <c ca="center">
                     <p>0</p>
                  </c>
                  <c ca="center">
                     <p>0 (0.0)</p>
                  </c>
                  <c ca="center">
                     <p>0 (0.0)</p>
                  </c>
                  <c ca="center">
                     <p>0.000 (0.0)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Pfold (M)</p>
                  </c>
                  <c ca="center">
                     <p>819</p>
                  </c>
                  <c ca="center">
                     <p>0</p>
                  </c>
                  <c ca="center">
                     <p>0 (0.0)</p>
                  </c>
                  <c ca="center">
                     <p>0 (0.0)</p>
                  </c>
                  <c ca="center">
                     <p>0.000 (0.0)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>ILM (H)</p>
                  </c>
                  <c ca="center">
                     <p>794</p>
                  </c>
                  <c ca="center">
                     <p>1048</p>
                  </c>
                  <c ca="center">
                     <p>389 (49.0)</p>
                  </c>
                  <c ca="center">
                     <p>602 (39.3)</p>
                  </c>
                  <c ca="center">
                     <p>0.438 (44.1)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>ILM (M)</p>
                  </c>
                  <c ca="center">
                     <p>819</p>
                  </c>
                  <c ca="center">
                     <p>1161</p>
                  </c>
                  <c ca="center">
                     <p>560 (68.4)</p>
                  </c>
                  <c ca="center">
                     <p>405 (58.0)</p>
                  </c>
                  <c ca="center">
                     <p>0.630 (63.2)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>ILM-pknot (H)</p>
                  </c>
                  <c ca="center">
                     <p>869</p>
                  </c>
                  <c ca="center">
                     <p>1048</p>
                  </c>
                  <c ca="center">
                     <p>272 (31.3)</p>
                  </c>
                  <c ca="center">
                     <p>759 (26.4)</p>
                  </c>
                  <c ca="center">
                     <p>0.287 (28.8)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>ILM-pknot (M)</p>
                  </c>
                  <c ca="center">
                     <p>869</p>
                  </c>
                  <c ca="center">
                     <p>1161</p>
                  </c>
                  <c ca="center">
                     <p>377 (43.4)</p>
                  </c>
                  <c ca="center">
                     <p>629 (37.5)</p>
                  </c>
                  <c ca="center">
                     <p>0.403 (40.4)</p>
                  </c>
               </r>
               <r>
                  <c cspan="6" ca="left">
                     <p>Plan B: Unaligned sequences</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Carnac (H)</p>
                  </c>
                  <c ca="center">
                     <p>816</p>
                  </c>
                  <c ca="center">
                     <p>422</p>
                  </c>
                  <c ca="center">
                     <p>390 (47.8)</p>
                  </c>
                  <c ca="center">
                     <p>7 (98.2)</p>
                  </c>
                  <c ca="center">
                     <p>0.685 (73.0)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Carnac (H) + RNAfold-C</p>
                  </c>
                  <c ca="center">
                     <p>816</p>
                  </c>
                  <c ca="center">
                     <p>873</p>
                  </c>
                  <c ca="center">
                     <p>674 (82.6)</p>
                  </c>
                  <c ca="center">
                     <p>156 (81.2)</p>
                  </c>
                  <c ca="center">
                     <p>0.819 (81.9)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Carnac (M)</p>
                  </c>
                  <c ca="center">
                     <p>821</p>
                  </c>
                  <c ca="center">
                     <p>508</p>
                  </c>
                  <c ca="center">
                     <p>463 (56.4)</p>
                  </c>
                  <c ca="center">
                     <p>14 (97.1)</p>
                  </c>
                  <c ca="center">
                     <p>0.740 (76.7)</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Carnac (M) + RNAfold-C</p>
                  </c>
                  <c ca="center">
                     <p>821</p>
                  </c>
                  <c ca="center">
                     <p>865</p>
                  </c>
                  <c ca="center">
                     <p>682 (83.1)</p>
                  </c>
                  <c ca="center">
                     <p>147 (82.3)</p>
                  </c>
                  <c ca="center">
                     <p>0.827 (82.7)</p>
                  </c>
               </r>
            </tblbdy>
         </tbl>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>The authors thank the numerous researchers who provided access, documentation and installation assistance for their algorithms; Notably Ivo Hofacker, Dave Mathews, Bjarne Knudsen, Matthias Hochsmann and Sven Siebert, authors of RNAalifold, Dynalign, Pfold, RNAforester and MARNA respectively. PPG thanks Niels Hansen and Andreas Wilm for useful discussions and advice. PPG was supported by a DFG (German Research Foundation) post-doctoral scholarship and a Carlsberg Foundation Grant (21-00-0680). The basis of much of this work was conceived at the ESF and NIH funded 2003 computational RNA workshop in Benasque, Spain. The authors thank the (mostly) anonymous reviewers for their constructive comments.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>The natural chemical repertoire of natural ribozymes</p>
            </title>
            <aug>
               <au>
                  <snm>Doudna</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Cech</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>418</volume>
            <fpage>222</fpage>
            <lpage>228</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/418222a</pubid>
                  <pubid idtype="pmpid" link="fulltext">12110898</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>The path from the RNA world</p>
            </title>
            <aug>
               <au>
                  <snm>Poole</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Jeffares</snm>
                  <fnm>DC</fnm>
               </au>
               <au>
                  <snm>Penny</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Journal of Molecular Evolution</source>
            <pubdate>1998</pubdate>
            <volume>46</volume>
            <fpage>1</fpage>
            <lpage>17</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9419221</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Relics from the RNA world</p>
            </title>
            <aug>
               <au>
                  <snm>Jeffares</snm>
                  <fnm>DC</fnm>
               </au>
               <au>
                  <snm>Poole</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Penny</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Journal of Molecular Evolution</source>
            <pubdate>1998</pubdate>
            <volume>46</volume>
            <fpage>18</fpage>
            <lpage>36</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9419222</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Large-scale transcriptional activity in chromosomes 21 and 22</p>
            </title>
            <aug>
               <au>
                  <snm>Kapranov</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Cawley</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Drenkow</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bekiranov</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Strausberg</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Fodor</snm>
                  <fnm>SPA</fnm>
               </au>
               <au>
                  <snm>Gingeras</snm>
                  <fnm>TR</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>296</volume>
            <fpage>916</fpage>
            <lpage>919</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1068597</pubid>
                  <pubid idtype="pmpid" link="fulltext">11988577</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Novel RNAs identified from an in-depth analysis of the transcriptome of human chromosomes 21 and 22</p>
            </title>
            <aug>
               <au>
                  <snm>Kampa</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Cheng</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kapranov</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Yamanaka</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Brubaker</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Cawley</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Drenkow</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Piccolboni</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bekiranov</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Helt</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Tammana</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Gingeras</snm>
                  <fnm>TR</fnm>
               </au>
            </aug>
            <source>Genome Research</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <fpage>331</fpage>
            <lpage>342</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gr.2094104</pubid>
                  <pubid idtype="pmpid" link="fulltext">14993201</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Unbiased mapping of transcription factor binding sites along human chromosomes 21 and 22 points to widespread regulation of noncoding RNAs</p>
            </title>
            <aug>
               <au>
                  <snm>Cawley</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bekiranov</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ng</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kapranov</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Sekinger</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Kampa</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Piccolboni</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sementchenko</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Cheng</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Williams</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wheeler</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Wong</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Drenkow</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Yamanaka</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Patel</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Brubaker</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Tammana</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Helt</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Struhl</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Gingeras</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2004</pubdate>
            <volume>116</volume>
            <issue>4</issue>
            <fpage>499</fpage>
            <lpage>509</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0092-8674(04)00127-8</pubid>
                  <pubid idtype="pmpid" link="fulltext">14980218</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>The evolution of controlled multitasked gene networks: The role of introns and other noncoding RNAs in the development of complex organisms</p>
            </title>
            <aug>
               <au>
                  <snm>Mattick</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Gagen</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2001</pubdate>
            <volume>18</volume>
            <fpage>1611</fpage>
            <lpage>1630</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11504843</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Non-coding RNAs: the architects of eukaryotic complexity</p>
            </title>
            <aug>
               <au>
                  <snm>Mattick</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>EMBO Reports</source>
            <pubdate>2001</pubdate>
            <volume>2</volume>
            <fpage>986</fpage>
            <lpage>991</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/embo-reports/kve230</pubid>
                  <pubid idtype="pmpid" link="fulltext">11713189</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>How RNA folds</p>
            </title>
            <aug>
               <au>
                  <snm>Tinoco</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Bustamante</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1999</pubdate>
            <volume>293</volume>
            <issue>2</issue>
            <fpage>271</fpage>
            <lpage>281</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.1999.3001</pubid>
                  <pubid idtype="pmpid" link="fulltext">10550208</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>RNA folding and unfolding</p>
            </title>
            <aug>
               <au>
                  <snm>Onoa</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Tinoco</snm>
                  <fnm>I</fnm>
               </au>
            </aug>
            <source>Curr Opin Struct Biol</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <issue>3</issue>
            <fpage>374</fpage>
            <lpage>379</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.sbi.2004.04.001</pubid>
                  <pubid idtype="pmpid" link="fulltext">15193319</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>NMR spectroscopy of RNA</p>
            </title>
            <aug>
               <au>
                  <snm>F&#252;rtig</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Richter</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>W&#246;hnert</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Schwalbe</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Chembiochem</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <issue>10</issue>
            <fpage>936</fpage>
            <lpage>962</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/cbic.200300700</pubid>
                  <pubid idtype="pmpid" link="fulltext">14523911</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Optimal computer folding of large RNA sequences using thermodynamics and auxiliary information</p>
            </title>
            <aug>
               <au>
                  <snm>Zuker</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Stiegler</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>1981</pubdate>
            <volume>9</volume>
            <fpage>133</fpage>
            <lpage>148</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">326673</pubid>
                  <pubid idtype="pmpid">6163133</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Fast folding and comparison of RNA secondary structures</p>
            </title>
            <aug>
               <au>
                  <snm>Hofacker</snm>
                  <fnm>IL</fnm>
               </au>
               <au>
                  <snm>Fontana</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Bonhoeffer</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Stadler</snm>
                  <fnm>PF</fnm>
               </au>
            </aug>
            <source>Monatshefte fur Chemie</source>
            <pubdate>1994</pubdate>
            <volume>125</volume>
            <fpage>167</fpage>
            <lpage>188</lpage>
         </bibl>
         <bibl id="B14">
            <aug>
               <au>
                  <snm>Woese</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Pace</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>The RNA World, chap. Probing RNA structure, function, and history by comparative analysis</source>
            <publisher>Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY</publisher>
            <pubdate>1993</pubdate>
            <fpage>91</fpage>
            <lpage>117</lpage>
         </bibl>
         <bibl id="B15">
            <title>
               <p>CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, positions-specific gap penalties and weight matrix choice</p>
            </title>
            <aug>
               <au>
                  <snm>Thompson</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Higgins</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Gibson</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>1994</pubdate>
            <volume>22</volume>
            <fpage>4673</fpage>
            <lpage>4680</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308517</pubid>
                  <pubid idtype="pmpid">7984417</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>T-COFFEE: A novel method for fast and accurate multiple alignment</p>
            </title>
            <aug>
               <au>
                  <snm>Notredame</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Higgins</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Heringa</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Journal of Molecular Biology</source>
            <pubdate>2000</pubdate>
            <volume>302</volume>
            <fpage>205</fpage>
            <lpage>217</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.2000.4042</pubid>
                  <pubid idtype="pmpid" link="fulltext">10964570</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Multiple sequence alignment: algorithms and applications</p>
            </title>
            <aug>
               <au>
                  <snm>Gotoh</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Adv Biophys</source>
            <pubdate>1999</pubdate>
            <volume>36</volume>
            <fpage>159</fpage>
            <lpage>206</lpage>
            <xrefbib>
               <pubid idtype="pmpid">10463075</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Inferring consensus structure from nucleic acid sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Chiu</snm>
                  <fnm>DK</fnm>
               </au>
               <au>
                  <snm>Kolodziejczak</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Comput Appl Biosci</source>
            <pubdate>1991</pubdate>
            <volume>7</volume>
            <fpage>347</fpage>
            <lpage>352</lpage>
            <xrefbib>
               <pubid idtype="pmpid">1913217</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Identifying constraints on the higher-order structure of RNA: continued development and application of comparative sequence analysis methods</p>
            </title>
            <aug>
               <au>
                  <snm>Gutell</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Power</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hertz</snm>
                  <fnm>GZ</fnm>
               </au>
               <au>
                  <snm>Putz</snm>
                  <fnm>EJ</fnm>
               </au>
               <au>
                  <snm>Stormo</snm>
                  <fnm>GD</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1992</pubdate>
            <volume>20</volume>
            <fpage>5785</fpage>
            <lpage>5795</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">334417</pubid>
                  <pubid idtype="pmpid">1454539</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Displaying the information contents of structural RNA alignments</p>
            </title>
            <aug>
               <au>
                  <snm>Gorodkin</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Heyer</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Brunak</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Stormo</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Comput Appl Biosci</source>
            <pubdate>1997</pubdate>
            <volume>13</volume>
            <fpage>583</fpage>
            <lpage>586</lpage>
            <xrefbib>
               <pubid idtype="pmpid">9475985</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Secondary structure prediction for aligned RNA sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Hofacker</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Fekete</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Stadler</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Journal of Molecular Biology</source>
            <pubdate>2002</pubdate>
            <volume>319</volume>
            <issue>5</issue>
            <fpage>1059</fpage>
            <lpage>1066</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0022-2836(02)00308-X</pubid>
                  <pubid idtype="pmpid" link="fulltext">12079347</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>An iterated loop matching approach to the prediction of RNA secondary structures with pseudoknots</p>
            </title>
            <aug>
               <au>
                  <snm>Ruan</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Stormo</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <fpage>58</fpage>
            <lpage>66</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btg373</pubid>
                  <pubid idtype="pmpid" link="fulltext">14693809</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Pfold: RNA secondary structure prediction using stochastic context-free grammars</p>
            </title>
            <aug>
               <au>
                  <snm>Knudsen</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Hein</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <issue>13</issue>
            <fpage>3423</fpage>
            <lpage>3428</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">169020</pubid>
                  <pubid idtype="pmpid" link="fulltext">12824339</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg614</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>RNA secondary structure prediction using stochastic context-free grammars and evolutionary history</p>
            </title>
            <aug>
               <au>
                  <snm>Knudsen</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Hein</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>1999</pubdate>
            <volume>15</volume>
            <issue>6</issue>
            <fpage>446</fpage>
            <lpage>454</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/15.6.446</pubid>
                  <pubid idtype="pmpid" link="fulltext">10383470</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Simultaneous solution of the RNA folding, alignment and protosequence problems</p>
            </title>
            <aug>
               <au>
                  <snm>Sankoff</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>SIAM Journal on Applied Mathematics</source>
            <pubdate>1985</pubdate>
            <volume>45</volume>
            <fpage>810</fpage>
            <lpage>825</lpage>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Alignment of RNA base pairing probability matrices</p>
            </title>
            <aug>
               <au>
                  <snm>Hofacker</snm>
                  <fnm>IL</fnm>
               </au>
               <au>
                  <snm>Bernhart</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Stadler</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <fpage>2222</fpage>
            <lpage>7</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bth229</pubid>
                  <pubid idtype="pmpid" link="fulltext">15073017</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Finding the most significant common sequence and structure motifs in a set of RNA sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Gorodkin</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Heyer</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Stormo</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>1997</pubdate>
            <volume>25</volume>
            <issue>18</issue>
            <fpage>3724</fpage>
            <lpage>3732</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">146942</pubid>
                  <pubid idtype="pmpid" link="fulltext">9278497</pubid>
                  <pubid idtype="doi">10.1093/nar/25.18.3724</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Discovering common stem-loop motifs in unaligned RNA sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Gorodkin</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Stricklin</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Stormo</snm>
                  <fnm>GD</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <issue>10</issue>
            <fpage>2135</fpage>
            <lpage>2144</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">55461</pubid>
                  <pubid idtype="pmpid" link="fulltext">11353083</pubid>
                  <pubid idtype="doi">10.1093/nar/29.10.2135</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Dynalign: An algorithm for finding the secondary structure common to two RNA sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Mathews</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Turner</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Journal of Molecular Biology</source>
            <pubdate>2002</pubdate>
            <volume>317</volume>
            <issue>2</issue>
            <fpage>191</fpage>
            <lpage>203</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.2001.5351</pubid>
                  <pubid idtype="pmpid" link="fulltext">11902836</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>The tree-to-tree correction problem</p>
            </title>
            <aug>
               <au>
                  <snm>Tai</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Journal of the ACM</source>
            <pubdate>1979</pubdate>
            <volume>26</volume>
            <fpage>422</fpage>
            <lpage>433</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1145/322139.322143</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>An algorithm for comparing multiple RNA secondary structures</p>
            </title>
            <aug>
               <au>
                  <snm>Shapiro</snm>
                  <fnm>BA</fnm>
               </au>
            </aug>
            <source>Comput Appl Biosci</source>
            <pubdate>1988</pubdate>
            <volume>4</volume>
            <fpage>387</fpage>
            <lpage>393</lpage>
            <xrefbib>
               <pubid idtype="pmpid">2458170</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Comparing multiple RNA secondary structures using tree comparisons</p>
            </title>
            <aug>
               <au>
                  <snm>Shapiro</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>CABIOS</source>
            <pubdate>1990</pubdate>
            <volume>6</volume>
            <fpage>309</fpage>
            <lpage>318</lpage>
            <xrefbib>
               <pubid idtype="pmpid">1701685</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Simple fast algorithms for the editing distance between trees and related problems</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Shasha</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>SIAM Journal of Computing</source>
            <pubdate>1989</pubdate>
            <volume>18</volume>
            <issue>6</issue>
            <fpage>1245</fpage>
            <lpage>1262</lpage>
         </bibl>
         <bibl id="B34">
            <title>
               <p>A more efficient approximation scheme for tree alignment</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Jiang</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Gusfield</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>SIAM J Comput</source>
            <pubdate>2000</pubdate>
            <volume>30</volume>
            <fpage>283</fpage>
            <lpage>299</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1137/S0097539796313507</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Alignment of trees &#8211; an alternative to tree edit</p>
            </title>
            <aug>
               <au>
                  <snm>Jiang</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Theor Comput Sci</source>
            <pubdate>1995</pubdate>
            <volume>143</volume>
            <fpage>137</fpage>
            <lpage>148</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/0304-3975(95)80015-8</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>RNA-related tools on the Bielefeld Bioinformatics Server</p>
            </title>
            <aug>
               <au>
                  <snm>Sczyrba</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kruger</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Mersch</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kurtz</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Giegerich</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <issue>13</issue>
            <fpage>3767</fpage>
            <lpage>3770</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">168982</pubid>
                  <pubid idtype="pmpid" link="fulltext">12824414</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg576</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Local similarity of RNA secondary structures</p>
            </title>
            <aug>
               <au>
                  <snm>H&#246;chsmann</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>T&#246;ller</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Giegerich</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Kurtz</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Proc of the IEEE Bioinformatics Conference</source>
            <pubdate>2003</pubdate>
            <fpage>159</fpage>
            <lpage>168</lpage>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Alignment between two RNA structures</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Lecture Notes in Computer Science</source>
            <pubdate>2001</pubdate>
            <volume>2136</volume>
            <fpage>690</fpage>
            <lpage>703</lpage>
         </bibl>
         <bibl id="B39">
            <title>
               <p>MARNA A server for multiple alignment of RNAs</p>
            </title>
            <aug>
               <au>
                  <snm>Siebert</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Backofen</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>In Proceedings of the German Conference on Bioinformatics</source>
            <pubdate>2003</pubdate>
            <fpage>135</fpage>
            <lpage>140</lpage>
         </bibl>
         <bibl id="B40">
            <title>
               <p>A graph theoretical approach for predicting common RNA secondary structure motifs including pseudoknots in unaligned sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Ji</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Xu</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Stormo</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <issue>10</issue>
            <fpage>1591</fpage>
            <lpage>1602</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bth131</pubid>
                  <pubid idtype="pmpid" link="fulltext">14962926</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>A statistical sampling algorithm for RNA secondary structure prediction</p>
            </title>
            <aug>
               <au>
                  <snm>Ding</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Lawrence</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <issue>24</issue>
            <fpage>7280</fpage>
            <lpage>7301</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">297010</pubid>
                  <pubid idtype="pmpid" link="fulltext">14654704</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg938</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>A partition function algorithm for nucleic acid secondary structure, including pseudoknots</p>
            </title>
            <aug>
               <au>
                  <snm>Dirks</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Pierce</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Journal of Computational Chemistry</source>
            <pubdate>2003</pubdate>
            <volume>24</volume>
            <fpage>1664</fpage>
            <lpage>1677</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/jcc.10296</pubid>
                  <pubid idtype="pmpid" link="fulltext">12926009</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Design, implementation and evaluation of a practical pseudoknot folding algorithm based on thermodynamics</p>
            </title>
            <aug>
               <au>
                  <snm>Reeder</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Giegerich</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>104</fpage>
            <lpage>104</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">514697</pubid>
                  <pubid idtype="pmpid" link="fulltext">15294028</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-5-104</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Some measures of comparative performance in the three CASPs</p>
            </title>
            <aug>
               <au>
                  <snm>Venclovas</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Zemla</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Fidelis</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Moult</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Proteins</source>
            <pubdate>1999</pubdate>
            <volume>Suppl 3</volume>
            <fpage>231</fpage>
            <lpage>237</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/(SICI)1097-0134(1999)37:3+&lt;231::AID-PROT30>3.3.CO;2-T</pubid>
                  <pubid idtype="pmpid">10526374</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Comparison of performance in successive CASP experiments</p>
            </title>
            <aug>
               <au>
                  <snm>Venclovas</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Zemla</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Fidelis</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Moult</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Proteins</source>
            <pubdate>2001</pubdate>
            <volume>Suppl 5</volume>
            <fpage>163</fpage>
            <lpage>170</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/prot.10053</pubid>
                  <pubid idtype="pmpid" link="fulltext">11835494</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>CAFASP3: the third critical assessment of fully automated structure prediction methods</p>
            </title>
            <aug>
               <au>
                  <snm>Fischer</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Rychlewski</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Dunbrack</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Ortiz</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Elofsson</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Proteins</source>
            <pubdate>2003</pubdate>
            <volume>53</volume>
            <issue>6</issue>
            <fpage>503</fpage>
            <lpage>516</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/prot.10538</pubid>
                  <pubid idtype="pmpid" link="fulltext">14579340</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>Assessment of progress over the CASP experiments</p>
            </title>
            <aug>
               <au>
                  <snm>Venclovas</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Zemla</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Fidelis</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Moult</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Proteins</source>
            <pubdate>2003</pubdate>
            <volume>53</volume>
            <issue>6</issue>
            <fpage>585</fpage>
            <lpage>595</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/prot.10530</pubid>
                  <pubid idtype="pmpid" link="fulltext">14579350</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>Evaluation of gene structure prediction programs</p>
            </title>
            <aug>
               <au>
                  <snm>Burset</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Guigo</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Genomics</source>
            <pubdate>1996</pubdate>
            <volume>34</volume>
            <issue>3</issue>
            <fpage>353</fpage>
            <lpage>367</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/geno.1996.0298</pubid>
                  <pubid idtype="pmpid" link="fulltext">8786136</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>Evaluation of gene prediction software using a genomic data set: application to <it>Arabidopsis thaliana </it>sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Pavy</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Rombauts</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Dehais</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Mathe</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ramana</snm>
                  <fnm>DV</fnm>
               </au>
               <au>
                  <snm>Leroy</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Rouze</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>1999</pubdate>
            <volume>15</volume>
            <issue>11</issue>
            <fpage>887</fpage>
            <lpage>899</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/15.11.887</pubid>
                  <pubid idtype="pmpid" link="fulltext">10743555</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>An assessment of gene prediction accuracy in large DNA sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Guigo</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Agarwal</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Abril</snm>
                  <fnm>JF</fnm>
               </au>
               <au>
                  <snm>Burset</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Fickett</snm>
                  <fnm>JW</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2000</pubdate>
            <volume>10</volume>
            <issue>10</issue>
            <fpage>1631</fpage>
            <lpage>1642</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gr.122800</pubid>
                  <pubid idtype="pmpid" link="fulltext">11042160</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>A comprehensive comparison of multiple sequence alignment programs</p>
            </title>
            <aug>
               <au>
                  <snm>Thompson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Plewniak</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Poch</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1999</pubdate>
            <volume>27</volume>
            <issue>13</issue>
            <fpage>2682</fpage>
            <lpage>90</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">148477</pubid>
                  <pubid idtype="pmpid" link="fulltext">10373585</pubid>
                  <pubid idtype="doi">10.1093/nar/27.13.2682</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>BAliBASE: a benchmark alignment database for the evaluation of multiple alignment programs</p>
            </title>
            <aug>
               <au>
                  <snm>Thompson</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Plewniak</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Poch</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>1999</pubdate>
            <volume>15</volume>
            <fpage>87</fpage>
            <lpage>88</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/15.1.87</pubid>
                  <pubid idtype="pmpid" link="fulltext">10068696</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>BAliBASE (Benchmark Alignment dataBASE): enhancements for repeats, transmembrane sequences and circular permutations</p>
            </title>
            <aug>
               <au>
                  <snm>Bahr</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Thompson</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Thierry</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Poch</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <fpage>323</fpage>
            <lpage>326</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">29792</pubid>
                  <pubid idtype="pmpid" link="fulltext">11125126</pubid>
                  <pubid idtype="doi">10.1093/nar/29.1.323</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>Quality assessment of multiple alignment programs</p>
            </title>
            <aug>
               <au>
                  <snm>Lassmann</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Sonnhammer</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>FEBS Lett</source>
            <pubdate>2002</pubdate>
            <volume>529</volume>
            <fpage>126</fpage>
            <lpage>130</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0014-5793(02)03189-7</pubid>
                  <pubid idtype="pmpid" link="fulltext">12354624</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>Gene regulation by riboswitches</p>
            </title>
            <aug>
               <au>
                  <snm>Mandal</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Breaker</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Nat Rev Mol Cell Biol</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <issue>6</issue>
            <fpage>451</fpage>
            <lpage>463</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrm1403</pubid>
                  <pubid idtype="pmpid" link="fulltext">15173824</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <title>
               <p>Riboswitches exert genetic control through metabolite-induced conformational change</p>
            </title>
            <aug>
               <au>
                  <snm>Soukup</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Soukup</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Curr Opin Struct Biol</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <issue>3</issue>
            <fpage>344</fpage>
            <lpage>349</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.sbi.2004.04.007</pubid>
                  <pubid idtype="pmpid" link="fulltext">15193315</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <title>
               <p>Riboswitches: the oldest mechanism for the regulation of gene expression?</p>
            </title>
            <aug>
               <au>
                  <snm>Vitreschak</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Rodionov</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Mironov</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Gelfand</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <fpage>44</fpage>
            <lpage>50</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tig.2003.11.008</pubid>
                  <pubid idtype="pmpid" link="fulltext">14698618</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>Prediction and visualization of structural switches in RNA</p>
            </title>
            <aug>
               <au>
                  <snm>Giegerich</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Haase</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Rehmsmeier</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Pacific Symposium on Biocomputing</source>
            <pubdate>1999</pubdate>
            <fpage>126</fpage>
            <lpage>137</lpage>
            <xrefbib>
               <pubid idtype="pmpid">10380191</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <title>
               <p>Evaluating the predictability of conformational switching in RNA</p>
            </title>
            <aug>
               <au>
                  <snm>Voss</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Meyer</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Giegerich</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <fpage>1573</fpage>
            <lpage>82</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bth129</pubid>
                  <pubid idtype="pmpid" link="fulltext">14962925</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B60">
            <title>
               <p>Riboswitch finder &#8211; a tool for identification of riboswitch RNAs</p>
            </title>
            <aug>
               <au>
                  <snm>Bengert</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Dandekar</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <issue>Web Server issue</issue>
            <fpage>W154</fpage>
            <lpage>159</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">441490</pubid>
                  <pubid idtype="pmpid" link="fulltext">15215370</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B61">
            <title>
               <p>The accuracy of ribosomal RNA comparative structure models</p>
            </title>
            <aug>
               <au>
                  <snm>Gutell</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Connone</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Curr Opin Struct Biol</source>
            <pubdate>2002</pubdate>
            <volume>12</volume>
            <fpage>301</fpage>
            <lpage>310</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0959-440X(02)00339-1</pubid>
                  <pubid idtype="pmpid" link="fulltext">12127448</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B62">
            <title>
               <p>PHYLIP (Phylogeny inference package) version 3.6a3</p>
            </title>
            <aug>
               <au>
                  <snm>Felsenstein</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Department of Genome Sciences, University of Washington, Seattle</source>
            <pubdate>2002</pubdate>
            <note>[Distributed by the author].</note>
         </bibl>
         <bibl id="B63">
            <title>
               <p>Assessing the accuracy of prediction algorithms for classication: an overview</p>
            </title>
            <aug>
               <au>
                  <snm>Baldi</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Brunak</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Chauvin</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Andersen</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2000</pubdate>
            <volume>16</volume>
            <fpage>412</fpage>
            <lpage>424</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/16.5.412</pubid>
                  <pubid idtype="pmpid" link="fulltext">10871264</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B64">
            <title>
               <p>Evaluation of several lightweight stochastic context-free grammars for RNA secondary structure prediction</p>
            </title>
            <aug>
               <au>
                  <snm>Dowell</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Eddy</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>71</fpage>
            <lpage>71</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">442121</pubid>
                  <pubid idtype="pmpid" link="fulltext">15180907</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-5-71</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B65">
            <title>
               <p>Using an RNA secondary structure partition function to determine confidence in base pairs predicted by free energy minimization</p>
            </title>
            <aug>
               <au>
                  <snm>Mathews</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>RNA</source>
            <pubdate>2004</pubdate>
            <volume>10</volume>
            <issue>8</issue>
            <fpage>1178</fpage>
            <lpage>1190</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1261/rna.7650904</pubid>
                  <pubid idtype="pmpid" link="fulltext">15272118</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B66">
            <title>
               <p>Statistics of RNA secondary structures</p>
            </title>
            <aug>
               <au>
                  <snm>Fontana</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Konings</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Stadler</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Schuster</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Biopolymers</source>
            <pubdate>1993</pubdate>
            <volume>33</volume>
            <issue>9</issue>
            <fpage>1389</fpage>
            <lpage>1404</lpage>
            <xrefbib>
               <pubid idtype="pmpid">7691201</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B67">
            <title>
               <p>Metrics on RNA secondary structures</p>
            </title>
            <aug>
               <au>
                  <snm>Moulton</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Zuker</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Steel</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Pointon</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Penny</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Journal of Computational Biology</source>
            <pubdate>2000</pubdate>
            <volume>7</volume>
            <issue>1&#8211;2</issue>
            <fpage>277</fpage>
            <lpage>292</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1089/10665270050081522</pubid>
                  <pubid idtype="pmpid" link="fulltext">10890402</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B68">
            <title>
               <p>Expanded sequence dependence of thermodynamic parameters provides robust prediction of RNA secondary structure</p>
            </title>
            <aug>
               <au>
                  <snm>Mathews</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Sabina</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zuker</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Turner</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1999</pubdate>
            <volume>288</volume>
            <fpage>911</fpage>
            <lpage>940</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.1999.2700</pubid>
                  <pubid idtype="pmpid" link="fulltext">10329189</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B69">
            <title>
               <p>A comparison of thermodynamic foldings with comparatively derived structures of 16S and 16S-like rRNAs</p>
            </title>
            <aug>
               <au>
                  <snm>Konings</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Gutell</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>RNA</source>
            <pubdate>1995</pubdate>
            <volume>1</volume>
            <issue>6</issue>
            <fpage>559</fpage>
            <lpage>574</lpage>
            <xrefbib>
               <pubid idtype="pmpid">7489516</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B70">
            <title>
               <p>An analysis of large rRNA sequences folded by a thermodynamic method</p>
            </title>
            <aug>
               <au>
                  <snm>Fields</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Gutell</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Fold Des</source>
            <pubdate>1996</pubdate>
            <volume>1</volume>
            <issue>6</issue>
            <fpage>419</fpage>
            <lpage>430</lpage>
            <xrefbib>
               <pubid idtype="pmpid">9080188</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B71">
            <title>
               <p>Evaluation of the suitability of free-energy minimization using nearest-neighbor energy parameters for RNA secondary structure prediction</p>
            </title>
            <aug>
               <au>
                  <snm>Doshi</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Cannone</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Cobaugh</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Gutell</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>105</fpage>
            <lpage>105</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">514602</pubid>
                  <pubid idtype="pmpid" link="fulltext">15296519</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-5-105</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B72">
            <title>
               <p>Mfold</p>
            </title>
            <url>http://www.bioinfo.rpi.edu/applications/mfold/</url>
         </bibl>
         <bibl id="B73">
            <title>
               <p>RNAfold</p>
            </title>
            <url>http://www.tbi.univie.ac.at/~ivo/RNA/</url>
         </bibl>
         <bibl id="B74">
            <title>
               <p>Sfold</p>
            </title>
            <url>http://www.bioinfo.rpi.edu/applications/sfold/srna.pl</url>
         </bibl>
         <bibl id="B75">
            <title>
               <p>The equilibrium partition function and base pair binding probabilities for RNA secondary structures</p>
            </title>
            <aug>
               <au>
                  <snm>McCaskill</snm>
                  <fnm>JS</fnm>
               </au>
            </aug>
            <source>Biopolymers</source>
            <pubdate>1990</pubdate>
            <volume>29</volume>
            <fpage>1105</fpage>
            <lpage>1119</lpage>
            <xrefbib>
               <pubid idtype="pmpid">1695107</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B76">
            <title>
               <p>RNAalifold</p>
            </title>
            <url>http://www.tbi.univie.ac.at/~ivo/RNA/</url>
         </bibl>
         <bibl id="B77">
            <title>
               <p>Pfold</p>
            </title>
            <url>http://www.daimi.au.dk/~compbio/rnafold/</url>
         </bibl>
         <bibl id="B78">
            <title>
               <p>ILM</p>
            </title>
            <url>http://www.cs.wustl.edu/~zhang/projects/rna/ilm/</url>
         </bibl>
         <bibl id="B79">
            <title>
               <p>Algorithms for loop matchings</p>
            </title>
            <aug>
               <au>
                  <snm>Nussinov</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Piecznik</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Grigg</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Kleitman</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>SIAM Journal on Applied Mathematics</source>
            <pubdate>1978</pubdate>
            <volume>35</volume>
            <fpage>68</fpage>
            <lpage>82</lpage>
         </bibl>
         <bibl id="B80">
            <title>
               <p>Finding the common structure shared by two homologous RNAs</p>
            </title>
            <aug>
               <au>
                  <snm>Perriquet</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Touzet</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Dauchet</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>19</volume>
            <fpage>108</fpage>
            <lpage>116</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/19.1.108</pubid>
                  <pubid idtype="pmpid" link="fulltext">12499300</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B81">
            <title>
               <p>CARNAC: folding families of related RNAs</p>
            </title>
            <aug>
               <au>
                  <snm>Touzet</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Perriquet</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <issue>Web Server issue</issue>
            <fpage>W142</fpage>
            <lpage>145</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">441553</pubid>
                  <pubid idtype="pmpid" link="fulltext">15215367</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B82">
            <title>
               <p>FOLDalign</p>
            </title>
            <url>http://www.bioinf.au.dk/FOLDALIGN/</url>
         </bibl>
         <bibl id="B83">
            <title>
               <p>Identification of consensus patterns in unaligned DNA sequences known to be functionally related</p>
            </title>
            <aug>
               <au>
                  <snm>Hertz</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Hartzell</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Stormo</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Comput Appl Biosci</source>
            <pubdate>1990</pubdate>
            <volume>6</volume>
            <fpage>81</fpage>
            <lpage>92</lpage>
            <xrefbib>
               <pubid idtype="pmpid">2193692</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B84">
            <title>
               <p>Molecular control of vertebrate iron metabolism: mRNA-based regulatory circuits operated by iron, nitric oxide, and oxidativestress</p>
            </title>
            <aug>
               <au>
                  <snm>Hentze</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kuhn</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1996</pubdate>
            <volume>93</volume>
            <fpage>8175</fpage>
            <lpage>8182</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">38642</pubid>
                  <pubid idtype="pmpid" link="fulltext">8710843</pubid>
                  <pubid idtype="doi">10.1073/pnas.93.16.8175</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B85">
            <title>
               <p>Dynalign</p>
            </title>
            <url>http://rna.urmc.rochester.edu/</url>
         </bibl>
         <bibl id="B86">
            <title>
               <p>Fast evaluation of internal loops in RNA secondary structure prediction</p>
            </title>
            <aug>
               <au>
                  <snm>Lyngs0</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Zuker</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Pedersen</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>1999</pubdate>
            <volume>15</volume>
            <issue>6</issue>
            <fpage>440</fpage>
            <lpage>445</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/15.6.440</pubid>
                  <pubid idtype="pmpid" link="fulltext">10383469</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B87">
            <title>
               <p>caRNAc</p>
            </title>
            <url>http://bioinfo.lifl.fr/carnac/</url>
         </bibl>
         <bibl id="B88">
            <title>
               <p>RNAforester</p>
            </title>
            <url>http://bibiserv.techfak.uni-bielefeld.de/rnaforester/</url>
         </bibl>
         <bibl id="B89">
            <title>
               <p>MARNA</p>
            </title>
            <url>http://www.bio.inf.uni-jena.de/Software/MARNA/index.html</url>
         </bibl>
         <bibl id="B90">
            <title>
               <p>A general edit distance between RNA structures</p>
            </title>
            <aug>
               <au>
                  <snm>Jiang</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Ma</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Journal of Computational Biology</source>
            <pubdate>2002</pubdate>
            <volume>9</volume>
            <issue>2</issue>
            <fpage>371</fpage>
            <lpage>388</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1089/10665270252935511</pubid>
                  <pubid idtype="pmpid" link="fulltext">12015887</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B91">
            <title>
               <p>A memory-efficient dynamic programming algorithm for optimal structural alignment of a sequence to an RNA secondary structure</p>
            </title>
            <aug>
               <au>
                  <snm>Eddy</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2002</pubdate>
            <volume>3</volume>
            <fpage>18</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">119854</pubid>
                  <pubid idtype="pmpid" link="fulltext">12095421</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-3-18</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B92">
            <title>
               <p>RSEARCH: finding homologs of single structured RNA sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Klein</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Eddy</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <fpage>44</fpage>
            <lpage>44</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">239859</pubid>
                  <pubid idtype="pmpid" link="fulltext">14499004</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-4-44</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B93">
            <title>
               <p>RNA secondary structure: physical and computational aspects</p>
            </title>
            <aug>
               <au>
                  <snm>Higgs</snm>
                  <fnm>PG</fnm>
               </au>
            </aug>
            <source>Quarterly Reviews of BioPhysics</source>
            <pubdate>2000</pubdate>
            <volume>33</volume>
            <issue>3</issue>
            <fpage>199</fpage>
            <lpage>253</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1017/S0033583500003620</pubid>
                  <pubid idtype="pmpid">11191843</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B94">
            <title>
               <p>The language of RNA: a formal grammar that includes pseudoknots</p>
            </title>
            <aug>
               <au>
                  <snm>Rivas</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Eddy</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2000</pubdate>
            <volume>16</volume>
            <issue>4</issue>
            <fpage>334</fpage>
            <lpage>340</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/16.4.334</pubid>
                  <pubid idtype="pmpid" link="fulltext">10869031</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B95">
            <title>
               <p>Pure multiple RNA secondary structure alignments: A progressive profile approach</p>
            </title>
            <aug>
               <au>
                  <snm>H&#246;chsmann</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Voss</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Giegerich</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>IEEE/ACM Transactions on Computational Biology and Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>1</volume>
            <fpage>53</fpage>
            <lpage>62</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1109/TCBB.2004.11</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B96">
            <title>
               <p>Abstract shapes of RNA</p>
            </title>
            <aug>
               <au>
                  <snm>Giegerich</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Voss</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Rehmsmeier</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>NAR</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <fpage>4843</fpage>
            <lpage>4851</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1093/nar/gkh779</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B97">
            <title>
               <p>Infernal</p>
            </title>
            <url>http://www.genetics.wustl.edu/eddy/infernal/</url>
         </bibl>
         <bibl id="B98">
            <title>
               <p>Rfam: an RNA family database</p>
            </title>
            <aug>
               <au>
                  <snm>Griffiths-Jones</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bateman</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Marshall</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Khanna</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Eddy</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <fpage>439</fpage>
            <lpage>441</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">165453</pubid>
                  <pubid idtype="pmpid" link="fulltext">12520045</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg006</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B99">
            <title>
               <p>Rfam</p>
            </title>
            <url>http://www.sanger.ac.uk/Software/Rfam/index.shtml</url>
         </bibl>
         <bibl id="B100">
            <title>
               <p>Co-transcriptional folding is encoded within RNA genes</p>
            </title>
            <aug>
               <au>
                  <snm>Meyer</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Mikl&#243;s</snm>
                  <fnm>I</fnm>
               </au>
            </aug>
            <source>BMC Mol Biol</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>10</fpage>
            <lpage>10</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">514895</pubid>
                  <pubid idtype="pmpid" link="fulltext">15298702</pubid>
                  <pubid idtype="doi">10.1186/1471-2199-5-10</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B101">
            <title>
               <p>The European large subunit ribosomal RNA database</p>
            </title>
            <aug>
               <au>
                  <snm>Wuyts</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>De Rijk</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Van de Peer</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Winkelmans</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>De Wachter</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <fpage>175</fpage>
            <lpage>177</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">29789</pubid>
                  <pubid idtype="pmpid" link="fulltext">11125083</pubid>
                  <pubid idtype="doi">10.1093/nar/29.1.175</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B102">
            <title>
               <p>The European large subunit ribosomal RNA database</p>
            </title>
            <url>http://oberon.fvms.ugent.be:8080/rRNA/lsu/</url>
         </bibl>
         <bibl id="B103">
            <title>
               <p>The comparative RNA web (CRW) site: An online database of comparative sequence and structure information for ribosomal, intron, and other RNAs</p>
            </title>
            <aug>
               <au>
                  <snm>Cannone</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Subramanian</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Schnare</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Collett</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>D'Souza</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Du</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Feng</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Madabusi</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Muller</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Pande</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Shang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Gutell</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>BMC</source>
            <pubdate>2002</pubdate>
            <volume>3</volume>
            <fpage>2</fpage>
            <xrefbib>
               <pubid idtype="doi">10.1186/1471-2105-3-2</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B104">
            <title>
               <p>Gutell lab comparative RNA web site</p>
            </title>
            <url>http://www.rna.icmb.utexas.edu/</url>
         </bibl>
         <bibl id="B105">
            <title>
               <p>The European database on small subunit ribosomal RNA</p>
            </title>
            <aug>
               <au>
                  <snm>Wuyts</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Van de Peer</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Winkelmans</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>De Wachter</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2002</pubdate>
            <volume>30</volume>
            <fpage>183</fpage>
            <lpage>185</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">99113</pubid>
                  <pubid idtype="pmpid" link="fulltext">11752288</pubid>
                  <pubid idtype="doi">10.1093/nar/30.1.183</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B106">
            <title>
               <p>The European database on small subunit ribosomal RNA</p>
            </title>
            <url>http://oberon.fvms.ugent.be:8080/rRNA/ssu/</url>
         </bibl>
         <bibl id="B107">
            <title>
               <p>The ribonuclease P database</p>
            </title>
            <aug>
               <au>
                  <snm>Brown</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1999</pubdate>
            <volume>27</volume>
            <fpage>314</fpage>
            <lpage>314</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">148169</pubid>
                  <pubid idtype="pmpid" link="fulltext">9847214</pubid>
                  <pubid idtype="doi">10.1093/nar/27.1.314</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B108">
            <title>
               <p>The ribonuclease P database</p>
            </title>
            <url>http://www.mbio.ncsu.edu/RNaseP/home.html</url>
         </bibl>
         <bibl id="B109">
            <title>
               <p>A simple model for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Kimura</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Journal of Molecular Evolution</source>
            <pubdate>1980</pubdate>
            <volume>16</volume>
            <fpage>111</fpage>
            <lpage>120</lpage>
            <xrefbib>
               <pubid idtype="pmpid">7463489</pubid>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
