<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2164-5-5</ui>
   <ji>1471-2164</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Identification and comparative analysis of components from the signal recognition particle in protozoa and fungi</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Rosenblad</snm>
               <mnm>Alm</mnm>
               <fnm>Magnus</fnm>
               <insr iid="I1"/>
               <email>magnus.alm@medkem.gu.se</email>
            </au>
            <au id="A2">
               <snm>Zwieb</snm>
               <fnm>Christian</fnm>
               <insr iid="I2"/>
               <email>zwieb@uthct.edu</email>
            </au>
            <au id="A3" ca="yes">
               <snm>Samuelsson</snm>
               <fnm>Tore</fnm>
               <insr iid="I1"/>
               <email>tore.samuelsson@medkem.gu.se</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Medical Biochemistry, Goteborg University, Box 440, SE-405 30 Goteborg, Sweden</p>
            </ins>
            <ins id="I2">
               <p>Department of Molecular Biology, The University of Texas Health Science Center at Tyler, 11937 US Highway 271, Tyler TX 75708-3154, U.S.A</p>
            </ins>
         </insg>
         <source>BMC Genomics</source>
         <issn>1471-2164</issn>
         <pubdate>2004</pubdate>
         <volume>5</volume>
         <issue>1</issue>
         <fpage>5</fpage>
         <url>http://www.biomedcentral.com/1471-2164/5/5</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">14720308</pubid>
               <pubid idtype="doi">10.1186/1471-2164-5-5</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>25</day>
               <month>9</month>
               <year>2003</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>13</day>
               <month>1</month>
               <year>2004</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>13</day>
               <month>1</month>
               <year>2004</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2004</year>
         <collab>Rosenblad et al; licensee BioMed Central Ltd. This is an Open Access article: verbatim copying and redistribution of this article are permitted in all media for any purpose, provided this notice is preserved along with the article's original URL.</collab>
      </cpyrt>
      <kwdg>
         <kwd>SRP protozoa fungi RNA</kwd>
      </kwdg>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>The signal recognition particle (SRP) is a ribonucleoprotein complex responsible for targeting proteins to the ER membrane. The SRP of metazoans is well characterized and composed of an RNA molecule and six polypeptides. The particle is organized into the S and Alu domains. The Alu domain has a translational arrest function and consists of the SRP9 and SRP14 proteins bound to the terminal regions of the SRP RNA. So far, our understanding of the SRP and its evolution in lower eukaryotes such as protozoa and yeasts has been limited. However, genome sequences of such organisms have recently become available, and we have now analyzed this information with respect to genes encoding SRP components.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>A number of SRP RNA and SRP protein genes were identified by an analysis of genomes of protozoa and fungi. The sequences and secondary structures of the Alu portion of the RNA were found to be highly variable. Furthermore, proteins SRP9/14 appeared to be absent in certain species. Comparative analysis of the SRP RNAs from different <it>Saccharomyces </it>species resulted in models which contain features shared between all SRP RNAs, but also a new secondary structure element in SRP RNA helix 5. Protein SRP21, previously thought to be present only in <it>Saccharomyces</it>, was shown to be a constituent of additional fungal genomes. Furthermore, SRP21 was found to be related to metazoan and plant SRP9, suggesting that the two proteins are functionally related.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusions</p>
               </st>
               <p>Analysis of a number of not previously annotated SRP components show that the SRP Alu domain is subject to a more rapid evolution than the other parts of the molecule. For instance, the RNA portion is highly variable and the protein SRP9 seems to have evolved into the SRP21 protein in fungi. In addition, we identified a secondary structure element in the <it>Sacccharomyces </it>RNA that has been inserted close to the Alu region. Together, these results provide important clues as to the structure, function and evolution of SRP.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>The mammalian signal recognition particle (SRP) plays a critical role in targeting of proteins to the ER membrane. SRP first binds the N-terminal signal sequence of the nascent chain as it appears on the surface of translating ribosomes. As a result, protein synthesis is arrested and the ribosome-nascent chain-SRP complex is targeted to the ER membrane through interaction with the SRP receptor <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. In a series of events that are accompanied by GTP hydrolysis, the SRP is released, protein synthesis is resumed and translocation of the secretory protein is initiated.</p>
         <p>The mammalian SRP is composed of six polypeptides named SRP9, SRP14, SRP19, SRP54, SRP68 and SRP72 which form a complex with a single RNA molecule (originally referred to as 7SL RNA) of approximately 300 nucleotide residues. The S domain (Fig. <figr fid="F1">1</figr>) of SRP is responsible for signal sequence recognition and contains the central region of SRP RNA and proteins SRP19, SRP54, SRP68 and SRP72. SRP54 is a highly conserved protein which is responsible for signal sequence binding and it interacts with the helix 8 region of the RNA. The Alu domain of the SRP (Fig. <figr fid="F1">1</figr>) functions in translational arrest and is composed of proteins SRP9/14 bound to the terminal regions of SRP RNA <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>. High-resolution three-dimensional structures of the Alu domain and critical parts of the S domain were obtained recently <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp> and provided considerable insight into structure and function of the mammalian SRP.</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Schematic view of SRP in metazoans</p>
            </caption>
            <text>
               <p><b>Schematic view of SRP in metazoans</b>. The protein subunits with molecular weights 9, 14, 19, 54, 68 and 72 kDa are shown as well as RNA helix numbers 3, 4, 5, 6 and 8 that are referred to in the text. The human SRP RNA sequence is shown. The Alu domain contains the 5' and 3' terminal regions of the RNA as well as the SRP9 and 14 proteins. The S domain contains the central region of the RNA and the four remaining SRP proteins.</p>
            </text>
            <graphic file="1471-2164-5-5-1"/>
         </fig>
         <p>Components of the SRP have been identified in all three domains of life <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. The genomes of the Archaea were shown to contain SRP RNAs which closely resemble the sequences and secondary structures of the SRP RNAs of metazoans, but only two SRP protein genes (SRP19 and SRP54) could be identified <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. The bacterial SRP consists of protein SRP54 (referred to as <it>Ffh</it>) and a 4.5S RNA which corresponds in large part to SRP RNA helix 8 of mammalian SRP. Significantly larger bacterial SRP RNAs (6S RNAs) which contain an Alu-like region are present in a restricted number of taxa such as <it>Bacillus </it><abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. A rationale for the high level of conservation of SRP54 and SRP RNA helix 8 in every SRP has been provided by the high-resolution structure of the <it>E. coli </it>SRP which suggested that the signal peptide binds within a hydrophobic groove formed by the M-domain of SRP54 as well as to SRP RNA <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>.</p>
         <p>Our understanding of the SRP components of the lower eukaryotes has suffered from a lack of data required for comparative sequence analysis. In addition, the greater diversity of this phylogenetic group has made it difficult to identify the SRP RNAs and SRP proteins even after the genome sequences became available. For instance, despite the detailed biochemical characterization of the yeast <it>S. cerevisiae </it>SRP <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>, an understanding of its structure has been hampered by the fact that yeast SRP RNA is nearly twice as long (519 nucleotide residues) with no obvious homology to other known SRP RNA sequences <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>.</p>
         <p>Here we report an analysis of protozoan and fungal genomes to identify several not previously annotated SRP components. We have been able to identify several novel SRP RNAs and compare their two-dimensional structures. The analysis has lead us to propose that protein SRP21 of <it>S. cerevisiae </it>is a homolog of SRP9 and thus might form a heterodimer with SRP14 which binds to the Alu domain. These studies provide not only inroads into the comprehensive molecular characterization of the SRP but also clues as to the early evolution and origin of SRP and its Alu domain.</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <p>In aiming to produce a comprehensive inventory of SRP components in protozoa and fungi we considered Euglenozoans (<it>Entosiphon</it>, <it>Trypanosoma</it>, and <it>Leishmania), </it>Alveolata (<it>Plasmodium</it>, <it>Eimeria</it>, <it>Theileria</it>), <it>Chlamydomonas</it>, <it>Giardia</it>, <it>Entamoeba </it>and <it>Encephalitozoon</it>. Complete genome sequences and preliminary gene annotation were available for <it>P. falciparum </it><url>http://www.plasmodb.org</url>, <it>C. reinhardtii </it><url>http://genome.jgi-psf.org/chlre1/chlre1.home.html</url>, and <it>Encephalitozoon cuniculi </it><abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. Significant portions of the other genomes had been sequenced as indicated in Table <tblr tid="T1">1</tblr>. For <it>Entosiphon </it>only a very limited amount of sequence data was available. A schematic phylogenetic tree involving the organisms discussed here is shown in Fig. <figr fid="F2">2</figr>. An overview of the results of our inventory of SRP RNA and proteins in protozoa and fungi is shown in Table <tblr tid="T1">1</tblr>. A significant number of these were not previously annotated.</p>
         <tbl id="T1">
            <title>
               <p>Table 1</p>
            </title>
            <caption>
               <p>Overview of inventory of SRP in protozoa and fungi. SRP RNAs and proteins were predicted as described in the text. Symbols are as follows: +) subunit found, -) subunit not found and genome complete, *) previously reported subunit, M) multiple SRP RNA-like sequences were found and P) only partial RNA sequence found. Empty cells are instances where subunit has not been found and where there is no complete genome assembly and preliminary gene annotation.</p>
            </caption>
            <tblbdy cols="11">
               <r>
                  <c ca="left">
                     <p>Taxonomic group</p>
                  </c>
                  <c ca="left">
                     <p>Organism</p>
                  </c>
                  <c ca="center">
                     <p>Genome size (MB)</p>
                  </c>
                  <c ca="center">
                     <p>Genome sequence data available in this work (MB)</p>
                  </c>
                  <c ca="center">
                     <p>SRP RNA</p>
                  </c>
                  <c ca="center">
                     <p>SRP9/21</p>
                  </c>
                  <c ca="center">
                     <p>SRP14</p>
                  </c>
                  <c ca="center">
                     <p>SRP19</p>
                  </c>
                  <c ca="center">
                     <p>SRP54</p>
                  </c>
                  <c ca="center">
                     <p>SRP68</p>
                  </c>
                  <c ca="center">
                     <p>SRP72</p>
                  </c>
               </r>
               <r>
                  <c cspan="11">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Euglenozoa</p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Leishmania major</it>
                     </p>
                  </c>
                  <c ca="center">
                     <p>34</p>
                  </c>
                  <c ca="center">
                     <p>43.7</p>
                  </c>
                  <c ca="center">
                     <p>+*</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Trypanosoma cruzi</it>
                     </p>
                  </c>
                  <c ca="center">
                     <p>?</p>
                  </c>
                  <c ca="center">
                     <p>43.0</p>
                  </c>
                  <c ca="center">
                     <p>P</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Alveolata</p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Plasmodium falciparum</it>
                     </p>
                  </c>
                  <c ca="center">
                     <p>23</p>
                  </c>
                  <c ca="center">
                     <p>23.1</p>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c ca="center">
                     <p>+*</p>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c ca="center">
                     <p>+*</p>
                  </c>
                  <c ca="center">
                     <p>+*</p>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Theileria annulata</it>
                     </p>
                  </c>
                  <c ca="center">
                     <p>10</p>
                  </c>
                  <c ca="center">
                     <p>8.5</p>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c>
                     <p/>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Eimeria tenella</it>
                     </p>
                  </c>
                  <c ca="center">
                     <p>60</p>
                  </c>
                  <c ca="center">
                     <p>52.3</p>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c>
                     <p/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Viridiplantae</p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Chlamydomonas reinhardtii</it>
                     </p>
                  </c>
                  <c ca="center">
                     <p>95?</p>
                  </c>
                  <c ca="center">
                     <p>97.3</p>
                  </c>
                  <c ca="center">
                     <p>M</p>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Diplomonadida</p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Giardia lamblia</it>
                     </p>
                  </c>
                  <c ca="center">
                     <p>12</p>
                  </c>
                  <c ca="center">
                     <p>120</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Entamoebidae</p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Entamoeba histolytica</it>
                     </p>
                  </c>
                  <c ca="center">
                     <p>20</p>
                  </c>
                  <c ca="center">
                     <p>86.7</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Fungi-Microsporidia</p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Encephalitozoon cuniculi</it>
                     </p>
                  </c>
                  <c ca="center">
                     <p>2.5</p>
                  </c>
                  <c ca="center">
                     <p>2.5</p>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>Fungi-Ascomycota</p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>S. pombe</it>
                     </p>
                  </c>
                  <c ca="center">
                     <p>14</p>
                  </c>
                  <c ca="center">
                     <p>14</p>
                  </c>
                  <c ca="center">
                     <p>+*</p>
                  </c>
                  <c ca="center">
                     <p>+*</p>
                  </c>
                  <c ca="center">
                     <p>+*</p>
                  </c>
                  <c ca="center">
                     <p>+*</p>
                  </c>
                  <c ca="center">
                     <p>+*</p>
                  </c>
                  <c ca="center">
                     <p>+*</p>
                  </c>
                  <c ca="center">
                     <p>+*</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Neurospora crassa</it>
                     </p>
                  </c>
                  <c ca="center">
                     <p>40</p>
                  </c>
                  <c ca="center">
                     <p>38</p>
                  </c>
                  <c ca="center">
                     <p>+*</p>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Aspergillus nidulans</it>
                     </p>
                  </c>
                  <c ca="center">
                     <p>28</p>
                  </c>
                  <c ca="center">
                     <p>30</p>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Candida albicans</it>
                     </p>
                  </c>
                  <c ca="center">
                     <p>16</p>
                  </c>
                  <c ca="center">
                     <p>17.8</p>
                  </c>
                  <c ca="center">
                     <p>+*</p>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
                  <c ca="center">
                     <p>+</p>
                  </c>
               </r>
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Saccharomyces</it>
                     </p>
                  </c>
                  <c ca="center">
                     <p>12</p>
                  </c>
                  <c ca="center">
                     <p>12</p>
                  </c>
                  <c ca="center">
                     <p>+*</p>
                  </c>
                  <c ca="center">
                     <p>+*</p>
                  </c>
                  <c ca="center">
                     <p>+*</p>
                  </c>
                  <c ca="center">
                     <p>+*</p>
                  </c>
                  <c ca="center">
                     <p>+*</p>
                  </c>
                  <c ca="center">
                     <p>+*</p>
                  </c>
                  <c ca="center">
                     <p>+*</p>
                  </c>
               </r>
            </tblbdy>
         </tbl>
         <fig id="F2">
            <title>
               <p>Figure 2</p>
            </title>
            <caption>
               <p>Phylogenetic tree</p>
            </caption>
            <text>
               <p><b>Phylogenetic tree. </b>A schematic tree is shown that includes the yeasts and protozoa referred to in the text. It was based on the tree shown in Baldauf et al <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. Branch lengths are not proportional to evolutionary distance.</p>
            </text>
            <graphic file="1471-2164-5-5-2"/>
         </fig>
         <sec>
            <st>
               <p>Identification and analysis of SRP RNA genes</p>
            </st>
            <p>To predict SRP RNA genes from protozoans and fungi we used a method previously described <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. The first step is a heuristic pattern-based search for conserved features of the helix 8 region as described under "Methods". The pattern was relatively degenerate and in many cases the result included a number of false positives. In a second step the candidates from the pattern-based search were screened with a COVE model of SRP RNA. COVE is an implementation of the algorithms described by Eddy &amp; Durbin <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> that make use of probabilistic models to describe the sequence and secondary structure consensus of an RNA family. The COVE analysis is much more stringent than the first pattern-based step and we expect very few false positives among the high-scoring hits from this analysis.</p>
            <p>The identified SRP RNA candidates aligned well to a COVE model for eukaryotic RNAs in the conserved S domain region, i.e. the part that corresponded to the helices 5, 6 and 8. The Alu domain displayed a higher degree of variation and in many instances did not align well to the COVE model. As a consequence, the prediction of the 5' and 3' ends of the molecule were in most cases unreliable.</p>
            <p>The secondary structure of all candidates were also predicted with MFOLD <abbrgrp><abbr bid="B16">16</abbr></abbrgrp> with default parameters or specifying constraints consistent with known conserved elements of SRP RNA. MFOLD was able to fold the S domain of the SRP RNA in a manner which was consistent with the secondary structure predicted by COVE. However, when used without constraints, MFOLD typically predicted a secondary structure for the Alu domain that was inconsistent with our identifications. As a consequence, for the prediction of Alu domains as well as their folding, we relied on consensus features, such as the presence of a conserved sequence motif UGUNR (where N is any base and R is purine, typically an A) motif and the general secondary structure outline (Fig. <figr fid="F1">1</figr>). In summary, in our prediction and folding of SRP RNA we combined pattern matching, COVE, and MFOLD, and we checked that known consensus motifs of SRP RNAs were present in the predicted RNAs. Finally, we used BLAST to show that the predicted SRP RNA genes did not overlap with predicted protein-coding regions or any other annotated features. Therefore, we believe that the final candidates presented here represent sequences that are evolutionary related to SRP RNA. Still, it should be noted that we cannot distinguish between a bona fide SRP RNA gene and pseudogenes that are known to occur in plants <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B17">17</abbr></abbrgrp> and in mammals. Examples are the two SRP RNA gene candidates that we identified in the <it>C. reinhardtii </it>genome. The covariance models did not allow us to predict the 5' and 3' ends and the folding of the Alu domain of these two sequences. Therefore, it remains to be seen which of these candidates, if any, represents a functional RNA.</p>
            <p>We were not able to identify an SRP RNA in <it>Giardia lamblia </it>and <it>Entamoeba histolytica</it>. However, as SRP proteins were identified in these organisms we expect that, as the genomes are completed, SRP RNA genes will be discovered.</p>
         </sec>
         <sec>
            <st>
               <p>SRP RNAs of Euglenozoa and Alveolates display a large variation in the Alu domain</p>
            </st>
            <p>An SRP RNA gene was identified in <it>Entosiphon sulcatum </it>(Fig. <figr fid="F3">3</figr>). Its Alu domain was found to contain a very short helix 4 and in this respect resembled the structure of the Alu domain of the trypanosomatids. This relationship was consistent with the known close evolutionary relationship between euglenids and trypanosomatids <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>). Interestingly, the <it>E. sulcatum </it>SRP RNA gene was shown to be part of a cluster which also contained genes for 5S rRNA, U1, U2 and U5 snRNAs <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr></abbrgrp>). This gene organization is reminiscent of that of <it>T. brucei </it>and <it>Leishmania </it>where SRP RNA genes were found to be located adjacent to other RNA genes <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp>.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Predicted secondary structures of protozoan SRP RNA Alu domains</p>
               </caption>
               <text>
                  <p><b>Predicted secondary structures of protozoan SRP RNA Alu domains. </b>Members of the groups Euglenozoa, Alveolata, and Fungi/Microsporidia are shown. The conserved UGUNR motif characteristic of the Alu domain is indicated by shading. Model of <it>E. cuniculi </it>is highly tentative. The complete structures including the S domain are shown in the web supplement at <url>http://bio.lundberg.gu.se/srp03/</url>.</p>
               </text>
               <graphic file="1471-2164-5-5-3"/>
            </fig>
            <p>In the group of the Alveolates we found an <it>Eimeria tenella </it>SRP RNA (Fig. <figr fid="F3">3</figr>) with a predicted Alu domain structure similar to that of the metazoans. Interestingly, the Alu domain of <it>Theileria annulata </it>was reminiscent of the SRP RNA Alu domain previously identified in the Ciliophora <it>Tetrahymena </it><abbrgrp><abbr bid="B23">23</abbr></abbrgrp>, in the respect that the helix 4 appeared to be absent.</p>
            <p>It has previously been reported that the genome of the malaria parasite <it>Plasmodium falciparum </it>encodes several SRP proteins <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. Here, we were able to identify the corresponding SRP RNA (Fig. <figr fid="F3">3</figr>). The secondary structure of the Alu domain of this RNA was predicted by combining COVE and MFOLD procedures. In addition, the RNA of two other Plasmodium species, <it>P. yoelii </it>and <it>P. knowlesi</it>, were predicted to form the same structure despite significant differences in their primary sequences (Fig. <figr fid="F3">3</figr>). The Alu domain of <it>Plasmodium </it>SRP RNA was different in that it possessed an internal loop in helix 4. Therefore, even within the Alveolates, a considerable variation in the predicted folding of the Alu domain was observed.</p>
         </sec>
         <sec>
            <st>
               <p>Saccharomyces SRP RNAs has an insert in helix 5 adjacent to a highly conserved Alu hairpin motif</p>
            </st>
            <p>We previously identified SRP RNA genes in <it>C. albicans </it>and <it>N. crassa </it><abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. Here we also found an SRP RNA candidate in <it>Aspergillus nidulans</it>. As shown for <it>Yarrowia </it>SRP RNA in Fig. <figr fid="F4">4</figr>, in all of these fungi, including <it>S. pombe, </it>the SRP RNA secondary structure were shown to be very similar. For the unusually large (519 nts) SRP RNA of <it>S. cerevisiae</it><abbrgrp><abbr bid="B12">12</abbr></abbrgrp>, the COVE model predicted an S domain with helices 5, 6 and 8 as for other eukaryotic RNAs. MFOLD also folded this part of the molecule in accordance with the consensus 2D structure of the S domain. As the 5' and 3' terminal sequences were shown to be related to Alu, we concluded that the <it>S. cerevisiae </it>SRP RNA contained at least one insert as compared to other fungi. Secondary structures for the SRP RNAs including these inserts were constructed for <it>S. mikatae</it>, <it>S. kudriavzevii</it>, <it>S. bayanus</it>, <it>S. castellii </it>and <it>S. kluyveri. </it>The sequences were identified by BLAST using the <it>S. cerevisiae </it>sequence as query. For the prediction of the 5' end of the RNA we took advantage of the fact that the highly conserved Alu domain was present at the very 5' end of the RNA. For prediction of the 3' end we considered a T-rich region which was conserved in all six <it>Saccharomyces </it>strains and likely is part of a transcription termination signal.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Proposed secondary structures of <it>Saccharomyces </it>SRP RNAs</p>
               </caption>
               <text>
                  <p><b>Proposed secondary structures of <it>Saccharomyces </it>SRP RNAs. </b>Models of the RNAs of <it>S. kluyveri</it>, <it>S. cerevisiae </it>and <it>Y. lipolytica </it>are shown. <it>Saccharomyces </it>RNAs have regions, 5c-g and 5h-i (shaded) not found in other fungi. Inset with shaded background with helices 5c-g are showing compensatory base changes in this domain (bases with white background). Also indicated is the location of the 5c-g insert as related to <it>Yarrowia </it>(arrow) as well as conserved sequence elements of the fungal Alu domain (box). The complete models of other <it>Saccharomyces </it>species are shown in the web supplement at <url>http://bio.lundberg.gu.se/srp03/</url>.</p>
               </text>
               <graphic file="1471-2164-5-5-4"/>
            </fig>
            <p>Since the number of available <it>Saccharomyces </it>SRP RNA sequences was too low for using covariation or mutual information analysis, and a COVE model that would predict the pairing in the insert regions could not be obtained, MFOLD was used first with each full-length sequence to predict the secondary structure. As expected, all <it>Saccharomyces </it>sequences folded into structures which contained helices 5, 6 and 8 as predicted for <it>S. cerevisiae</it>. Furthermore, all <it>Saccharomyces </it>RNAs possessed a hairpin structure with the Alu UGUNR motif at the 5' end similar to what was observed in the Alu domains of the other fungi. A multiple alignment was obtained using procedures described under Methods and is available in the web supplement to this paper <url>http://bio.lundberg.gu.se/srp03/</url>.</p>
            <p>As for the SRP RNA insertions specific to <it>Saccharomyces</it>, MFOLD predicted the structure shown in Fig. <figr fid="F4">4</figr> containing the helices that we here refer to as 5c-g (Fig. <figr fid="F4">4</figr>). The helices 5h-i were also characteristic of the <it>Saccharomyces </it>RNAs. Smaller corresponding inserts reminiscent of these were found in <it>Yarrowia</it>, <it>Neurospora </it>and <it>Aspergillus</it>. The predicted secondary structure of the 5c-g region was very similar in all <it>Saccharomyces </it>species although there was significant variation in sequence. The bases involved in compensatory base changes in this part of the RNA (Fig. <figr fid="F4">4</figr>) offer support to the predicted folding.</p>
            <p>The 5c-g insertion appeared to be specific to <it>Saccharomyces </it>and indicated that at some point during evolution this piece of the RNA was inserted near the 5' end as indicated in Fig. <figr fid="F4">4</figr>. The evolution of <it>Saccharomyces </it>also involved the enlargement of helices 5h and 5i. One may speculate that these species have developed some additional mechanisms related to SRP-mediated translational arrest or additional unknown functions.</p>
            <p>Our findings suggested that all fungal SRP RNAs contain a hairpin structure with a Alu UGUNR motif at the 5' end. This part of the RNA was found to be conserved and there was phylogenetic support for the hairpin structure from the covariations indicated in Fig. <figr fid="F4">4</figr>. The fungal hairpin motif was distinct from all other known non-fungal Alu domains where the conserved UGUNR motif was found to be part of a more elaborate pseudoknot.</p>
            <p>It has been shown previously that a 5' terminal 99 nt fragment of the <it>S. cerevisiae </it>RNA was able to bind <it>in vitro </it>to the SRP14 protein. A tentative model of this portion of the SRP RNA has been presented where only the 5' terminal portion formed the Alu domain <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. However, based on the analysis presented here it is likely that also the 3' portion is part of the Alu domain.</p>
         </sec>
         <sec>
            <st>
               <p>An SRP RNA candidate in <it>E. cuniculi</it></p>
            </st>
            <p>There is strong evidence that the Microsporidia, such as <it>E. cuniculi</it>, are phylogenetically related to Fungi <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. An analysis of the <it>E. cuniculi </it>genome revealed a candidate SRP RNA in a 396 nt intergenic region (chromosome X, positions 138833&#8211;139226) which contained helices 6 and 8, and aligned well with our eukaryotic COVE model. However, we were unable to identify a typical metazoan Alu domain. Fig. <figr fid="F3">3</figr> shows a tentative model of the Alu domain which is similar to that of other fungi.</p>
         </sec>
         <sec>
            <st>
               <p>Identification of SRP proteins</p>
            </st>
            <p>We used a range of tools to identify and inventory SRP proteins in protozoa and fungi. Genome sequences were analyzed for genes encoding SRP proteins using BLAST <abbrgrp><abbr bid="B26">26</abbr></abbrgrp> or FASTA <abbrgrp><abbr bid="B27">27</abbr></abbrgrp> using previously known eukaryotic SRP proteins as query sequences. In addition, we performed PSI-BLAST <abbrgrp><abbr bid="B26">26</abbr></abbrgrp> searches where a SRP protein sequence, typically the human ortholog, was used to search a database with the proteins in a public protein sequence database combined with the proteins obtained by translating all possible open reading frames of the genome being analyzed. Genomes were also analyzed using Genscan <abbrgrp><abbr bid="B28">28</abbr></abbrgrp> or GlimmerM <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> and predicted peptide sequences were used in a BLAST or PSI-BLAST procedure as above. The results of our findings are shown in Table <tblr tid="T1">1</tblr>. One should keep in mind that several genomes were incomplete although large portions were covered by raw sequence data or contigs. The fact that we failed to identify a certain component of SRP was therefore not entirely conclusive in these cases (indicated by empty cells in Table <tblr tid="T1">1</tblr>).</p>
            <p>As previously noted for other species, SRP54 was shown to be ubiquitous and highly conserved. Also SRP19, SRP68 and SRP72 were found in most of the genomes analyzed here (Table <tblr tid="T1">1</tblr>) although we were unable to identify a SRP72 homolog among the Alveolates. In <it>E. cuniculi </it>we failed to identify SRP68/72, as described further below. We obtained evidence that the SRP9/14 proteins were subject to a more rapid evolution, as discussed in the following.</p>
         </sec>
         <sec>
            <st>
               <p>Absence of SRP9/14 in certain protozoa and Microsporidia</p>
            </st>
            <p>SRP9 and 14 homologs were identified in <it>Plasmodium falciparum </it>and <it>Chlamydomonas reinhardtii</it>. The SRP14 homolog in <it>P. falciparum </it>has been identified previously but annotated as a hypothetical protein (<url>http://www.plasmodb.org</url>, accession PFL0160w). SRP14 was found also in <it>E. tenella </it>and <it>E. histolytica</it>. However, we were unable to identify SRP9/14 in <it>Leishmania major, Trypanosoma cruzi, Theileria annulata, Giardia lamblia </it>and <it>E. cuniculi</it>. Although an intriguing observation, we could not formally rule out the possibility that SRP9/14 homologs would be discovered during the completion of the genome assemblies and that the SRP9/14 protein sequences in these organisms have strongly diverged from known members of this protein family.</p>
            <p>Evidence has been provided that <it>Leptomonas </it>and <it>T. brucei </it>possess a tRNA-like RNA that associates with the SRP <abbrgrp><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr></abbrgrp>. It has been speculated that this RNA compensates for the loss of portions of the Alu domain <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. The possibility that the tRNA-like RNA took the role of proteins SRP9/14 was considered as well. Upon completion of additional trypanosomatid genome sequences it will be interesting to determine if SRP in all these organisms carry a tRNA-like component and if they all lack SRP9/14.</p>
            <p>The microsporidian <it>E. cuniculi </it>has a highly compact genome that seem to have been under a pressure to eliminate non-essential material. As SRP9/14, 68 and 72 seem to be missing, the evolution of this organism could have involved the loss of these genes. The lack of these proteins would suggest that they are less critical for SRP function. It is interesting to note that in this respect <it>E. cuniculi </it>resembles archaea which also appear to lack SRP9/14, 68 and 72.</p>
         </sec>
         <sec>
            <st>
               <p>Yeast SRP21 is related to metazoan and plant SRP9</p>
            </st>
            <p>Homologs of SRP14, SRP19, SRP54, SRP68, and SRP72 were identified in all the <it>Saccharomyces </it>species (Table <tblr tid="T1">1</tblr>). It has been reported previously that <it>S. cerevisiae </it>possess SRP21 which was thought to be a new family of SRP proteins <abbrgrp><abbr bid="B11">11</abbr></abbrgrp> unique to <it>Saccharomyces</it>. We have here reexamined the relationship of SRP21 to other proteins, including those of the SRP. Homologs to <it>S. cerevisiae </it>SRP21 in <it>S. mikatae</it>, <it>S. kudriavzevii</it>, <it>S. bayanus</it>, <it>S. castellii</it>, <it>S. kluyveri </it>and <it>S. paradoxus </it>sequences were initially identified using BLAST. The E-values ranged from 1e-77 (<it>S. paradoxus</it>) to 1e-29 (<it>S. kluyveri</it>). A homolog to SRP21 was found also in <it>Candida albicans </it>(E-value 8e-06). Open reading frames in the region of the BLAST hit were identified to obtain a likely full-length protein sequence for the <it>C. albicans </it>SRP21 (Fig. <figr fid="F5">5</figr>). We also identified the <it>S. pombe </it>protein YE07_SCHPO (T37873) as a distantly related homolog, with a E-value of 6.3. Interestingly, this protein displayed a distant homology to metazoan protein SRP9 and was previously listed in the SRPDB as a potential SRP9 homolog. Using the <it>S. pombe </it>protein as query we identified a possible homolog in <it>Neurospora crassa </it>(E-value 5e-6) using TBLASTN to search <it>N. crassa </it>genomic sequences (Fig. <figr fid="F5">5</figr>).</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Alignment of <it>Saccharomyces </it>SRP21 proteins with <it>C. albicans</it>, <it>N. crassa </it>and <it>S. pombe </it>homologs, metazoan and plant SRP9 proteins as well SRP14 proteins</p>
               </caption>
               <text>
                  <p><b>Alignment of <it>Saccharomyces </it>SRP21 proteins with <it>C. albicans</it>, <it>N. crassa </it>and <it>S. pombe </it>homologs, metazoan and plant SRP9 proteins as well SRP14 proteins</b>. The alignment of SRP9 to SRP14 is that previously shown in Birse et al <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. The alignment of SRP9 to SRP21 is based on the results of profile-based searches like the one shown in Fig. <figr fid="F6">6</figr> and CLUSTALW. The secondary structure from <it>Birse et al</it>. is shown with gray cylinders for &#945;-helices and arrows for &#946;-strands. The region between &#946;-1 and &#946;-2 (box) is not aligned. Boxed residues are basic residues and cysteines protruding from the &#946;-sheet into the solvent, according to <it>Birse et al</it>. Highly conserved residues are shown in dark gray, residues with conservative substitutions having the same physico-chemical properties are shown in light gray. Where organism names are given (<it>Saccharomyces</it>, <it>C. albicans</it>, <it>N. crassa </it>and <it>S. pombe) </it>they refer to the SRP21 proteins identified in this work whereas the SRP9 and SRP14 proteins are denoted by their Swissprot/TREMBL names.</p>
               </text>
               <graphic file="1471-2164-5-5-5"/>
            </fig>
            <p>We considered that SRP21 might bind to the regions inserted specifically into <it>Saccharomyces </it>SRP RNA, i.e helices 5c-g or 5h-i. However, this possibility appeared unlikely because SRP21 homologs were identified also in those fungi that did not possess these helix 5 expansions.</p>
            <p>To further examine the SRP21 homologs and their relationship with SRP9 we carried out profile-based searches. First, all novel putative yeast SRP21 homologs identified here were merged with the sequences of public protein sequence databases (Genpept and SWISSPROT/TREMBL). The resulting databases were used in profile-based searches including PSI-BLAST, PROFILESEARCH, and hmmsearch. For instance, when the <it>S. cerevisiae </it>SRP21 sequence was used as query in a PSI-BLAST search, after two iterations the <it>S. pombe </it>protein YE07_SCHPO (T37873) was identified above the default threshold (E-value 0.01) together with the <it>Candida </it>and <it>Saccharomyces </it>SRP21 proteins (not shown). Second, PROFILESEARCH was used with a profile based on a multiple alignment of the <it>Saccharomyces </it>sequences (Fig. <figr fid="F6">6</figr>, sequences shown in lighter gray) obtained by CLUSTALW to search SWISSPROT/TREMBL (approximately 1 million protein sequences as of March 2003). In the result of this search the <it>Candida</it>, <it>Neurospora</it>, and <it>S. pombe </it>sequences followed immediately after the <it>Saccharomyces </it>SRP21 sequences (Fig. <figr fid="F6">6</figr>). Interestingly, the SRP9 proteins of maize, <it>Arabidopsis</it>, and <it>C. elegans </it>ranked closely to the top-scoring yeast sequences, although they had scores lower than the other SRP21-related proteins. Similar results were obtained with hmmsearch (not shown).</p>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>Profile-based search reveals relationship between SRP9 and SRP21</p>
               </caption>
               <text>
                  <p><b>Profile-based search reveals relationship between SRP9 and SRP21</b>. PROFILESEARCH (Wisconsin package version 10.2, Genetics Computer Group (GCG), Madison, Wisc.) with default parameters was used with a profile based on a multiple alignment of SRP21 from <it>Saccharomyces </it>species created with CLUSTALW. The database queried was one where all the SRP21-related proteins identified here were merged with Swissprot/TREMBL. Swissprot/TREMBL names are given for the sequences except for the six <it>Saccharomyces</it>, <it>N. crassa </it>and <it>C. albicans </it>sequences identified here. YE07_SCHPO is the <it>S. pombe </it>SRP21 / SRP9 homolog. The <it>Saccharomyces sequences </it>at the top of the list (lighter gray) were those used to create the profile.</p>
               </text>
               <graphic file="1471-2164-5-5-6"/>
            </fig>
            <p>A multiple alignment of SRP21 and SRP9 as well as SRP14 protein sequences is shown in Fig. <figr fid="F5">5</figr>. The alignment of SRP9 to SRP14 is the structural alignment of Birse et al. <abbrgrp><abbr bid="B32">32</abbr></abbrgrp> and the alignment of SRP9 to SRP21 was the result of a CLUSTALW analysis which was consistent with the results obtained from the profile searches described above. Interestingly, many of the positions that were conserved in the SRP9/14 structural alignment <abbrgrp><abbr bid="B32">32</abbr></abbrgrp> were occupied by the same category of amino acids in the SRP21 proteins (Fig. <figr fid="F5">5</figr>). These data indicated that SRP21 is structurally similar to the SRP9/14 proteins and provided further evidence of the homology between SRP21 and SRP9.</p>
            <p>In mammalian SRP, proteins SRP9 and SRP14 were shown to form a heterodimer and share a &#945;&#946;&#946;&#946;&#945; topology <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. To determine if the secondary structure predicted for fungal SRP21 was similar to the SRP9 and SRP14 structure, we made predictions with PSIPRED <abbrgrp><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr></abbrgrp>. <it>Saccharomyces </it>SRP21 sequences were used as input and the results are shown in Fig. <figr fid="F7">7</figr>. For human SRP9 and SRP14 the boxed regions indicate the positions of the secondary structure elements as known from the structure of the proteins. The corresponding regions for the fungal proteins are also shown in boxes and are based on the alignment in Fig. <figr fid="F5">5</figr>. The results showed that the predicted secondary structure of SRP21 was remarkably similar to the predicted or known structures of SRP9 and SRP14. With the exception of the first &#945;-helix of <it>Saccharomyces </it>SRP21 all the predicted secondary structure elements were consistent with those of the SRP9/14 structure. Similar results were obtained with the secondary structure prediction method SAMT02 <abbrgrp><abbr bid="B35">35</abbr></abbrgrp> (not shown).</p>
            <fig id="F7">
               <title>
                  <p>Figure 7</p>
               </title>
               <caption>
                  <p>Protein secondary structure predictions of SRP21, SRP9 and SRP14 proteins</p>
               </caption>
               <text>
                  <p><b>Protein secondary structure predictions of SRP21, SRP9 and SRP14 proteins. </b>Protein sequences, SRP21 and potential homologs from the organisms indicated as well as human SRP9 and SRP14, were subjected to secondary structure prediction by PSI-PRED <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. Sequences and predictions are shown unaligned. &#945;-helices and &#946;-strands are indicated by cylinders and arrows, respectively. For human SRP9 and SRP14 the boxed regions indicate the positions of the secondary structure elements as known from the structure of the proteins. The corresponding regions for the fungal proteins are also shown in boxes and are based on the alignment in Fig. <figr fid="F5">5</figr>.</p>
               </text>
               <graphic file="1471-2164-5-5-7"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Potential interactions of yeast SRP21</p>
            </st>
            <p>In metazoan cells, SRP9/14 forms a complex with the Alu domain of SRP RNA. The structure of this complex has been determined at high resolution <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B32">32</abbr></abbrgrp>. On the other hand, the Alu domain of yeast is less well characterized. An analysis of yeast SRP showed that it was missing an obvious SRP9 homolog <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. SRP21 was not believed to be the SRP9 equivalent in yeast because its sequence similarity to SRP9 was not recognized and SRP21 did not appear to be stably associated with SRP14. Furthermore, evidence was provided that SRP14 is present in two copies in the yeast SRP <abbrgrp><abbr bid="B3">3</abbr></abbrgrp> and SRP14 was shown to form a homodimer which bound to the Alu domain <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. On the basis of these observations it was assumed that the SRP14 homodimer was functionally equivalent to the SRP9/14 heterodimer. In contrast, we suggest that SRP21 is not only structurally but also functionally related to SRP9. We suggest that the protein is an integral component not only of <it>Saccharomyces</it>, but of every yeast SRP. In the light of these findings it will be important to reexamine experimentally the role of SRP21 in yeast.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusions</p>
         </st>
         <p>In the process of identifying and analyzing numerous SRP RNAs in protozoa and fungi we have demonstrated that the RNA portion of the Alu domain is highly variable both in sequence and secondary structure. Although the RNAs possess a conserved UGUNR motif, other parts of the Alu domain show a large degree of variation. One striking example of the plasticity of Alu is apparent in the Alveolates where the <it>Plasmodium </it>Alu domain is distinct from all other members of that group. Furthermore, in fungi the Alu domain is a simple hairpin motif as compared to all other species where the Alu domain is more elaborate. We have also identified secondary structure element insertions in the <it>Sacccharomyces </it>SRP RNAs towards the terminal regions which could be considered as expansions of the Alu domain.</p>
         <p>The Alu associated SRP9 and SRP14 proteins appear to have been subject to rapid evolution as well. One example is the evolution of the fungal SRP21 protein. Using sensitive profile-based searches, we have presented evidence that SRP21 is homologous to the metazoan SRP9. In addition, it seems that SRP9 and SRP14 are missing in some protozoa and fungi, and it is known from previous studies that the Bacillus type eubacteria are missing these proteins and sequence analysis has so far failed to reveal archaebacterial homologs. Based on these findings it is tempting to speculate that the ancestral Alu domain was built solely from RNA, and proteins SRP9/14 were added to adjust Alu function in subsequent evolutionary events.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Sources of genomic sequences</p>
            </st>
            <p>Some of the genomic sequences used in this work were unfinished sequences and represent as yet unpublished material. In these cases permission to present the results in this paper was obtained from the respective research groups. SRP RNA sequences from <it>S. mikatae</it>, <it>S. kudriavzevii</it>, <it>S. bayanus</it>, <it>S. castellii </it>and <it>S. kluyveri </it>were retrieved using BLAST searches against the corresponding genomes using the <it>S. cerevisiae </it>sequence as query using the BLAST server at the Genome Sequencing Center at Washington University <url>http://www.genome.wustl.edu/blast/yeast_client.cgi</url>. <it>Saccharomyces </it>protein sequences were identified using the same BLAST server and the Synteny viewer at the Saccharomyces Genome Database <url>http://www.yeastgenome.org</url>. Sequences of <it>Candida albicans </it>were from the Stanford Genome Technology Center website at <url>http://www-sequence.stanford.edu/group/candida</url> (Assembly 6). <it>Neurospora </it>sequences were from the Neurospora Sequencing Project, Whitehead Institute/MIT Center for Genome Research <url>http://www-genome.wi.mit.edu</url>. The dataset used in these studies was <url>http://www.broad.mit.edu/ftp/pub/annotation/neurospora/assembly3/neurospora_3.fasta.gz</url>. <it>Aspergillus nidulans </it>sequences were downloaded from the Whitehead Institute, Center for Genome Research at <url>http://www-genome.wi.mit.edu/cgi-bin/annotation/aspergillus/download_license.cgi</url>. <it>Plasmodium falciparum </it>sequences were from PlasmoDB at <url>http://www.plasmodb.org</url>.</p>
            <p><it>Trypanosoma cruzi </it>and <it>Entamoeba histolytica </it>were downloaded with permission from TIGR.</p>
            <p><it>Eimeria tenella </it><url>http://www.sanger.ac.uk/Projects/E_tenella/</url>,</p>
            <p><it>Theileria annulata </it><url>http://www.sanger.ac.uk/Projects/T_annulata/</url>,</p>
            <p><it>Leishmania major </it><url>http://www.sanger.ac.uk/Projects/L_major/</url> were from the Sanger Centre.</p>
            <p>Other sources were <it>Chlamydomonas reinhardtii </it><url>http://genome.jgi-psf.org/chlre1/chlre1.home.html</url>,</p>
            <p>
               <it>Giardia lamblia </it>
               <url>http://jbpc.mbl.edu/Giardia-HTML/index2.html</url>
            </p>
            <p>and <it>Encephalitozoon cuniculi</it></p>
            <p><url>ftp://ftp.ncbi.nih.gov/genomes/Encephalitozoon_cuniculi/</url>.</p>
         </sec>
         <sec>
            <st>
               <p>Identification of SRP RNA sequences and prediction of RNA secondary structure</p>
            </st>
            <p>SRP RNA genes were predicted as described previously <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> by applying a combination of pattern searches using rnabob <url>http://www.genetics.wustl.edu/eddy/software/#rnabob</url> and COVE <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. The rnabob searches made use of descriptors based on consensus features of the helix 8 region of the RNA. The search pattern was typically (XX) YYAGR (NNN) GRRA (N'N'N') AGCAR (X'X') or minor variations of it, where X pairs with X' and N with N'. RNA secondary structure predictions were carried out by COVE and MFOLD <abbrgrp><abbr bid="B16">16</abbr></abbrgrp> and were complemented by analysis of compensatory base changes <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. Saccharomyces RNAs were first aligned with CLUSTALW <abbrgrp><abbr bid="B37">37</abbr></abbrgrp> using a gap extension penalty of 0, and this alignment was manually edited to be consistent with observations from MFOLD predictions to assure that bases involved in the formation of helices were properly aligned.</p>
         </sec>
         <sec>
            <st>
               <p>Identification and analysis of SRP protein sequences</p>
            </st>
            <p>The program 'sixpack' of the EMBOSS package <abbrgrp><abbr bid="B38">38</abbr></abbrgrp> was used to obtain all possible translation products of genome sequences. PROFILESEARCH was part of the GCG package (Wisconsin package version 10.2, Genetics Computer Group (GCG), Madison, Wisc.). Hmmer package, developed by S. Eddy, was obtained from <url>http://hmmer.wustl.edu/</url>. Genscan was downloaded from <url>http://genes.mit.edu/GENSCANinfo.html</url> and GlimmerM from <url>http://www.tigr.org/software/glimmerm/</url>. Protein secondary structure was predicted using PSIPRED <abbrgrp><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr></abbrgrp> and SAM-T02 <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>.</p>
            <p>All predicted RNA and protein sequences, secondary structures and multiple alignments are shown in a web supplement at <url>http://bio.lundberg.gu.se/srp03/</url>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>MAR and TS carried out bioinformatics analyses. TS and CZ conceived of the study and drafted the manuscript. All authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>This work was supported by NIH grant GM49034 to C.Z. and the EU grant QLK3-CT-200-00082 to TS.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>The signal recognition particle</p>
            </title>
            <aug>
               <au>
                  <snm>Keenan</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Freymann</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Stroud</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Walter</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Annu Rev Biochem</source>
            <pubdate>2001</pubdate>
            <volume>70</volume>
            <fpage>755</fpage>
            <lpage>775</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.biochem.70.1.755</pubid>
                  <pubid idtype="pmpid" link="fulltext">11395422</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>New insights into signal recognition and elongation arrest activities of the signal recognition particle</p>
            </title>
            <aug>
               <au>
                  <snm>Bui</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Strub</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Biol Chem</source>
            <pubdate>1999</pubdate>
            <volume>380</volume>
            <issue>2</issue>
            <fpage>135</fpage>
            <lpage>145</lpage>
            <xrefbib>
               <pubid idtype="pmpid">10195420</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Elongation arrest is a physiologically important function of signal recognition particle</p>
            </title>
            <aug>
               <au>
                  <snm>Mason</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Ciufo</snm>
                  <fnm>LF</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>JD</fnm>
               </au>
            </aug>
            <source>Embo J</source>
            <pubdate>2000</pubdate>
            <volume>19</volume>
            <issue>15</issue>
            <fpage>4164</fpage>
            <lpage>4174</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/emboj/19.15.4164</pubid>
                  <pubid idtype="pmpid" link="fulltext">10921896</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Structure of the SRP19 RNA complex and implications for signal recognition particle assembly</p>
            </title>
            <aug>
               <au>
                  <snm>Hainzl</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Huang</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sauer-Eriksson</snm>
                  <fnm>AE</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>417</volume>
            <issue>6890</issue>
            <fpage>767</fpage>
            <lpage>771</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature00768</pubid>
                  <pubid idtype="pmpid" link="fulltext">12050674</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Structure and assembly of the Alu domain of the mammalian signal recognition particle</p>
            </title>
            <aug>
               <au>
                  <snm>Weichenrieder</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Wild</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Strub</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Cusack</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>408</volume>
            <issue>6890</issue>
            <fpage>167</fpage>
            <lpage>173</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11089964</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Crystal structure of SRP19 in complex with the S domain of SRP RNA and its implication for the assembly of the signal recognition particle</p>
            </title>
            <aug>
               <au>
                  <snm>Oubridge</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Kuglstatter</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Jovine</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Nagai</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Mol Cell</source>
            <pubdate>2002</pubdate>
            <volume>9</volume>
            <issue>6</issue>
            <fpage>1251</fpage>
            <lpage>1261</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12086622</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Induced structural changes of 7SL RNA during the assembly of human signal recognition particle</p>
            </title>
            <aug>
               <au>
                  <snm>Kuglstatter</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Oubridge</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Nagai</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Nat Struct Biol</source>
            <pubdate>2002</pubdate>
            <volume>9</volume>
            <issue>10</issue>
            <fpage>740</fpage>
            <lpage>744</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nsb843</pubid>
                  <pubid idtype="pmpid" link="fulltext">12244299</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>SRPDB: Signal Recognition Particle Database</p>
            </title>
            <aug>
               <au>
                  <snm>Rosenblad</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Gorodkin</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Knudsen</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Zwieb</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Samuelsson</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <issue>1</issue>
            <fpage>363</fpage>
            <lpage>364</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">165554</pubid>
                  <pubid idtype="pmpid" link="fulltext">12520023</pubid>
                  <pubid idtype="doi">10.1093/nar/gkg107</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Getting on target: The archaeal signal recognition particle</p>
            </title>
            <aug>
               <au>
                  <snm>Zwieb</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Eichler</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Archaea</source>
            <pubdate>2002</pubdate>
            <volume>1</volume>
            <fpage>27</fpage>
            <lpage>34</lpage>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Crystal structure of the ribonucleoprotein core of the signal recognition particle</p>
            </title>
            <aug>
               <au>
                  <snm>Batey</snm>
                  <fnm>RT</fnm>
               </au>
               <au>
                  <snm>Rambo</snm>
                  <fnm>RP</fnm>
               </au>
               <au>
                  <snm>Lucast</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Rha</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Doudna</snm>
                  <fnm>JA</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2000</pubdate>
            <volume>287</volume>
            <issue>5456</issue>
            <fpage>1232</fpage>
            <lpage>1239</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.287.5456.1232</pubid>
                  <pubid idtype="pmpid" link="fulltext">10678824</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Subunits of the Saccharomyces cerevisiae signal recognition particle required for its functional expression</p>
            </title>
            <aug>
               <au>
                  <snm>Brown</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Hann</snm>
                  <fnm>BC</fnm>
               </au>
               <au>
                  <snm>Medzihradszky</snm>
                  <fnm>KF</fnm>
               </au>
               <au>
                  <snm>Niwa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Burlingame</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Walter</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Embo J</source>
            <pubdate>1994</pubdate>
            <volume>13</volume>
            <issue>18</issue>
            <fpage>4390</fpage>
            <lpage>4400</lpage>
            <xrefbib>
               <pubid idtype="pmpid">7925282</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>The most abundant small cytoplasmic RNA of Saccharomyces cerevisiae has an important function required for normal cell growth</p>
            </title>
            <aug>
               <au>
                  <snm>Felici</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Cesareni</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Hughes</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Mol Cell Biol</source>
            <pubdate>1989</pubdate>
            <volume>9</volume>
            <issue>8</issue>
            <fpage>3260</fpage>
            <lpage>3268</lpage>
            <xrefbib>
               <pubid idtype="pmpid">2477683</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Genome sequence and gene compaction of the eukaryote parasite Encephalitozoon cuniculi</p>
            </title>
            <aug>
               <au>
                  <snm>Katinka</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Duprat</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Cornillot</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Metenier</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Thomarat</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Prensier</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Barbe</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Peyretaillade</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Brottier</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Wincker</snm>
                  <fnm>P</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2001</pubdate>
            <volume>414</volume>
            <issue>6862</issue>
            <fpage>450</fpage>
            <lpage>453</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35106579</pubid>
                  <pubid idtype="pmpid" link="fulltext">11719806</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Prediction of signal recognition particle RNA genes</p>
            </title>
            <aug>
               <au>
                  <snm>Regalia</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rosenblad</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Samuelsson</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2002</pubdate>
            <volume>30</volume>
            <issue>15</issue>
            <fpage>3368</fpage>
            <lpage>3377</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">137091</pubid>
                  <pubid idtype="pmpid" link="fulltext">12140321</pubid>
                  <pubid idtype="doi">10.1093/nar/gkf468</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>RNA sequence analysis using covariance models</p>
            </title>
            <aug>
               <au>
                  <snm>Eddy</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Durbin</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1994</pubdate>
            <volume>22</volume>
            <issue>11</issue>
            <fpage>2079</fpage>
            <lpage>2088</lpage>
            <xrefbib>
               <pubid idtype="pmpid">8029015</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>On finding all suboptimal foldings of an RNA molecule</p>
            </title>
            <aug>
               <au>
                  <snm>Zuker</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1989</pubdate>
            <volume>244</volume>
            <issue>4900</issue>
            <fpage>48</fpage>
            <lpage>52</lpage>
            <xrefbib>
               <pubid idtype="pmpid">2468181</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>cDNA cloning of the wheat germ SRP 7S RNAs</p>
            </title>
            <aug>
               <au>
                  <snm>Marshallsay</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Prehn</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Zwieb</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1989</pubdate>
            <volume>17</volume>
            <issue>4</issue>
            <fpage>1771</fpage>
            <xrefbib>
               <pubid idtype="pmpid">2537964</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>A kingdom-level phylogeny of eukaryotes based on combined protein data</p>
            </title>
            <aug>
               <au>
                  <snm>Baldauf</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Roger</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Wenk-Siefert</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Doolittle</snm>
                  <fnm>WF</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2000</pubdate>
            <volume>290</volume>
            <issue>5493</issue>
            <fpage>972</fpage>
            <lpage>977</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.290.5493.972</pubid>
                  <pubid idtype="pmpid" link="fulltext">11062127</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Characterization of trans-splicing in Euglenoids</p>
            </title>
            <aug>
               <au>
                  <snm>Frantz</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ebel</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Paulus</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Imbault</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Curr Genet</source>
            <pubdate>2000</pubdate>
            <volume>37</volume>
            <issue>6</issue>
            <fpage>349</fpage>
            <lpage>355</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s002940000116</pubid>
                  <pubid idtype="pmpid" link="fulltext">10905424</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Trans-splicing and cis-splicing in the colourless Euglenoid, Entosiphon sulcatum</p>
            </title>
            <aug>
               <au>
                  <snm>Ebel</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Frantz</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Paulus</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Imbault</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Curr Genet</source>
            <pubdate>1999</pubdate>
            <volume>35</volume>
            <issue>5</issue>
            <fpage>542</fpage>
            <lpage>550</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s002940050451</pubid>
                  <pubid idtype="pmpid" link="fulltext">10369962</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>tRNAs of Trypanosoma brucei. Unusual gene organisation and mitochondrial importation</p>
            </title>
            <aug>
               <au>
                  <snm>Bell</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>RG</fnm>
               </au>
               <au>
                  <snm>Barry</snm>
                  <fnm>JD</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>1991</pubdate>
            <volume>266</volume>
            <issue>27</issue>
            <fpage>18313</fpage>
            <lpage>18317</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">1717447</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Upstream tRNA genes are essential for expression of small nuclear and cytoplasmic RNA genes in trypanosomes</p>
            </title>
            <aug>
               <au>
                  <snm>Naakar</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Dare</snm>
                  <fnm>AO</fnm>
               </au>
               <au>
                  <snm>Hong</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Ullu</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Tscudi</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <pubdate>1994</pubdate>
            <volume>14</volume>
            <issue>10</issue>
            <fpage>6736</fpage>
            <lpage>6742</lpage>
            <xrefbib>
               <pubid idtype="pmpid">7523857</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Sequence and structure of Tetrahymena SRP RNA</p>
            </title>
            <aug>
               <au>
                  <snm>Brennwald</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Siegel</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Walter</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Wise</snm>
                  <fnm>JA</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1991</pubdate>
            <volume>19</volume>
            <issue>8</issue>
            <fpage>1942</fpage>
            <xrefbib>
               <pubid idtype="pmpid">1709498</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Genome sequence of the human malaria parasite Plasmodium falciparum</p>
            </title>
            <aug>
               <au>
                  <snm>Gardner</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Hall</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Fung</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Berriman</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hyman</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>Carlton</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Pain</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>KE</fnm>
               </au>
               <au>
                  <snm>Bowman</snm>
                  <fnm>S</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>419</volume>
            <issue>6906</issue>
            <fpage>498</fpage>
            <lpage>511</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature01097</pubid>
                  <pubid idtype="pmpid" link="fulltext">12368864</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>The Alu domain homolog of the yeast signal recognition particle consists of an Srp14p homodimer and a yeast-specific RNA structure</p>
            </title>
            <aug>
               <au>
                  <snm>Strub</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Fornallaz</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bui</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Rna</source>
            <pubdate>1999</pubdate>
            <volume>5</volume>
            <issue>10</issue>
            <fpage>1333</fpage>
            <lpage>1347</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1017/S1355838299991045</pubid>
                  <pubid idtype="pmpid" link="fulltext">10573124</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Gapped BLAST and PSI-BLAST: a new generation of protein database search programs</p>
            </title>
            <aug>
               <au>
                  <snm>Altschul</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Madden</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Schaffer</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Lipman</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1997</pubdate>
            <volume>25</volume>
            <issue>17</issue>
            <fpage>3389</fpage>
            <lpage>3402</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">146917</pubid>
                  <pubid idtype="pmpid" link="fulltext">9254694</pubid>
                  <pubid idtype="doi">10.1093/nar/25.17.3389</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Flexible sequence similarity searching with the FASTA3 program package</p>
            </title>
            <aug>
               <au>
                  <snm>Pearson</snm>
                  <fnm>WR</fnm>
               </au>
            </aug>
            <source>Methods Mol Biol</source>
            <pubdate>2000</pubdate>
            <volume>132</volume>
            <fpage>185</fpage>
            <lpage>219</lpage>
            <xrefbib>
               <pubid idtype="pmpid">10547837</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Prediction of complete gene structures in human genomic DNA</p>
            </title>
            <aug>
               <au>
                  <snm>Burge</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Karlin</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1997</pubdate>
            <volume>268</volume>
            <issue>1</issue>
            <fpage>78</fpage>
            <lpage>94</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.1997.0951</pubid>
                  <pubid idtype="pmpid" link="fulltext">9149143</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Interpolated Markov models for eukaryotic gene finding</p>
            </title>
            <aug>
               <au>
                  <snm>Salzberg</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Pertea</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Delcher</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Gardner</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Tettelin</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Genomics</source>
            <pubdate>1999</pubdate>
            <volume>59</volume>
            <issue>1</issue>
            <fpage>24</fpage>
            <lpage>31</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/geno.1999.5854</pubid>
                  <pubid idtype="pmpid" link="fulltext">10395796</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Identification of a tRNA-like molecule that copurifies with the 7SL RNA of Trypanosoma brucei</p>
            </title>
            <aug>
               <au>
                  <snm>Beja</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Ullu</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Michaeli</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Mol Biochem Parasitol</source>
            <pubdate>1993</pubdate>
            <volume>57</volume>
            <issue>2</issue>
            <fpage>223</fpage>
            <lpage>229</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0166-6851(93)90198-7</pubid>
                  <pubid idtype="pmpid">8433714</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>The Trypanosomatid Signal Recognition Particle Consists of Two RNA Molecules, a 7SL RNA Homologue and a Novel tRNA-like Molecule</p>
            </title>
            <aug>
               <au>
                  <snm>Liu</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Ben-Shlomo</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Xu</snm>
                  <fnm>YX</fnm>
               </au>
               <au>
                  <snm>Stern</snm>
                  <fnm>MZ</fnm>
               </au>
               <au>
                  <snm>Goncharov</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Michaeli</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>2003</pubdate>
            <volume>278</volume>
            <issue>20</issue>
            <fpage>18271</fpage>
            <lpage>18280</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1074/jbc.M209215200</pubid>
                  <pubid idtype="pmpid" link="fulltext">12606550</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>The crystal structure of the signal recognition particle Alu RNA binding heterodimer, SRP9/14</p>
            </title>
            <aug>
               <au>
                  <snm>Birse</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Kapp</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Strub</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Cusack</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Aberg</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Embo J</source>
            <pubdate>1997</pubdate>
            <volume>16</volume>
            <issue>13</issue>
            <fpage>3757</fpage>
            <lpage>3766</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/emboj/16.13.3757</pubid>
                  <pubid idtype="pmpid" link="fulltext">9233785</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>The PSIPRED protein structure prediction server</p>
            </title>
            <aug>
               <au>
                  <snm>McGuffin</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Bryson</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>DT</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2000</pubdate>
            <volume>16</volume>
            <issue>4</issue>
            <fpage>404</fpage>
            <lpage>405</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/16.4.404</pubid>
                  <pubid idtype="pmpid" link="fulltext">10869041</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Protein secondary structure prediction based on position-specific scoring matrices</p>
            </title>
            <aug>
               <au>
                  <snm>Jones</snm>
                  <fnm>DT</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1999</pubdate>
            <volume>292</volume>
            <issue>2</issue>
            <fpage>195</fpage>
            <lpage>202</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.1999.3091</pubid>
                  <pubid idtype="pmpid" link="fulltext">10493868</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>What is the value added by human intervention in protein structure prediction?</p>
            </title>
            <aug>
               <au>
                  <snm>Karplus</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Karchin</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Barrett</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Tu</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Cline</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Diekhans</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Grate</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Casper</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hughey</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Proteins</source>
            <pubdate>2001</pubdate>
            <volume>Suppl</volume>
            <issue>5</issue>
            <fpage>86</fpage>
            <lpage>91</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1002/prot.10021</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>SRP-RNA sequence alignment and secondary structure</p>
            </title>
            <aug>
               <au>
                  <snm>Larsen</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Zwieb</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1991</pubdate>
            <volume>19</volume>
            <issue>2</issue>
            <fpage>209</fpage>
            <lpage>215</lpage>
            <xrefbib>
               <pubid idtype="pmpid">1707519</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice</p>
            </title>
            <aug>
               <au>
                  <snm>Thompson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Higgins</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Gibson</snm>
                  <fnm>TJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1994</pubdate>
            <volume>22</volume>
            <issue>22</issue>
            <fpage>4673</fpage>
            <lpage>4680</lpage>
            <xrefbib>
               <pubid idtype="pmpid">7984417</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>EMBOSS opens up sequence analysis. European Molecular Biology Open Software Suite</p>
            </title>
            <aug>
               <au>
                  <snm>Olson</snm>
                  <fnm>SA</fnm>
               </au>
            </aug>
            <source>Brief Bioinform</source>
            <pubdate>2002</pubdate>
            <volume>3</volume>
            <issue>1</issue>
            <fpage>87</fpage>
            <lpage>91</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12002227</pubid>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>

