<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2164-8-375</ui>
   <ji>1471-2164</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>A motif-based search in bacterial genomes identifies the ortholog of the small RNA Yfr1 in all lineages of cyanobacteria</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Vo&#223;</snm>
               <fnm>Bj&#246;rn</fnm>
               <insr iid="I1"/>
               <email>bjoern.voss@biologie.uni-freiburg.de</email>
            </au>
            <au id="A2">
               <snm>Gierga</snm>
               <fnm>Gregor</fnm>
               <insr iid="I1"/>
               <email>gregor.gierga@biologie.uni-freiburg.de</email>
            </au>
            <au id="A3">
               <snm>Axmann</snm>
               <mi>M</mi>
               <fnm>Ilka</fnm>
               <insr iid="I2"/>
               <email>ilka.Axmann@gmx.de</email>
            </au>
            <au id="A4" ca="yes">
               <snm>Hess</snm>
               <mi>R</mi>
               <fnm>Wolfgang</fnm>
               <insr iid="I1"/>
               <email>wolfgang.hess@biologie.uni-freiburg.de</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>University of Freiburg, Faculty of Biology, Experimental Bioinformatics, Sch&#228;nzlestr. 1, D-79104 Freiburg, Germany</p>
            </ins>
            <ins id="I2">
               <p>Humboldt University Berlin, Institute for Theoretical Biology, Invalidenstrasse 43, D-10115 Berlin, Germany</p>
            </ins>
         </insg>
         <source>BMC Genomics</source>
         <issn>1471-2164</issn>
         <pubdate>2007</pubdate>
         <volume>8</volume>
         <issue>1</issue>
         <fpage>375</fpage>
         <url>http://www.biomedcentral.com/1471-2164/8/375</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">17941988</pubid>
               <pubid idtype="doi">10.1186/1471-2164-8-375</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>29</day>
               <month>3</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>17</day>
               <month>10</month>
               <year>2007</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>17</day>
               <month>10</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Vo&#223; et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Non-coding RNAs (ncRNA) are regulators of gene expression in all domains of life. They control growth and differentiation, virulence, motility and various stress responses. The identification of ncRNAs can be a tedious process due to the heterogeneous nature of this molecule class and the missing sequence similarity of orthologs, even among closely related species. The small ncRNA Yfr1 has previously been found in the <it>Prochlorococcus/Synechococcus </it>group of marine cyanobacteria.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Here we show that screening available genome sequences based on an RNA motif and followed by experimental analysis works successfully in detecting this RNA in all lineages of cyanobacteria. Yfr1 is an abundant ncRNA between 54 and 69 nt in size that is ubiquitous for cyanobacteria except for two low light-adapted strains of <it>Prochlorococcus</it>, MIT 9211 and SS120, in which it must have been lost secondarily. Yfr1 consists of two predicted stem-loop elements separated by an unpaired sequence of 16&#8211;20 nucleotides containing the ultraconserved undecanucleotide 5'-ACUCCUCACAC-3'.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>Starting with an ncRNA previously found in a narrow group of cyanobacteria only, we show here the highly specific and sensitive identification of its homologs within all lineages of cyanobacteria, whereas it was not detected within the genome sequences of <it>E. coli </it>and of 7 other eubacteria belonging to the alpha-proteobacteria, chlorobiaceae and spirochaete. The integration of RNA motif prediction into computational pipelines for the detection of ncRNAs in bacteria appears as a promising step to improve the quality of such predictions.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="bmc" subtype="user_supplied_xml" id="endnote"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Non-coding RNAs (ncRNAs) are sequence-specific regulators of gene expression, mediating a plethora of cellular responses to changing environmental conditions and morphological differentiation. In bacteria, ncRNAs are a heterogeneous group of functional RNA molecules normally (but not always) lacking a protein-coding function. They are frequently smaller than 200 nt in size, and act to regulate mRNA translation/decay but can also bind to proteins and thereby modify protein function (recent reviews <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>). In many cases, these ncRNAs function through sequence-specific base pairing; hence they frequently have a (partial) base complementarity to their target RNA molecules. The vast majority of known ncRNAs is encoded at genomic locations far away from the target genes (<it>trans</it>-encoded ncRNAs). However, a small number of ncRNAs is transcribed from the reverse complementary strand of the respective target and hence these are fully or partially overlapping with their target RNAs. This latter class of ncRNAs is called <it>cis</it>-encoded or antisense RNAs.</p>
         <p>As a result of recent systematic searches, more than 70 ncRNAs are now known in <it>E. coli</it>, most of which had been overlooked by traditional genome analysis. Many, or possibly most major stress responses in <it>E. coli </it>include at least one small regulatory RNA as part of the regulon <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. Systematic biochemical, genetic, genomic or computational searches for ncRNAs are still lacking for most eubacterial phyla outside the enterobacteria. In general, genes encoding ncRNAs are not annotated during standard genome analysis procedures. The efforts to accomplish their identification in bacteria can broadly be divided into (i) sequencing the population of sRNAs (directly or after cloning) as comprehensively as possible (RNomics) or (ii) prediction by bioinformatics tools (mostly) followed by experimental verification (see <abbrgrp><abbr bid="B3">3</abbr></abbrgrp> for a recent review).</p>
         <p>There is currently only very scarce information on regulatory RNAs and their genes in cyanobacteria. In addition to the more common types of ncRNA (tmRNA, 6S RNA, RNAse P RNA and <it>ffs </it>RNA), less than 10 different ncRNAs and only three antisense RNAs have been described for this whole group of bacteria so far <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp>. Among the known ncRNAs are Yfr1-7 (c<ul>Y</ul>anobacterial <ul>F</ul>unctional <ul>R</ul>NA). The existence of these ncRNAs was predicted for one (Yfr4 and Yfr5), two (Yfr3 and Yfr6), three (Yfr1) or four (Yfr2 and Yfr7) strains of the <it>Prochlorococcus/Synechococcus </it>lineage in a comparative genomics-based approach and their actual presence was demonstrated under various growth and stress conditions that they encounter in the natural environment <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. However, functions and phylogenetic distribution of these ncRNAs have remained unknown. There are 19 genome sequences from the <it>Prochlorococcus/Synechococcus </it>clade in the public domain now, providing an excellent data set for comparative genomics-based computational prediction of ncRNAs. However, for more distantly related cyanobacteria there are only few such data sets available, rendering the direct identification of ncRNAs through comparative genomics difficult. An exception is ncRNA Yfr7. Due to its exceptionally highly conserved sequence and based on an extended structural model, this RNA was identified as the cyanobacterial homolog of a ubiquitous eubacterial riboregulator, the RNA polymerase &#8211; interacting 6S RNA <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. Molecular details of its expression in synchronized cell cultures of <it>Prochlorococcus </it>MED4 are given in more detail separately <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. The RNAs Yfr2 through Yfr5 in <it>Prochlorococcus </it>MED4 constitute a family of closely related ncRNAs with a length of 89&#8211;95 nt <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. Gene copy numbers of Yfr2-5 homologues in other <it>Prochlorococcus </it>and <it>Synechococcus </it>genomes seem to differ widely and there is only very little synteny among them. Therefore, here we selected Yfr1, with 54, 57 and 56 nucleotides in <it>Prochlorococcus </it>MED4, MIT 9313 and <it>Synechococcus </it>WH 8102 the smallest ncRNA detected in cyanobacteria so far <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>, to address the question if it is possible to find more orthologs of a short bacterial ncRNA in an alternative computational-experimental approach. This approach is likely to be successful also in other cases in which comparative data suggest a highly conserved ncRNA secondary structure that can be taken as a starting point.</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <sec>
            <st>
               <p>Yfr1 in marine unicellular cyanobacteria</p>
            </st>
            <p>Cyanobacteria are phototrophic bacteria which perform oxygenic photosynthesis and populate widely diverse environments such as freshwater, the oceans, rock surfaces, desert soil or the Polar Regions. Therefore, various types of regulatory RNA can be expected that interplay with the different signal transduction pathways and stress responses. Antisense RNAs found within the gas vesicle operon in <it>Calothrix </it>PCC 7601 <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>, or covering the <it>ferric uptake regulator </it>gene <it>furA </it>in <it>Anabaena </it>PCC 7120 over its full length <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, or regulating the light-absorbing protein IsiA under conditions of iron limitation and redox stress in the unicellular <it>Synechocystis </it>PCC 6803 <abbrgrp><abbr bid="B5">5</abbr></abbrgrp> are three known examples for such RNAs in cyanobacteria. Based on comparative computational analysis, we predicted a whole set of putative trans-acting ncRNAs with unknown function, which we called Yfr1-Yfr7 <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>.</p>
            <p>We have previously shown the existence of Yfr1 in three out of four tested marine cyanobacteria belonging to the genera <it>Prochlorococcus </it>and <it>Synechococcus </it><abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. Unicellular marine cyanobacteria of these genera provide an excellent dataset for computational predictions that require comparative genome information since currently 19 different genome sequences from very closely related isolates are available. Thus, there is an extensive dataset to take compensatory base mutations into account for the prediction of novel ncRNA candidates. In these genomes, orthologs of Yfr1 can be found even by simple BLAST searches (Fig. <figr fid="F1">1A</figr>).</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Comparison of putative Yfr1 RNAs from 15 different <it>Prochlorococcus </it>and <it>Synechococcus</it></p>
               </caption>
               <text>
                  <p><b>Comparison of putative Yfr1 RNAs from 15 different <it>Prochlorococcus </it>and <it>Synechococcus</it></b>. A. Sequence alignment of the top scoring sequences obtained by BlastN using the known Yfr1 sequences [7] as a query. The respective strain numbers are prefixed "Pro" and "Syn" for <it>Prochlorococcus </it>and <it>Synechococcus</it>. B. Sequence/structure model for putative Yfr1 RNAs from 15 different <it>Prochlorococcus </it>and <it>Synechococcus </it>as shown in part A. Sequence is given in IUPAC-notation (R: A or G; Y: C or U; S: G or C; K: G or U; B: G, U or C; V: G, C or A; D: G, U or A; H: A, C or U). Base pair colors indicate the number of different base pairs which occur in the different sequences at this position (red = 1, yellow = 2, green = 3) and their shading resembles the frequency of base pairing, i.e. the number of sequences where this base pair is not present.</p>
               </text>
               <graphic file="1471-2164-8-375-1"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Prediction of Yfr1 homologs in different classes of cyanobacteria based on an RNA motif</p>
            </st>
            <p>Despite some recent progress <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>, marine unicellular cyanobacteria are not trivial in functional studies requiring genetic manipulation. Therefore, finding possible orthologs of Yfr1 in one of the cyanobacterial model species amenable for genetic manipulation would be desirable. The direct identification of such an ncRNA is not intuitively possible based on sequence similarity alone. In Table <tblr tid="T1">1</tblr>, BlastN hits are given for three such model strains, <it>Synechocystis </it>PCC 6803, <it>Synechococcus </it>PCC 7942/6301, <it>Anabaena </it>PCC 7120 and the marine <it>Synechococcus </it>WH 7803. Using the <it>yfr1 </it>full length sequence of <it>Prochlorococcus </it>MED4 (5'-atgggggaaaccccatactcctcacacaccaaatcgcccgatttatcgggcttt-3') as a query, only for <it>Synechocystis </it>PCC 6803 a hit to the correct sequence was found, ranking only at the third position. Similar results were obtained when the <it>yfr1 </it>full length sequence of <it>Synechococcus </it>WH 8102 was taken, except that the ortholog in the very closely related <it>Synechococcus </it>WH 7803 was found with clear statistical support (score = 77.8 bits, e-value = 4e-16; not shown).</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Search for homologs of <it>Prochlorococcus </it>MED4 Yfr1 within the genome sequences of four different cyanobacterial model species based on sequence similarity</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="left">
                        <p>
                           <b>Species</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Match</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Score (bits)</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>E value</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Location</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b><it>Synechocystis </it>PCC 6803</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <inline-formula>
                              <graphic file="1471-2164-8-375-i1.gif"/>
                           </inline-formula>
                        </p>
                     </c>
                     <c ca="left">
                        <p>30.2</p>
                     </c>
                     <c ca="left">
                        <p>0.13</p>
                     </c>
                     <c ca="left">
                        <p>overlapping the start codon of slr1277 <it>gspD </it>in antisense orientation</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <inline-formula>
                              <graphic file="1471-2164-8-375-i2.gif"/>
                           </inline-formula>
                        </p>
                     </c>
                     <c ca="left">
                        <p>30.2</p>
                     </c>
                     <c ca="left">
                        <p>0.13</p>
                     </c>
                     <c ca="left">
                        <p>intergenic spacer sll0185_slr0199</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <inline-formula>
                              <graphic file="1471-2164-8-375-i3.gif"/>
                           </inline-formula>
                        </p>
                     </c>
                     <c ca="left">
                        <p>28.2</p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>0.50</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>ssl3769-<it>trxA </it>intergenic spacer</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b><it>Anabaena </it>PCC 7120</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <inline-formula>
                              <graphic file="1471-2164-8-375-i4.gif"/>
                           </inline-formula>
                        </p>
                     </c>
                     <c ca="left">
                        <p>32.2</p>
                     </c>
                     <c ca="left">
                        <p>0.058</p>
                     </c>
                     <c ca="left">
                        <p>mRNA, cobalt transport protein</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <inline-formula>
                              <graphic file="1471-2164-8-375-i5.gif"/>
                           </inline-formula>
                        </p>
                     </c>
                     <c ca="left">
                        <p>28.2</p>
                     </c>
                     <c ca="left">
                        <p>0.91</p>
                     </c>
                     <c ca="left">
                        <p>mRNA, hypothetical protein</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <inline-formula>
                              <graphic file="1471-2164-8-375-i6.gif"/>
                           </inline-formula>
                        </p>
                     </c>
                     <c ca="left">
                        <p>28.2</p>
                     </c>
                     <c ca="left">
                        <p>0.91</p>
                     </c>
                     <c ca="left">
                        <p>Intergenic spacer</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><b><it>Synechococcus </it>PCC 630</b>1</p>
                     </c>
                     <c ca="left">
                        <p>
                           <inline-formula>
                              <graphic file="1471-2164-8-375-i7.gif"/>
                           </inline-formula>
                        </p>
                     </c>
                     <c ca="left">
                        <p>26.3</p>
                     </c>
                     <c ca="left">
                        <p>1.4</p>
                     </c>
                     <c ca="left">
                        <p>antisense orientation to mRNA of syc1906_d "hypothetical protein"</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <inline-formula>
                              <graphic file="1471-2164-8-375-i8.gif"/>
                           </inline-formula>
                        </p>
                     </c>
                     <c ca="left">
                        <p>26.3</p>
                     </c>
                     <c ca="left">
                        <p>1.4</p>
                     </c>
                     <c ca="left">
                        <p>mRNA,<it>ftsY</it></p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <inline-formula>
                              <graphic file="1471-2164-8-375-i9.gif"/>
                           </inline-formula>
                        </p>
                     </c>
                     <c ca="left">
                        <p>24.3</p>
                     </c>
                     <c ca="left">
                        <p>5.4</p>
                     </c>
                     <c ca="left">
                        <p>mRNA, <it>hisC</it></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b><it>Synechococcus </it>WH 7803</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <inline-formula>
                              <graphic file="1471-2164-8-375-i10.gif"/>
                           </inline-formula>
                        </p>
                     </c>
                     <c ca="left">
                        <p>24.3</p>
                     </c>
                     <c ca="left">
                        <p>4.8</p>
                     </c>
                     <c ca="left">
                        <p>mRNA, Ferredoxin-dependent glutamate synthase</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <inline-formula>
                              <graphic file="1471-2164-8-375-i11.gif"/>
                           </inline-formula>
                        </p>
                     </c>
                     <c ca="left">
                        <p>24.3</p>
                     </c>
                     <c ca="left">
                        <p>4.8</p>
                     </c>
                     <c ca="left">
                        <p>mRNA, NAD/NADP transhydrogenase subunit alpha part 1</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <inline-formula>
                              <graphic file="1471-2164-8-375-i12.gif"/>
                           </inline-formula>
                        </p>
                     </c>
                     <c ca="left">
                        <p>22.3</p>
                     </c>
                     <c ca="left">
                        <p>19</p>
                     </c>
                     <c ca="left">
                        <p>mRNA, glycosyltransferase of family GT4</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>The three top hits (S, subject sequence; Q, query sequence) obtained by BlastN are given for each genome.</p>
               </tblfn>
            </tbl>
            <p>Thus, sequence homology alone is not sufficient and we reasoned to complement it by comparative structure information. Taking up this idea, the initial set of the three previously identified Yfr1 sequences from <it>Prochlorococcus </it>MED4, MIT9313 and <it>Synechococcus </it>WH8102 plus the novel putative orthologs from the marine <it>Synechococcus/Prochlorococcus </it>were subjected to a comparative sequence/structure analysis. As a result we derived an initial sequence/structure-model (Fig. <figr fid="F1">1B</figr>) consisting of two stem loops and a central unpaired section of 18&#8211;19 nt. Interestingly, both loop sequences appeared unconserved and therefore these were set to aNy nucleotide in our subsequent searches. In contrast, we noted the presence of an eleven nt highly conserved sequence stretch within the unpaired central section. We reasoned if this sequence element would be essential and tried different RNA motif descriptors in which the undecanucleotide had to be fully conserved or allowed one or two mismatches. Furthermore, we relaxed the required length of the central unpaired section to a range of 12&#8211;25 nt and the lengths of both terminal helices to 5&#8211;10 base pairs and permitted a single bulge or mismatch in the 5' helix as observed in the data in Fig. <figr fid="F1">1</figr>. This model was used to search for orthologs within genomes of other cyanobacteria and also other eubacteria. Candidates found by this method were incorporated into the model and, finally, this procedure led to the identification of putative Yfr1 orthologs in 31 (28 new) out of 33 available cyanobacterial genomes. All here identified Yfr1 candidates are summarized in a multiple sequence/structure alignment shown in Fig. <figr fid="F2">2</figr>. An outstanding property appearing in this alignment is the perfectly conserved 11 nt long sequence motif (5'-ACUCCUCACAC-3') within the unpaired region. The reason for this perfect conservation might be that this short sequence is targeting a conserved sequence element in one or several other RNAs, mRNAs or ncRNAs, through base-pairing interactions as it is known for many ncRNAs. Alternatively, it could also be a protein binding motif. The rather high concentration of Yfr1 in the different organisms is rather untypical for an ncRNA acting through base-pairing.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Multiple alignment of Yfr1 RNA from 31 cyanobacteria</p>
               </caption>
               <text>
                  <p><b>Multiple alignment of Yfr1 RNA from 31 cyanobacteria</b>. Blue and green background colors indicate participation in a base pair of the consensus structure, which is given on the last line. The perfectly conserved sequence motif within the unpaired region is given in capital, red letters.</p>
               </text>
               <graphic file="1471-2164-8-375-2"/>
            </fig>
            <p>From this set of orthologs, a comprehensive sequence/structure-model for Yfr1 was derived (Fig. <figr fid="F3">3</figr>). The 5'-end forms a 6&#8211;10 bp stem-loop, which may be interrupted by one bulge or internal loop. At the 3'-end the sequences show a 6&#8211;9 bp GC-rich hairpin, followed by a U-tail. These features make this element very likely to be a Rho-independent terminator. Furthermore, together with the 5'-terminal helix the 3'-helix may ensure the unpaired state of the conserved sequence motif and/or protect the RNA against degradation. One would expect approximately 0.5 instances of a specific 11 nt motif in a 2 MB genome and equal base distribution of 0.25. The here identified ultraconserved 5'-ACUCCUCACAC-3' occurs more often then by chance in the investigated genomes (for instance, three times in <it>Prochlorococcus </it>sp. MED4, genome size 1.66 MB; four times in <it>Synechocystis </it>PCC6803, genome size 3.57 MB, but also twice in <it>Prochlorococcus </it>sp. SS120 where it is not related to Yfr1 and is not located within a comparable context of possible secondary structure). Therefore, this motif alone is not sufficient to find homologs of Yfr1. However, it is interesting to note that the two additional occurrences in <it>Prochlorococcus </it>MED4 both are located in intergenic spacer regions, close to two other ncRNAs: 160&#8211;150 nt upstream the mapped 5' end of Yfr2 and 128&#8211;138 nt upstream of Yfr5 <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. The only two cyanobacterial genomes for which no Yfr1 ortholog was found were <it>Prochlorococcus </it>sp. SS120 and MIT9211. These latter two examples were chosen to test when the RNA motif prediction would find likely false positives. When the score was reduced to 0.0 still no candidate Yfr1 homologs were found in these two genomes. Only when in addition one or two mismatches were allowed in the central consensus element, candidates were found (see additional file <supplr sid="S1">1</supplr>). However, these candidate sequences could not be aligned to each other or to the other Yfr1 sequences. In case of <it>Prochlorococcus </it>sp. SS120 also a previous experimental screen had remained negative <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. Therefore, the candidates found under these very relaxed conditions (additional file <supplr sid="S1">1</supplr>) do not appear realistic. Thus, the here desribed search method allows for an  excellent discrimination between true positives and false  positives. This finding is in agreement with studies in which RNA motif prediction was integrated into computational pipelines for the high-throughput prediction of <it>cis</it>-regulatory RNA sequences <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>.</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p>Results and conditions of an RNA motif-based search for Yfr1 homologs in <it>Prochlorococcus </it>sp. SS120 and MIT9211 with relaxed conditions.</p>
               </text>
               <file name="1471-2164-8-375-S1.doc">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Sequence/structure model for Yfr1 RNA based on a structural alignment created from a ClustalW multiple sequence alignment and an RNAlishapes consensus secondary structure from 31 cyanobacteria</p>
               </caption>
               <text>
                  <p><b>Sequence/structure model for Yfr1 RNA based on a structural alignment created from a ClustalW multiple sequence alignment and an RNAlishapes consensus secondary structure from 31 cyanobacteria</b>. Sequence is given in IUPAC-notation (R: A or G; Y: C or U; M: A or C; S: G or C; B: G, U or C; V: G, C or A; D: G, T or A). Base pair colors indicate the number of different base pairs which occur in the different sequences at this position (red = 1, yellow = 2, green = 3 and blue = 4) and their shading resembles the frequency of base pairing, i.e. the number of sequences where this base pair is not present. The unpaired sequence is given as a sequence logo prepared using WebLogo [23].</p>
               </text>
               <graphic file="1471-2164-8-375-3"/>
            </fig>
            <p>Conservation of genomic location and flanking genes can also be a powerful tool for finding related ncRNAs. Indeed, the <it>yfr1 </it>gene is at a conserved position in the majority of the here investigated genomes. In 25 out of 31 genomes, it is found upstream a <it>guaB </it>gene, coding for inosine-5'-monophosphate dehydrogenase (Table <tblr tid="T2">2</tblr>). Even more obvious is the frequent association with a downstream located <it>trxA </it>gene (27 of 31 genomes, Table <tblr tid="T2">2</tblr>).</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Genomic location and orientation of <it>yfr1 </it>in cyanobacteria</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="left">
                        <p>
                           <b>Organism</b>
                        </p>
                     </c>
                     <c cspan="3" ca="center">
                        <p>
                           <b>Orientation</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>all <it>Prochlorococcus </it>(8 strains)</p>
                     </c>
                     <c ca="center">
                        <p><it>guaB</it>&#8594;</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b><it>yfr1</it>&#8594; </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p><it>trxA</it>&#8594;</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>all marine <it>Synechococcus </it>(9 strains)</p>
                     </c>
                     <c ca="center">
                        <p><it>guaB</it>&#8594;</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b><it>yfr1</it>&#8594; </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p><it>trxA</it>&#8594;</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Thermosyn. elongatus</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>tlr1779*&#8594;</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b><it>yfr1</it>&#8594; </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>&#8592;tll1780*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Gloeobacter violaceu</it>s</p>
                     </c>
                     <c ca="center">
                        <p><it>trpE</it>&#8594;</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b><it>yfr1</it>&#8594; </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p><it>trxA</it>&#8594;</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Synechococcus </it>Bprime</p>
                     </c>
                     <c ca="center">
                        <p><it>ligA</it>&#8594;</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b><it>yfr1</it>&#8594; </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p><it>clpX</it>&#8594;</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Synechococcus </it>Aprime</p>
                     </c>
                     <c ca="center">
                        <p><it>ligA</it>&#8594;</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b><it>yfr1</it>&#8594; </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p><it>clpX</it>&#8594;</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Synechococcus </it>PCC7942/6301</p>
                     </c>
                     <c ca="center">
                        <p><it>guaB</it>&#8594;</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b><it>yfr1</it>&#8594; </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p><it>trxA</it>&#8594;</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Crocosphaera watsonii </it>WH 8501</p>
                     </c>
                     <c ca="center">
                        <p>&#8592;<it>pcrA</it></p>
                     </c>
                     <c ca="center">
                        <p>
                           <b><it>yfr1</it>&#8594; </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p><it>trxA</it>&#8594;</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Synechocystis </it>PCC 6803</p>
                     </c>
                     <c ca="center">
                        <p>&#8592;sll0586*</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b><it>yfr1</it>&#8594; </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p><it>trxA</it>&#8594;</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Trichodesmium eryt</it>. IMS101</p>
                     </c>
                     <c ca="center">
                        <p><it>guaB</it>&#8594;</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b><it>yfr1</it>&#8594; </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>&#8592;Tery3307*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Lyngbya </it>PCC7419</p>
                     </c>
                     <c ca="center">
                        <p><it>guaB</it>&#8594;</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b><it>yfr1</it>&#8594; </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p><it>trxA</it>&#8594;</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Anabaena variabilis </it>ATCC 29413</p>
                     </c>
                     <c ca="center">
                        <p><it>guaB</it>&#8594;</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b><it>yfr1</it>&#8594; </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p><it>trxA</it>&#8594;</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Anabaena </it>PCC7120</p>
                     </c>
                     <c ca="center">
                        <p><it>guaB</it>&#8594;</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b><it>yfr1</it>&#8594; </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p><it>trxA</it>&#8594;</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Nostoc punctiforme </it>PCC73102</p>
                     </c>
                     <c ca="center">
                        <p><it>guaB</it>&#8594;</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b><it>yfr1</it>&#8594; </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p><it>trxA</it>&#8594;</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p><it>Nodularia </it>PCC9350</p>
                     </c>
                     <c ca="center">
                        <p><it>guaB</it>&#8594;</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b><it>yfr1</it>&#8594; </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p><it>trxA</it>&#8594;</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>See Fig. 2 for the strain names of marine <it>Synechococcus </it>and <it>Prochlorococcus;</it>*hypothetical protein.</p>
               </tblfn>
            </tbl>
            <p>We extended the search also to non-cyanobacterial photosynthetic bacteria, such as the alpha-proteobacteria <it>Erythrobacter litoralis</it>, <it>Rhodobacter sphaeroides</it>, <it>Silicibacter sp</it>. TM1040, the chlorobiaceae <it>Chlorobium tepidum</it>, <it>Chlorobium chlorochromatii </it>and <it>Cytophaga hutchinsonii</it>, as well as one spirochaete (<it>Leptospira borgpeterseni</it>) and, finally, <it>Escherichia coli</it>, but did not identify a reasonable ortholog of Yfr1 within these. So far, the occurrence of Yfr1 seems to be restricted to cyanobacteria. Since cyanobacteria represent a separate eubacterial phylum for at least 2.5 billion years of evolution it is very likely that Yfr1 originated very early in the progenitors of this group and must have co-evolved with its (hypothetical) target(s) for a very long time.</p>
         </sec>
         <sec>
            <st>
               <p>Experimental verification of Yfr1 in phylogenetically different groups of cyanobacteria</p>
            </st>
            <p>According to morphologic criteria, cyanobacteria can be organized into five different sections <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. These sections are only partially congruent with molecular phylogenetic data, therefore we chose six cyanobacterial species which clearly are very different from each other, judged by their morphology, 16S rRNA sequence and life style, to test our result by Northern hybridization. We also included RNA from <it>Microcystis </it>PCC7806 for which no genome sequence was available and, hence, no candidate predicted. The results are given in Fig. <figr fid="F4">4</figr> and clearly show that Yfr1 is present in all tested species. Together with the three previous identified ones <abbrgrp><abbr bid="B7">7</abbr></abbrgrp> our experiment raises the number of validated Yfr1 homologs to ten. In our previous study we did not identify an Yfr1 homolog in <it>Prochlorococcus </it>SS120. This is substantiated by the result of this study as we also do not predict one here. Summarizing, we found reasonable orthologs of Yfr1 in 31 out of 33 cyanobacterial strains and, furthermore, their existence could be validated for ten out of ten in this and a previous study.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Detection of Yfr1 RNA in seven different freshwater cyanobacteria</p>
               </caption>
               <text>
                  <p><b>Detection of Yfr1 RNA in seven different freshwater cyanobacteria</b>. <b>A</b>. About 10 &#956;g of total RNA from <it>Synechococcus </it>sp. PCC 7942 (Syn 7942), <it>Thermosynechococcus elongatus </it>BP1 (Tsyn. elong.), <it>Synechocystis </it>PCC 6803 (Sync. 6803), <it>Microcystis aeruginosa </it>PCC 7806 (Mic. 7806), <it>Nostoc </it>sp. PCC 7120 (Nos. 7120), <it>Nostoc punctiforme </it>(Nos. punct.) and <it>Gloeobacter violaceus </it>PCC 7421 (Gloe. 7421) was analyzed by staining a 10% polyacrylamide gel with ethidium bromide.<b>B</b>. Northern blot hybridization with DNA oligonucleotides for the presence of Yfr1 (lower part) and, as a control, the 6S RNA (upper part).</p>
               </text>
               <graphic file="1471-2164-8-375-4"/>
            </fig>
            <p>Superimposing our results with a phylogenetic analysis based on 16S rRNA yields the tree shown in Fig. <figr fid="F5">5</figr>. The widespread existence of Yfr1 over large evolutionary distances points to an important function of Yfr1 throughout the cyanobacterial domain. Interestingly, <it>Prochlorococcus </it>SS120 and MIT9211, neither of which is predicted to hold an Yfr1 homolog, appear in one separate cluster. This lets us conclude, that these two or a common ancestor of them lost Yfr1 secondarily. This may be related to their special habitat. Different <it>Prochlorococcus </it>isolates can broadly become subdivided in an ecotype adapted to low light and another one adapted to high light. These ecotypes are genetically and physiologically distinct <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> and show a distinct distribution under natural conditions <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr></abbrgrp>. <it>Prochlorococcus </it>SS120 and MIT9211 have both been isolated from greater depths in the water column and are adapted to very low light conditions.</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>16S rRNA-based phylogenetic tree summarizing the presence or absence of Yfr1 in all major cyanobacterial lineages</p>
               </caption>
               <text>
                  <p><b>16S rRNA-based phylogenetic tree summarizing the presence or absence of Yfr1 in all major cyanobacterial lineages</b>. Cyanobacteria for which genome sequences are currently available from public databases are in boldface letters. Those strains, for which a Yfr1 ncRNA was computationally predicted are labeled Yfr1+. Those, for which we provide experimental evidence for the presence of Yfr1 are labeled Yfr1*. We have shown the presence of Yfr1 in the <it>Prochlorococcus </it>strains MED4 and MIT9313 and in <it>Synechococcus </it>WH 8102 before [7]. The tree is based on a multiple sequence alignment of 16S rRNA sequences analyzed in MEGA3.1 [24]. Statistic support values were generated by bootstrapping and are only shown if > 70. The 16S rRNA of <it>Chlorobium tepidum </it>TLS served as an outgroup. The sections I-V of cyanobacterial taxonomy [12] are indicated.</p>
               </text>
               <graphic file="1471-2164-8-375-5"/>
            </fig>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>Starting with a known ncRNA from one lineage we show here the highly specific and sensitive identification of its homologs within all lineages of cyanobacteria. The integration of RNA motif prediction into computational pipelines for the detection of ncRNAs in bacteria appears as a promising step to improve the quality of such predictions. The only cyanobacteria that appear to lack Yfr1 are  <it>Prochlorococcus</it> SS120 and MIT9211, which both are adapted  to live at very low light intensities and must have lost  Yfr1 secondarily. Therefore, one essential function of Yfr1 might have to do with the ability to cope with high photosynthetic energy fluxes or redox conditions.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Cultures of cyanobacteria</p>
            </st>
            <p>Marine unicellular cyanobacteria were grown as previously published <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. Cultures of <it>Microcystis aeruginosa</it>, <it>Synechococcus elongatus </it>PCC 7942, <it>Thermosynechococcus elongatus </it>BP-1, <it>Synechocystis </it>sp. PCC 6803, <it>Nostoc </it>sp. PCC 7120 and <it>Gloeobacter violaceus </it>were obtained from the Pasteur Culture Collection and one, <it>Nostoc punctiforme </it>ATCC 29133, from American Tissue Culture Collection. All cultures were grown as recommended by the Pasteur Culture Collection.</p>
         </sec>
         <sec>
            <st>
               <p>Extraction and analysis of RNA</p>
            </st>
            <p>Total RNA was isolated as previously described <abbrgrp><abbr bid="B7">7</abbr></abbrgrp> but with modified lysis conditions for PCC strains, <it>Anabaena </it>and <it>Nostoc </it>as follows: Disruption by adding an equal volume of glass beads, 33.3 &#956;l 20 % SDS solution and 583 &#956;l acidic phenol to 500 &#956;l concentrated cell solution, following several cycles of vigorous agitation, freezing in liquid nitrogen and thawing in a water bath. Centrifugation of the mixture for 15 min at maximal speed at 4&#176;C yielded an upper aqueous phase which could be cleaned up by standard phenol-chloroform-extraction. Finally, the phases were separated by centrifugation at 9.000 rpm for 15 minutes at 4&#176;C. The RNA was precipitated from the aqueous phase with 100 &#956;l isopropanol for several hours at -20&#176;C, pelleted by centrifugation, washed with 75% ethanol, dried and resuspended in 100 &#956;l of DEPC-treated water. Total RNA was separated in 10 % polyacrylamide-urea gels. Polyacrylamide gels were stained with ethidium bromide (0.3 &#956;g/l) in 1&#215; TBE buffer, rinsed with water and analyzed with a Lumi-Imager F1 system (Roche). Transcript sizes were determined by correlation to <it>Msp</it>I-digested DNA of plasmid pUC19.</p>
         </sec>
         <sec>
            <st>
               <p>Sequence data</p>
            </st>
            <p>The <it>Nostoc punctiforme </it>ATCC 29133 (=PCC 73102) genome sequence was downloaded from the JGI website <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. <it>Synechococcus </it>RCC307 and WH7803 were obtained from the Genoscope <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>.</p>
            <p>All other sequences were obtained from the finished or unfinished genomes website at Genbank <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> with the following accession numbers: <it>Anabaena variabilis </it>ATCC 29413, CP000117; <it>Crocosphaera watsonii </it>WH8501, AADV00000000; <it>Gloeobacter violaceus </it>PCC 7421, BA000045; <it>Lyngbya aestuarii </it>CCY9616; <it>Nodularia spumigena </it>CCY9414, <it>Anabaena </it>PCC 7120, <ext-link ext-link-type="gen" ext-link-id="NC_003272">NC_003272</ext-link>; <it>Prochlorococcus </it>strains: MIT9312, <ext-link ext-link-type="gen" ext-link-id="NC_007577">NC_007577</ext-link>; NATL2A, <ext-link ext-link-type="gen" ext-link-id="NC_007335">NC_007335</ext-link>; MIT9313, <ext-link ext-link-type="gen" ext-link-id="NC_005071">NC_005071</ext-link>; SS120, <ext-link ext-link-type="gen" ext-link-id="NC_005042">NC_005042</ext-link>; MIT9211, AALP00000000; MIT9303, <ext-link ext-link-type="gen" ext-link-id="NC_008820">NC_008820</ext-link>; AS9601, <ext-link ext-link-type="gen" ext-link-id="NC_008816">NC_008816</ext-link>; NATL1A, <ext-link ext-link-type="gen" ext-link-id="NC_008819">NC_008819</ext-link>; MIT9515, <ext-link ext-link-type="gen" ext-link-id="NC_008817">NC_008817</ext-link>; MED4 (=CCMP1986), <ext-link ext-link-type="gen" ext-link-id="NC_005072">NC_005072</ext-link>; <it>Synechococcus </it>strains: PCC7942, <ext-link ext-link-type="gen" ext-link-id="NC_007604">NC_007604</ext-link>; PCC 6301, <ext-link ext-link-type="gen" ext-link-id="NC_006576">NC_006576</ext-link>; OS-B' (JA-2-3B'a(2-13)), <ext-link ext-link-type="gen" ext-link-id="NC_007776">NC_007776</ext-link>; OS-A (JA-3-3Ab), <ext-link ext-link-type="gen" ext-link-id="NC_007775">NC_007775</ext-link>; CC9605, <ext-link ext-link-type="gen" ext-link-id="NC_007516">NC_007516</ext-link>; CC9902, <ext-link ext-link-type="gen" ext-link-id="NC_007513">NC_007513</ext-link>; WH8102, <ext-link ext-link-type="gen" ext-link-id="NC_005070">NC_005070</ext-link>; WH7805, AAOK00000000; WH5701, AANO00000000; RS9917 (=RCC556), AANP00000000; RS9916 (=RCC555), <ext-link ext-link-type="gen" ext-link-id="NZ_AAUA00000000">NZ_AAUA00000000</ext-link>; <it>Synechococcus </it>BL107, <ext-link ext-link-type="gen" ext-link-id="NZ_AATZ00000000">NZ_AATZ00000000</ext-link>; <it>Synechococcus </it>CC9311, <ext-link ext-link-type="gen" ext-link-id="NC_008319">NC_008319</ext-link>; <it>Synechocystis </it>sp. PCC 6803, <ext-link ext-link-type="gen" ext-link-id="NC_000911">NC_000911</ext-link>; <it>Thermosynechococcus elongatus </it>BP-1, <ext-link ext-link-type="gen" ext-link-id="NC_004113">NC_004113</ext-link>; <it>Trichodesmium erythraeum </it>IMS101, <ext-link ext-link-type="gen" ext-link-id="NC_008312">NC_008312</ext-link>.</p>
         </sec>
         <sec>
            <st>
               <p>Comparative sequence/structure analysis</p>
            </st>
            <p>Multiple sequence alignments were generated using ClustalW <abbrgrp><abbr bid="B19">19</abbr></abbrgrp> with default parameters for DNA. Comparative structure prediction was done with RNAlishapes <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>, a tool which predicts a consensus structure for a set of aligned sequences by taking covariance and free energy into account. The resulting consensus structure was analysed together with the multiple sequence alignment using RALEE <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. The latter served also for manual optimisation of the alignment and the consensus structure, respectively, and for the production of color annotated alignments. Homology searches were done using RNAMotif <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> based on a combined sequence structure motif description for Yfr1. The final RNAMotif descriptor was:</p>
            <p>descr</p>
            <p>&#160;&#160;&#160;h5(minlen = 5, maxlen = 10, mispair = 1, pair+={"g:u","u:g"}) ss(minlen = 3, maxlen = 8) h3</p>
            <p>&#160;&#160;&#160;ss(minlen = 12, maxlen = 20, seq="ACTCCTCACAC", mismatch = 0)</p>
            <p>&#160;&#160;&#160;h5(minlen = 6, maxlen = 9, mispair = 0, pair+={"g:u","u:g"}) ss(minlen = 3, maxlen = 15) h3</p>
            <p>&#160;&#160;&#160;ss(seq=" ^ [AUCG]UU$")</p>
            <p>score</p>
            <p>&#160;&#160;&#160;{ gcnt = 0;</p>
            <p>&#160;&#160;&#160;&#160;&#160;&#160;len = length(h5 [5]);</p>
            <p>&#160;&#160;&#160;&#160;&#160;&#160;loop = length(ss [6]);</p>
            <p>&#160;&#160;&#160;&#160;&#160;&#160;for(i = 1; i &lt; = len; i++){</p>
            <p>&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;j = len - i + 1;</p>
            <p>&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;b1 = h5 [5, i, 1];</p>
            <p>&#160;&#160;&#160;&#160;&#160;&#160;b2 = h3 [7, j, 1];</p>
            <p>&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;if((b1 == "g" &amp;&amp; b2 == "c") || (b1 == "c" &amp;&amp; b2 == "g"))</p>
            <p>&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;gcnt++;</p>
            <p>&#160;&#160;&#160;&#160;&#160;&#160;}</p>
            <p>&#160;&#160;&#160;# require 65% GC in the stem!</p>
            <p>&#160;&#160;&#160;&#160;&#160;&#160;SCORE = 1.0 * gcnt/len;</p>
            <p>&#160;&#160;&#160;&#160;&#160;&#160;if(SCORE &lt; .65)</p>
            <p>&#160;&#160;&#160;&#160;&#160;&#160;REJECT;</p>
            <p>&#160;&#160;&#160;}</p>
         </sec>
         <sec>
            <st>
               <p>Oligonucleotides for hybridization</p>
            </st>
            <p>Yfr1_multi_2 5'-GTGTGGTGTGAGGAGTGAACGGAA-3'</p>
            <p>Yfr1_gloeo_2 5'-CATGGTGTGAGGAGTGAACGGAAAC-3'</p>
            <p>SsA_Thermorev 5'-GCGACGCCGTTTTACCT-3'</p>
            <p>SsaA_6803rev 5'-CACCACGCCGTTTTACCT-3'</p>
            <p>SsaA_7120rev 5'-CGCAACGCCGTTTTACCT-3'</p>
         </sec>
         <sec>
            <st>
               <p>Hybridization conditions</p>
            </st>
            <p>Hybridization was performed at 54&#176;C for Yfr1 and 50&#176;C for 6S RNA in hybridization buffer (50% deionized formamide, 7% SDS, 250 mM NaCl, 120 mM Na(PO<sub>4</sub>), pH 7.2).</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p>ncRNA, non-coding RNA; sRNA, small RNA</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>GG and IMA performed RNA analyses in the laboratory, BV designed and carried out bioinformatic analyses and participated in drafting the manuscript. WRH designed research and wrote the manuscript. All authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>This work was supported by the Deutsche Forschungsgemeinschaft Focus program "Sensory and regulatory RNAs in Prokaryotes" SPP1258 (project HE 2544/4-1), Marine Genomics Europe (GOCE-CT-2004-505403), and by the Landesstiftung Baden-Wuerttemberg (P-LS-RNS/40).</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>The small RNA regulators of Escherichia coli: roles and mechanisms</p>
            </title>
            <aug>
               <au>
                  <snm>Gottesman</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Annu Rev Microbiol</source>
            <pubdate>2004</pubdate>
            <volume>58</volume>
            <fpage>303</fpage>
            <lpage>328</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.micro.58.030603.123841</pubid>
                  <pubid idtype="pmpid" link="fulltext">15487940</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Micros for microbes: non-coding regulatory RNAs in bacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Gottesman</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>7</issue>
            <fpage>399</fpage>
            <lpage>404</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tig.2005.05.008</pubid>
                  <pubid idtype="pmpid" link="fulltext">15913835</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>How to find small non-coding RNAs in bacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Vogel</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Sharma</snm>
                  <fnm>CM</fnm>
               </au>
            </aug>
            <source>Biol Chem</source>
            <pubdate>2005</pubdate>
            <volume>386</volume>
            <fpage>1219</fpage>
            <lpage>1238</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1515/BC.2005.140</pubid>
                  <pubid idtype="pmpid" link="fulltext">16336117</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Transcriptional analysis of the cyanobacterial gvpABC operon in differentiated cells: occurrence of an antisense RNA complementary to three overlapping transcripts</p>
            </title>
            <aug>
               <au>
                  <snm>Csiszar</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Houmard</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Damerval</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Tandeau de Marsac</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>1987</pubdate>
            <volume>60</volume>
            <issue>1</issue>
            <fpage>29</fpage>
            <lpage>37</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0378-1119(87)90210-1</pubid>
                  <pubid idtype="pmpid">2450053</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>An internal antisense RNA regulates expression of the photosynthesis gene isiA</p>
            </title>
            <aug>
               <au>
                  <snm>D&#252;hring</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Axmann</snm>
                  <fnm>IM</fnm>
               </au>
               <au>
                  <snm>Hess</snm>
                  <fnm>WR</fnm>
               </au>
               <au>
                  <snm>Wilde</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2006</pubdate>
            <volume>103</volume>
            <fpage>7054</fpage>
            <lpage>7058</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1459017</pubid>
                  <pubid idtype="pmpid" link="fulltext">16636284</pubid>
                  <pubid idtype="doi">10.1073/pnas.0600927103</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Identification of a furA cis antisense RNA in the cyanobacterium Anabaena sp. PCC 7120</p>
            </title>
            <aug>
               <au>
                  <snm>Hernandez</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Muro-Pastor</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Flores</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Bes</snm>
                  <fnm>MT</fnm>
               </au>
               <au>
                  <snm>Peleato</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Fillat</snm>
                  <fnm>MF</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2005</pubdate>
            <volume>355</volume>
            <fpage>325</fpage>
            <lpage>334</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.jmb.2005.10.079</pubid>
                  <pubid idtype="pmpid" link="fulltext">16324715</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Identification of cyanobacterial non-coding RNAs by comparative genome analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Axmann</snm>
                  <fnm>IM</fnm>
               </au>
               <au>
                  <snm>Kensche</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Vogel</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kohl</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Herzel</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Hess</snm>
                  <fnm>WR</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <issue>9</issue>
            <fpage>R73</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1242208</pubid>
                  <pubid idtype="pmpid" link="fulltext">16168080</pubid>
                  <pubid idtype="doi">10.1186/gb-2005-6-9-r73</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>6S RNA is a widespread regulator of eubacterial RNA polymerase that resembles an open promoter</p>
            </title>
            <aug>
               <au>
                  <snm>Barrick</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Sudarsan</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Weinberg</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Ruzzo</snm>
                  <fnm>WL</fnm>
               </au>
               <au>
                  <snm>Breaker</snm>
                  <fnm>RR</fnm>
               </au>
            </aug>
            <source>RNA</source>
            <pubdate>2005</pubdate>
            <volume>11</volume>
            <issue>5</issue>
            <fpage>774</fpage>
            <lpage>784</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1370762</pubid>
                  <pubid idtype="pmpid" link="fulltext">15811922</pubid>
                  <pubid idtype="doi">10.1261/rna.7286705</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Two distinct types of 6S RNA in Prochlorococcus</p>
            </title>
            <aug>
               <au>
                  <snm>Axmann</snm>
                  <fnm>IM</fnm>
               </au>
               <au>
                  <snm>Holtzendorff</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Voss</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Kensche</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Hess</snm>
                  <fnm>WR</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>2007</pubdate>
            <fpage>epub ahead of print: doi:10.1016/j.gene.2007.06.011 </fpage>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Genetic manipulation of Prochlorococcus MIT9313: GFP expression on an RSF1010 plasmid and Tn5 transposition</p>
            </title>
            <aug>
               <au>
                  <snm>Tolonen</snm>
                  <fnm>AC</fnm>
               </au>
               <au>
                  <snm>Liszt</snm>
                  <fnm>GB</fnm>
               </au>
               <au>
                  <snm>Hess</snm>
                  <fnm>WR</fnm>
               </au>
            </aug>
            <source>Appl Environ Microbiol</source>
            <pubdate>2006</pubdate>
            <volume>72</volume>
            <fpage>7607</fpage>
            <lpage>7613</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1694220</pubid>
                  <pubid idtype="pmpid" link="fulltext">17041154</pubid>
                  <pubid idtype="doi">10.1128/AEM.02034-06</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>A Computational Pipeline for High- Throughput Discovery of cis-Regulatory Noncoding RNA in Prokaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Yao</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Barrick</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Weinberg</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Neph</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Breaker</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Tompa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ruzzo</snm>
                  <fnm>WL</fnm>
               </au>
            </aug>
            <source>PLoS Comput Biol</source>
            <pubdate>2007</pubdate>
            <volume>3</volume>
            <issue>7</issue>
            <fpage>e126</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1913097</pubid>
                  <pubid idtype="pmpid" link="fulltext">17616982</pubid>
                  <pubid idtype="doi">10.1371/journal.pcbi.0030126</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Generic assignments, strain histories and properties of pure cultures of cyanobacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Rippka</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Deruelles</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Waterbury</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Herdmann</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Stanier</snm>
                  <fnm>RY</fnm>
               </au>
            </aug>
            <source>J Gen Microbiol</source>
            <pubdate>1979</pubdate>
            <volume>111</volume>
            <fpage>1</fpage>
            <lpage>61</lpage>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Physiology and molecular phylogeny of coexisting Prochlorococcus ecotypes</p>
            </title>
            <aug>
               <au>
                  <snm>Moore</snm>
                  <fnm>LR</fnm>
               </au>
               <au>
                  <snm>Rocap</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Chisholm</snm>
                  <fnm>SW</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1998</pubdate>
            <volume>393</volume>
            <issue>6684</issue>
            <fpage>464</fpage>
            <lpage>467</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/30861</pubid>
                  <pubid idtype="pmpid" link="fulltext">9624000</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Rapid diversification of marine picophytoplankton with dissimilar light-harvesting structures inferred from sequences of Prochlorococcus and Synechococcus (Cyanobacteria)</p>
            </title>
            <aug>
               <au>
                  <snm>Urbach</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Scanlan</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Distel</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Waterbury</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Chisholm</snm>
                  <fnm>SW</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>1998</pubdate>
            <volume>46</volume>
            <issue>2</issue>
            <fpage>188</fpage>
            <lpage>201</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/PL00006294</pubid>
                  <pubid idtype="pmpid" link="fulltext">9452521</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Niche-partitioning of Prochlorococcus populations in a stratified water column in the eastern north Atlantic ocean</p>
            </title>
            <aug>
               <au>
                  <snm>West</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Scanlan</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Appl Environ Microbiol</source>
            <pubdate>1999</pubdate>
            <volume>65</volume>
            <issue>6</issue>
            <fpage>2585</fpage>
            <lpage>2591</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">91382</pubid>
                  <pubid idtype="pmpid" link="fulltext">10347047</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>JGI website</p>
            </title>
            <url>http://genome.jgi-psf.org/finished_microbes/nospu/nospu.home.html</url>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Genoscope</p>
            </title>
            <url>http://www.genoscope.cns.fr/</url>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Genbank</p>
            </title>
            <url>http://www.ncbi.nlm.nih.gov/genomes/lproks.cgi</url>
         </bibl>
         <bibl id="B19">
            <title>
               <p>CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice</p>
            </title>
            <aug>
               <au>
                  <snm>Thompson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Higgins</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Gibson</snm>
                  <fnm>TJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1994</pubdate>
            <volume>22</volume>
            <issue>22</issue>
            <fpage>4673</fpage>
            <lpage>4680</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308517</pubid>
                  <pubid idtype="pmpid" link="fulltext">7984417</pubid>
                  <pubid idtype="doi">10.1093/nar/22.22.4673</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Structural analysis of aligned RNAs</p>
            </title>
            <aug>
               <au>
                  <snm>Voss</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2006</pubdate>
            <volume>34</volume>
            <issue>19</issue>
            <fpage>5471</fpage>
            <lpage>5481</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1636479</pubid>
                  <pubid idtype="pmpid" link="fulltext">17020924</pubid>
                  <pubid idtype="doi">10.1093/nar/gkl692</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>RALEE--RNA ALignment editor in Emacs</p>
            </title>
            <aug>
               <au>
                  <snm>Griffiths-Jones</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>2</issue>
            <fpage>257</fpage>
            <lpage>259</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/bth489</pubid>
                  <pubid idtype="pmpid" link="fulltext">15377506</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>RNAMotif, an RNA secondary structure definition and search algorithm</p>
            </title>
            <aug>
               <au>
                  <snm>Macke</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Ecker</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Gutell</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Gautheret</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Case</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Sampath</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <issue>22</issue>
            <fpage>4724</fpage>
            <lpage>4735</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">92549</pubid>
                  <pubid idtype="pmpid" link="fulltext">11713323</pubid>
                  <pubid idtype="doi">10.1093/nar/29.22.4724</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>WebLogo: a sequence logo generator</p>
            </title>
            <aug>
               <au>
                  <snm>Crooks</snm>
                  <fnm>GE</fnm>
               </au>
               <au>
                  <snm>Hon</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Chandonia</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Brenner</snm>
                  <fnm>SE</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <issue>6</issue>
            <fpage>1188</fpage>
            <lpage>1190</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">419797</pubid>
                  <pubid idtype="pmpid" link="fulltext">15173120</pubid>
                  <pubid idtype="doi">10.1101/gr.849004</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>MEGA: Molecular Evolutionary Genetics Analysis software for microcomputers</p>
            </title>
            <aug>
               <au>
                  <snm>Kumar</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Tamura</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Nei</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Comput Appl Biosci</source>
            <pubdate>1994</pubdate>
            <volume>10</volume>
            <issue>2</issue>
            <fpage>189</fpage>
            <lpage>191</lpage>
            <xrefbib>
               <pubid idtype="pmpid">8019868</pubid>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
