<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2229-6-2</ui>
   <ji>1471-2229</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>EST analysis of the scaly green flagellate <it>Mesostigma viride </it>(Streptophyta): Implications for the evolution of green plants (Viridiplantae)</p>
         </title>
         <aug>
            <au id="A1" ce="yes">
               <snm>Simon</snm>
               <fnm>Andreas</fnm>
               <insr iid="I1"/>
               <email>andreas.simon@uni-koeln.de</email>
            </au>
            <au id="A2" ce="yes">
               <snm>Gl&#246;ckner</snm>
               <fnm>Gernot</fnm>
               <insr iid="I2"/>
               <email>gernot@imb-jena.de</email>
            </au>
            <au id="A3">
               <snm>Felder</snm>
               <fnm>Marius</fnm>
               <insr iid="I2"/>
               <email>mfelder@imb-jena.de</email>
            </au>
            <au id="A4">
               <snm>Melkonian</snm>
               <fnm>Michael</fnm>
               <insr iid="I1"/>
               <email>michael.melkonian@uni-koeln.de</email>
            </au>
            <au id="A5" ca="yes">
               <snm>Becker</snm>
               <fnm>Burkhard</fnm>
               <insr iid="I1"/>
               <email>b.becker@uni-koeln.de</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Botanical Institute, University of Cologne, Gyrhofstr. 15, 50931 Cologne, Germany</p>
            </ins>
            <ins id="I2">
               <p>Genome Analysis, Leibniz Institute for Age Research &#8211; Fritz Lipmann Institute, Beutenbergstr. 11, 07745 Jena, Germany</p>
            </ins>
         </insg>
         <source>BMC Plant Biology</source>
         <issn>1471-2229</issn>
         <pubdate>2006</pubdate>
         <volume>6</volume>
         <issue>1</issue>
         <fpage>2</fpage>
         <url>http://www.biomedcentral.com/1471-2229/6/2</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">16476162</pubid>
               <pubid idtype="doi">10.1186/1471-2229-6-2</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>31</day>
               <month>10</month>
               <year>2005</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>13</day>
               <month>2</month>
               <year>2006</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>13</day>
               <month>2</month>
               <year>2006</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2006</year>
         <collab>Simon et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>The Viridiplantae (land plants and green algae) consist of two monophyletic lineages, the Chlorophyta and the Streptophyta. The Streptophyta include all embryophytes and a small but diverse group of freshwater algae traditionally known as the Charophyceae (e.g. Charales, <it>Coleochaete </it>and the Zygnematales). The only flagellate currently included in the Streptophyta is <it>Mesostigma viride </it>Lauterborn. To gain insight into the genome evolution in streptophytes, we have sequenced 10,395 ESTs from <it>Mesostigma </it>representing 3,300 independent contigs and compared the ESTs of <it>Mesostigma </it>with available plant genomes (<it>Arabidopsis, Oryza</it>, <it>Chlamydomonas</it>), with ESTs from the bryophyte <it>Physcomitrella</it>, the genome of the rhodophyte <it>Cyanidioschyzon</it>, the ESTs from the rhodophyte <it>Porphyra</it>, and the genome of the diatom <it>Thalassiosira</it>.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>The number of expressed genes shared by <it>Mesostigma </it>with the embryophytes (90.3 % of the expressed genes showing similarity to known proteins) is higher than with <it>Chlamydomonas </it>(76.1 %). In general, cytosolic metabolic pathways, and proteins involved in vesicular transport, transcription, regulation, DNA-structure and replication, cell cycle control, and RNA-metabolism are more conserved between <it>Mesostigma </it>and the embryophytes than between <it>Mesostigma </it>and <it>Chlamydomonas</it>. However, plastidic and mitochondrial metabolic pathways, cytoskeletal proteins and proteins involved in protein folding are more conserved between <it>Mesostigma </it>and <it>Chlamydomonas </it>than between <it>Mesostigma </it>and the embryophytes.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>Our EST-analysis of <it>Mesostigma </it>supports the notion that this organism should be a suitable unicellular model for the last flagellate common ancestor of the streptophytes. <it>Mesostigma </it>shares more genes with the embryophytes than with the chlorophyte <it>Chlamydomonas reinhardtii</it>, although both organisms are flagellate unicells. Thus, it seems likely that several major physiological changes (e.g. in the regulation of photosynthesis and photorespiration) took place early during the evolution of streptophytes, i.e. before the transition to land.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>The Viridiplantae (literally meaning green plants) include all green algae and embryophyte plants. They represent a monophyletic group of organisms, which display a surprising diversity with respect to their morphology, cell architecture, life histories and reproduction, and their biochemistry. The colonization of the terrestrial habitat by streptophyte algae 450 &#8211; 470 million years ago [reviewed in <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>] was undoubtedly one of the most important steps in the evolution of life on earth <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>, which paved the way for the evolution of the various groups of land plants (embryophytes = bryophytes, pteridophytes and spermatophytes) resulting in our current terrestrial ecosystems <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>.</p>
         <p>A thorough understanding of the evolution of land plants requires knowledge about the phylogeny of green algae and embryophytes as well as insight into the evolution of plant genomes with special reference to developmental processes. Whereas our knowledge about the phylogeny of the Viridiplantae has greatly increased over the last years, the latter has hardly been addressed to date.</p>
         <p>The Viridiplantae are grouped into two divisions: the Chlorophyta and the Streptophyta <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. The Chlorophyta comprise the vast majority of green algae including most scaly green flagellates (e.g. <it>Pyramimonas</it>, <it>Tetraselmis</it>), the Ulvophyceae (e.g. <it>Ulva</it>, <it>Acetabularia</it>), Chlorophyceae (e.g. <it>Chlamydomonas</it>, <it>Volvox</it>) and Trebouxiophyceae (e.g. <it>Chlorella</it>) <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp>. The Streptophyta include all embryophyte plants and a diverse paraphyletic assemblage of freshwater green algae, the Charales (stoneworts), <it>Coleochaete</it>, the Zygnematophyceae and a few other taxa <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. Currently, the Charales are thought to be the sister group of the embryophytes suggesting that the evolution of true land plants already started with a complex organism <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. Remarkably, only a single scaly green flagellate <it>Mesostigma viride </it>Lauterborn, has been found to belong to the Streptophyta <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>. The exact phylogenetic position of <it>Mesostigma viride</it>, however, is still controversial <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>. <it>Mesostigma </it>has recently attracted much attention as a putative key organism for the understanding of the early evolution of the Streptophyta <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr></abbrgrp>.</p>
         <p>Two aspects in the evolution of land plants seem to be important in this respect. First, many key evolutionary inventions of plants took already place within the streptophyte algae. According to Graham et al. <abbrgrp><abbr bid="B21">21</abbr></abbrgrp> one can distinguish several major transitions in the evolution of land plants starting with a <it>Mesostigma</it>-like flagellate ancestor: development of a cellulosic cell wall, multicellularity, cytokinesis by a phragmoplast, plasmodesmata, apical meristematic cell and apical cell proliferation leading to branching, asymmetric cell division, cell differentiation, retention of zygotes, heteromorphic life history, and a root meristem. Of these distinguishing features only the latter two evolved not until the embryophytes emerged. Second, the colonization of the terrestrial habitat with its exposure to air, increased solar radiation and life in a desiccating environment led to adaptations of cell architecture, metabolism and body plan to survive in the terrestrial ecosystems <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. The evolutionary history of these adaptations is currently not known. Important questions are: How did the green algal progenitor adapt to the terrestrial habitat? Which genomic changes were associated with this transition? And which of these genes are derived from streptophyte green algae? To gain insight into these questions we have started to analyze ESTs from various streptophyte algal lineages.</p>
         <p>Here, we present an analysis of 10,395 ESTs representing 3306 non-redundant expressed genes obtained from <it>Mesostigma viride</it>. We show that the number of genes shared is higher between <it>Mesostigma </it>and the embryophytes than between <it>Mesostigma </it>and <it>Chlamydomonas</it>. Comparison of expressed genes from <it>Mesostigma </it>with the genomes of <it>Arabidopsis, Chlamydomonas</it>, the red alga <it>Cyanidioschyzon</it>, and rice as well as ESTs from <it>Physcomitrella </it>and <it>Porphyra </it>allowed us to identify conserved and derived cellular functions within the different evolutionary lines and to obtain a first insight into the metabolic capabilities of the flagellate ancestor of green plants.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Preparation and characterization of libraries</p>
            </st>
            <p>Total RNA was isolated from an axenic culture of <it>Mesostigma viride </it>during the light phase. The culture contained about 5 % cell division stages. The isolated RNA was used for the construction of 4 different cDNA libraries (Meso 1 &#8211; Meso 4). Meso 1 and 2 differed in the size of the cloned inserts. For Meso 3 and 4 full-length enriched cDNA was prepared and normalized prior to cloning. Meso 3 was obtained from the total normalized full-length enriched cDNA, whereas for Meso 4 the normalized full-length enriched cDNA was size-fractionated by gel permeation chromatography to remove small fragments. The basic characteristics of the four libraries are given in Table <tblr tid="T1">1</tblr>.</p>
            <tbl id="T1" hint_layout="double">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p><it>Mesostigma viride </it>cDNA libraries used.</p>
               </caption>
               <tblbdy cols="7">
                  <r>
                     <c ca="left">
                        <p>Name</p>
                     </c>
                     <c ca="center">
                        <p>cDNA</p>
                     </c>
                     <c ca="center">
                        <p>Number of primary clones</p>
                     </c>
                     <c ca="center">
                        <p>Percentage recombinant clones</p>
                     </c>
                     <c ca="center">
                        <p>size of inserts<sup>1 </sup>(bp)</p>
                     </c>
                     <c ca="center">
                        <p>average size of inserts<sup>1 </sup>(bp)</p>
                     </c>
                     <c ca="center">
                        <p>number of ESTs sequenced</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Meso1</p>
                     </c>
                     <c ca="center">
                        <p>small size fraction</p>
                     </c>
                     <c ca="center">
                        <p>310 000</p>
                     </c>
                     <c ca="center">
                        <p>90</p>
                     </c>
                     <c ca="center">
                        <p>250&#8211;2000</p>
                     </c>
                     <c ca="center">
                        <p>706</p>
                     </c>
                     <c ca="center">
                        <p>100</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Meso2</p>
                     </c>
                     <c ca="center">
                        <p>large size fraction</p>
                     </c>
                     <c ca="center">
                        <p>295 000</p>
                     </c>
                     <c ca="center">
                        <p>88</p>
                     </c>
                     <c ca="center">
                        <p>600&#8211;3200</p>
                     </c>
                     <c ca="center">
                        <p>1142</p>
                     </c>
                     <c ca="center">
                        <p>4954</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Meso3</p>
                     </c>
                     <c ca="center">
                        <p>total cDNA normalized</p>
                     </c>
                     <c ca="center">
                        <p>106 000</p>
                     </c>
                     <c ca="center">
                        <p>63</p>
                     </c>
                     <c ca="center">
                        <p>100&#8211;1100</p>
                     </c>
                     <c ca="center">
                        <p>650</p>
                     </c>
                     <c ca="center">
                        <p>535</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Meso4</p>
                     </c>
                     <c ca="center">
                        <p>large size fraction normalized</p>
                     </c>
                     <c ca="center">
                        <p>304 000</p>
                     </c>
                     <c ca="center">
                        <p>56</p>
                     </c>
                     <c ca="center">
                        <p>400&#8211;6600</p>
                     </c>
                     <c ca="center">
                        <p>2025</p>
                     </c>
                     <c ca="center">
                        <p>2403 cDNAs = 4806 ESTs<sup>2</sup></p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><sup>1 </sup>determined by agarose gel electrophoresis, <sup>2</sup>sequenced from 3' and 5'end</p>
               </tblfn>
            </tbl>
            <p>Initially, about 100&#8211;500 ESTs were sequenced from all libraries and analyzed by BLASTX against the Swissprot and translated Genbank databases. Since the Meso 2 and 4 libraries containing the larger inserts gave more promising results, we subsequently sequenced about 4000 additional ESTs from the Meso 2 and Meso 4 libraries, respectively yielding a total of 10,395 reads (5,527,413 bp). Based on comparison with published sequences from <it>Mesostigma viride </it>the rate of sequencing error was determined to be generally between 1% and 7 % (average 4 %) depending on the quality of the sequence.</p>
            <p>ESTs were assembled using the PHRAP software yielding 3300 contigs with an average size of 769 bases (57 &#8211; 4452 bases) after manual curation. Further analysis based on sequence similarity searches revealed that 294 of these contigs were of plastidic, mitochondrial, or possibly bacterial origin (sequences showing the highest similarity to organellar or bacterial genomes, Table <tblr tid="T2">2</tblr>). These contigs were excluded from the data set. 1315 of the 3006 contigs analyzed (44%) showed significant similarity at the protein level to sequences from the public databases (Table <tblr tid="T2">2</tblr>). Hence, approximately 56% of the contigs represent either novel sequences with unknown function or untranslated regions of a gene. However, when the 1691 contigs with no significant similarity to known proteins were searched against the Interpro protein motif database, 574 (33.9%) of these contigs contained a recognizable protein motif (Table <tblr tid="T2">2</tblr>). The most common protein motifs found in all 3006 expressed gene sequences were bipartite nuclear localization signals (IPR001472, 197x), proline-rich regions (IPR000694, 150x) and cytochrome c heme-binding sites (IPR000345, 99x).</p>
            <tbl id="T2" hint_layout="single">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Summary of <it>Mesostigma viride </it>expressed genes obtained from four cDNA libraries (Meso 1 &#8211; Meso 4).</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="left">
                        <p>Category</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>No of contigs</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Mitochondrial<sup>1</sup></p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>36</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Plastidic<sup>1</sup></p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>65</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Bacterial<sup>2</sup></p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>193</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Novel</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>1691</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>with recognizable protein motif</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>574</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>no protein motifs</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>1117</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Similarity</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>1315</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>with known function<sup>3</sup></p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>574</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>unknown function<sup>4</sup></p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>395</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>low similarity<sup>5</sup></p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>346</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Total Contigs</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>3300</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><sup>1 </sup>sequences showing only similarity to organelle genomes. <sup>2 </sup>sequences showing only similarity to bacterial sequences or the highest similarity to bacterial sequences; the origin of these putative bacterial contaminations is currently not clear, as bacteria-free cultures of <it>Mesostigma </it>were used. <sup>3</sup>similarity to proteins with a well-defined function (BLAST score >100). <sup>4</sup>similarity to conserved proteins with no established function (BLAST score >100). <sup>5</sup>low similarity to proteins from a few organisms (BLAST score generally between 100 and 200); might reflect conserved protein domains.</p>
               </tblfn>
            </tbl>
            <p>A functional catalogue was assembled using the 3006 <it>Mesostigma </it>contigs and the KOG-database and is presented in Table <tblr tid="T3">3</tblr>. As expected for an interphase cell, genes in the categories (1) translation, ribosomal structure and biogenesis (168), (2) posttranslational modification, protein turnover, chaperones (101), and (3) energy production and conversion (87) are represented by the largest number of contigs (Table <tblr tid="T3">3</tblr>). In the following, the assembled contigs are referred to as (expressed) genes.</p>
            <tbl id="T3" hint_layout="single">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Functional classification of 3006 <it>Mesostigma viride </it>contigs using the KOG system [43] and an expectation threshold of e = 10<sup>-7</sup>.</p>
               </caption>
               <tblbdy cols="2">
                  <r>
                     <c ca="left">
                        <p>Functional Category</p>
                     </c>
                     <c ca="center">
                        <p>No. of Contigs</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="2">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>INFORMATION STORAGE AND PROCESSING</p>
                     </c>
                     <c indent="1" ca="center">
                        <p>236</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>[J] Translation, ribosomal structure and biogenesis</p>
                     </c>
                     <c ca="center">
                        <p>168</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>[A] RNA processing and modification</p>
                     </c>
                     <c ca="center">
                        <p>28</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>[K] Transcription</p>
                     </c>
                     <c ca="center">
                        <p>26</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>[L] Replication, recombination and repair</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>[B] Chromatin structure and dynamics</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>CELLULAR PROCESSES AND SIGNALING</p>
                     </c>
                     <c indent="1" ca="center">
                        <p>209</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>[D] Cell cycle control, cell division, chromosome partitioning</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>[Y] Nuclear structure</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>[V] Defense mechanisms</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>[T] Signal transduction mechanisms</p>
                     </c>
                     <c ca="center">
                        <p>31</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>[M] Cell wall/membrane/envelope biogenesis</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>[N + Z] Cytoskeleton and cell motility</p>
                     </c>
                     <c ca="center">
                        <p>18</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>[W] Extracellular structures</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>[U] Intracellular trafficking, secretion, and vesicular transport</p>
                     </c>
                     <c ca="center">
                        <p>41</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>[O] Posttranslational modification, protein turnover, chaperones</p>
                     </c>
                     <c ca="center">
                        <p>101</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>METABOLISM</p>
                     </c>
                     <c indent="1" ca="center">
                        <p>212</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>[C] Energy production and conversion</p>
                     </c>
                     <c ca="center">
                        <p>87</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>[G] Carbohydrate transport and metabolism</p>
                     </c>
                     <c ca="center">
                        <p>35</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>[E] Amino acid transport and metabolism</p>
                     </c>
                     <c ca="center">
                        <p>24</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>[F] Nucleotide transport and metabolism</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>[H] Coenzyme transport and metabolism</p>
                     </c>
                     <c ca="center">
                        <p>13</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>[I] Lipid transport and metabolism</p>
                     </c>
                     <c ca="center">
                        <p>21</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>[P] Inorganic ion transport and metabolism</p>
                     </c>
                     <c ca="center">
                        <p>16</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>[Q] Secondary metabolites biosynthesis, transport and catabolism</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>POORLY CHARACTERIZED</p>
                     </c>
                     <c indent="1" ca="center">
                        <p>158</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>[R] General function prediction only</p>
                     </c>
                     <c ca="center">
                        <p>55</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>[S] Function unknown</p>
                     </c>
                     <c ca="center">
                        <p>37</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>[X] Unnamed protein</p>
                     </c>
                     <c ca="center">
                        <p>66</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Unknown</p>
                     </c>
                     <c ca="center">
                        <p>2191</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Classification of <it>Mesostigma </it>ESTs according to homologous genes in other organisms</p>
            </st>
            <p>EST data represent only a fraction of all genes of an organism. Thus, comparisons of EST data alone cannot be used to describe unique or shared genes of an organism. For embryophytes, chlorophytes and red algae complete genome sequences of at least one organism exist. This makes it possible to find potential orthologous genes if present. Moreover, the surplus of genes of an organism in respect to a complete genome can be detected in EST data. In tBLASTX analyses of the 1315 expressed genes with similarity to known proteins 90.3 % matched proteins from streptophytes, 76.1 % from chlorophytes and 61 % from rhodophytes, respectively. In addition, 46 genes showed similarity to known proteins, which have not been reported from plants or red algae to date. The overlap of <it>Mesostigma </it>genes with different organisms can be visualized in a Venn diagram (Figure <figr fid="F1">1</figr>). For 211 genes, we detected similar proteins only within the streptophyte but not in the chlorophyte or rhodophyte lineages. Conversely, for 62 genes we detected similar proteins only within the chlorophyte but not in the streptophyte or rhodophyte lineages. Surprisingly, we also found 6 genes which showed significant similarity to rhodophyte proteins but for which we could not detect any similar protein sequences within the Viridiplantae. Removal of BLAST hits with significant but low similarity (see Table <tblr tid="T2">2</tblr>) reduced the overall numbers to 972 expressed genes, but gave similar results (Figure <figr fid="F1">1</figr>). A complete list of genes showing only similarity to proteins with known functions present in specific subgroups of organisms can be found in supplemental Table 1 [see <supplr sid="S1">Additional file 1</supplr>]. We will discuss important differences below.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Classification of expressed genes from <it>Mesostigma </it>according to the presence of similar proteins in other organisms in a Venn diagram</p>
               </caption>
               <text>
                  <p><b>Classification of expressed genes from <it>Mesostigma </it>according to the presence of similar proteins in other organisms in a Venn diagram</b>. All non-redundant expressed genes were used as a query in (t)blastx similarity searches with the Swissprot, Genbank, <it>Chlamydomonas</it>, <it>Cyanidioschyzon</it>, <it>Porphyra</it>, <it>Physcomitrella</it>, <it>Arabidopsis </it>and <it>Oryza </it>data sets. The outermost circle represents all <it>Mesostigma </it>expressed genes. The inner circles, which are labeled chlorophyte, streptophyte and rhodophyte, represent genes, which have similarity to chlorophyte, streptophyte or rhodophyte sequences, respectively. The areas depicted are not proportional to the gene numbers and the number of <it>Mesostigma </it>expressed genes in each category is written in each segment. Numbers in brackets indicate the number of expressed genes in a category after removal of low similarity hits (see Table 2 for a definition of low similarity hits).</p>
               </text>
               <graphic file="1471-2229-6-2-1" hint_layout="single"/>
            </fig>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p>Supplemental Table 1</p>
               </text>
               <file name="1471-2229-6-2-S1.doc">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Overall protein similarities between various photoautotrophic organisms</p>
            </st>
            <p>To compare the overall similarity between <it>Mesostigma </it>and various photoautotrophic organisms with completed genomes or large data sets of ESTs, we decided to calculate the average identity of a protein between <it>Mesostigma </it>and the various organisms. To compare <it>Mesostigma </it>genes with the genomes or ESTs from different organisms, we calculated the average identity (AI) between <it>Mesostigma </it>and another organism as the mean value of all pair wise identities of the BLAST-matches for each organism (Table <tblr tid="T4">4</tblr>).</p>
            <tbl id="T4" hint_layout="double">
               <title>
                  <p>Table 4</p>
               </title>
               <caption>
                  <p>Comparison of the <it>Mesostigma </it>expressed genes with the genomes and ESTs from various organisms. Average identity (AI) of pair wise comparisons of <it>Mesostigma </it>expressed genes with the indicated organismal data set.</p>
               </caption>
               <tblbdy cols="9">
                  <r>
                     <c ca="left">
                        <p>Data set  (No. of contigs)</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>
                           <it>Chlamydomonas</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>Cyanidioschyzon</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>Porphyra</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>Thalassiosira</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>Physcomitrella</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>Arabidopsis</it>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <it>Oryza</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>Genome</p>
                     </c>
                     <c ca="center">
                        <p>EST</p>
                     </c>
                     <c ca="center">
                        <p>Genome</p>
                     </c>
                     <c ca="center">
                        <p>EST</p>
                     </c>
                     <c ca="center">
                        <p>Genome</p>
                     </c>
                     <c ca="center">
                        <p>EST</p>
                     </c>
                     <c ca="center">
                        <p>Genome</p>
                     </c>
                     <c ca="center">
                        <p>Genome</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="9">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>All (969)</p>
                     </c>
                     <c ca="center">
                        <p>0.573</p>
                     </c>
                     <c ca="center">
                        <p>0.563</p>
                     </c>
                     <c ca="center">
                        <p>0.522</p>
                     </c>
                     <c ca="center">
                        <p>0.528</p>
                     </c>
                     <c ca="center">
                        <p>0.528</p>
                     </c>
                     <c ca="center">
                        <p>0.590</p>
                     </c>
                     <c ca="center">
                        <p>0.577</p>
                     </c>
                     <c ca="center">
                        <p>0.565</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Constrained (314)</p>
                     </c>
                     <c ca="center">
                        <p>0.648</p>
                     </c>
                     <c ca="center">
                        <p>0.653 (n = 244)<sup>1)</sup></p>
                     </c>
                     <c ca="center">
                        <p>0.557</p>
                     </c>
                     <c ca="center">
                        <p>0.585 (n = 188)<sup>1)</sup></p>
                     </c>
                     <c ca="center">
                        <p>0.569</p>
                     </c>
                     <c ca="center">
                        <p>0.675 (n = 301)<sup>1)</sup></p>
                     </c>
                     <c ca="center">
                        <p>0.679</p>
                     </c>
                     <c ca="center">
                        <p>0.671</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Evolutionary distance D<sup>2)</sup></p>
                     </c>
                     <c ca="center">
                        <p>0.473</p>
                     </c>
                     <c ca="center">
                        <p>0.463</p>
                     </c>
                     <c ca="center">
                        <p>0.658</p>
                     </c>
                     <c ca="center">
                        <p>0.597</p>
                     </c>
                     <c ca="center">
                        <p>0.631</p>
                     </c>
                     <c ca="center">
                        <p>0.425</p>
                     </c>
                     <c ca="center">
                        <p>0.418</p>
                     </c>
                     <c ca="center">
                        <p>0.432</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>The total data set contains all <it>Mesostigma </it>expressed genes with significant similarity to proteins from other organisms with known or unknown function (see Table 2). The constrained data set contains only <it>Mesostigma </it>expressed genes with significant similarity to proteins in all completely sequenced eukaryotic autotroph organisms.</p>
                  <p><sup>1) </sup>Number of ESTs showing similarity to <it>Mesostigma </it>expressed genes from the constrained data set in a tBLASTX analysis. <sup>2) </sup>Evolutionary distances were calculated using the constrained data set and the approximation given by Kimura [28]: D = -ln (1 - p - 0.2 p<sup>2</sup>), where p is the fraction of amino acid that differs between the two species.</p>
               </tblfn>
            </tbl>
            <p>The AI between <it>Mesostigma </it>and <it>Chlamydomonas </it>or the embryophytes are very similar. The highest AI value obtained was for <it>Physcomitrella/Mesostigma </it>followed by <it>Arabidopsis/Mesostigma</it>, <it>Chlamydomonas/ Mesostigma </it>and <it>Oryza/Mesostigma</it>. The full data set includes many proteins, which we detected only in some species using <it>Mesostigma </it>expressed genes as a query. Therefore, we constructed a constrained data set (314 expressed genes, including at least 46 nuclear encoded plastidic, 9 nuclear encoded mitochondrial, and 73 cytosolic ribosomal proteins), containing only <it>Mesostigma </it>genes which gave matches with all completed genomes from photoautotrophic eukaryotic organisms (including the diatom <it>Thalassiosira</it>). This constrained data set represents a conserved core set of nuclear encoded expressed proteins from photoautotrophic eukaryote organisms. We calculated AI values for the constrained data set using complete genomes and the available ESTs of <it>Physcomitrella</it>, <it>Porphyra</it>, and <it>Chlamydomonas</it>. The results are included in Table <tblr tid="T4">4</tblr>. We obtained the highest AI-values in the constrained data set for the three embryophytes, followed by <it>Chlamydomonas</it>. The similar AI values for the three different embryophytes suggest that the overall evolutionary rate was very similar for the embryophytes investigated, when compared with <it>Mesostigma </it>(see below).</p>
            <p>To test whether the observed differences are significant a paired students t-test was performed, and the results are shown in Table <tblr tid="T5">5</tblr>. Applying a significance level of 0.0072 [0.05/7 Bonferroni adjustment <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>] the differences in AI between <it>Mesostigma</it>/<it>Chlamydomonas </it>and <it>Mesostigma</it>/embryophytes are highly significant (Table <tblr tid="T5">5</tblr>), whereas the differences in AI among the embryophytes are not significant (Table <tblr tid="T5">5</tblr>). Furthermore, when we varied the numbers of expressed genes used for the calculation of the AI, we observed that when more than 100 ESTs were included the significance of the differences became very stable (Fig. <figr fid="F2">2A</figr>). In addition, to evaluate the consistency of the data set we calculated 8 times the AI for 150 randomly selected expressed genes from the constrained data set. A clear difference between the AI from the various organisms was always observed (Fig. <figr fid="F2">2B</figr> 1 &#8211; 8). The expression level of the expressed genes (as revealed by the number ESTs in a contig) had no effect on the differences between the investigated organisms (Fig. <figr fid="F2">2B</figr>, compare 9 and 10), although highly expressed genes are better conserved (Fig. <figr fid="F2">2B</figr>, 9 and 10).</p>
            <tbl id="T5" hint_layout="double">
               <title>
                  <p>Table 5</p>
               </title>
               <caption>
                  <p>Statistical significance of the obtained AI values. A paired students t-test was performed for the constrained data set to test whether the observed differences between the average identity of pair wise comparisons of <it>Mesostigma </it>expressed genes with the indicated organismal data set are significant. Differences are considered significant when p is &lt; 0.0071 (0.05/8 Bonferroni adjustment [22]).</p>
               </caption>
               <tblbdy cols="8">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Variable<sup>1</sup></p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>No. of genes shared</b>
                           <sup>
                              <b>2</b>
                           </sup>
                        </p>
                     </c>
                     <c ca="center">
                        <p>mean<sup>3</sup></p>
                     </c>
                     <c ca="center">
                        <p>standard deviation<sup>4</sup></p>
                     </c>
                     <c ca="center">
                        <p>t-value</p>
                     </c>
                     <c ca="center">
                        <p>Degrees of freedom</p>
                     </c>
                     <c ca="center">
                        <p>&#961;</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="8">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p><it>Mesostigma/Chlamydomonas </it>G <it>Mesostigma/Chlamydomonas </it>E</p>
                     </c>
                     <c ca="center">
                        <p>244</p>
                     </c>
                     <c ca="center">
                        <p>0.652992 0.652992</p>
                     </c>
                     <c ca="center">
                        <p>0.149868 0.151255</p>
                     </c>
                     <c ca="center">
                        <p>-0,03107</p>
                     </c>
                     <c ca="center">
                        <p>243</p>
                     </c>
                     <c ca="center">
                        <p>0.975239</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p><it>Mesostigma/Chlamydomonas </it>G <it>Mesostigma/Physcomitrella E</it></p>
                     </c>
                     <c ca="center">
                        <p>301</p>
                     </c>
                     <c ca="center">
                        <p>0.649934 0.674618</p>
                     </c>
                     <c ca="center">
                        <p>0.149292 0.153641</p>
                     </c>
                     <c ca="center">
                        <p>-3.24578</p>
                     </c>
                     <c ca="center">
                        <p>300</p>
                     </c>
                     <c ca="center">
                        <p>0.001304</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p><it>Mesostigma/Chlamydomonas </it>G <it>Mesostigma/Arabidopsis </it>G</p>
                     </c>
                     <c ca="center">
                        <p>314</p>
                     </c>
                     <c ca="center">
                        <p>0.648057 0.677994</p>
                     </c>
                     <c ca="center">
                        <p>0.148512 0.140940</p>
                     </c>
                     <c ca="center">
                        <p>-4.44025</p>
                     </c>
                     <c ca="center">
                        <p>313</p>
                     </c>
                     <c ca="center">
                        <p>0.000012</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p><it>Mesostigma/Chlamydomonas </it>G <it>Mesostigma./Oryza </it>G</p>
                     </c>
                     <c ca="center">
                        <p>314</p>
                     </c>
                     <c ca="center">
                        <p>0.648057 0.670382</p>
                     </c>
                     <c ca="center">
                        <p>0.148512 0.148384</p>
                     </c>
                     <c ca="center">
                        <p>-3.15371</p>
                     </c>
                     <c ca="center">
                        <p>313</p>
                     </c>
                     <c ca="center">
                        <p>0.001768</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="left">
                        <p>Mesostigma/Physcomitrella <it>E Mesostigma/Arabidopsis </it>G</p>
                     </c>
                     <c ca="center">
                        <p>302</p>
                     </c>
                     <c ca="center">
                        <p>0.675364 0.681159</p>
                     </c>
                     <c ca="center">
                        <p>0.153933 0.140567</p>
                     </c>
                     <c ca="center">
                        <p>-135783</p>
                     </c>
                     <c ca="center">
                        <p>301</p>
                     </c>
                     <c ca="center">
                        <p>0.175535</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="left">
                        <p>Mesostigma/Physcomitrella <it>E Mesostigma./Oryza </it>G</p>
                     </c>
                     <c ca="center">
                        <p>302</p>
                     </c>
                     <c ca="center">
                        <p>0.675364 0.673311</p>
                     </c>
                     <c ca="center">
                        <p>0.153933 0.147624</p>
                     </c>
                     <c ca="center">
                        <p>0,43158</p>
                     </c>
                     <c ca="center">
                        <p>301</p>
                     </c>
                     <c ca="center">
                        <p>0.666355</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p><it>Mesostigma/Arabidopsis </it>G <it>Mesostigma./Oryza </it>G</p>
                     </c>
                     <c ca="center">
                        <p>314</p>
                     </c>
                     <c ca="center">
                        <p>0.678730 0.671175</p>
                     </c>
                     <c ca="center">
                        <p>0.141321 0.148813</p>
                     </c>
                     <c ca="center">
                        <p>2.480053</p>
                     </c>
                     <c ca="center">
                        <p>313</p>
                     </c>
                     <c ca="center">
                        <p>0.013660</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><sup>1 </sup>Compared data sets E = ESTs, G = Genome. <sup>2 </sup>No. of genes shared between the compared data sets. <sup>3 </sup>AI recalculated on the basis of the genes shared between the compared datasets. <sup>4 </sup>Standard deviation of <sup>3</sup>.</p>
               </tblfn>
            </tbl>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Consistency of the constrained data set used to calculate AI value</p>
               </caption>
               <text>
                  <p><b>Consistency of the constrained data set used to calculate AI values. </b>(A) The figure illustrates the effect of the number of genes included in the AI-values. The significant differences in the AI values are stable when more than 150 genes are included. (B) 150 genes were resampled randomly and the AIs calculated for the indicated organisms (1 &#8211; 8). AI values were calculated for the 150 most strongly (9, as revealed by the number of ESTs in a contig) and weakly (10, only single ESTs) expressed genes.<b/></p>
               </text>
               <graphic file="1471-2229-6-2-2" hint_layout="single"/>
            </fig>
            <p>Two other results are remarkable. First, for the calculation of the AI it is possible to use large EST-data sets instead of genomes. We obtained the same result for <it>Mesostigma</it>/<it>Chlamydomonas </it>genome and for <it>Mesostigma</it>/<it>Chlamydomonas </it>ESTs (AI = 0.653 for both data sets; p = 0,975, Table <tblr tid="T5">5</tblr>, using 244 expressed genes from <it>Mesostigma</it>). Similarly, when <it>Mesostigma</it>/<it>Physcomitrella </it>ESTs were compared with the <it>Mesostigma</it>/<it>Arabidopsis </it>genome and with the <it>Mesostigma</it>/<it>Oryza </it>genome only small differences were observed (AI = 0,675/0,681; 0,675/0,673 respectively, using 302 expressed genes from <it>Mesostigma</it>, Table <tblr tid="T5">5</tblr>). Statistical analysis (paired students t-test) showed that the observed differences are not significant. Furthermore, we note that the genome of the diatom <it>Thalassiosira pseudonana </it>shows a similar AI in respect to <it>Mesostigma </it>as the red algal genome and ESTs (Table <tblr tid="T4">4</tblr>). The difference values of these distantly related genomes represent presumably an upper threshold for reasonable AI value calculations.</p>
         </sec>
         <sec>
            <st>
               <p>Analysis of metabolic pathways</p>
            </st>
            <p>ESTs have been widely used for the identification of metabolic pathways <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. A complete list of all metabolic pathways identified is presented in supplemental Table 2 [see <supplr sid="S2">Additional file 2</supplr>]. Indeed, many ESTs showed similarity to proteins required for photosynthesis (66 expressed genes), nucleotide synthesis (6), nucleotide sugar conversion, the biosynthesis of precursors of scale polysaccharides (6), heme and chlorophyll biosynthesis (6), fatty acid and lipid biosynthesis (9), terpenoid biosynthesis (6), glycolysis (11) and the TCA-cycle including pyruvate dehydrogenase and respiration (12). The biosynthetic pathways for several amino acids were also well represented in our ESTs (21 expressed genes for Ala, Arg, Gly, Ile, Leu, Lys, Pro, Ser, Thr, Trp and Val). However, for several other amino acids (Asn, Asp, Cys, Gln, Glu, His, Met, Phe, Tyr) we did not find a single EST which could be matched to the known biosynthetic pathways.</p>
            <suppl id="S2">
               <title>
                  <p>Additional file 2</p>
               </title>
               <text>
                  <p>Supplemental Table 2</p>
               </text>
               <file name="1471-2229-6-2-S2.doc">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>All enzymes except one (triose isomerase) of the Calvin cycle are represented by at least one EST. Interestingly, we found several genes coding for subunits of the plastidic GAPDH. In angiosperms the plastidic GAPDH consists of an A<sub>2</sub>B<sub>2 </sub>heterotetramer <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. Compared to GAPDH A, which is present in the plastids of all eukaryotic algae, GAPDH B has a C-terminal extension that contains the two conserved cysteine residues, which are required for regulation by the thioredoxin system. To our knowledge, GAPDH B has only been reported from streptophytes. Two genes of <it>Mesostigma </it>showed significant similarity to GAPDH B from angiosperms. We present an alignment of the C-terminus of <it>Mesostigma </it>GAPDH B with the C-terminus of spinach GAPDH B in Figure <figr fid="F3">3</figr>. The two sequences are very similar and the two cysteines required for regulation by the thioredoxin system are conserved in <it>Mesostigma </it>indicating that the activity of plastidic GAPDH came under the control of the thioredoxin system early during the evolution of streptophytes. We found no evidence for a GAPDH B in <it>Chlamydomonas </it>or other chlorophytes. Therefore, the evolution of a GAPDH B might represent a molecular characteristic (synapomorphy) of the streptophytes.</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Alignment of the deduced amino acid sequence of the putative GAPDHB (Meso2a42g12) gene from <it>Mesostigma </it>with spinach (P12860) GAPDHB</p>
               </caption>
               <text>
                  <p><b>Alignment of the deduced amino acid sequence of the putative GAPDHB (Meso2a42g12) gene from <it>Mesostigma </it>with spinach (P12860) GAPDHB</b>. The conserved cysteine residues are indicated in red letters. Numbers refer to the amino acid position (spinach) or nucleotide position (<it>Mesostigma</it>).</p>
               </text>
               <graphic file="1471-2229-6-2-3" hint_layout="single"/>
            </fig>
            <p>A total of 25 expressed genes encode components of the light-harvesting complex. There are some light-harvesting complex proteins, which <it>Mesostigma </it>shares only with the chlorophytes and red algae (e.g. so called fucoxanthin/chlorophyll a-binding proteins). For others, we detected similar proteins only within embryophytes. However, the lhc proteins form a large superfamily and their phylogenetic analysis is beyond the scope of this study.</p>
            <p>Several genes encode proteins of the photorespiratory C2-cycle (glycolate phosphatase, peroxisomal glycolate oxidase, a component of the glycine decarboxylase enzyme complex, and a peroxisomal serine-glyoxylate transaminase). As in embryophytes, the NADH required for reduction of hydroxy pyruvate is produced by a peroxisomal NADH malate dehydrogenase.</p>
            <p>A glycolate oxidase activity was never detected in chlorophytes by biochemical enzyme assays, but one <it>Chlamydomonas </it>protein is currently annotated as a glycolate oxidase (gene model C_340068, JGI Chlamydomonas reinhardtii v2.0) We therefore performed a phylogenetic analysis for glycolate oxidases and lactate dehydrogenases, which are both members of the same protein superfamily, from embryophytes, <it>Mesostigma</it>, <it>Chlamydomonas</it>, <it>Cyanidioschyzon</it>, <it>Dictyostelium</it>, a few metazoans and some bacteria (Fig. <figr fid="F4">4</figr>). The glycolate oxidases from embryophytes, <it>Mesostigma </it>and <it>Cyanidioschyzon </it>are monophyletic. In contrast, the glycolate oxidase-like sequence from <it>Chlamydomonas </it>clusters with bacterial sequences, which are annotated as lactate dehydrogenase and glycolate oxidases. Therefore, we conclude that, in agreements with the biochemical findings, <it>Chlamydomonas </it>does not contain a plant-type peroxisomal glycolate oxidase.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Phylogenetic tree of glycolate oxidase and glycolate oxidase-like genes</p>
               </caption>
               <text>
                  <p><b>Phylogenetic tree of glycolate oxidase and glycolate oxidase-like genes</b>. The tree shown was derived by Bayesian inference analysis from 402 amino acid positions using a mixed model for amino acid substitutions and a gamma correction for rate variation among sites. The Bayesian inference utilized MRBAYES, Ver. 3.0 * with posterior probabilities derived from 100000 generations and discarding a burnin of 1000. The tree obtained with a parsimony analysis using PHYLIP gave essentially the same topology.</p>
               </text>
               <graphic file="1471-2229-6-2-4" hint_layout="single"/>
            </fig>
            <p>We did not find evidence for a hexokinase and sucrose biosynthesis in interphase cells of <it>Mesostigma</it>. Several ESTs represent plastidic pyruvate kinase, however, only a single EST coded for the cytosolic isoform. Expressed genes for PEP carboxylase and a cytosolic malate dehydrogenase are present, suggesting that malate may be the major substrate for respiration in the mitochondrion of <it>Mesostigma </it>as in many embryophytes. The plastidic pyruvate kinase probably functions in the generation of acetyl-CoA required to sustain fatty acid synthesis in plastids.</p>
            <p>Scales consist mainly of the 2-keto sugar acids 3-deoxy-manno-octulosonic acid (2-keto-3-deoxy-oktonate, kdo), 5OMekdo, 3-deoxy-lyxo-heptulosaric acid, dha) and gal, galA, gul and some minor monosaccharides <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. Expressed genes coding for kdo synthesis, and activation of kdo as CMP-kdo are present. The obtained sequence similar to a CMP-sialA transporter might actually be the CMP-kdo transporter necessary for uptake of CMP-kdo into the Golgi apparatus, as kdo and sialA are structural analogs. Interestingly, kdo-synthase and CMP-kdo-transferase are among the most conserved proteins between <it>Mesostigma </it>and the embryophytes. As in embryophytes <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>, galA is synthesized via the UDP-glc dehydrogenase pathway and the myo-inositol oxygenase pathway. We could not detect the latter enzyme in <it>Chlamydomonas </it>or red algae.</p>
            <p>Our EST-data support the presence of vitamin B12-biosynthesis and the production of a phosphagen phosphoarginine by arginine kinase in <it>Mesostigma</it>.</p>
         </sec>
         <sec>
            <st>
               <p>Evolution of metabolism and cell structure</p>
            </st>
            <p>259 expressed genes from <it>Mesostigma </it>showed similarity to proteins belonging to various metabolic pathways. A pair-wise comparison of these genes with the genome of <it>Chlamydomonas </it>and the genomes and ESTs of the three embryophytes showed that <it>Mesostigma </it>shares more metabolic genes with the embryophytes than with <it>Chlamydomonas</it>, however, the overall AI is slightly higher with <it>Chlamydomonas </it>than with any embryophyte (AI, Table <tblr tid="T6">6</tblr>). Statistical analyses showed that the differences in AIs for the total metabolic enzyme data set are not significant (not shown). However, if we calculate AIs for different functional categories separately, we see that metabolic enzymes of the chloroplasts and mitochondria (photosynthesis except the Calvin cycle enzymes, fatty acid synthesis, synthesis of some amino acids, citric acid cycle, and respiration) were generally more conserved between <it>Mesostigma </it>and <it>Chlamydomonas </it>than between <it>Mesostigma </it>and the embryophytes (Table <tblr tid="T6">6</tblr>). In contrast, proteins of cytosolic pathways (nucleotide metabolism, NDP-sugar metabolism, and glycolysis) in <it>Mesostigma </it>were more similar to embryophyte proteins (Table <tblr tid="T6">6</tblr>),</p>
            <tbl id="T6" hint_layout="double">
               <title>
                  <p>Table 6</p>
               </title>
               <caption>
                  <p>Comparison of <it>Mesostigma </it>genes related to metabolic functions with Chlamydomonas and three embryophytes. The average identity (AI) of pair-wise comparisons of <it>Mesostigma </it>expressed genes coding for the indicated metabolic function with the ESTs or genome of the given organisms are presented.</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="left">
                        <p>Function<sup>1</sup></p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Chlamydomonas</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Physcomitrella</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Arabidopsis</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Oryza</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Metabolism (259)</p>
                     </c>
                     <c ca="left">
                        <p>0.613</p>
                     </c>
                     <c ca="left">
                        <p>0.601</p>
                     </c>
                     <c ca="left">
                        <p>0.593</p>
                     </c>
                     <c ca="left">
                        <p>0.594</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Plastidic metabolism (107)<sup>2</sup></p>
                     </c>
                     <c ca="left">
                        <p>0,595</p>
                     </c>
                     <c ca="left">
                        <p>0,572</p>
                     </c>
                     <c ca="left">
                        <p>0,567</p>
                     </c>
                     <c ca="left">
                        <p>0,572</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Mitochondrial metabolism (16)<sup>2</sup></p>
                     </c>
                     <c ca="left">
                        <p>0,647</p>
                     </c>
                     <c ca="left">
                        <p>0,593</p>
                     </c>
                     <c ca="left">
                        <p>0,569</p>
                     </c>
                     <c ca="left">
                        <p>0,578</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>cytosolic metabolism (glycolysis, NDP-sugar metabolism, nucleotide synthesis, 22)<sup>2</sup></p>
                     </c>
                     <c ca="left">
                        <p>0,568</p>
                     </c>
                     <c ca="left">
                        <p>0,639</p>
                     </c>
                     <c ca="left">
                        <p>0,641</p>
                     </c>
                     <c ca="left">
                        <p>0,632</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><sup>1 </sup>Numbers in brackets indicate the number of genes in this category.</p>
               </tblfn>
            </tbl>
            <p>Genes coding for information storage and processing, and cellular processes and signaling (Table <tblr tid="T3">3</tblr>) were overall more conserved between <it>Mesostigma </it>and the embryophytes than between <it>Mesostigma </it>and <it>Chlamydomonas</it>. Exceptions to this rule are proteins of the cytoskeleton (Table <tblr tid="T7">7</tblr>) and proteins involved in protein folding (chaperones, Table <tblr tid="T7">7</tblr>) and plastidic proteases (not shown), which show higher AI values with <it>Chlamydomonas </it>than with the embryophytes. If the cytoskeletal proteins are removed from the data set, the differences between <it>Mesostigma</it>/<it>Chlamydomonas </it>genome and <it>Mesostigma</it>/embryophytes are statistical significant (p = 0.000109 for <it>Mesostigma</it>/<it>Chlamydomonas </it>versus <it>Mesostigma</it>/<it>Physcomitrella</it>; p = 0.000703 for <it>Mesostigma</it>/<it>Chlamydomonas </it>versus <it>Mesostigma</it>/<it>Arabidopsis</it>, p = 0.006937 for <it>Mesostigma</it>/<it>Chlamydomonas </it>versus <it>Mesostigma</it>/<it>Oryza</it>). Remarkably, the three embryophytes behave differently in our analysis. We obtained higher AI values with <it>Physcomitrella </it>regarding the categories protein folding (chaperones), vesicular transport, transcription, and regulation (Table <tblr tid="T7">7</tblr>). In contrast, proteins related to DNA structure, replication, cell cycle and RNA-metabolism were more conserved between <it>Mesostigma </it>and the angiosperms <it>Arabidopsis </it>and <it>Oryza </it>than between <it>Mesostigma </it>and <it>Physcomitrella </it>(Table <tblr tid="T7">7</tblr>).</p>
            <tbl id="T7" hint_layout="double">
               <title>
                  <p>Table 7</p>
               </title>
               <caption>
                  <p>Comparison of <it>Mesostigma </it>genes related to cell structure functions with the genome or ESTs of Chlamydomonas and three embryophytes. The average identity of pair-wise comparisons of <it>Mesostigma </it>expressed genes coding for the indicated cellular functions with the ESTs or genomes of the given organisms are presented.</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="left">
                        <p>Function<sup>1</sup></p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Chlamydomonas</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Physcomitrella</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Arabidopsis</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <it>Oryza</it>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cell structure (201)</p>
                     </c>
                     <c ca="left">
                        <p>0.629</p>
                     </c>
                     <c ca="left">
                        <p>0.641</p>
                     </c>
                     <c ca="left">
                        <p>0.627</p>
                     </c>
                     <c ca="left">
                        <p>0.618</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Cytoskeleton (7)</p>
                     </c>
                     <c ca="left">
                        <p>0.751</p>
                     </c>
                     <c ca="left">
                        <p>0.729</p>
                     </c>
                     <c ca="left">
                        <p>0.0.731</p>
                     </c>
                     <c ca="left">
                        <p>0.731</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Protein folding/Chaperones (21)</p>
                     </c>
                     <c ca="left">
                        <p>0.685</p>
                     </c>
                     <c ca="left">
                        <p>0.605</p>
                     </c>
                     <c ca="left">
                        <p>0.580</p>
                     </c>
                     <c ca="left">
                        <p>0.581</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Cytosolic protein degradation (22)</p>
                     </c>
                     <c ca="left">
                        <p>0.682</p>
                     </c>
                     <c ca="left">
                        <p>0.733</p>
                     </c>
                     <c ca="left">
                        <p>0.722</p>
                     </c>
                     <c ca="left">
                        <p>0.703</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Vesicular transport (22)</p>
                     </c>
                     <c ca="left">
                        <p>0.633</p>
                     </c>
                     <c ca="left">
                        <p>0.701</p>
                     </c>
                     <c ca="left">
                        <p>0.680</p>
                     </c>
                     <c ca="left">
                        <p>0.673</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Regulation (18)</p>
                     </c>
                     <c ca="left">
                        <p>0.585</p>
                     </c>
                     <c ca="left">
                        <p>0.599</p>
                     </c>
                     <c ca="left">
                        <p>0.548</p>
                     </c>
                     <c ca="left">
                        <p>0.544</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>DNA structure, replication, cell cycle (21)</p>
                     </c>
                     <c ca="left">
                        <p>0.580</p>
                     </c>
                     <c ca="left">
                        <p>0.616</p>
                     </c>
                     <c ca="left">
                        <p>0.655</p>
                     </c>
                     <c ca="left">
                        <p>0.625</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>Transcription (16)</p>
                     </c>
                     <c ca="left">
                        <p>0.684</p>
                     </c>
                     <c ca="left">
                        <p>0.675</p>
                     </c>
                     <c ca="left">
                        <p>0.647</p>
                     </c>
                     <c ca="left">
                        <p>0.623</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>RNA metabolism (23)</p>
                     </c>
                     <c ca="left">
                        <p>0.599</p>
                     </c>
                     <c ca="left">
                        <p>0.619</p>
                     </c>
                     <c ca="left">
                        <p>0.637</p>
                     </c>
                     <c ca="left">
                        <p>0.637</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><sup>1 </sup>Numbers in brackets indicate the number of genes in this category.</p>
               </tblfn>
            </tbl>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>In this study, we have analyzed about 3000 expressed genes from the scaly green flagellate <it>Mesostigma viride</it>. We compared the expressed genes with the complete genomes from the angiosperms <it>Arabidopsis thaliana </it>and <it>Oryza sativa</it>, the chlorophyte <it>Chlamydomonas reinhardtii</it>, the red alga <it>Cyanidioschyzon merolae </it>and the diatom <it>Thalassiosira pseudonana</it>, as well as the ESTs from the moss <it>Physcomitrella patens</it>, and the red alga <it>Porphyra yezoensis</it>. Altogether, the <it>Mesostigma </it>proteome is more similar to the embryophytes than to <it>Chlamydomonas</it>, although <it>Mesostigma </it>and <it>Chlamydomonas </it>are both flagellate unicells. <it>Mesostigma </it>shares more genes with the embryophytes than with <it>Chlamydomonas</it>, including several enzymes confined to the streptophytes (e.g. GAPDH B, [Cu-Zn] superoxide dismutase), and the average identity of shared proteins is higher between <it>Mesostigma </it>and the embryophytes than between <it>Mesostigma </it>and <it>Chlamydomonas</it>. Therefore, we consider <it>Mesostigma </it>to be a member of the streptophytes, although <it>Mesostigma </it>clearly shares some ancestral characters with chlorophytes. Plastidic (with the exception of the Calvin cycle) and mitochondrial functions e.g. seem to be more conserved between <it>Mesostigma </it>and chlorophytes than between <it>Mesostigma </it>and embryophytes, i.e. these functions are more derived in embryophytes, probably due to adaptation of embryophytes to the terrestrial habitat. In contrast, other cellular functions except for the cytoskeleton are more conserved between <it>Mesostigma </it>and embryophytes than between <it>Mesostigma </it>and <it>Chlamydomonas</it>. Interestingly, in previous phylogenetic analyses plastidic and mitochondrial genes failed to show a clear relationship between <it>Mesostigma </it>and the streptophytes <abbrgrp><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr></abbrgrp>, whereas actin and nuclear-encoded SSU rDNA phylogenies support the notion that <it>Mesostigma </it>is a member of the streptophytes <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp>. The different evolutionary rates for different cellular functions observed in this study might explain this discrepancy.</p>
         <p>We calculated the average identity (AI) values from automatically generated BLAST output alignments. Automatically derived alignments are prone to errors. However, we believe that our approach is justified for the following reasons: (1) the BLAST alignments cover only the conserved parts of proteins and our calculated AI values indicate that in most alignments more than half of the amino acids are identical enhancing the quality of the automatically produced alignments; (2) although small mistakes may occur, they are insignificant given the high number of amino acids used to calculate the AI. On average the BLAST alignments contained about 150 amino acids and therefore about 45,000 amino acid positions were used in the constrained data set. In large data sets small unbiased errors become irrelevant <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. Our results indicate that at least 100 (better are 150&#8211;200) expressed genes have to be used to obtain statistically significant results. It could be argued that our analysis uses only similarity values and no real evolutionary distances. AI values can be easily converted into evolutionary distances using an approximation given by Kimura <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>, with the effect that the differences between the various organisms become larger but no changes occur in the order of relatedness (included in Table <tblr tid="T4">4</tblr>). We conclude that the AI of proteins shared between different organisms represents a reasonable measure of evolutionary relatedness, if sufficiently large data sets are used.</p>
         <p>In the following, we briefly discuss some major differences in coding potential observed between the different photosynthetic eukaryotic organisms.</p>
         <p>11 of 18 proteins included in supplemental Table 1 [see <supplr sid="S1">Additional file 1</supplr>] which are shared only by <it>Mesostigma </it>and <it>Chlamydomonas </it>are associated with flagellar functions such as axonemal dyneins or components of the IFT (intra-flagellar transport) machinery. Most likely, the angiosperms lost these proteins during evolution together with the ability to produce flagellate cells. The absence of these proteins in the ESTs from the moss <it>Physcomitrella</it>, is presumably due to the fact that ESTs from developing spermatozoids are not available.</p>
         <p>Proteins shared by <it>Mesostigma </it>and the embryophytes but not present in chlorophytes perform diverse functions. There are some well known biochemical differences between chlorophytes and streptophytes such as the presence of (Cu-Zn) superoxide dismutase <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr></abbrgrp> and glycolate oxidase in streptophytes <abbrgrp><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr></abbrgrp> but not in chlorophytes. In addition, streptophytes use the DXP and mevalonate pathways for isoprene biosynthesis whereas chlorophytes posses only the DXP pathway <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. For all these functions, we find molecular support in our expressed gene data set except for the mevalonate pathway of isoprene biosynthesis. Two genes matched two different enzymes of the DXP pathway; however, no matches for the MVA pathway were obtained, although the presence of this pathway has been demonstrated biochemically <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. This could be due to the selective expression of one or the other pathway under different environmental conditions.</p>
         <p>Remarkably, our list of proteins uniquely shared by <it>Mesostigma </it>and the embryophytes includes several proteins involved in steroid biosynthesis (e.g. a 3-oxo-5-beta-steroid dehydrogenase and a C-4 sterol oxidase), a homeobox protein of the knox family and proteins of the F-box family. The latter protein family underwent a dramatic expansion in the embryophytes (<it>Arabidopsis </it>has more than 700 members of this family).</p>
         <p>Our expressed protein data set contains sequences similar to a protein involved in vitamin B-12 metabolism (present in rhodophytes and chlorophytes), an arginine kinase and a ARL6 protein, the latter two are absent in chlorophytes, embryophytes and red algae. It has been shown that arginine kinase is part of the ATP regeneration system in cilia of <it>Paramecium </it><abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. <it>Chlamydomonas </it>lacks arginine kinase and recently Pazour et al. <abbrgrp><abbr bid="B35">35</abbr></abbrgrp> showed that enzymes of the late glycolytic pathway are present in the flagella of <it>Chlamydomonas</it>, suggesting that the ATP required for flagellar function is produced by the glycolytic pathway in <it>Chlamydomonas</it>. The ARL6 protein has been implicated in protein translocation at the rER <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>, although its exact function is still not known.</p>
         <p>There are some typical embryophyte pathways that we failed to detect in <it>Mesostigma</it>, e.g. sucrose metabolism, hexokinase, and enzymes of cellulose biosynthesis. There are no reports about the presence of sucrose metabolism and hexokinase in green algae in the literature, whereas embryophyte-like Ces genes (catalytical subunit of cellulose synthase) have been reported in the streptophyte alga <it>Mesotaenium </it><abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. Although we cannot exclude that <it>Mesostigma </it>lost these genes, we do expect to find theses genes in the genome of <it>Mesostigma</it>.</p>
         <sec>
            <st>
               <p>Evolution of photosynthesis and photorespiration</p>
            </st>
            <p>It is well known that embryophytes and chlorophytes differ in important aspects of photosynthesis and its regulation, and in photorespiration (e.g., presence of GAPDHB, number of enzymes regulated by thioredoxin, glycolate oxidase vs. glycolate dehydrogenase, and presence or absence of (Cu-Zn) superoxide dismutase).</p>
            <p>Table <tblr tid="T8">8</tblr> summarizes the available information on the regulation of plastidic proteins by the thioredoxin system. The number of thioredoxin-regulated proteins has apparently increased during evolution and <it>Mesostigma </it>in this respect most closely resembles the embryophytes. Similarly, the peroxisomes of <it>Mesostigma </it>have been biochemically characterized as "leaf-type peroxisomes" <abbrgrp><abbr bid="B38">38</abbr></abbrgrp> in full agreement with our EST-data. In contrast, chlorophytes lack glycolate oxidase and photorespiration involves only chloroplast and mitochondrial enzymes <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. Interestingly, red algae possess a peroxisomal glycolate oxidase whereas the other enzymes of the photorespiratory cycle are located in the mitochondrion <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. Thus, it seems likely that at the onset of streptophyte evolution major changes occurred in the regulation of the Calvin cycle and the subcellular organization of photorespiration. What might have been the driving force for these changes? We note that rhodophytes and chlorophytes both presumably evolved in a marine environment [red algae in a coastal benthic habitat, whereas chlorophytes proliferated as marine phytoplankton <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>]. Streptophyte algae most likely originated in a freshwater/brackish environment. In contrast to their marine counterparts, they had to deal with much higher light intensities and fluctuating environmental conditions such as salinity and temperature. With higher temperature, the rate of photorespiration increases. The observed changes in regulation of the Calvin cycle and photorespiration might be adaptations to this stress. It is possible that these adaptations to a shallow freshwater/brackish environment prepared streptophytes to colonize the terrestrial habitat later during evolution. In this respect we note that in extant chlorophytes activation of carbon concentrating mechanisms (CCM) is the dominant reaction to compensate for increased photorespiratory losses <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. In contrast, streptophytes are able to channel large amounts of glycolate through the photorespiratory cycle <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. According to Badger and Price <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> CCMs did not evolve until 400 million years ago, long after streptophytes had evolved and the colonization of the terrestrial habitat by streptophyte algae took place. Therefore during the palaeozoic era with reduced CO<sub>2</sub>- and increased O<sub>2</sub>-levels <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> streptophyte algae might have had an advantage over chlorophyte algae allowing them to colonize the terrestrial habitat during that time.</p>
            <tbl id="T8" hint_layout="double">
               <title>
                  <p>Table 8</p>
               </title>
               <caption>
                  <p>Regulation of plastidic enzymes by the thioredoxin system. Proteins similar to embryophyte plastidic thioredoxin-regulated proteins were identified in the genomes of <it>Cyanidioschyzon</it>, <it>Chlamydomonas</it>, and the ESTs of <it>Mesostigma </it>using the BLASTP or BLASTX algorithms. A putative thioredoxin-regulated orthologue as revealed by the conserved cysteine residues is indicated with +. An asterisk indicates putative cyanobacterial/plastidic proteins, which do not contain the conserved cysteines required for thioredoxin-regulation. Missing enzymes are indicated with -.</p>
               </caption>
               <tblbdy cols="6">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Cyanobacteria</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Rhodophytes <it>Cyanidioschyzon</it></b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Chlorophytes <it>Chlamydomonas</it></b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Mesostigma</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Embryophytes</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>PRK</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>SDPase</p>
                     </c>
                     <c ca="center">
                        <p>*</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>G6PDH</p>
                     </c>
                     <c ca="center">
                        <p>*</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>n.d.</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>FBPase</p>
                     </c>
                     <c ca="center">
                        <p>*</p>
                     </c>
                     <c ca="center">
                        <p>*<sup>1)</sup></p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>&#947;-ATPase</p>
                     </c>
                     <c ca="center">
                        <p>*</p>
                     </c>
                     <c ca="center">
                        <p>*</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GABDHB</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NADP-MDH</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>+<sup>2)</sup></p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>n.d.</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>+</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Rubisco activase</p>
                     </c>
                     <c ca="center">
                        <p>-<sup>3)</sup></p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>*</p>
                     </c>
                     <c ca="center">
                        <p>*</p>
                     </c>
                     <c ca="center">
                        <p>(+)<sup>4)</sup></p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>n.d. not detected in Mesostigma. <sup>1) </sup>In <it>Galdieria </it>(<it>Cyanidioschyzon</it>) 2 (1) of the 3 conserved cysteines occurring in the Viridiplantae are present [48]. <sup>2) </sup>Chlorophyte NADP-malate dehydrogenase possesses a C- and N-terminal extension like the embryophyte enzyme, however only the C-terminal cysteines of the embryophyte enzyme are conserved [49, 50]. <sup>3) </sup>A few cyanobacteria contain an unusual rubisco activase. Only the central AAA<sup>+ </sup>domain shows similarity to plant rubisco activases, whereas the N and C terminal domain are very different [51]. <sup>4) </sup>Many angiosperms contain two forms of rubisco activase. Only the long form is regulated by the thioredoxin system [52].</p>
               </tblfn>
            </tbl>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>In summary, our EST analysis shows that <it>Mesostigma </it>shares more genes with the embryophytes than with the chlorophyte <it>Chlamydomonas reinhardtii</it>, although both organisms are flagellate unicells. Thus, it seems likely that many typical biochemical characteristics of streptophytes evolved early during the evolution of streptophytes, i.e. before the transition to land. Alternatively, such characteristics may haven been lost in the chlorophyte lineage or remain to be discovered in other chlorophytes. A decision between these alternatives requires further information on the genomes of other preferentially early branching chlorophytes such as <it>Pyramimonas</it>.</p>
         <p>Our EST-analysis of <it>Mesostigma </it>supports the notion that this organism should be a suitable unicellular model for the last flagellate common ancestor of the streptophytes.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Plant material, RNA preparation and construction of libraries</p>
            </st>
            <p><b>Total RNA </b>was isolated from cultures of <it>Mesostigma viride </it>Lauterborn (strain NIES 476, Tsukuba, Japan) and mRNA isolated using the mTRAP&#8482; Total Kit (Active Motif). 5 &#956;g of mRNA were converted into cDNA using the SuperScript&#8482; Plasmid System (Invitrogen) and the cDNA obtained was fractionated by column chromatography. A large and a small size fraction were cloned into the pSPORT1 vector (Invitrogen).</p>
            <p>Normalized full-length cDNA was prepared by Evrogen JSC (Moscow, Russia). cDNA was prepared from total RNA using the SMART approach <abbrgrp><abbr bid="B41">41</abbr></abbrgrp> normalized using the DSN normalization method <abbrgrp><abbr bid="B42">42</abbr></abbrgrp> and then amplified by PCR. cDNAs were either directly cloned into a pPCR-Script Amp SK(+) Vector (PCR-Script Amp Cloning Kit, Stratagene) or a large size fraction was isolated by column chromatography and then cloned into a pGEM-T Easy vector (Promega). All libraries were transformed into TOP 10 <it>E. coli </it>cells (Invitrogen) by electroporation.</p>
         </sec>
         <sec>
            <st>
               <p>Sequencing, contig assembly and data analysis</p>
            </st>
            <sec>
               <st>
                  <p>Clone preparation and sequencing</p>
               </st>
               <p>Isolated plasmids were sequenced by the cycle sequencing method using an ABI3700 96 capillary sequencer. A minimal contig set was assembled using the phrap assembler and all contigs were manually curated.</p>
            </sec>
            <sec>
               <st>
                  <p>Annotation</p>
               </st>
               <p>Each contig was compared as 3-frame translations to the protein databases Swissprot and genpept using blastx. Furthermore, all contigs were compared using the tBLASTX search algorithm to the genome sequence of <it>Chlamydomonas reinhardtii</it>, <it>Cyanidioschyzon merolae</it>, <it>Thalassiosira pseudonana</it>, <it>Arabidopsis thaliana</it>, <it>Oryza sativa </it>and to the EST databases of <it>Physcomitrella patens </it>and <it>Porphyra yezoensis</it>. The results were compiled to an Excel compatible file. Analyses of COG and KOG categories <abbrgrp><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr></abbrgrp> and Interpro protein domains <abbrgrp><abbr bid="B45">45</abbr></abbrgrp> for the contigs were also performed.</p>
            </sec>
            <sec>
               <st>
                  <p>Analysis of metabolic pathways</p>
               </st>
               <p>Using the metabolic pathways present at the AraCyc website <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>, we identified all expressed genes with significant similarity to <it>Arabidopsis </it>genes present in AraCyc. Expressed genes that showed no significant similarity to <it>Arabidopsis </it>genes but to enzymes from other organisms were assigned to a pathway using the MetaCyc database <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>.</p>
            </sec>
            <sec>
               <st>
                  <p>Phylogenetic analysis of glycolate oxidase</p>
               </st>
               <p>Thirteen glycolate oxidase/lactate dehydrogenase sequences were obtained from public databases (<it>Cyanidioschyzon merolae </it>[KEGG:CMQ436C]; <it>Chlamydomonas reinhardtii </it>[JGI:C_340068]; <it>Spinacia oleracea </it>[Swiss-Prot:<ext-link ext-link-type="sprot" ext-link-id="P05414">P05414</ext-link>]; <it>Nostoc punctiforme </it>PCC 73102 [Genbank:<ext-link ext-link-type="gen" ext-link-id="ZP_00106740.1">ZP_00106740.1</ext-link>]; <it>Nostoc </it>sp. PCC 7120 [Genbank:<ext-link ext-link-type="gen" ext-link-id="BAB77694.1">BAB77694.1</ext-link>]; <it>Anabaena variabilis </it>ATCC 29413 [Genbank:<ext-link ext-link-type="gen" ext-link-id="ZP_00160276.2">ZP_00160276.2</ext-link>]; <it>Arabidopsis thaliana </it>[Genbank:<ext-link ext-link-type="gen" ext-link-id="CAB78838">CAB78838</ext-link>], <it>Oryza sativa </it>[Genbank:<ext-link ext-link-type="gen" ext-link-id="AAB82143">AAB82143</ext-link>], <it>Nicotiana tabacum </it>[Genbank:<ext-link ext-link-type="gen" ext-link-id="AAC33509">AAC33509</ext-link>], <it>Homo sapiens </it>[Genbank:<ext-link ext-link-type="gen" ext-link-id="CAC34364">CAC34364</ext-link>], <it>Drosophila melanogaster </it>[Genbank:<ext-link ext-link-type="gen" ext-link-id="AAO41411">AAO41411</ext-link>], <it>Dictyostelium discoideum </it>[Genbank:<ext-link ext-link-type="gen" ext-link-id="XP_629946">XP_629946</ext-link>], <it>Lactobacillus johnsonii </it>NCC 533 [Genbank:<ext-link ext-link-type="gen" ext-link-id="NP_965805">NP_965805</ext-link>]). The nearly complete <it>Mesostigma </it>glycolate oxidase sequence was obtained by complete sequencing of EST clone Meso2b12b08. The sequences were aligned using Clustal X. The alignment was checked manually. Phylogenetic analyses were performed using the Phylip (neighbour joining and parsimony method) and MRBAYES software v 3.0 (Bayesian inference).</p>
            </sec>
            <sec>
               <st>
                  <p>Data deposition</p>
               </st>
               <p>Sequence data from this article have been deposited with the EMBL/Genbank data libraries under accession numbers <ext-link ext-link-type="gen" ext-link-id="DN254242">DN254242</ext-link> to <ext-link ext-link-type="gen" ext-link-id="DN264595">DN264595</ext-link>.</p>
            </sec>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>AS constructed the libraries and participated in the analysis of the EST-data. GG sequenced the ESTs, participated in the analysis of the EST-data and helped to draft the manuscript. MF participated in the analysis of the EST-data. MM participated in the design of the study, the analysis of the EST-data and helped to draft the manuscript. BB conceived the study, and participated in its design, coordination and analysis of the EST-data, and helped to draft the manuscript. All authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>This work was supported by the DFG (Be1779/7-1).</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Molecular evidence on plant divergence times</p>
            </title>
            <aug>
               <au>
                  <snm>Sanderson</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Thorne</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Wikstrom</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Bremer</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Am J Bot</source>
            <pubdate>2004</pubdate>
            <volume>91</volume>
            <fpage>1656</fpage>
            <lpage>1665</lpage>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Early evolution of land plants: Phylogeny, physiology, and ecology of the primary terrestrial radiation</p>
            </title>
            <aug>
               <au>
                  <snm>Bateman</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Crane</snm>
                  <fnm>PR</fnm>
               </au>
               <au>
                  <snm>DiMichele</snm>
                  <fnm>WA</fnm>
               </au>
               <au>
                  <snm>Kenrick</snm>
                  <fnm>PR</fnm>
               </au>
               <au>
                  <snm>Rowe</snm>
                  <fnm>NP</fnm>
               </au>
               <au>
                  <snm>Speck</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Stein</snm>
                  <fnm>WE</fnm>
               </au>
            </aug>
            <source>Annu Rev Ecol Syst</source>
            <pubdate>1998</pubdate>
            <volume>29</volume>
            <fpage>263</fpage>
            <lpage>292</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1146/annurev.ecolsys.29.1.263</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <aug>
               <au>
                  <snm>Kenrick</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Crane</snm>
                  <fnm>PR</fnm>
               </au>
            </aug>
            <source>The origin and early diversification of land plants</source>
            <publisher>Washington, London: Smithsonian Institution Press</publisher>
            <pubdate>1997</pubdate>
         </bibl>
         <bibl id="B4">
            <aug>
               <au>
                  <snm>Graham</snm>
                  <fnm>LE</fnm>
               </au>
            </aug>
            <source>Origin of Land Plants</source>
            <publisher>New York: John Wiley &amp; Sons, Inc</publisher>
            <pubdate>1993</pubdate>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Molecular adaptation and the origin of land plants</p>
            </title>
            <aug>
               <au>
                  <snm>Waters</snm>
                  <fnm>ER</fnm>
               </au>
            </aug>
            <source>Molecular Phylogenetics and Evolution</source>
            <pubdate>2003</pubdate>
            <volume>29</volume>
            <fpage>456</fpage>
            <lpage>463</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/j.ympev.2003.07.018</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Summary of green plant phylogeny and classification</p>
            </title>
            <aug>
               <au>
                  <snm>Bremer</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Cladistics</source>
            <pubdate>1985</pubdate>
            <volume>1</volume>
            <fpage>369</fpage>
            <lpage>385</lpage>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Classification of green algae: A concept based on comparative cytology</p>
            </title>
            <aug>
               <au>
                  <snm>Mattox</snm>
                  <fnm>KR</fnm>
               </au>
               <au>
                  <snm>Stewart</snm>
                  <fnm>KD</fnm>
               </au>
            </aug>
            <source>Systematics of the green algae</source>
            <publisher>London: Academic Press</publisher>
            <editor>Irvine DEG, John DM</editor>
            <pubdate>1984</pubdate>
            <fpage>29</fpage>
            <lpage>72</lpage>
         </bibl>
         <bibl id="B8">
            <title>
               <p>The basal position of scaly green flagellates among the green algae (Chlorophyta) is revealed by analyses of nuclear-encoded SSU rRNA sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Nakayama</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Marin</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Kranz</snm>
                  <fnm>HD</fnm>
               </au>
               <au>
                  <snm>Surek</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Huss</snm>
                  <fnm>VAR</fnm>
               </au>
               <au>
                  <snm>Inouye</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Melkonian</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Protist</source>
            <pubdate>1998</pubdate>
            <volume>149</volume>
            <fpage>367</fpage>
            <lpage>380</lpage>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Charophyte evolution and the origin of land plants</p>
            </title>
            <aug>
               <au>
                  <snm>Huss</snm>
                  <fnm>VAR</fnm>
               </au>
               <au>
                  <snm>Kranz</snm>
                  <fnm>HD</fnm>
               </au>
            </aug>
            <source>Plant Syst Evol</source>
            <pubdate>1997</pubdate>
            <fpage>103</fpage>
            <lpage>114</lpage>
         </bibl>
         <bibl id="B10">
            <title>
               <p>The closest living relatives of land plants</p>
            </title>
            <aug>
               <au>
                  <snm>Karol</snm>
                  <fnm>KG</fnm>
               </au>
               <au>
                  <snm>McCourt</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Cimino</snm>
                  <fnm>MT</fnm>
               </au>
               <au>
                  <snm>Delwiche</snm>
                  <fnm>CF</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2001</pubdate>
            <volume>294</volume>
            <fpage>2351</fpage>
            <lpage>2353</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1065156</pubid>
                  <pubid idtype="pmpid" link="fulltext">11743201</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Actin phylogeny identifies Mesostigma viride as a flagellate ancestor of the land plants</p>
            </title>
            <aug>
               <au>
                  <snm>Bhattacharya</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Weber</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>An</snm>
                  <fnm>SS</fnm>
               </au>
               <au>
                  <snm>Berning-Koch</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>1998</pubdate>
            <volume>47</volume>
            <fpage>544</fpage>
            <lpage>550</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/PL00006410</pubid>
                  <pubid idtype="pmpid" link="fulltext">9797404</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Mesostigmatophyceae, a new class of streptophyte green algae revealed by SSU rRNA sequence comparisons</p>
            </title>
            <aug>
               <au>
                  <snm>Marin</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Melkonian</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Protist</source>
            <pubdate>1999</pubdate>
            <volume>150</volume>
            <fpage>399</fpage>
            <lpage>417</lpage>
            <xrefbib>
               <pubid idtype="pmpid">10714774</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Phylogeny and Evolution of the Algae</p>
            </title>
            <aug>
               <au>
                  <snm>Melkonian</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Marin</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Surek</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Biodiversity and Evolution</source>
            <publisher>Tokyo: The National Science Museum Foundation</publisher>
            <editor>Arai R, Kato M, Doi Y</editor>
            <pubdate>1995</pubdate>
            <fpage>153</fpage>
            <lpage>176</lpage>
         </bibl>
         <bibl id="B14">
            <title>
               <p>The Complete Mitochondrial DNA Sequence of Mesostigma viride Identifies This Green Alga as the Earliest Green Plant Divergence and Predicts a Highly Compact Mitochondrial Genome in the Ancestor of All Green Plants</p>
            </title>
            <aug>
               <au>
                  <snm>Turmel</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Otis</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Lemieux</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2002</pubdate>
            <volume>19</volume>
            <fpage>24</fpage>
            <lpage>38</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11752187</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Ancestral chloroplast genome in Mesostigma viride reveals an early branch of green plant evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Lemieux</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Otis</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Turmel</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>403</volume>
            <fpage>649</fpage>
            <lpage>652</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35001059</pubid>
                  <pubid idtype="pmpid" link="fulltext">10688199</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Phylogeny of the genus Coleochaete (Coleochaetales, Charophyta) and related taxa inferred by analysis of the chloroplast gene rbcL</p>
            </title>
            <aug>
               <au>
                  <snm>Delwiche</snm>
                  <fnm>CF</fnm>
               </au>
               <au>
                  <snm>Karol</snm>
                  <fnm>KG</fnm>
               </au>
               <au>
                  <snm>Cimino</snm>
                  <fnm>MT</fnm>
               </au>
               <au>
                  <snm>Sytsma</snm>
                  <fnm>KJ</fnm>
               </au>
            </aug>
            <source>J Phycol</source>
            <pubdate>2002</pubdate>
            <volume>38</volume>
            <fpage>394</fpage>
            <lpage>403</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1046/j.1529-8817.2002.01174.x</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Life with 25,000 genes</p>
            </title>
            <aug>
               <au>
                  <snm>Poethig</snm>
                  <fnm>RS</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2001</pubdate>
            <volume>11</volume>
            <fpage>313</fpage>
            <lpage>316</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gr.180001</pubid>
                  <pubid idtype="pmpid" link="fulltext">11230155</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>The plant tree of life: An overview and some points of view</p>
            </title>
            <aug>
               <au>
                  <snm>Palmer</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Soltis</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Chase</snm>
                  <fnm>MW</fnm>
               </au>
            </aug>
            <source>Am J Bot</source>
            <pubdate>2004</pubdate>
            <volume>91</volume>
            <fpage>1437</fpage>
            <lpage>1445</lpage>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Green algae and the origin of land plants</p>
            </title>
            <aug>
               <au>
                  <snm>Lewis</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>McCourt</snm>
                  <fnm>RM</fnm>
               </au>
            </aug>
            <source>American Journal of Botany</source>
            <pubdate>2004</pubdate>
            <volume>91</volume>
            <fpage>1535</fpage>
            <lpage>1556</lpage>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Charophyte algae and land plant origins</p>
            </title>
            <aug>
               <au>
                  <snm>McCourt</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Delwiche</snm>
                  <fnm>CF</fnm>
               </au>
               <au>
                  <snm>Karol</snm>
                  <fnm>KG</fnm>
               </au>
            </aug>
            <source>Trends Ecol Evol</source>
            <pubdate>2004</pubdate>
            <volume>19</volume>
            <fpage>661</fpage>
            <lpage>666</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/j.tree.2004.09.013</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>The origin of plants: Body plan changes contributing to a major evolutionary radiation</p>
            </title>
            <aug>
               <au>
                  <snm>Graham</snm>
                  <fnm>LE</fnm>
               </au>
               <au>
                  <snm>Cook</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Busse</snm>
                  <fnm>JS</fnm>
               </au>
            </aug>
            <source>PNAS</source>
            <pubdate>2000</pubdate>
            <volume>97</volume>
            <fpage>4535</fpage>
            <lpage>4540</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">34322</pubid>
                  <pubid idtype="pmpid" link="fulltext">10781058</pubid>
                  <pubid idtype="doi">10.1073/pnas.97.9.4535</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <aug>
               <au>
                  <snm>Quinn</snm>
                  <fnm>GP</fnm>
               </au>
               <au>
                  <snm>Keough</snm>
                  <fnm>MJ</fnm>
               </au>
            </aug>
            <source>Experimental design and data analysis for biologists</source>
            <publisher>Cambridge: Cambridge University Press</publisher>
            <pubdate>2002</pubdate>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Unraveling plant metabolism by EST analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Ohlrogge</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Benning</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Curr Opin Plant Biol</source>
            <pubdate>2000</pubdate>
            <volume>3</volume>
            <fpage>224</fpage>
            <lpage>228</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10837264</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Subunit Structure of Higher-Plant Glyceraldehyde-3-Phosphate Dehydrogenases (Ec 1.2.1.12 and Ec 1.2.1.13)</p>
            </title>
            <aug>
               <au>
                  <snm>Cerff</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Chambers</snm>
                  <fnm>SE</fnm>
               </au>
            </aug>
            <source>J Biol Chem</source>
            <pubdate>1979</pubdate>
            <volume>254</volume>
            <fpage>6094</fpage>
            <lpage>6098</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">109446</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>2-keto-sugar acids in green flagellates: A chemical marker for prasinophycean scales</p>
            </title>
            <aug>
               <au>
                  <snm>Becker</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Becker</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kamerling</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Melkonian</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>JPhycol</source>
            <pubdate>1991</pubdate>
            <volume>27</volume>
            <fpage>498</fpage>
            <lpage>504</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1111/j.0022-3646.1991.00498.x</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>The inositol oxygenase gene family of Arabidopsis is involved in the biosynthesis of nucleotide sugar precursors for cell-wall matrix polysaccharides</p>
            </title>
            <aug>
               <au>
                  <snm>Kanter</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Usadel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Guerineau</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Pauly</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Tenhaken</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Planta</source>
            <pubdate>2005</pubdate>
            <volume>221</volume>
            <fpage>243</fpage>
            <lpage>254</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s00425-004-1441-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">15660207</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Phylogenomics and the reconstruction of the tree of life</p>
            </title>
            <aug>
               <au>
                  <snm>Delsuc</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Brinkmann</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Philippe</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Nature Reviews Genetics</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>361</fpage>
            <lpage>375</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrg1603</pubid>
                  <pubid idtype="pmpid" link="fulltext">15861208</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <aug>
               <au>
                  <snm>Kimura</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>The Neutral Theory of Molecular Evolution</source>
            <publisher>Cambridge: Cambridge University Press</publisher>
            <pubdate>1983</pubdate>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Molecular evolution and structure-function relationships of the superoxide dismutase gene families in angiosperms and their relationship to other eukaryotic and prokaryotic superoxide dismutases</p>
            </title>
            <aug>
               <au>
                  <snm>Fink</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Scandalios</snm>
                  <fnm>JG</fnm>
               </au>
            </aug>
            <source>Arch Biochem Biophys</source>
            <pubdate>2002</pubdate>
            <volume>399</volume>
            <fpage>19</fpage>
            <lpage>36</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/abbi.2001.2739</pubid>
                  <pubid idtype="pmpid" link="fulltext">11883900</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Taxonomic Distribution of Copper-Zinc Superoxide-Dismutase in Green-Algae and Its Phylogenetic Importance</p>
            </title>
            <aug>
               <au>
                  <snm>Dejesus</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Tabatabai</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Chapman</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>J Phycol</source>
            <pubdate>1989</pubdate>
            <volume>25</volume>
            <fpage>767</fpage>
            <lpage>772</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1111/j.0022-3646.1989.00767.x</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Occurrence of Glycolate Dehydrogenase and Glycolate Oxidase in Green Plants &#8211; Evolutionary Survey</p>
            </title>
            <aug>
               <au>
                  <snm>Frederics</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Gruber</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Tolbert</snm>
                  <fnm>NE</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>1973</pubdate>
            <volume>52</volume>
            <fpage>318</fpage>
            <lpage>323</lpage>
         </bibl>
         <bibl id="B32">
            <title>
               <p>The role of peroxisomes in the integration of metabolism and evolutionary diversity of photosynthetic organisms</p>
            </title>
            <aug>
               <au>
                  <snm>Igamberdiev</snm>
                  <fnm>AU</fnm>
               </au>
               <au>
                  <snm>Lea</snm>
                  <fnm>PJ</fnm>
               </au>
            </aug>
            <source>Phytochemistry</source>
            <pubdate>2002</pubdate>
            <volume>60</volume>
            <fpage>651</fpage>
            <lpage>674</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0031-9422(02)00179-6</pubid>
                  <pubid idtype="pmpid" link="fulltext">12127583</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Chlorophyta exclusively use the 1-deoxyxylulose 5-phosphate/2- C-methylerythritol 4-phosphate pathway for the biosynthesis of isoprenoids</p>
            </title>
            <aug>
               <au>
                  <snm>Schwender</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Gemunden</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Lichtenthaler</snm>
                  <fnm>HK</fnm>
               </au>
            </aug>
            <source>Planta</source>
            <pubdate>2001</pubdate>
            <volume>212</volume>
            <fpage>416</fpage>
            <lpage>423</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s004250000409</pubid>
                  <pubid idtype="pmpid" link="fulltext">11289606</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>ATP-regenerating system in the cilia of Paramecium caudatum</p>
            </title>
            <aug>
               <au>
                  <snm>Noguchi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sawadas</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Akazawa</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Journal of Experimental Biology</source>
            <pubdate>2001</pubdate>
            <volume>204</volume>
            <fpage>1063</fpage>
            <lpage>1071</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11222125</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Proteomic analysis of a eukaryotic cilium</p>
            </title>
            <aug>
               <au>
                  <snm>Pazour</snm>
                  <fnm>GJ</fnm>
               </au>
               <au>
                  <snm>Agrin</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Leszyk</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Witman</snm>
                  <fnm>GB</fnm>
               </au>
            </aug>
            <source>J Cell Biol</source>
            <pubdate>2005</pubdate>
            <volume>170</volume>
            <fpage>103</fpage>
            <lpage>113</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1083/jcb.200504008</pubid>
                  <pubid idtype="pmpid" link="fulltext">15998802</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Arf, Arl, Arp and Sar proteins: a family of GTP-binding proteins with a structural device for 'front-back' communication</p>
            </title>
            <aug>
               <au>
                  <snm>Pasqualato</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Renault</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Cherfils</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>EMBO Rep</source>
            <pubdate>2002</pubdate>
            <volume>3</volume>
            <fpage>1035</fpage>
            <lpage>1041</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/embo-reports/kvf221</pubid>
                  <pubid idtype="pmpid" link="fulltext">12429613</pubid>
                  <pubid idtype="pmcid">1307594</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Cellulose synthase (CesA) genes in algae and seedless plants</p>
            </title>
            <aug>
               <au>
                  <snm>Roberts</snm>
                  <fnm>AW</fnm>
               </au>
               <au>
                  <snm>Roberts</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Cellulose</source>
            <pubdate>2004</pubdate>
            <volume>11</volume>
            <fpage>419</fpage>
            <lpage>435</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1023/B:CELL.0000046418.01131.d3</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Glycolate metabolism in green algae</p>
            </title>
            <aug>
               <au>
                  <snm>Stabenau</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Winkler</snm>
                  <fnm>U</fnm>
               </au>
            </aug>
            <source>Physiol Plant</source>
            <pubdate>2005</pubdate>
            <volume>123</volume>
            <fpage>235</fpage>
            <lpage>245</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1111/j.1399-3054.2005.00442.x</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>The evolution of modern eukaryotic phytoplankton</p>
            </title>
            <aug>
               <au>
                  <snm>Falkowski</snm>
                  <fnm>PG</fnm>
               </au>
               <au>
                  <snm>Katz</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Knoll</snm>
                  <fnm>AH</fnm>
               </au>
               <au>
                  <snm>Quigg</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Raven</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Schofield</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>FJR</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2004</pubdate>
            <volume>305</volume>
            <fpage>354</fpage>
            <lpage>360</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1095964</pubid>
                  <pubid idtype="pmpid" link="fulltext">15256663</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>CO2 concentrating mechanisms in cyanobacteria: molecular components, their diversity and evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Badger</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Price</snm>
                  <fnm>GD</fnm>
               </au>
            </aug>
            <source>J Exp Bot</source>
            <pubdate>2003</pubdate>
            <volume>54</volume>
            <fpage>609</fpage>
            <lpage>622</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/jxb/erg076</pubid>
                  <pubid idtype="pmpid" link="fulltext">12554704</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Reverse transcriptase template switching: a SMART approach for full-length cDNA library construction</p>
            </title>
            <aug>
               <au>
                  <snm>Zhu</snm>
                  <fnm>YY</fnm>
               </au>
               <au>
                  <snm>Machleder</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Chenchik</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Siebert</snm>
                  <fnm>PD</fnm>
               </au>
            </aug>
            <source>Biotechniques</source>
            <pubdate>2001</pubdate>
            <volume>30</volume>
            <fpage>892</fpage>
            <lpage>897</lpage>
            <xrefbib>
               <pubid idtype="pmpid">11314272</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>A novel method for SNP detection using a new duplex-specific nuclease from crab hepatopancreas</p>
            </title>
            <aug>
               <au>
                  <snm>Shagin</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Rebrikov</snm>
                  <fnm>DV</fnm>
               </au>
               <au>
                  <snm>Kozhemyako</snm>
                  <fnm>VB</fnm>
               </au>
               <au>
                  <snm>Altshuler</snm>
                  <fnm>IM</fnm>
               </au>
               <au>
                  <snm>Shcheglov</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Zhulidov</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>Bogdanova</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Staroverov</snm>
                  <fnm>DB</fnm>
               </au>
               <au>
                  <snm>Rasskazov</snm>
                  <fnm>VA</fnm>
               </au>
               <au>
                  <snm>Lukyanov</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2002</pubdate>
            <volume>12</volume>
            <fpage>1935</fpage>
            <lpage>1942</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">187582</pubid>
                  <pubid idtype="pmpid" link="fulltext">12466298</pubid>
                  <pubid idtype="doi">10.1101/gr.547002</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>The COG database: an updated version includes eukaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Tatusov</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Fedorova</snm>
                  <fnm>ND</fnm>
               </au>
               <au>
                  <snm>Jackson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Jacobs</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Kiryutin</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Krylov</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Mazumder</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Mekhedov</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Nikolskaya</snm>
                  <fnm>AN</fnm>
               </au>
               <au>
                  <snm>Rao</snm>
                  <fnm>BS</fnm>
               </au>
               <au>
                  <snm>Smirnov</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sverdlov</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Vasudevan</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Yin</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Natale</snm>
                  <fnm>DA</fnm>
               </au>
            </aug>
            <source>Bmc Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <fpage>41</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">222959</pubid>
                  <pubid idtype="pmpid" link="fulltext">12969510</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-4-41</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>The COG database: a tool for genome-scale analysis of protein functions and evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Tatusov</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Galperin</snm>
                  <fnm>MY</fnm>
               </au>
               <au>
                  <snm>Natale</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2000</pubdate>
            <volume>28</volume>
            <fpage>33</fpage>
            <lpage>36</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">102395</pubid>
                  <pubid idtype="pmpid" link="fulltext">10592175</pubid>
                  <pubid idtype="doi">10.1093/nar/28.1.33</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>The InterPro database, an integrated documentation resource for protein families, domains and functional sites</p>
            </title>
            <aug>
               <au>
                  <snm>Apweiler</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Attwood</snm>
                  <fnm>TK</fnm>
               </au>
               <au>
                  <snm>Bairoch</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bateman</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Birney</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Biswas</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bucher</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Cerutti</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Corpet</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Croning</snm>
                  <fnm>MDR</fnm>
               </au>
               <au>
                  <snm>Durbin</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Falquet</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Fleischmann</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Gouzy</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hermjakob</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Hulo</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Jonassen</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Kahn</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kanapin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Karavidopoulou</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Lopez</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Marx</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Mulder</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Oinn</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Pagni</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Servant</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Sigrist</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Zdobnov</snm>
                  <fnm>EM</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <fpage>37</fpage>
            <lpage>40</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">29841</pubid>
                  <pubid idtype="pmpid" link="fulltext">11125043</pubid>
                  <pubid idtype="doi">10.1093/nar/29.1.37</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <url>http://www.arabidopsis.org/tools/aracyc/</url>
         </bibl>
         <bibl id="B47">
            <url>http://metacyc.org/META/server.html</url>
         </bibl>
         <bibl id="B48">
            <title>
               <p>Regulation of chloroplast enzyme activities by thioredoxins: activation or relief from inhibition?</p>
            </title>
            <aug>
               <au>
                  <snm>Ruelland</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Miginiac-Maslow</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Trends Plant Sci</source>
            <pubdate>1999</pubdate>
            <volume>4</volume>
            <fpage>136</fpage>
            <lpage>141</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S1360-1385(99)01391-6</pubid>
                  <pubid idtype="pmpid" link="fulltext">10322547</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>Light-modulated NADP-malate dehydrogenases from mossfern and green algae: insights into evolution of the enzyme's regulation</p>
            </title>
            <aug>
               <au>
                  <snm>Ocheretina</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Haferkamp</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Tellioglu</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Scheibe</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>2000</pubdate>
            <volume>258</volume>
            <fpage>147</fpage>
            <lpage>154</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0378-1119(00)00409-1</pubid>
                  <pubid idtype="pmpid" link="fulltext">11111052</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>NADP-malate dehydrogenase from unicellular green alga Chlamydomonas reinhardtii. A first step toward redox regulation</p>
            </title>
            <aug>
               <au>
                  <snm>Lemaire</snm>
                  <fnm>SD</fnm>
               </au>
               <au>
                  <snm>Quesada</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Merchan</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Corral</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Igeno</snm>
                  <fnm>MI</fnm>
               </au>
               <au>
                  <snm>Keryer</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Issakidis-Bourguet</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Hirasawa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Knaff</snm>
                  <fnm>DB</fnm>
               </au>
               <au>
                  <snm>Miginiac-Maslow</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2005</pubdate>
            <volume>137</volume>
            <fpage>514</fpage>
            <lpage>521</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1065352</pubid>
                  <pubid idtype="pmpid" link="fulltext">15579663</pubid>
                  <pubid idtype="doi">10.1104/pp.104.052670</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>Rubisco activase &#8211; Rubisco's catalytic chaperone</p>
            </title>
            <aug>
               <au>
                  <snm>Portis</snm>
                  <fnm>AR</fnm>
               </au>
            </aug>
            <source>Photosynth Res</source>
            <pubdate>2003</pubdate>
            <volume>75</volume>
            <fpage>11</fpage>
            <lpage>27</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1023/A:1022458108678</pubid>
                  <pubid idtype="pmpid">16245090</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>Plant thioredoxin systems revisited</p>
            </title>
            <aug>
               <au>
                  <snm>Schurmann</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Jacquot</snm>
                  <fnm>JP</fnm>
               </au>
            </aug>
            <source>Annu Rev Plant Physiol Plant Molec Biol</source>
            <pubdate>2000</pubdate>
            <volume>51</volume>
            <fpage>371</fpage>
            <lpage>400</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1146/annurev.arplant.51.1.371</pubid>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
