<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2229-9-99</ui>
   <ji>1471-2229</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>The cellulose synthase superfamily in fully sequenced plants and algae</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Yin</snm>
               <fnm>Yanbin</fnm>
               <insr iid="I1"/>
               <email>yinyb@csbl.bmb.uga.edu</email>
            </au>
            <au id="A2">
               <snm>Huang</snm>
               <fnm>Jinling</fnm>
               <insr iid="I2"/>
               <email>HUANGJ@ecu.edu</email>
            </au>
            <au ca="yes" id="A3">
               <snm>Xu</snm>
               <fnm>Ying</fnm>
               <insr iid="I1"/>
               <insr iid="I3"/>
               <email>xyn@bmb.uga.edu</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Computational System Biology Lab, Department of Biochemistry and Molecular Biology, Institute of Bioinformatics, University of Georgia, BioEnergy Science Center, USA</p>
            </ins>
            <ins id="I2">
               <p>Department of Biology, Howell Science Complex, East Carolina University, Greenville, NC 27858, USA</p>
            </ins>
            <ins id="I3">
               <p>College of Computer Science and Technology, Jilin University, Changchun, Jilin, PR China</p>
            </ins>
         </insg>
         <source>BMC Plant Biology</source>
         <issn>1471-2229</issn>
         <pubdate>2009</pubdate>
         <volume>9</volume>
         <issue>1</issue>
         <fpage>99</fpage>
         <url>http://www.biomedcentral.com/1471-2229/9/99</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="doi">10.1186/1471-2229-9-99</pubid>
               <pubid idtype="pmpid">19646250</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>9</day>
               <month>1</month>
               <year>2009</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>31</day>
               <month>7</month>
               <year>2009</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>31</day>
               <month>7</month>
               <year>2009</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2009</year>
         <collab>Yin et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>The cellulose synthase superfamily has been classified into nine cellulose synthase-like (Csl) families and one cellulose synthase (CesA) family. The Csl families have been proposed to be involved in the synthesis of the backbones of hemicelluloses of plant cell walls. With 17 plant and algal genomes fully sequenced, we sought to conduct a genome-wide and systematic investigation of this superfamily through in-depth phylogenetic analyses.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>A single-copy gene is found in the six chlorophyte green algae, which is most closely related to the CslA and CslC families that are present in the seven land plants investigated in our analyses. Six proteins from poplar, grape and sorghum form a distinct family (CslJ), providing further support for the conclusions from two recent studies. CslB/E/G/H/J families have evolved significantly more rapidly than their widely distributed relatives, and tend to have intragenomic duplications, in particular in the grape genome.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>Our data suggest that the CslA and CslC families originated through an ancient gene duplication event in land plants. We speculate that the single-copy Csl gene in green algae may encode a mannan synthase. We confirm that the rest of the Csl families have a different evolutionary origin than CslA and CslC, and have proposed a model for the divergence order among them. Our study provides new insights about the evolution of this important gene family in plants.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification id="endnote" subtype="user_supplied_xml" type="bmc"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>The first plant gene encoding a cellulose synthase catalytic subunit (CesA) was identified in 1996 in cotton based on its sequence similarity to a bacterial CesA <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. In 2000, Richmond and Somerville identified 10 CesA genes and 31 cellulose synthases-like (Csl) genes in Arabidopsis, which were further classified into one CesA family and six Csl families (CslA/B/C/D/E/G) based on phylogenetic analyses <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. Since then, the whole CesA and Csl gene repertoire has been cataloged in fully sequenced plants, including rice <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>, poplar <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp> and the moss <it>Physcomitrella patens </it><abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. Additional CesA and Csl genes have also been found in diverse and not fully sequenced land plants such as maize <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>, barley <abbrgrp><abbr bid="B8">8</abbr></abbrgrp> and pine <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>; CesAs have been identified in streptophyte green algae such as <it>Mesotaenium caldariorum </it><abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp> and in red alga <it>Porphyra yezoensis </it><abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp> as well. Two additional Csl families (CslF and CslH) were found in these studies; together with the other six Csl families and one CesA family, they comprise the CesA superfamily.</p>
         <p>The CesA superfamily genes are among the most important players involved in the biosynthesis of plant cell walls, which are mainly composed of biopolymers such as celluloses, hemicelluloses, pectins and lignins. Because the Csl genes share sequence similarities with the CesA genes, they are hypothesized to be involved in the biosynthesis of the backbone of various polysaccharide polymers <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>, in particular hemicelluloses <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. This so-called "CSL hypothesis" has been supported by recent experimental studies, which suggest that the CslA genes encode the mannan synthases <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>, the CslF and CslH genes encode the mixed linkage glucan synthases <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr></abbrgrp>, and the CslC genes are probably involved in the xyloglucan biosynthesis <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. Therefore the backbone synthases of all major hemicellulose classes except for xylans are known. However, the functional roles of the other Csl families (CslB/D/E/G) remain unclear.</p>
         <p>The phylogenetic classification and the function of the CesA superfamily were reviewed by Lerouxel <it>et al</it>. in 2006 <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>, and since then there have been a few updates in terms of the phylogenetic analyses of these important genes. Fincher <it>et al</it>. have found a new Csl family (CslJ) in cereals <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr></abbrgrp>. Roberts and Bushoven have mined the <it>P. pattens </it>genomic and EST data and found CesA, CslA, CslC and CslD genes in this lower plant <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>; their phylogenetic analyses revealed that seven <it>P. patens </it>CesA genes form a monophyletic clade by themselves and there are no one-to-one orthologs in the moss corresponding to the Arabidopsis CesA triplet subunits (CesA1/3/6 for the primary cell wall and CesA4/7/8 for the secondary cell wall). Furthermore, comprehensive phylogenetic analyses of the plant CesA superfamily by including CesAs from other organismal groups (e.g., bacteria, fungi and animals) indicated that plant CslA and CslC genes have a different origin than the remaining plant genes <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. Evidences have been reported that these remaining genes of the CesA superfamily were anciently acquired from cyanobacteria <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>. It was proposed <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> that the plant CslG genes evolved first, followed by the CslE, CslB, CesA and CslD/F genes. However, a more recent study could not find homologs of the CslG/E/B/H/F genes in <it>P. patens </it><abbrgrp><abbr bid="B6">6</abbr></abbrgrp>, suggesting that these Csl families are narrowly distributed and unlikely to be the earliest evolved.</p>
         <p>To date 17 plant and algal genomes have been fully or nearly fully sequenced, and their gene prediction and annotation are publicly available (Table <tblr tid="T1">1</tblr>). The availability of these genomes and their annotated genes facilitates comparative genomic studies of plants, making it possible to address major plant biology questions <it>in silico </it><abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. We have performed comparative analyses of the CesA superfamily genes in the 17 sequenced plant and algal genomes. Our goals are to define CesA and Csl gene homologs across these genomes and to investigate the evolution of different Csl gene families. We have built a catalog of all the Csl genes and classified them phylogenetically. The gene structure, the evolutionary rate, and the distribution of the Csl families across different genomes are also studied. Throughout this paper, we use Csl genes to denote all cellulose synthases-like genes including CesAs.</p>
         <tbl id="T1">
            <title>
               <p>Table 1</p>
            </title>
            <caption>
               <p>Plant and algal genomes used in the present study</p>
            </caption>
            <tblbdy cols="6">
               <r>
                  <c ca="left">
                     <p>Index</p>
                  </c>
                  <c ca="left">
                     <p>Abbr.</p>
                  </c>
                  <c ca="left">
                     <p>Clade</p>
                  </c>
                  <c ca="left">
                     <p>Species</p>
                  </c>
                  <c ca="left">
                     <p>Genome Published/Released</p>
                  </c>
                  <c ca="left">
                     <p>Csl Published?</p>
                  </c>
               </r>
               <r>
                  <c cspan="6">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1</p>
                  </c>
                  <c ca="left">
                     <p>Tp</p>
                  </c>
                  <c ca="left">
                     <p>Diatom</p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Thalassiosira pseudonana</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <abbrgrp>
                           <abbr bid="B25">25</abbr>
                        </abbrgrp>
                     </p>
                  </c>
                  <c ca="left">
                     <p>N</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>2</p>
                  </c>
                  <c ca="left">
                     <p>Pht</p>
                  </c>
                  <c ca="left">
                     <p>Diatom</p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Phaeodactylum tricornutum</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>JGI</p>
                  </c>
                  <c ca="left">
                     <p>N</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>3</p>
                  </c>
                  <c ca="left">
                     <p>Pa</p>
                  </c>
                  <c ca="left">
                     <p>brown tide algae</p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Aureococcus anophagefferens</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>JGI</p>
                  </c>
                  <c ca="left">
                     <p>N</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>4</p>
                  </c>
                  <c ca="left">
                     <p>Cm</p>
                  </c>
                  <c ca="left">
                     <p>red algae</p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Cyanidioschyzon merolae</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <abbrgrp>
                           <abbr bid="B26">26</abbr>
                        </abbrgrp>
                     </p>
                  </c>
                  <c ca="left">
                     <p>N</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>5</p>
                  </c>
                  <c ca="left">
                     <p>Mpc</p>
                  </c>
                  <c ca="left">
                     <p>green algae</p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Micromonas pusilla CCMP1545</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <abbrgrp>
                           <abbr bid="B27">27</abbr>
                        </abbrgrp>
                     </p>
                  </c>
                  <c ca="left">
                     <p>N</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>6</p>
                  </c>
                  <c ca="left">
                     <p>Mpr</p>
                  </c>
                  <c ca="left">
                     <p>green algae</p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Micromonas strain RCC299</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <abbrgrp>
                           <abbr bid="B27">27</abbr>
                        </abbrgrp>
                     </p>
                  </c>
                  <c ca="left">
                     <p>N</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>7</p>
                  </c>
                  <c ca="left">
                     <p>Ol</p>
                  </c>
                  <c ca="left">
                     <p>green algae</p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Ostreococcus lucimarinus</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <abbrgrp>
                           <abbr bid="B28">28</abbr>
                        </abbrgrp>
                     </p>
                  </c>
                  <c ca="left">
                     <p>N</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>8</p>
                  </c>
                  <c ca="left">
                     <p>Ot</p>
                  </c>
                  <c ca="left">
                     <p>green algae</p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Ostreococcus tauri</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <abbrgrp>
                           <abbr bid="B29">29</abbr>
                        </abbrgrp>
                     </p>
                  </c>
                  <c ca="left">
                     <p>N</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>9</p>
                  </c>
                  <c ca="left">
                     <p>Cr</p>
                  </c>
                  <c ca="left">
                     <p>green algae</p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Chlamydomonas reinhardtii</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <abbrgrp>
                           <abbr bid="B30">30</abbr>
                        </abbrgrp>
                     </p>
                  </c>
                  <c ca="left">
                     <p>N</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>10</p>
                  </c>
                  <c ca="left">
                     <p>Vc</p>
                  </c>
                  <c ca="left">
                     <p>green algae</p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Volvox carteri f. nagariensis</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>JGI</p>
                  </c>
                  <c ca="left">
                     <p>N</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>11</p>
                  </c>
                  <c ca="left">
                     <p>Pp</p>
                  </c>
                  <c ca="left">
                     <p>moss</p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Physcomitrella patens ssp patens</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <abbrgrp>
                           <abbr bid="B31">31</abbr>
                        </abbrgrp>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <abbrgrp>
                           <abbr bid="B6">6</abbr>
                        </abbrgrp>
                     </p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>12</p>
                  </c>
                  <c ca="left">
                     <p>Sm</p>
                  </c>
                  <c ca="left">
                     <p>spike moss</p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Selaginella moellendorffii</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>JGI</p>
                  </c>
                  <c ca="left">
                     <p>N <abbrgrp><abbr bid="B32">32</abbr></abbrgrp></p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>13</p>
                  </c>
                  <c ca="left">
                     <p>Pt</p>
                  </c>
                  <c ca="left">
                     <p>dicot</p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Populus trichocarpa</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <abbrgrp>
                           <abbr bid="B33">33</abbr>
                        </abbrgrp>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <abbrgrp>
                           <abbr bid="B5">5</abbr>
                        </abbrgrp>
                     </p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>14</p>
                  </c>
                  <c ca="left">
                     <p>At</p>
                  </c>
                  <c ca="left">
                     <p>dicot</p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Arabidopsis thaliana</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <abbrgrp>
                           <abbr bid="B34">34</abbr>
                        </abbrgrp>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <abbrgrp>
                           <abbr bid="B2">2</abbr>
                        </abbrgrp>
                     </p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>15</p>
                  </c>
                  <c ca="left">
                     <p>Vv</p>
                  </c>
                  <c ca="left">
                     <p>dicot</p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Vitis vinifera</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <abbrgrp>
                           <abbr bid="B35">35</abbr>
                        </abbrgrp>
                     </p>
                  </c>
                  <c ca="left">
                     <p>N</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>16</p>
                  </c>
                  <c ca="left">
                     <p>Os</p>
                  </c>
                  <c ca="left">
                     <p>monocot</p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Oryza sativa</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <abbrgrp>
                           <abbr bid="B36">36</abbr>
                           <abbr bid="B37">37</abbr>
                        </abbrgrp>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <abbrgrp>
                           <abbr bid="B3">3</abbr>
                        </abbrgrp>
                     </p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>17</p>
                  </c>
                  <c ca="left">
                     <p>Sb</p>
                  </c>
                  <c ca="left">
                     <p>monocot</p>
                  </c>
                  <c ca="left">
                     <p>
                        <it>Sorghum bicolor</it>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <abbrgrp>
                           <abbr bid="B38">38</abbr>
                        </abbrgrp>
                     </p>
                  </c>
                  <c ca="left">
                     <p>N</p>
                  </c>
               </r>
            </tblbdy>
         </tbl>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Identification of Csl proteins</p>
            </st>
            <p>We analyzed the Csl genes in the 17 genomes based on BLAST <abbrgrp><abbr bid="B39">39</abbr></abbrgrp> and HMMER <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> searches (see Methods for details). We found that all predicted Csl proteins contain either the Pfam Cellulose_synt domain (PF03552, 898 aa long) or the glycosyltransferase family 2 (GT2) domain (PF00535, 149 aa long) but not both. For example, all CslA and CslC proteins in Arabidopsis have the GT2 domain but not the Cellulose_synt domain, while all other Arabidopsis Csl proteins contain only the Cellulose_synt domain, which is consistent with a previous finding that CslA and CslC have a different origin than the other Csl families <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. We then grouped all proteins with the GT2 domain into a set denoted as the GT2 dataset, and those with the Cellulose_synt domain into the Cellulose_synt dataset. These two datasets do not share any common proteins.</p>
            <p>By querying the Pfam Cellulose_synt domain, we identified two cyanobacterial proteins from the fully sequenced bacterial genomes. These two proteins were originally identified by Nobles <it>et al</it>. in 2001 <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>, and here are used to root the eukaryotic Cellulose_synt phylogeny.</p>
         </sec>
         <sec>
            <st>
               <p>Phylogenetic classification of Csl families</p>
            </st>
            <p>To characterize the identified Csl genes from the 17 genomes, we built phylogenetic trees for each of the two aforementioned datasets, based on the multiple sequence alignments of both the full length proteins and the conserved Pfam domains (see Methods for details). Figures <figr fid="F1">1A</figr> and <figr fid="F1">1B</figr> show the un-rooted maximum likelihood (ML) trees for the Cellulose_synt dataset and the GT2 dataset, respectively. The number and the species information of the genes included in the phylogeny are given in Table <tblr tid="T2">2</tblr>. Neighbor joining (NJ) trees for the two datasets were also constructed and given in the Additional file <supplr sid="S1">1</supplr> [see Additional file <supplr sid="S1">1</supplr>].</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p><b>NJ phylogeny built by using MEGA4</b>. See Legend of Figure <figr fid="F1">1</figr>.</p>
               </text>
               <file name="1471-2229-9-99-S1.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>The maximum likelihood (ML) phylogenies of the Csl families</p>
               </caption>
               <text>
                  <p><b>The maximum likelihood (ML) phylogenies of the Csl families</b>. a) 217 plant proteins (211 proteins if alternatively splicing variants from Arabidopsis and rice are excluded) that have the Pfam Cellulose_synt domain were used to construct this tree. b) 88 proteins (83 proteins if alternatively splicing variants are excluded) that have the Pfam GT2 domain were used to construct the tree. Both the full length protein sequences and the conserved Pfam domain regions were used in the phylogeny reconstruction and the corresponding bootstrap values are shown and split by '/'.</p>
               </text>
               <graphic file="1471-2229-9-99-1"/>
            </fig>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Sizes (number of genes in each family) of Csl families in 17 plant and algal genomes</p>
               </caption>
               <tblbdy cols="13">
                  <r>
                     <c ca="center">
                        <p>Abbr. <sup>a)</sup></p>
                     </c>
                     <c ca="center">
                        <p>GenomeSize<sup>b)</sup></p>
                     </c>
                     <c ca="center">
                        <p>Sum.</p>
                     </c>
                     <c ca="center">
                        <p>CesA</p>
                     </c>
                     <c ca="center">
                        <p>CslD</p>
                     </c>
                     <c ca="center">
                        <p>CslF</p>
                     </c>
                     <c ca="center">
                        <p>CslB</p>
                     </c>
                     <c ca="center">
                        <p>CslH</p>
                     </c>
                     <c ca="center">
                        <p>CslE</p>
                     </c>
                     <c ca="center">
                        <p>CslG</p>
                     </c>
                     <c ca="center">
                        <p>CslJ</p>
                     </c>
                     <c ca="center">
                        <p>CslA</p>
                     </c>
                     <c ca="center">
                        <p>CslC</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="13">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Tp</p>
                     </c>
                     <c ca="center">
                        <p>11390</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Pht</p>
                     </c>
                     <c ca="center">
                        <p>10025</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Aa</p>
                     </c>
                     <c ca="center">
                        <p>11501</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Cm</p>
                     </c>
                     <c ca="center">
                        <p>5014</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Mpc</p>
                     </c>
                     <c ca="center">
                        <p>10475</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Mpr</p>
                     </c>
                     <c ca="center">
                        <p>9815</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Ol</p>
                     </c>
                     <c ca="center">
                        <p>7651</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Ot</p>
                     </c>
                     <c ca="center">
                        <p>7725</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Cr</p>
                     </c>
                     <c ca="center">
                        <p>14598</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Vc</p>
                     </c>
                     <c ca="center">
                        <p>15544</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Pp</p>
                     </c>
                     <c ca="center">
                        <p>35938</p>
                     </c>
                     <c ca="center">
                        <p>26</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Sm</p>
                     </c>
                     <c ca="center">
                        <p>34697</p>
                     </c>
                     <c ca="center">
                        <p>22</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Pt</p>
                     </c>
                     <c ca="center">
                        <p>58036</p>
                     </c>
                     <c ca="center">
                        <p>50</p>
                     </c>
                     <c ca="center">
                        <p>18</p>
                     </c>
                     <c ca="center">
                        <p>11</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>At</p>
                     </c>
                     <c ca="center">
                        <p>31921</p>
                     </c>
                     <c ca="center">
                        <p>39<sup>c)</sup></p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Vv</p>
                     </c>
                     <c ca="center">
                        <p>30434</p>
                     </c>
                     <c ca="center">
                        <p>58</p>
                     </c>
                     <c ca="center">
                        <p>11</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                     <c ca="center">
                        <p>15</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Os</p>
                     </c>
                     <c ca="center">
                        <p>66710</p>
                     </c>
                     <c ca="center">
                        <p>44</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Sb</p>
                     </c>
                     <c ca="center">
                        <p>35899</p>
                     </c>
                     <c ca="center">
                        <p>49</p>
                     </c>
                     <c ca="center">
                        <p>12</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>11</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>-</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="2" ca="center">
                        <p>Total</p>
                     </c>
                     <c ca="center">
                        <p>294</p>
                     </c>
                     <c ca="center">
                        <p>79</p>
                     </c>
                     <c ca="center">
                        <p>46</p>
                     </c>
                     <c ca="center">
                        <p>19</p>
                     </c>
                     <c ca="center">
                        <p>15</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>19</p>
                     </c>
                     <c ca="center">
                        <p>22</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>45</p>
                     </c>
                     <c ca="center">
                        <p>37</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="13">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c cspan="2" ca="center">
                        <p>All</p>
                     </c>
                     <c ca="center">
                        <p>294</p>
                     </c>
                     <c cspan="8" ca="center">
                        <p>211</p>
                     </c>
                     <c cspan="2" ca="center">
                        <p>83</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><sup>a) </sup>See Table 1 for species full names</p>
                  <p><sup>b) </sup>Genome size is measured as the number of protein coding genes in each genome.</p>
                  <p><sup>c) </sup>AtCslA1 was not identified in our hmmsearch (see Methods for details), and thus the number of Csl genes in Arabidopsis is shown to be 39 instead of 40.</p>
               </tblfn>
            </tbl>
            <p>The number of proteins in each Csl family, calculated based on their groupings as shown in Figure <figr fid="F1">1</figr>, is given in Table <tblr tid="T2">2</tblr>. No Csl genes were detected in the three stramenopile algae (two diatoms: <it>Thalassiosira pseudonana</it>, <it>Phaeodactylum tricornutum </it>and one brown tide alga: <it>Aureococcus anophagefferens</it>) and the red alga <it>Cyanidioschyzon merolae</it>. Only one single-copy gene, which is most closely related to CslA/C, was identified in each of the six green algae. Overall, 77 proteins from land plants have the GT2 Pfam domain, which are assigned to either the CslA or the CslC family (Figure <figr fid="F1">1b</figr>). Six additional proteins from green algae also have the GT2 Pfam domain and are the most homologous to both the land plant CslA and CslC genes. The Cellulose_synt Pfam domain is found in 211 proteins, which are assigned to the other Csl families (Figure <figr fid="F1">1a</figr>). Based on their taxonomic distribution and the phylogenies shown in Figure <figr fid="F1">1</figr>, we have divided the 294 identified Csl proteins into three categories. The first category consists of CslA and CslC, the most conserved Csl families in our analyses. These families are found in all seven land plants and all six green algae sampled in this study. The second category contains three phylogenetically related families, CesA, CslD and CslF (Figure <figr fid="F1">1a</figr>). Among these families, CslF is specific to grasses while CesA and CslD are present across all the sampled land plants, including the two lower plants (<it>P. patens </it>of bryophytes and <it>S. moellendorffii </it>of lycophytes in Table <tblr tid="T1">1</tblr>). The last category includes the remaining Csl families, which appear to be confined to the five sampled seed plants (Table <tblr tid="T2">2</tblr>). Such a restricted distribution suggests a likely late origin of these Csl families and possible functional roles specific to seed plants.</p>
         </sec>
         <sec>
            <st>
               <p>CslJ represents a new Csl family that is found in both monocot and dicot plants</p>
            </st>
            <p>Figure <figr fid="F1">1</figr> shows all the nine previously known Csl families <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>, namely CesA and CslA/B/C/D/E/F/G/H, each forming a well-supported group. A new Csl family was discovered very recently in cereals and named CslJ <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr></abbrgrp>. Here we found this family not only in sorghum but also in poplar and grape, suggesting that it is another seed plant-specific Csl family in addition to the CslE family. The sorghum sequence (Sb03g047220) was found in both this study and previous papers <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr></abbrgrp>. According to the phylogeny (Figure <figr fid="F1">1a</figr>; [see Additional file <supplr sid="S1">1</supplr>]), CslJ is closely related to CslG. We have examined if the CslJ family contains functional genes, knowing that neither Arabidopsis nor rice contains this family (Table <tblr tid="T2">2</tblr>). The following evidence suggests that genes of this family are not likely to be pseudogenes. First, our Ka/Ks analysis (see Methods for details) shows that members of this family are under negative selection (median of Ka/Ks = 0.23, Figure <figr fid="F2">2</figr>). Second, all members of this family (Table <tblr tid="T3">3</tblr>) have EST sequences in the NCBI EST database, indicating that these genes are expressed. And third, members of this gene family have also been found in unfinished genomes such as barley, wheat and maize <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr></abbrgrp>.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Estimation of evolutionary rates of different Csl families</p>
               </caption>
               <text>
                  <p><b>Estimation of evolutionary rates of different Csl families</b>. If the Ka/Ks ratio for a protein is less than one (less amino acid replacements than silent base substitutions), it means the protein is under negative selection; otherwise if Ka/Ks>1, it means the protein is under positive selection. We used the model = 1 implemented in codeml of PAML to compute Ka/Ks, which allows each gene in the tree evolving at its own rate; therefore within each family (tree) a different Ka/Ks value for each gene is obtained (see Methods for details). The distribution of the Ka/Ks values of genes of each Csl family is plotted side by side, showing the minimum, the 25% percentile, the median, the 75% percentile and the maximum values of Ka/Ks. The boxes are drawn with widths proportional to the square-roots of the number of genes in the groups. A notch is drawn on each side of the box towards the median. For panel a) we used Csl genes from all the seven land plants; for b), c) and d) we extracted and plotted genes from only subsets of the seven genomes, namely, the five seed plants, the three dicot plants and the two monocot plants, respectively.</p>
               </text>
               <graphic file="1471-2229-9-99-2"/>
            </fig>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Expression of members of the CslJ family</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="left">
                        <p>CslJ gene ID</p>
                     </c>
                     <c ca="left">
                        <p>NCBI accession numbers of ESTs</p>
                     </c>
                     <c ca="left">
                        <p>UniGene</p>
                     </c>
                     <c ca="left">
                        <p>Tissue/Organ</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>fgenesh1_pg.C_LG_X000708</p>
                     </c>
                     <c ca="left">
                        <p>DB888819.1, CV257302.1, DB906752.1</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Mixture of leaf, bud, stem, root</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>estExt_fgenesh1_pg_v1.C_LG_X0702</p>
                     </c>
                     <c ca="left">
                        <p>DB885869.1, DN497067.1, BU871140.1, AJ772607.1, DN487448.1, AJ770380.1, DB903904.1, AJ772118.1</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Dormant bud, mixture of leaf, bud, stem, root</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Sb03g047220</p>
                     </c>
                     <c ca="left">
                        <p>CF430961.1, CF431079.1</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Nitrogen-deficient seedlings</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GSVIVP00020164001</p>
                     </c>
                     <c ca="left">
                        <p>CF211163.1, EE094868.1, EE097022.1, EC990611.1, CV100631.1, CF983720.1, EE086006.1, EC925887.1, DT021105.1, EC927377.1, CF211254.1, DT010825.1, CF210160.1, CN006709.1, CF515516.1, CF515427.1, CF210083.1, EE093327.1, EE093253.1, CV179236.1</p>
                     </c>
                     <c ca="left">
                        <p>Vvi.14469</p>
                     </c>
                     <c ca="left">
                        <p>Fruit; flower; leaf; mixed; cell culture</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GSVIVP00020168001</p>
                     </c>
                     <c ca="left">
                        <p>FC063595.1, EC948646.1, DT004980.1</p>
                     </c>
                     <c ca="left">
                        <p>Vvi.20726</p>
                     </c>
                     <c ca="left">
                        <p>Flower, leaf and root</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GSVIVP00020169001</p>
                     </c>
                     <c ca="left">
                        <p>EE094198.1, CF983803.1, EE100185.1</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Leaf and berry</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Protein sequences of the CslJ members (first column) were queried to search against the NCBI EST database. With E-value cutoff &lt;= 1e-2, EST matches that are >98% identical to the query protein and are from the same species as the query were collected and listed in the second column. These ESTs matches were manually checked at the NCBI website to find UniGene links (third column) and tissue/organ expression information (fourth column) if they have.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>A green algae-specific family</p>
            </st>
            <p>Six green algal Csl genes (each green alga contains a single-copy of this gene; see Table <tblr tid="T2">2</tblr>) form a well-supported group distinct from the land plant CslA and CslC genes (Figure <figr fid="F1">1b</figr>). Inasmuch this green algal Csl group possibly represents the most homologous genes of the common ancestor of CslA and CslC in all the land plants.</p>
         </sec>
         <sec>
            <st>
               <p>Evolutionary rate of Csl families</p>
            </st>
            <p>The ratio between Ka and Ks (see Methods for details) has been widely used to measure the selection pressure on proteins <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>. Generally, a lower selection pressure indicates a higher evolutionary rate. Using Ka, Ks and the Ka/Ks ratio as proxies, people have found significant correlations between protein evolutionary rates and numerous features derivable from genome sequences, such as the number of protein-protein interaction partners, gene expression levels, the essentiality of a gene and the number of paralogs of a gene <abbrgrp><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr></abbrgrp> (and papers cited therein). In particular, genes that originated relatively recently through gene duplications or by other mechanisms usually evolve more rapidly than the more ancient genes <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>. In order to compare the relative evolutionary rates among the Csl families that may have come into being through duplications <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>, we have calculated the Ka/Ks ratios (see Methods for details) and conducted rigorous statistical analyses on the computational results.</p>
            <p>The comparisons of the distribution of the Ka/Ks ratios across different Csl families are shown in Figure <figr fid="F2">2a&#8211;d</figr>, from which it is clear that the narrowly distributed Csl families (CslB/H/E/G/J) tend to have higher Ka/Ks ratios than those widely distributed Csl families (CslA/C/D and CesA). This observation is statistically supported by pair-wise Wilcoxon tests (Table <tblr tid="T4">4a&#8211;d</tblr>). Interestingly, this observation remains to be true when we compared the genes of the Csl families across different groupings of the plant genomes, namely across (a) all the seven land plants (Figure <figr fid="F2">2a</figr>), (b) the five seed plants (excluding moss and spike moss from (a); Figure <figr fid="F2">2b</figr>), (c) the three dicot plants (Figure <figr fid="F2">2c</figr>), (d) the two monocot plants (Figure <figr fid="F2">2d</figr>), and (e) each of the seven plants (data not shown). Specifically, we found that the dicot-specific CslB and CslG families have evolved the most rapidly, followed by the monocot-specific CslH, and then by the seed plant-specific CslE and CslJ. In addition, the monocot-specific CslF has evolved significantly more rapidly than CslC and CesA, but not than CslA and CslD (Table <tblr tid="T4">4d</tblr>). Overall, we found that the CslB/H/E/G/J families have evolved more rapidly than the other Csl families, which lends further support for the hypothesis that these families might have diversified relatively recently to acquire new functions specific to the seed plants.</p>
            <tbl id="T4">
               <title>
                  <p>Table 4</p>
               </title>
               <caption>
                  <p>Two sample (pair-wise) nonparametric Wilcoxon test P values (Csl family in the column vs. in the row)</p>
               </caption>
               <tblbdy cols="10">
                  <r>
                     <c ca="left">
                        <p>All plants (a)</p>
                     </c>
                     <c ca="left">
                        <p>CslA</p>
                     </c>
                     <c ca="left">
                        <p>CslC</p>
                     </c>
                     <c ca="left">
                        <p>cesA</p>
                     </c>
                     <c ca="left">
                        <p>CslD</p>
                     </c>
                     <c ca="left">
                        <p>Seed plants (b)</p>
                     </c>
                     <c ca="left">
                        <p>CslA</p>
                     </c>
                     <c ca="left">
                        <p>CslC</p>
                     </c>
                     <c ca="left">
                        <p>cesA</p>
                     </c>
                     <c ca="left">
                        <p>CslD</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>CslF</p>
                     </c>
                     <c ca="left">
                        <p>0.1584*</p>
                     </c>
                     <c ca="left">
                        <p>2.76e-05</p>
                     </c>
                     <c ca="left">
                        <p>7.44e-07</p>
                     </c>
                     <c ca="left">
                        <p>0.0009</p>
                     </c>
                     <c ca="left">
                        <p>CslF</p>
                     </c>
                     <c ca="left">
                        <p>0.2707*</p>
                     </c>
                     <c ca="left">
                        <p>5.66e-06</p>
                     </c>
                     <c ca="left">
                        <p>1.26e-07</p>
                     </c>
                     <c ca="left">
                        <p>0.0065</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>CslB</p>
                     </c>
                     <c ca="left">
                        <p>3.36e-05</p>
                     </c>
                     <c ca="left">
                        <p>1.10e-06</p>
                     </c>
                     <c ca="left">
                        <p>5.56e-08</p>
                     </c>
                     <c ca="left">
                        <p>2.05e-06</p>
                     </c>
                     <c ca="left">
                        <p>CslB</p>
                     </c>
                     <c ca="left">
                        <p>3.52e-05</p>
                     </c>
                     <c ca="left">
                        <p>4.93e-10</p>
                     </c>
                     <c ca="left">
                        <p>1.78e-08</p>
                     </c>
                     <c ca="left">
                        <p>3.93e-08</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>CslH</p>
                     </c>
                     <c ca="left">
                        <p>0.0307</p>
                     </c>
                     <c ca="left">
                        <p>0.0012</p>
                     </c>
                     <c ca="left">
                        <p>0.0006</p>
                     </c>
                     <c ca="left">
                        <p>0.0033</p>
                     </c>
                     <c ca="left">
                        <p>CslH</p>
                     </c>
                     <c ca="left">
                        <p>0.0349</p>
                     </c>
                     <c ca="left">
                        <p>7.06e-05</p>
                     </c>
                     <c ca="left">
                        <p>0.0002</p>
                     </c>
                     <c ca="left">
                        <p>0.0020</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>CslE</p>
                     </c>
                     <c ca="left">
                        <p>0.0017</p>
                     </c>
                     <c ca="left">
                        <p>8.94e-07</p>
                     </c>
                     <c ca="left">
                        <p>3.64e-08</p>
                     </c>
                     <c ca="left">
                        <p>6.22e-06</p>
                     </c>
                     <c ca="left">
                        <p>CslE</p>
                     </c>
                     <c ca="left">
                        <p>0.0028</p>
                     </c>
                     <c ca="left">
                        <p>7.27e-09</p>
                     </c>
                     <c ca="left">
                        <p>3.43e-09</p>
                     </c>
                     <c ca="left">
                        <p>3.46e-06</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>CslG</p>
                     </c>
                     <c ca="left">
                        <p>3.97e-05</p>
                     </c>
                     <c ca="left">
                        <p>1.84e-07</p>
                     </c>
                     <c ca="left">
                        <p>2.94e-09</p>
                     </c>
                     <c ca="left">
                        <p>3.34e-07</p>
                     </c>
                     <c ca="left">
                        <p>CslG</p>
                     </c>
                     <c ca="left">
                        <p>5.26e-05</p>
                     </c>
                     <c ca="left">
                        <p>1.07e-10</p>
                     </c>
                     <c ca="left">
                        <p>5.71e-10</p>
                     </c>
                     <c ca="left">
                        <p>1.21e-08</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>CslJ</p>
                     </c>
                     <c ca="left">
                        <p>0.2187*</p>
                     </c>
                     <c ca="left">
                        <p>0.0261</p>
                     </c>
                     <c ca="left">
                        <p>0.0156</p>
                     </c>
                     <c ca="left">
                        <p>0.0463</p>
                     </c>
                     <c ca="left">
                        <p>CslJ</p>
                     </c>
                     <c ca="left">
                        <p>0.2554*</p>
                     </c>
                     <c ca="left">
                        <p>0.0178</p>
                     </c>
                     <c ca="left">
                        <p>0.0010</p>
                     </c>
                     <c ca="left">
                        <p>0.0805*</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c cspan="10">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Dicot plants (c)</p>
                     </c>
                     <c ca="left">
                        <p>CslA</p>
                     </c>
                     <c ca="left">
                        <p>CslC</p>
                     </c>
                     <c ca="left">
                        <p>cesA</p>
                     </c>
                     <c ca="left">
                        <p>CslD</p>
                     </c>
                     <c ca="left">
                        <p>Monocot plants (d)</p>
                     </c>
                     <c ca="left">
                        <p>CslA</p>
                     </c>
                     <c ca="left">
                        <p>CslC</p>
                     </c>
                     <c ca="left">
                        <p>cesA</p>
                     </c>
                     <c ca="left">
                        <p>CslD</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>CslB</p>
                     </c>
                     <c ca="left">
                        <p>0.0010</p>
                     </c>
                     <c ca="left">
                        <p>4.99e-08</p>
                     </c>
                     <c ca="left">
                        <p>4.97e-08</p>
                     </c>
                     <c ca="left">
                        <p>3.23e-08</p>
                     </c>
                     <c ca="left">
                        <p>CslF</p>
                     </c>
                     <c ca="left">
                        <p>0.3520*</p>
                     </c>
                     <c ca="left">
                        <p>0.0008</p>
                     </c>
                     <c ca="left">
                        <p>8.69e-06</p>
                     </c>
                     <c ca="left">
                        <p>0.0521*</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>CslE</p>
                     </c>
                     <c ca="left">
                        <p>0.0185</p>
                     </c>
                     <c ca="left">
                        <p>2.04e-06</p>
                     </c>
                     <c ca="left">
                        <p>9.67e-07</p>
                     </c>
                     <c ca="left">
                        <p>0.0001</p>
                     </c>
                     <c ca="left">
                        <p>CslH</p>
                     </c>
                     <c ca="left">
                        <p>0.0399</p>
                     </c>
                     <c ca="left">
                        <p>0.0019</p>
                     </c>
                     <c ca="left">
                        <p>0.0013</p>
                     </c>
                     <c ca="left">
                        <p>0.0063</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>CslG</p>
                     </c>
                     <c ca="left">
                        <p>0.0017</p>
                     </c>
                     <c ca="left">
                        <p>3.77e-09</p>
                     </c>
                     <c ca="left">
                        <p>2.13e-09</p>
                     </c>
                     <c ca="left">
                        <p>3.18e-08</p>
                     </c>
                     <c ca="left">
                        <p>CslE</p>
                     </c>
                     <c ca="left">
                        <p>0.0386</p>
                     </c>
                     <c ca="left">
                        <p>0.0024</p>
                     </c>
                     <c ca="left">
                        <p>0.0007</p>
                     </c>
                     <c ca="left">
                        <p>0.0112</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>CslJ</p>
                     </c>
                     <c ca="left">
                        <p>0.0933*</p>
                     </c>
                     <c ca="left">
                        <p>0.0003</p>
                     </c>
                     <c ca="left">
                        <p>0.0006</p>
                     </c>
                     <c ca="left">
                        <p>0.0123</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>In a hypothesis testing, a P-value is usually calculated to indicate if the null hypothesis is statistically supported. In this case, genes of Csl families in the column are tested against those in the row in terms of their Ka/Ks values. The null hypothesis is that the column is equal to the row, and the alternative hypothesis is that the column is larger than the row. If the P-value is less than 0.05, the null hypothesis is rejected and the alternative hypothesis is statistically significantly supported. This table shows that except for a very few cases (those with asterisks), the columns are always significantly larger than the rows.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Comparative study of individual Csl families</p>
            </st>
            <p>We have compared the Csl genes across different plant genomes by inspecting the phylogeny of each family and analyzing the gene structures. We present here a detailed comparative analysis of the less studied and narrowly distributed CslB/H/E/G/J families. The analyses of the other Csl families are given in the Additional file <supplr sid="S2">2</supplr> [see Additional file <supplr sid="S2">2</supplr>].</p>
            <suppl id="S2">
               <title>
                  <p>Additional file 2</p>
               </title>
               <text>
                  <p>Comparative study of CesA, CslA, CslC, CslD and CslF genes.</p>
               </text>
               <file name="1471-2229-9-99-S2.zip">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <sec>
               <st>
                  <p>A) CslE family</p>
               </st>
               <p>The phylogeny and the gene structure of the CslE family are shown in Figure <figr fid="F3">3</figr>. While there is only one CslE gene in Arabidopsis, nine copies of the gene are found in the recently sequenced grape genome (Table <tblr tid="T1">1</tblr>). Six of these grape genes form a monophyletic group, and are located on the chromosome in tandem within a ~45 kb region, which apparently resulted from a tandem duplication event. All these grape genes have Ka/Ks ratios between 0.24~0.33 (under purifying selection) and have EST data in GenBank, suggesting that they are functionally active. The grass genes form a large group on the phylogeny, suggesting that CslE diverged separately after the split of dicots and monocots.</p>
               <fig id="F3">
                  <title>
                     <p>Figure 3</p>
                  </title>
                  <caption>
                     <p>The subtree of the CslE family and gene structures of the family members</p>
                  </caption>
                  <text>
                     <p><b>The subtree of the CslE family and gene structures of the family members</b>. This ML phylogeny is taken from Figure 1a. The gene structure was plotted using the GSDS server <abbrgrp><abbr bid="B71">71</abbr></abbrgrp>. The branch length is scaled, i.e. proportional to the estimated number of molecular change. A scale bar is shown under the tree. The bootstrap values are shown to indicate the confidence level of the grouping. The intron-exon structure is shown on the right. The intron phase indicates the position of the intron within a codon. If it is not located within a codon (or located between two codons), the phase is 0. If it is located within a codon (or split a codon into two exons) and after the first base of the codon, the phase is 1, otherwise the phase is 2.</p>
                  </text>
                  <graphic file="1471-2229-9-99-3"/>
               </fig>
            </sec>
            <sec>
               <st>
                  <p>B) CslG and CslJ families</p>
               </st>
               <p>These two families are closely related to each other in the phylogeny shown in Figure <figr fid="F1">1</figr>. Genes of the dicot-specific CslG family <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> form mostly genome-specific monophyletic clusters (except for CslG-II that has a grape gene), and have conserved gene structures. This suggests that the CslG family diversified separately in each dicot plant via intragenomic duplications. Similar to CslE, this family is also substantially expanded in the grape genome possibly as a result of tandem gene duplications (Table <tblr tid="T2">2</tblr> and Figure <figr fid="F4">4</figr>). The 14 grape CslG genes have Ka/Ks ratios between 0.20~0.65; nine of them have ESTs in GenBank, and one has more than ten identical ESTs (UniGene entry Vvi.9751). The EST data indicate that this gene is expressed mostly in grape leaves, although further experimental studies are clearly needed to determine if any of these duplicated genes is actually functional.CslJ is possibly a new family according to recent studies <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr></abbrgrp> and our current analyses. Although CslJ is closely related to CslG, these two families have very different gene structures (Figure <figr fid="F4">4</figr>). In addition, unlike other families, the CslJ genes appear to be less conserved in their gene structures.</p>
               <fig id="F4">
                  <title>
                     <p>Figure 4</p>
                  </title>
                  <caption>
                     <p>The subtree of the CslG/J families and gene structures of the family members</p>
                  </caption>
                  <text>
                     <p><b>The subtree of the CslG/J families and gene structures of the family members</b>. See the legend of Figure 3.</p>
                  </text>
                  <graphic file="1471-2229-9-99-4"/>
               </fig>
            </sec>
            <sec>
               <st>
                  <p>C) CslB and CslH families</p>
               </st>
               <p>These two families are phylogenetically related as shown in Figure <figr fid="F1">1</figr>. CslB is a dicot-specific family while CslH is monocot-specific <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. We have observed that the paralogous genes of these two families tend to form clusters and have similar gene structures (except for Os04g35020.1) (Figure <figr fid="F5">5</figr>), suggesting the possibility of independent genome-specific duplications followed by subsequent sequence divergence.</p>
               <fig id="F5">
                  <title>
                     <p>Figure 5</p>
                  </title>
                  <caption>
                     <p>The subtree of the CslB/H families and gene structures of the family members</p>
                  </caption>
                  <text>
                     <p><b>The subtree of the CslB/H families and gene structures of the family members</b>. See the legend of Figure 3.</p>
                  </text>
                  <graphic file="1471-2229-9-99-5"/>
               </fig>
            </sec>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>The cellulose synthase-like gene families are among the most important players involved in the formation of plant cell walls. Some Csl families have been found to be responsible for the synthesis of the backbone of hemicelluloses, while the others are yet to be functionally characterized <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. This study focuses on the computational identification and classification of the Csl families in fully or nearly fully sequenced plant and algal genomes. Our phylogenetic analyses have led to the identification of the Csl orthologs across these genomes and the generation of new insights about how they might have evolved in plants and algae.</p>
         <sec>
            <st>
               <p>Limitation in identifying CesAs forming linear terminal complexes (TCs)</p>
            </st>
            <p>Our analyses did not detect any CesA and Csl genes in the three stramenopile algal genomes nor in the red algal genome, but this does not necessarily mean that no stramenopiles or red algae contain these families. In fact, a CesA gene has been identified very recently in an yet sequenced red alga <it>Porphyra yezoensis </it><abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. Actually, the cellulose synthases, which assemble into the so-called terminal complex (TC), have been found across all classes of organisms. TCs can be morphologically classified into rosette TCs or linear TCs <abbrgrp><abbr bid="B47">47</abbr><abbr bid="B48">48</abbr><abbr bid="B49">49</abbr></abbrgrp>, for each of which the component CesAs have rather different domain structures <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>. While the rosette TCs have been extensively studied and found in all seed plants, the linear TCs may be the most ancient, given their wider distribution across bacteria, fungi, animals and many classes of algae including stramenopiles <abbrgrp><abbr bid="B48">48</abbr><abbr bid="B50">50</abbr></abbrgrp>.</p>
            <p>Since the focus of this study is on CesAs that form rosette TCs, we have adopted a rather conservative filtration procedure (see Methods for details), which may have excluded some GT2 proteins that are possibly forming linear TCs. For instance, one GT2 gene from <it>P. patens </it>and seven GT2 genes from <it>S. moellendorfii </it>were removed by our filtration procedure (data not shown) but have been found to be similar to the CesA genes of cyanobacteria and the red alga <it>Porphyra yezoensis </it><abbrgrp><abbr bid="B32">32</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>The green algal CslA/C-like genes and their possible functions</p>
            </st>
            <p>Despite all the previous studies about the distribution of CesAs, very little has been done to identify the other Csl families across different organisms. So what is known about these families is fragmented at the best. For example, a sequence fragment from <it>Chlamydomonas </it>was reported to have diverged early from the CslA and CslC families <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. In our study, we found that all the six sampled chlorophyte green algae have a single-copy gene that is the most homologous to the land plant CslA and CslC families. Our phylogenetic analysis indicates that the CslA and the CslC families might have evolved via a gene duplication event that occurred uniquely in land plants after they split from green algae (Figure <figr fid="F1">1b</figr>).</p>
            <p>CslAs and CslCs have been characterized to encode mannan synthases <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp> and xyloglucan synthases <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>, respectively. It is tempting to speculate that the mannan synthesis might represent the ancestral function for the single-copy CslA/C-like genes in green algae, since mannan is present in both charophyte algae and chlorophyte algae while xyloglucan is absent from these green algae <abbrgrp><abbr bid="B51">51</abbr><abbr bid="B52">52</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>The divergence order of Csl families</p>
            </st>
            <p>Recent studies have shown that the plant CesA genes and some other Csl families are likely of cyanobacterial origin <abbrgrp><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>, possibly as a result of intracellular gene transfer from plastids (or cyanobacterial endosymbionts). In a Csl gene phylogeny built by Nobles and Brown, CslG was suggested to be the first Csl family evolved in plants, followed by CslE, CslB, CesA and CslD <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. However, our search of fully sequenced plant and algal genomes has shown that CslG is among the most narrowly distributed Csl families, and is absent from our sampled algae and lower plants (Table <tblr tid="T2">2</tblr>). Intuitively, such a distribution pattern suggests that CslG might not be the earliest plant Csl family.</p>
            <p>When constructing the phylogeny of Csl families (Figure <figr fid="F6">6</figr>), we included two cyanobacterial CesA protein sequences (YP_322086.1 from <it>Anabaena variabilis ATCC 29413 </it>and NP_487797.1 from <it>Nostoc sp. PCC 7120</it>) as the out-group of the plant CesA and Csl genes. These sequences are the only two hits (see the first section of Results) in our hmmsearch with the Pfam Cellulose_synt domain against all fully sequenced prokaryotic genomes (with E-value cutoff &lt; 1.0). These two cyanobacterial sequences are phylogenetically more closely related to the land plant rosette-TC-forming CesAs than to the other bacterial linear-TC-forming CesAs <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>, and thus have been proposed as the progenitor of all the rosette-TC-forming CesAs and Csl genes. The rooted Csl phylogeny (Figure <figr fid="F6">6</figr>) suggests a major early split between CesA/CslD/F and the other Csl families (at node I). Clearly neither CslG nor any of the other Csl families could be considered as the earliest in plants, because the cyanobacterial CesAs are not placed closer to any particular plant Csl family than to the others. Instead, the Csl phylogeny suggests that, after the establishment of plastids (or cyanobacterial endosymbionts) in the ancestral plant, the cyanobacterial derived CesA gene might have undergone several rounds of sequence and function divergence (see nodes I to IV in Figure <figr fid="F6">6</figr>). The evolutionary relationship of CslB/H to the other families is less clear from our phylogeny since the grouping of CslB/H with CslE/G/J has only a modest support (61% bootstrap value).</p>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>A phylogeny of the Csl families based on the multiple sequence alignment of 219 full-length protein sequences</p>
               </caption>
               <text>
                  <p><b>A phylogeny of the Csl families based on the multiple sequence alignment of 219 full-length protein sequences</b>. 217 proteins were used in Figure 1a. Two cyanobacterial sequences (YP_322086.1 from <it>Anabaena variabilis ATCC 29413 </it>and NP_487797.1 from <it>Nostoc sp. PCC 7120</it>) were used as the out-group to root the phylogeny.</p>
               </text>
               <graphic file="1471-2229-9-99-6"/>
            </fig>
            <p>The finding that CslB/H/E/G/J split earlier than CslD/F (I <it>vs</it>. III) is surprising, because CslD is found in lower plants (moss and spike moss) whereas CslB/H/E/G/J are not. One plausible explanation is that some ancestral genes of the CslB/H/E/G/J families might have been present in the most recent common ancestor of the land plants, but were lost in moss and spike moss later. To test this hypothesis, we performed a TFASTY <abbrgrp><abbr bid="B53">53</abbr></abbrgrp> search of the Csl proteins against the <it>P. patens </it>genomic sequences and the <it>S. moellendorffii </it>genomic sequences (Table <tblr tid="T2">2</tblr>), respectively, after masking out all predicted genes. Interestingly we found that some short sequence fragments are more similar to CslB/E/G than to the other Csl families, suggesting the possibility that they are the remnants of CslB/H/E/G/J. For example, the <it>P. patens </it>scaffold_21 region from position 2584724 to position 2584398 (in base pair) is 28% identical (51% similar) to the poplar CslB protein eugene3.00141027 from position 7 to position 109 (in amino acid), whose Cellulose_synt pfam domain is from position 9 to position 747; and the <it>S. moellendorffii </it>scaffold_577 region from position 192 to position 741 is 27% identical (52% similar) to the rice CslE protein Os09g30120.1 from position 533 to position 708, whose Cellulose_synt pfam domain is from position 15 to position 737. Similar searches against green algal genomic DNAs also found some homologous regions to the land plant CslB/H/E/G/J genes.</p>
         </sec>
         <sec>
            <st>
               <p>The seed plant-specific CslB/H/E/G/J families</p>
            </st>
            <p>Although the common ancestor of the CslB/H/E/G/J families may have split from the other Csl families during the early evolution of plants, their diversification might have occurred recently. This is supported by the narrow distribution of CslB and CslG in dicots and CslH in monocots. Additionally, we have shown that all these five families have evolved rapidly (Figure <figr fid="F2">2</figr>). Paralogous genes of these families typically form a monophyletic group in our phylogenies and have highly similar gene structures (Figure <figr fid="F3">3</figr>, <figr fid="F4">4</figr>, <figr fid="F5">5</figr>). These findings suggest that recent intragenomic duplications have played a major role in the rapid sequence and functional diversification of these families <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>. A simple TBLASTN <abbrgrp><abbr bid="B39">39</abbr></abbrgrp> search of the Csl proteins against the masked plant genomic DNA sequences (with all annotated gene models masked) found many homologous DNA fragments, indicating that they are likely pseudogenic relics after gene or genome duplications. A detailed analysis of these DNA fragments could possibly lead to a deeper understanding about the evolution of these Csl families.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>The cellulose synthase superfamily is identified and phylogenetically analyzed in fully sequenced plant and algal genomes. We conclude that 1) a CslJ family is present in both monocot and dicot plants, confirming two previous reports that it is a new Csl family, and that 2) a green algae-specific Csl family is most homologous to both land plant CslA and CslC families and it is speculated to be a mannan synthase, and that 3) CslG may not be the first evolved Csl family and a new model is proposed in regard to the evolution order of different Csl families.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Data sources</p>
            </st>
            <p>We downloaded the genome, proteome, and gene prediction and annotation data for the 17 genomes from various sources (Table <tblr tid="T1">1</tblr>). Specifically, the Arabidopsis data were from The Arabidopsis Information Resource (TAIR version 7.0) <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>, rice data from The Institute for Genomic Research (TIGR version 5.0) <abbrgrp><abbr bid="B55">55</abbr></abbrgrp>, grape data from <abbrgrp><abbr bid="B56">56</abbr></abbrgrp>, red algae data from <abbrgrp><abbr bid="B57">57</abbr></abbrgrp>, and data for all the others are from Joint Genome Institute (JGI) as of Dec. 2007. The Arabidopsis and rice proteome data include alternatively splicing variants, while all the other proteomes do not. We included these splicing variants of the two organisms in our analyses, but counted them as one gene in our statistics in Table <tblr tid="T2">2</tblr>. For instance, AtCslA3 has three known splicing variants (AT1G23480.1, AT1G23480.2 and AT1G23480.3); all these three proteins were included in our phylogenetic analyses, although we count them as one single gene in Table <tblr tid="T2">2</tblr>. We specified these variants in our gene structure figures such as Figures <figr fid="F3">3</figr>, <figr fid="F4">4</figr>, <figr fid="F5">5</figr>. In addition, 597 fully sequenced prokaryotic genomes were downloaded from <abbrgrp><abbr bid="B58">58</abbr></abbrgrp> as of Dec. 2007.</p>
         </sec>
         <sec>
            <st>
               <p>BLAST search</p>
            </st>
            <p>We downloaded known Csl sequences from the Cell Wall Navigator database <abbrgrp><abbr bid="B59">59</abbr><abbr bid="B60">60</abbr></abbrgrp>, including all known Arabidopsis and rice Csl genes as well as sequences from other species in UniProt <abbrgrp><abbr bid="B61">61</abbr></abbrgrp>. We took this data set as the initial query to search against the annotated protein sequences of the 17 genomes.</p>
         </sec>
         <sec>
            <st>
               <p>HMMER search</p>
            </st>
            <p>There are two Pfam <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> domain models for the Csl proteins: PF03552 (Cellulose_synt) and PF00535 (Glycos_transf_2 or GT2), both of which were searched in our analyses. In addition, we searched bacterial cellulose catalytic domain model PF03170 (BcsB) but did not find any significant hits (with E-value cutoff &lt; 1.0) in plants and algae. We ran hmmsearch against proteins of the 17 genomes by querying these HMM models in the ls mode (global with respect to query domain and local with respect to hit protein; see details in the manual of HMMER package). We also performed hmmsearch against the 597 prokaryotic genomes.</p>
         </sec>
         <sec>
            <st>
               <p>Selection of homologs</p>
            </st>
            <p>We have processed the search hits obtained from the above BLAST and HMMER searches in order to build an accurate Csl gene catalog for each genome:</p>
            <sec>
               <st>
                  <p>a) Intersection of search results</p>
               </st>
               <p>We removed all the BLAST and hmmsearch hits with E-value higher than 1.0, and kept only the hits returned by both BLAST and hmmsearch; that is, the final hits should be similar to the query Csl genes in the pair-wise sequence comparison and contain either of the two conserved Pfam domains. We used E-value cutoff &lt; 1.0 for both BLAST and hmmsearch because, under this condition, all known Arabidopsis Csl proteins except for AtCslA1/AT4G16590 (therefore in Table <tblr tid="T2">2</tblr> the number of Arabidopsis Csl genes is 39) were successfully retrieved and no false positives were found. In addition, under this condition, the best Arabidopsis homologs of all the identified Csl sequences are known Csl genes.</p>
            </sec>
            <sec>
               <st>
                  <p>b) Further filtration</p>
               </st>
               <p>We searched the candidate Csl proteins against the 17 genomes. For the true Csl genes, we expect to see their top hits in the candidate Csl gene list. For each candidate, we manually inspected its top 10 non-self hits: a candidate was dropped if fewer than eight of the top 10 non-self hits were in the candidate list. A few additional sequences were found to be not Csl genes in the subsequent more rigorous phylogenetic analyses and were removed. The FASTA format sequences of all the finally identified CesA and Csl genes are given in the Additional file <supplr sid="S3">3</supplr> [see Additional file <supplr sid="S3">3</supplr>].</p>
               <suppl id="S3">
                  <title>
                     <p>Additional file 3</p>
                  </title>
                  <text>
                     <p><b>The fasta format sequences of all Csl genes</b>. The description line is in the format: >ID|original ID|Csl family.</p>
                  </text>
                  <file name="1471-2229-9-99-S3.fa">
                     <p>Click here for file</p>
                  </file>
               </suppl>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Phylogenetic analysis</p>
            </st>
            <p>Two datasets were prepared for our phylogenetic analysis: protein sequences that contain the PF00535 (GT2) domain and those that have the PF03552 (Cellulose_synt) domain. Multiple protein sequence alignments (MSAs) were performed on both the full length regions and the conserved Pfam domains for the two datasets. MAFFT <abbrgrp><abbr bid="B62">62</abbr></abbrgrp> was used in these alignments using two highly accurate methods: L-INS-i and E-INS-i. L-INS-i is considered to be the most accurate MSA method <abbrgrp><abbr bid="B63">63</abbr><abbr bid="B64">64</abbr></abbrgrp>, and E-INS-i performs well on sequences with large unalignable regions (see manual of MAFFT). The resulting MSAs were manually edited to remove gaps and ambiguously aligned regions. We have also inspected the MSAs for the presence of the DXD and D, D, D, QXXRW motifs that are characteristic of possessive &#946;-glycosyltransferases <abbrgrp><abbr bid="B65">65</abbr></abbrgrp>. The original MSAs, the edited MSAs and the resulting phylogenetic trees are all available in the Additional file <supplr sid="S4">4</supplr> [see Additional file <supplr sid="S4">4</supplr>].</p>
            <suppl id="S4">
               <title>
                  <p>Additional file 4</p>
               </title>
               <text>
                  <p>The original MSAs, the edited MSAs and the resulting phylogenetic trees.</p>
               </text>
               <file name="1471-2229-9-99-S4.zip">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>The ProtTest v1.4 package <abbrgrp><abbr bid="B66">66</abbr></abbrgrp> was run on the computed MSAs to select the best-fit models for phylogenetic analyses. We found the combination of JTT+I+G+F models to be the best one for our phylogeny reconstruction. The maximum likelihood (ML) trees were built using PhyML <abbrgrp><abbr bid="B67">67</abbr></abbrgrp>, while neighbor-joining (NJ) trees were built using MEGA4 <abbrgrp><abbr bid="B68">68</abbr></abbrgrp> considering the above models. Specifically, PhyML analyses were conducted using the JTT model, 100 replicates of bootstraps, an estimated proportion of the invariable sites (I), four rate categories, an estimated gamma distribution parameter (G), and optimized starting BIONJ tree. MEGA analyses were conducted using the JTT substitution model, 500 replicates of bootstrap, pair-wise detection of gaps or missing data, gamma distributed rate among sites and the gamma parameter set at 1.0 (G).</p>
         </sec>
         <sec>
            <st>
               <p>Evolutionary rate computation</p>
            </st>
            <p>The evolutionary rate of proteins can be estimated by calculating the evolutionary distances that are often measured by calculating the ratio of the number of nonsynonymous substitutions per nonsynonymous site (Ka) and the number of synonymous substitutions per synonymous site (Ks) <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>. For each Csl family, in order to obtain the longest possible alignment, we firstly built an initial multiple sequence alignment on the full length Csl protein sequences, and then manually examined the alignment to remove fragmental sequences that introduced long gaps into the alignment (e.g. for those less well annotated genomes, the predicted protein sequences are often of low quality and fragmented). We then rebuilt the multiple sequence alignment on the remaining sequences and reconstructed an un-rooted ML tree.</p>
            <p>We transformed the amino acid sequence alignment into codon sequence alignment by using pal2nal <abbrgrp><abbr bid="B69">69</abbr></abbrgrp>. The coding sequences were obtained from the downloaded genome data. The maximum likelihood estimation of Ks, Ka and Ka/Ks values for each gene family was conducted by running codeml in PAML <abbrgrp><abbr bid="B70">70</abbr></abbrgrp>, using the above tree and the codon alignment as the input. The computation of Ka/Ks ratios for a group of genes based on their phylogeny is conducted under the assumption that each gene evolves at an independent rate. We used this model to compute the Ka/Ks ratio for each gene of each Csl family. Analyses were also performed using the conserved domain regions; the result as shown in Figure <figr fid="F2">2</figr> remains unchanged (data not shown).</p>
         </sec>
         <sec>
            <st>
               <p>Gene structure analysis</p>
            </st>
            <p>The gene structure information was parsed from the GFF file downloaded along with the genome data, and was used as the input for the graphic display at the Gene Structure Display Server of Peking University <abbrgrp><abbr bid="B71">71</abbr></abbrgrp>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>YY conceived this study, conducted all the analyses and drafted the manuscript. JH participated in data interpretation and revised the manuscript. YX supervised the project and finalized the paper.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>The authors are grateful to the reviewers for very constructive comments. We acknowledge the U.S. Department of Energy (grant # 4000063512) and the National Science Foundation (grant #: NSF/DEB-0830024, NSF/DBI-0354771, NSF/ITR-IIS-0407204, NSF/DBI-0542119, and NSF/CCF0621700) for funding support. The BioEnergy Science Center is supported by the Office of Biological and Environmental Research in the DOE Office of Science. We acknowledge that the sequence data of <it>Selaginella moellendorffii, Volvox carteri f. nagariensis, Aureococcus anophagefferens, and Phaeodactylum tricornutum </it>were produced and released by the U.S. Department of Energy Joint Genome Institute <url>http://www.jgi.doe.gov/</url> before the genomes are published. JH also acknowledges the support of a Research/Creativity Award from the East Carolina University. We thank Dr. Victor Olman for statistics suggestion and the biofuel group of the Computational System Biology Lab (CSBL) and all other CSBL members for helpful discussions and comments.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Higher plants contain homologs of the bacterial celA genes encoding the catalytic subunit of cellulose synthase</p>
            </title>
            <aug>
               <au>
                  <snm>Pear</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Kawagoe</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Schreckengost</snm>
                  <fnm>WE</fnm>
               </au>
               <au>
                  <snm>Delmer</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Stalker</snm>
                  <fnm>DM</fnm>
               </au>
            </aug>
            <source>Proceedings of the National Academy of Sciences of the United States of America</source>
            <pubdate>1996</pubdate>
            <volume>93</volume>
            <issue>22</issue>
            <fpage>12637</fpage>
            <lpage>12642</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.93.22.12637</pubid>
                  <pubid idtype="pmcid">38045</pubid>
                  <pubid idtype="pmpid">8901635</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>The cellulose synthase superfamily</p>
            </title>
            <aug>
               <au>
                  <snm>Richmond</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Somerville</snm>
                  <fnm>CR</fnm>
               </au>
            </aug>
            <source>Plant physiology</source>
            <pubdate>2000</pubdate>
            <volume>124</volume>
            <issue>2</issue>
            <fpage>495</fpage>
            <lpage>498</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1104/pp.124.2.495</pubid>
                  <pubid idtype="pmcid">1539280</pubid>
                  <pubid idtype="pmpid" link="fulltext">11027699</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Cellulose synthase-like genes of rice</p>
            </title>
            <aug>
               <au>
                  <snm>Hazen</snm>
                  <fnm>SP</fnm>
               </au>
               <au>
                  <snm>Scott-Craig</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Walton</snm>
                  <fnm>JD</fnm>
               </au>
            </aug>
            <source>Plant physiology</source>
            <pubdate>2002</pubdate>
            <volume>128</volume>
            <issue>2</issue>
            <fpage>336</fpage>
            <lpage>340</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1104/pp.010875</pubid>
                  <pubid idtype="pmcid">1540205</pubid>
                  <pubid idtype="pmpid" link="fulltext">11842136</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>The genome sequence of black cottonwood (Populus trichocarpa) reveals 18 conserved cellulose synthase (CesA) genes</p>
            </title>
            <aug>
               <au>
                  <snm>Djerbi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lindskog</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Arvestad</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Sterky</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Teeri</snm>
                  <fnm>TT</fnm>
               </au>
            </aug>
            <source>Planta</source>
            <pubdate>2005</pubdate>
            <volume>221</volume>
            <issue>5</issue>
            <fpage>739</fpage>
            <lpage>746</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s00425-005-1498-4</pubid>
                  <pubid idtype="pmpid" link="fulltext">15940463</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>The cellulose synthase gene superfamily and biochemical functions of xylem-specific cellulose synthase-like genes in Populus trichocarpa</p>
            </title>
            <aug>
               <au>
                  <snm>Suzuki</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Sun</snm>
                  <fnm>YH</fnm>
               </au>
               <au>
                  <snm>Chiang</snm>
                  <fnm>VL</fnm>
               </au>
            </aug>
            <source>Plant physiology</source>
            <pubdate>2006</pubdate>
            <volume>142</volume>
            <issue>3</issue>
            <fpage>1233</fpage>
            <lpage>1245</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1104/pp.106.086678</pubid>
                  <pubid idtype="pmcid">1630762</pubid>
                  <pubid idtype="pmpid" link="fulltext">16950861</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>The cellulose synthase (CESA) gene superfamily of the moss Physcomitrella patens</p>
            </title>
            <aug>
               <au>
                  <snm>Roberts</snm>
                  <fnm>AW</fnm>
               </au>
               <au>
                  <snm>Bushoven</snm>
                  <fnm>JT</fnm>
               </au>
            </aug>
            <source>Plant molecular biology</source>
            <pubdate>2007</pubdate>
            <volume>63</volume>
            <issue>2</issue>
            <fpage>207</fpage>
            <lpage>219</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s11103-006-9083-1</pubid>
                  <pubid idtype="pmpid" link="fulltext">17006591</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Cellulose synthesis in maize: isolation and expression analysis of the cellulose synthase (CesA) gene family</p>
            </title>
            <aug>
               <au>
                  <snm>Appenzeller</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Doblin</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Barreiro</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>HY</fnm>
               </au>
               <au>
                  <snm>Niu</snm>
                  <fnm>XM</fnm>
               </au>
               <au>
                  <snm>Kollipara</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Carrigan</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Tomes</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Chapman</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Dhugga</snm>
                  <fnm>KS</fnm>
               </au>
            </aug>
            <source>Cellulose</source>
            <pubdate>2004</pubdate>
            <volume>11</volume>
            <issue>3&#8211;4</issue>
            <fpage>287</fpage>
            <lpage>299</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1023/B:CELL.0000046417.84715.27</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>The CesA gene family of barley. Quantitative analysis of transcripts reveals two groups of co-expressed genes</p>
            </title>
            <aug>
               <au>
                  <snm>Burton</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Shirley</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>King</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Harvey</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Fincher</snm>
                  <fnm>GB</fnm>
               </au>
            </aug>
            <source>Plant physiology</source>
            <pubdate>2004</pubdate>
            <volume>134</volume>
            <issue>1</issue>
            <fpage>224</fpage>
            <lpage>236</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1104/pp.103.032904</pubid>
                  <pubid idtype="pmcid">316302</pubid>
                  <pubid idtype="pmpid" link="fulltext">14701917</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Three loblolly pine CesA genes expressed in developing xylem are orthologous to secondary cell wall CesA genes of angiosperms</p>
            </title>
            <aug>
               <au>
                  <snm>Nairn</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Haselkorn</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>The New phytologist</source>
            <pubdate>2005</pubdate>
            <volume>166</volume>
            <issue>3</issue>
            <fpage>907</fpage>
            <lpage>915</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1469-8137.2005.01372.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">15869651</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Cellulose synthase (CesA) genes in the green alga Mesotaenium caldariorum</p>
            </title>
            <aug>
               <au>
                  <snm>Roberts</snm>
                  <fnm>AW</fnm>
               </au>
               <au>
                  <snm>Roberts</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Delmer</snm>
                  <fnm>DP</fnm>
               </au>
            </aug>
            <source>Eukaryotic cell</source>
            <pubdate>2002</pubdate>
            <volume>1</volume>
            <issue>6</issue>
            <fpage>847</fpage>
            <lpage>855</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1128/EC.1.6.847-855.2002</pubid>
                  <pubid idtype="pmcid">138757</pubid>
                  <pubid idtype="pmpid" link="fulltext">12477785</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Cellulose synthase (CesA) genes in algae and seedless plants</p>
            </title>
            <aug>
               <au>
                  <snm>Roberts</snm>
                  <fnm>AW</fnm>
               </au>
               <au>
                  <snm>Roberts</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Cellulose</source>
            <pubdate>2004</pubdate>
            <volume>11</volume>
            <issue>3&#8211;4</issue>
            <fpage>419</fpage>
            <lpage>435</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1023/B:CELL.0000046418.01131.d3</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Cellulose synthase (CesA) genes in the red alga Porphyra Yezoensis Ueda</p>
            </title>
            <aug>
               <au>
                  <snm>Roberts</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Roberts</snm>
                  <fnm>AW</fnm>
               </au>
            </aug>
            <source>Journal of phycology</source>
            <pubdate>2007</pubdate>
            <volume>43</volume>
            <issue>Suppl 1</issue>
            <fpage>42</fpage>
            <lpage>42</lpage>
         </bibl>
         <bibl id="B13">
            <title>
               <p>A cellulose synthase (CesA) gene from the red alga Porphyra Yezoensis (Rhodophyta)</p>
            </title>
            <aug>
               <au>
                  <snm>Roberts</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Roberts</snm>
                  <fnm>AW</fnm>
               </au>
            </aug>
            <source>Journal of phycology</source>
            <pubdate>2009</pubdate>
            <volume>45</volume>
            <issue>1</issue>
            <fpage>203</fpage>
            <lpage>212</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1111/j.1529-8817.2008.00626.x</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Biosynthesis of plant cell wall polysaccharides &#8211; a complex process</p>
            </title>
            <aug>
               <au>
                  <snm>Lerouxel</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Cavalier</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Liepman</snm>
                  <fnm>AH</fnm>
               </au>
               <au>
                  <snm>Keegstra</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Current opinion in plant biology</source>
            <pubdate>2006</pubdate>
            <volume>9</volume>
            <issue>6</issue>
            <fpage>621</fpage>
            <lpage>630</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.pbi.2006.09.009</pubid>
                  <pubid idtype="pmpid" link="fulltext">17011813</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Guar seed beta-mannan synthase is a member of the cellulose synthase super gene family</p>
            </title>
            <aug>
               <au>
                  <snm>Dhugga</snm>
                  <fnm>KS</fnm>
               </au>
               <au>
                  <snm>Barreiro</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Whitten</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Stecca</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Hazebroek</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Randhawa</snm>
                  <fnm>GS</fnm>
               </au>
               <au>
                  <snm>Dolan</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kinney</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Tomes</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Nichols</snm>
                  <fnm>S</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science (New York, NY)</source>
            <pubdate>2004</pubdate>
            <volume>303</volume>
            <issue>5656</issue>
            <fpage>363</fpage>
            <lpage>366</lpage>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Expression of cellulose synthase-like (Csl) genes in insect cells reveals that CslA family members encode mannan synthases</p>
            </title>
            <aug>
               <au>
                  <snm>Liepman</snm>
                  <fnm>AH</fnm>
               </au>
               <au>
                  <snm>Wilkerson</snm>
                  <fnm>CG</fnm>
               </au>
               <au>
                  <snm>Keegstra</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Proceedings of the National Academy of Sciences of the United States of America</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <issue>6</issue>
            <fpage>2221</fpage>
            <lpage>2226</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.0409179102</pubid>
                  <pubid idtype="pmcid">548565</pubid>
                  <pubid idtype="pmpid" link="fulltext">15647349</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>A barley cellulose synthase-like CSLH gene mediates (1,3;1,4)-beta-D-glucan synthesis in transgenic Arabidopsis</p>
            </title>
            <aug>
               <au>
                  <snm>Doblin</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Pettolino</snm>
                  <fnm>FA</fnm>
               </au>
               <au>
                  <snm>Wilson</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Campbell</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Burton</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Fincher</snm>
                  <fnm>GB</fnm>
               </au>
               <au>
                  <snm>Newbigin</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Bacic</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Proceedings of the National Academy of Sciences of the United States of America</source>
            <pubdate>2009</pubdate>
            <volume>106</volume>
            <issue>14</issue>
            <fpage>5996</fpage>
            <lpage>6001</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.0902019106</pubid>
                  <pubid idtype="pmpid" link="fulltext">19321749</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Cellulose synthase-like CslF genes mediate the synthesis of cell wall (1,3;1,4)-beta-D-glucans</p>
            </title>
            <aug>
               <au>
                  <snm>Burton</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Wilson</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Hrmova</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Harvey</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Shirley</snm>
                  <fnm>NJ</fnm>
               </au>
               <au>
                  <snm>Stone</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Newbigin</snm>
                  <fnm>EJ</fnm>
               </au>
               <au>
                  <snm>Bacic</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Fincher</snm>
                  <fnm>GB</fnm>
               </au>
            </aug>
            <source>Science (New York, NY)</source>
            <pubdate>2006</pubdate>
            <volume>311</volume>
            <issue>5769</issue>
            <fpage>1940</fpage>
            <lpage>1942</lpage>
         </bibl>
         <bibl id="B19">
            <title>
               <p>A gene from the cellulose synthase-like C family encodes a beta-1,4 glucan synthase</p>
            </title>
            <aug>
               <au>
                  <snm>Cocuron</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Lerouxel</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Drakakaki</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Alonso</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Liepman</snm>
                  <fnm>AH</fnm>
               </au>
               <au>
                  <snm>Keegstra</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Raikhel</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Wilkerson</snm>
                  <fnm>CG</fnm>
               </au>
            </aug>
            <source>Proceedings of the National Academy of Sciences of the United States of America</source>
            <pubdate>2007</pubdate>
            <volume>104</volume>
            <issue>20</issue>
            <fpage>8550</fpage>
            <lpage>8555</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.0703133104</pubid>
                  <pubid idtype="pmcid">1895987</pubid>
                  <pubid idtype="pmpid" link="fulltext">17488821</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Plant cell wall biosynthesis: genetic, biochemical and functional genomics approaches to the identification of key genes</p>
            </title>
            <aug>
               <au>
                  <snm>Farrokhi</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Burton</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Brownfield</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Hrmova</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Wilson</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Bacic</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Fincher</snm>
                  <fnm>GB</fnm>
               </au>
            </aug>
            <source>Plant biotechnology journal</source>
            <pubdate>2006</pubdate>
            <volume>4</volume>
            <issue>2</issue>
            <fpage>145</fpage>
            <lpage>167</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1467-7652.2005.00169.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">17177793</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Revolutionary times in our understanding of cell wall biosynthesis and remodeling in the grasses</p>
            </title>
            <aug>
               <au>
                  <snm>Fincher</snm>
                  <fnm>GB</fnm>
               </au>
            </aug>
            <source>Plant physiology</source>
            <pubdate>2009</pubdate>
            <volume>149</volume>
            <issue>1</issue>
            <fpage>27</fpage>
            <lpage>37</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1104/pp.108.130096</pubid>
                  <pubid idtype="pmpid" link="fulltext">19126692</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>The pivotal role of cyanobacteria in the evolution of cellulose synthases and cellulose synthase-like proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Nobles</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>RM</fnm>
               </au>
            </aug>
            <source>Cellulose</source>
            <pubdate>2004</pubdate>
            <volume>11</volume>
            <issue>3&#8211;4</issue>
            <fpage>437</fpage>
            <lpage>448</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1023/B:CELL.0000046339.48003.0e</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Cellulose in cyanobacteria. Origin of vascular plant cellulose synthase?</p>
            </title>
            <aug>
               <au>
                  <snm>Nobles</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Romanovicz</snm>
                  <fnm>DK</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>RM</fnm>
                  <suf>Jr</suf>
               </au>
            </aug>
            <source>Plant physiology</source>
            <pubdate>2001</pubdate>
            <volume>127</volume>
            <issue>2</issue>
            <fpage>529</fpage>
            <lpage>542</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1104/pp.010557</pubid>
                  <pubid idtype="pmcid">125088</pubid>
                  <pubid idtype="pmpid" link="fulltext">11598227</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Green genes &#8211; Comparative genomics of the green branch of life</p>
            </title>
            <aug>
               <au>
                  <snm>Bowman</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Floyd</snm>
                  <fnm>SK</fnm>
               </au>
               <au>
                  <snm>Sakakibara</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Cell</source>
            <pubdate>2007</pubdate>
            <volume>129</volume>
            <issue>2</issue>
            <fpage>229</fpage>
            <lpage>234</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cell.2007.04.004</pubid>
                  <pubid idtype="pmpid" link="fulltext">17448980</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>The genome of the diatom Thalassiosira pseudonana: ecology, evolution, and metabolism</p>
            </title>
            <aug>
               <au>
                  <snm>Armbrust</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Berges</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Bowler</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Green</snm>
                  <fnm>BR</fnm>
               </au>
               <au>
                  <snm>Martinez</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Putnam</snm>
                  <fnm>NH</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Allen</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Apt</snm>
                  <fnm>KE</fnm>
               </au>
               <au>
                  <snm>Bechner</snm>
                  <fnm>M</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science (New York, NY)</source>
            <pubdate>2004</pubdate>
            <volume>306</volume>
            <issue>5693</issue>
            <fpage>79</fpage>
            <lpage>86</lpage>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Genome sequence of the ultrasmall unicellular red alga Cyanidioschyzon merolae 10D</p>
            </title>
            <aug>
               <au>
                  <snm>Matsuzaki</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Misumi</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Shin</snm>
                  <fnm>IT</fnm>
               </au>
               <au>
                  <snm>Maruyama</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Takahara</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Miyagishima</snm>
                  <fnm>SY</fnm>
               </au>
               <au>
                  <snm>Mori</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Nishida</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Yagisawa</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Nishida</snm>
                  <fnm>K</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2004</pubdate>
            <volume>428</volume>
            <issue>6983</issue>
            <fpage>653</fpage>
            <lpage>657</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature02398</pubid>
                  <pubid idtype="pmpid" link="fulltext">15071595</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Green Evolution and Dynamic Adaptations Revealed by Genomes of the Marine Picoeukaryotes Micromonas</p>
            </title>
            <aug>
               <au>
                  <snm>Worden</snm>
                  <fnm>AZ</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Mock</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Rouze</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Simmons</snm>
                  <fnm>MP</fnm>
               </au>
               <au>
                  <snm>Aerts</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Allen</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Cuvelier</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Derelle</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Everett</snm>
                  <fnm>MV</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science (New York, NY)</source>
            <pubdate>2009</pubdate>
            <volume>324</volume>
            <issue>5924</issue>
            <fpage>268</fpage>
            <lpage>272</lpage>
         </bibl>
         <bibl id="B28">
            <title>
               <p>The tiny eukaryote Ostreococcus provides genomic insights into the paradox of plankton speciation</p>
            </title>
            <aug>
               <au>
                  <snm>Palenik</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Grimwood</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Aerts</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Rouze</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Salamov</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Putnam</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Dupont</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Jorgensen</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Derelle</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Rombauts</snm>
                  <fnm>S</fnm>
               </au>
               <etal/>
            </aug>
            <source>Proceedings of the National Academy of Sciences of the United States of America</source>
            <pubdate>2007</pubdate>
            <volume>104</volume>
            <issue>18</issue>
            <fpage>7705</fpage>
            <lpage>7710</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.0611046104</pubid>
                  <pubid idtype="pmcid">1863510</pubid>
                  <pubid idtype="pmpid" link="fulltext">17460045</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Genome analysis of the smallest free-living eukaryote Ostreococcus tauri unveils many unique features</p>
            </title>
            <aug>
               <au>
                  <snm>Derelle</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Ferraz</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Rombauts</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Rouze</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Worden</snm>
                  <fnm>AZ</fnm>
               </au>
               <au>
                  <snm>Robbens</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Partensky</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Degroeve</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Echeynie</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Cooke</snm>
                  <fnm>R</fnm>
               </au>
               <etal/>
            </aug>
            <source>Proceedings of the National Academy of Sciences of the United States of America</source>
            <pubdate>2006</pubdate>
            <volume>103</volume>
            <issue>31</issue>
            <fpage>11647</fpage>
            <lpage>11652</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.0604795103</pubid>
                  <pubid idtype="pmcid">1544224</pubid>
                  <pubid idtype="pmpid" link="fulltext">16868079</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>The Chlamydomonas genome reveals the evolution of key animal and plant functions</p>
            </title>
            <aug>
               <au>
                  <snm>Merchant</snm>
                  <fnm>SS</fnm>
               </au>
               <au>
                  <snm>Prochnik</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Vallon</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>EH</fnm>
               </au>
               <au>
                  <snm>Karpowicz</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Witman</snm>
                  <fnm>GB</fnm>
               </au>
               <au>
                  <snm>Terry</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Salamov</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Fritz-Laylin</snm>
                  <fnm>LK</fnm>
               </au>
               <au>
                  <snm>Marechal-Drouard</snm>
                  <fnm>L</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science (New York, NY)</source>
            <pubdate>2007</pubdate>
            <volume>318</volume>
            <issue>5848</issue>
            <fpage>245</fpage>
            <lpage>250</lpage>
         </bibl>
         <bibl id="B31">
            <title>
               <p>The Physcomitrella genome reveals evolutionary insights into the conquest of land by plants</p>
            </title>
            <aug>
               <au>
                  <snm>Rensing</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Lang</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Zimmer</snm>
                  <fnm>AD</fnm>
               </au>
               <au>
                  <snm>Terry</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Salamov</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Shapiro</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Nishiyama</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Perroud</snm>
                  <fnm>PF</fnm>
               </au>
               <au>
                  <snm>Lindquist</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Kamisugi</snm>
                  <fnm>Y</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science (New York, NY)</source>
            <pubdate>2008</pubdate>
            <volume>319</volume>
            <issue>5859</issue>
            <fpage>64</fpage>
            <lpage>69</lpage>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Selaginella moellendorffii Csls</p>
            </title>
            <url>http://wiki.genomics.purdue.edu/index.php/Cellulose_synthase_superfamily</url>
         </bibl>
         <bibl id="B33">
            <title>
               <p>The genome of black cottonwood, Populus trichocarpa (Torr. &amp; Gray)</p>
            </title>
            <aug>
               <au>
                  <snm>Tuskan</snm>
                  <fnm>GA</fnm>
               </au>
               <au>
                  <snm>Difazio</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Jansson</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Bohlmann</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Grigoriev</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Hellsten</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Putnam</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Ralph</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Rombauts</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Salamov</snm>
                  <fnm>A</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science (New York, NY)</source>
            <pubdate>2006</pubdate>
            <volume>313</volume>
            <issue>5793</issue>
            <fpage>1596</fpage>
            <lpage>1604</lpage>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Analysis of the genome sequence of the flowering plant Arabidopsis thaliana</p>
            </title>
            <aug>
               <au>
                  <cnm>ArabidopsisGenomeInitiative</cnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>408</volume>
            <issue>6814</issue>
            <fpage>796</fpage>
            <lpage>815</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35048692</pubid>
                  <pubid idtype="pmpid" link="fulltext">11130711</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla</p>
            </title>
            <aug>
               <au>
                  <snm>Jaillon</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Aury</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Noel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Policriti</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Clepet</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Casagrande</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Choisne</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Aubourg</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Vitulo</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Jubin</snm>
                  <fnm>C</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2007</pubdate>
            <volume>449</volume>
            <issue>7161</issue>
            <fpage>463</fpage>
            <lpage>467</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature06148</pubid>
                  <pubid idtype="pmpid" link="fulltext">17721507</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>A draft sequence of the rice genome (Oryza sativa L. ssp. japonica)</p>
            </title>
            <aug>
               <au>
                  <snm>Goff</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Ricke</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Lan</snm>
                  <fnm>TH</fnm>
               </au>
               <au>
                  <snm>Presting</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Dunn</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Glazebrook</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Sessions</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Oeller</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Varma</snm>
                  <fnm>H</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science (New York, NY)</source>
            <pubdate>2002</pubdate>
            <volume>296</volume>
            <issue>5565</issue>
            <fpage>92</fpage>
            <lpage>100</lpage>
         </bibl>
         <bibl id="B37">
            <title>
               <p>A draft sequence of the rice genome (Oryza sativa L. ssp. indica)</p>
            </title>
            <aug>
               <au>
                  <snm>Yu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hu</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wong</snm>
                  <fnm>GK</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Deng</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Dai</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Zhou</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>X</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science (New York, NY)</source>
            <pubdate>2002</pubdate>
            <volume>296</volume>
            <issue>5565</issue>
            <fpage>79</fpage>
            <lpage>92</lpage>
         </bibl>
         <bibl id="B38">
            <title>
               <p>The Sorghum bicolor genome and the diversification of grasses</p>
            </title>
            <aug>
               <au>
                  <snm>Paterson</snm>
                  <fnm>AH</fnm>
               </au>
               <au>
                  <snm>Bowers</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Bruggmann</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Dubchak</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Grimwood</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Gundlach</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Haberer</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Hellsten</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Mitros</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Poliakov</snm>
                  <fnm>A</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2009</pubdate>
            <volume>457</volume>
            <issue>7229</issue>
            <fpage>551</fpage>
            <lpage>556</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature07723</pubid>
                  <pubid idtype="pmpid" link="fulltext">19189423</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Gapped BLAST and PSI-BLAST: a new generation of protein database search programs</p>
            </title>
            <aug>
               <au>
                  <snm>Altschul</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Madden</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Schaffer</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Lipman</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Nucleic acids research</source>
            <pubdate>1997</pubdate>
            <volume>25</volume>
            <issue>17</issue>
            <fpage>3389</fpage>
            <lpage>3402</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/25.17.3389</pubid>
                  <pubid idtype="pmcid">146917</pubid>
                  <pubid idtype="pmpid" link="fulltext">9254694</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Pfam: clans, web tools and services</p>
            </title>
            <aug>
               <au>
                  <snm>Finn</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Mistry</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Schuster-Bockler</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Griffiths-Jones</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hollich</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Lassmann</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Moxon</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Marshall</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Khanna</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Durbin</snm>
                  <fnm>R</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nucleic acids research</source>
            <pubdate>2006</pubdate>
            <issue>34 Database</issue>
            <fpage>D247</fpage>
            <lpage>251</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkj149</pubid>
                  <pubid idtype="pmcid">1347511</pubid>
                  <pubid idtype="pmpid" link="fulltext">16381856</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Statistical methods for detecting molecular adaptation</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Bielawski</snm>
                  <fnm>JP</fnm>
               </au>
            </aug>
            <source>Trends in ecology &amp; evolution (Personal edition)</source>
            <pubdate>2000</pubdate>
            <volume>15</volume>
            <issue>12</issue>
            <fpage>496</fpage>
            <lpage>503</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11114436</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Evolutionary systems biology: links between gene evolution and function</p>
            </title>
            <aug>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
            </aug>
            <source>Current opinion in biotechnology</source>
            <pubdate>2006</pubdate>
            <volume>17</volume>
            <issue>5</issue>
            <fpage>481</fpage>
            <lpage>487</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16962765</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>An integrated view of protein evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Pal</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Papp</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Lercher</snm>
                  <fnm>MJ</fnm>
               </au>
            </aug>
            <source>Nature reviews</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <issue>5</issue>
            <fpage>337</fpage>
            <lpage>348</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16619049</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Unifying measures of gene function and evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Carmel</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Proceedings of the Royal Society B-Biological Sciences</source>
            <pubdate>2006</pubdate>
            <volume>273</volume>
            <issue>1593</issue>
            <fpage>1507</fpage>
            <lpage>1515</lpage>
         </bibl>
         <bibl id="B45">
            <title>
               <p>The origin of new genes: glimpses from the young and old</p>
            </title>
            <aug>
               <au>
                  <snm>Long</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Betran</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Thornton</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Nature reviews</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <issue>11</issue>
            <fpage>865</fpage>
            <lpage>875</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14634634</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>Cellulose biosynthesis in plants: from genes to rosettes</p>
            </title>
            <aug>
               <au>
                  <snm>Doblin</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Kurek</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Jacob-Wilk</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Delmer</snm>
                  <fnm>DP</fnm>
               </au>
            </aug>
            <source>Plant &amp; cell physiology</source>
            <pubdate>2002</pubdate>
            <volume>43</volume>
            <issue>12</issue>
            <fpage>1407</fpage>
            <lpage>1420</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12514238</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>The cell walls that bind the tree of life</p>
            </title>
            <aug>
               <au>
                  <snm>Niklas</snm>
                  <fnm>KJ</fnm>
               </au>
            </aug>
            <source>Bioscience</source>
            <pubdate>2004</pubdate>
            <volume>54</volume>
            <issue>9</issue>
            <fpage>831</fpage>
            <lpage>841</lpage>
         </bibl>
         <bibl id="B48">
            <title>
               <p>Evolution of the cellulose synthase (CesA) Gene Family: Insights from green algae and seedless plants</p>
            </title>
            <aug>
               <au>
                  <snm>Roberts</snm>
                  <fnm>AW</fnm>
               </au>
               <au>
                  <snm>Roberts</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Cellulose: Molecular and Structural Biology</source>
            <publisher>Springer Netherlands</publisher>
            <editor>Brown RMJ, Saxena IM</editor>
            <pubdate>2007</pubdate>
            <fpage>17</fpage>
            <lpage>34</lpage>
         </bibl>
         <bibl id="B49">
            <title>
               <p>The sites of cellulose synthesis in algae: Diversity and evolution of cellulose-synthesizing enzyme complexes</p>
            </title>
            <aug>
               <au>
                  <snm>Tsekos</snm>
                  <fnm>I</fnm>
               </au>
            </aug>
            <source>Journal of phycology</source>
            <pubdate>1999</pubdate>
            <volume>35</volume>
            <issue>4</issue>
            <fpage>635</fpage>
            <lpage>655</lpage>
         </bibl>
         <bibl id="B50">
            <title>
               <p>Many paths up the mountain: tracking the evolution of cellulose biosynthesis</p>
            </title>
            <aug>
               <au>
                  <snm>Nobles</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Cellulose: Molecular and Structural Biology</source>
            <publisher>Springer Netherlands</publisher>
            <editor>Brown RMJ, Saxena IM</editor>
            <pubdate>2007</pubdate>
            <fpage>1</fpage>
            <lpage>15</lpage>
         </bibl>
         <bibl id="B51">
            <title>
               <p>Evolution and diversity of green plant cell walls</p>
            </title>
            <aug>
               <au>
                  <snm>Popper</snm>
                  <fnm>ZA</fnm>
               </au>
            </aug>
            <source>Current opinion in plant biology</source>
            <pubdate>2008</pubdate>
            <volume>11</volume>
            <issue>3</issue>
            <fpage>286</fpage>
            <lpage>292</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">18406657</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>Primary cell wall composition of bryophytes and charophytes</p>
            </title>
            <aug>
               <au>
                  <snm>Popper</snm>
                  <fnm>ZA</fnm>
               </au>
               <au>
                  <snm>Fry</snm>
                  <fnm>SC</fnm>
               </au>
            </aug>
            <source>Annals of botany</source>
            <pubdate>2003</pubdate>
            <volume>91</volume>
            <issue>1</issue>
            <fpage>1</fpage>
            <lpage>12</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12495914</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>Comparison of DNA sequences with protein sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Pearson</snm>
                  <fnm>WR</fnm>
               </au>
               <au>
                  <snm>Wood</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Genomics</source>
            <pubdate>1997</pubdate>
            <volume>46</volume>
            <issue>1</issue>
            <fpage>24</fpage>
            <lpage>36</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9403055</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>TAIR</p>
            </title>
            <url>ftp://ftp.arabidopsis.org/home/tair/Genes/TAIR7_genome_release</url>
         </bibl>
         <bibl id="B55">
            <title>
               <p>Rice genome</p>
            </title>
            <url>ftp://ftp.tigr.org/pub/data/Eukaryotic_Projects/o_sativa/annotation_dbs/pseudomolecules/version_5.0</url>
         </bibl>
         <bibl id="B56">
            <title>
               <p>Grape genome</p>
            </title>
            <url>http://www.genoscope.cns.fr/</url>
         </bibl>
         <bibl id="B57">
            <title>
               <p>Red algal genome</p>
            </title>
            <url>http://merolae.biol.s.u-tokyo.ac.jp/</url>
         </bibl>
         <bibl id="B58">
            <title>
               <p>Bacteria genomes</p>
            </title>
            <url>ftp://ftp.ncbi.nih.gov/genomes/Bacteria</url>
         </bibl>
         <bibl id="B59">
            <title>
               <p>Cell Wall Navigator database</p>
            </title>
            <url>http://bioweb.ucr.edu/Cellwall/family.pl?family_id=6</url>
         </bibl>
         <bibl id="B60">
            <title>
               <p>The Cell Wall Navigator database. A systems-based approach to organism-unrestricted mining of protein families involved in cell wall metabolism</p>
            </title>
            <aug>
               <au>
                  <snm>Girke</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Lauricha</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Tran</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Keegstra</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Raikhel</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Plant physiology</source>
            <pubdate>2004</pubdate>
            <volume>136</volume>
            <issue>2</issue>
            <fpage>3003</fpage>
            <lpage>3008</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">523362</pubid>
                  <pubid idtype="pmpid" link="fulltext">15489283</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B61">
            <title>
               <p>The universal protein resource (UniProt)</p>
            </title>
            <aug>
               <au>
                  <cnm>UniProt</cnm>
               </au>
            </aug>
            <source>Nucleic acids research</source>
            <pubdate>2008</pubdate>
            <issue>36 Database</issue>
            <fpage>D190</fpage>
            <lpage>195</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2238893</pubid>
                  <pubid idtype="pmpid" link="fulltext">18045787</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B62">
            <title>
               <p>MAFFT version 5: improvement in accuracy of multiple sequence alignment</p>
            </title>
            <aug>
               <au>
                  <snm>Katoh</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kuma</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Toh</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Miyata</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Nucleic acids research</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <issue>2</issue>
            <fpage>511</fpage>
            <lpage>518</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">548345</pubid>
                  <pubid idtype="pmpid" link="fulltext">15661851</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B63">
            <title>
               <p>A statistical score for assessing the quality of multiple sequence alignments</p>
            </title>
            <aug>
               <au>
                  <snm>Ahola</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Aittokallio</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Vihinen</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Uusipaikka</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>BMC bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>484</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1687212</pubid>
                  <pubid idtype="pmpid" link="fulltext">17081313</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B64">
            <title>
               <p>The accuracy of several multiple sequence alignment programs for proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Nuin</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Tillier</snm>
                  <fnm>ER</fnm>
               </au>
            </aug>
            <source>BMC bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>471</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1633746</pubid>
                  <pubid idtype="pmpid" link="fulltext">17062146</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B65">
            <title>
               <p>Identification of cellulose synthase(s) in higher plants: Sequence analysis of processive beta-glycosyltransferases with the common motif 'D, D, D35Q(R, Q)XRW'</p>
            </title>
            <aug>
               <au>
                  <snm>Saxena</snm>
                  <fnm>IM</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>RM</fnm>
               </au>
            </aug>
            <source>Cellulose</source>
            <pubdate>1997</pubdate>
            <volume>4</volume>
            <issue>1</issue>
            <fpage>33</fpage>
            <lpage>49</lpage>
         </bibl>
         <bibl id="B66">
            <title>
               <p>ProtTest: selection of best-fit models of protein evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Abascal</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Zardoya</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Posada</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Bioinformatics (Oxford, England)</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>9</issue>
            <fpage>2104</fpage>
            <lpage>2105</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15647292</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B67">
            <title>
               <p>A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood</p>
            </title>
            <aug>
               <au>
                  <snm>Guindon</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gascuel</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Systematic biology</source>
            <pubdate>2003</pubdate>
            <volume>52</volume>
            <issue>5</issue>
            <fpage>696</fpage>
            <lpage>704</lpage>
            <xrefbib>
               <pubid idtype="pmpid">14530136</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B68">
            <title>
               <p>MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0</p>
            </title>
            <aug>
               <au>
                  <snm>Tamura</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Dudley</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Nei</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kumar</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Molecular biology and evolution</source>
            <pubdate>2007</pubdate>
            <volume>24</volume>
            <issue>8</issue>
            <fpage>1596</fpage>
            <lpage>1599</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17488738</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B69">
            <title>
               <p>PAL2NAL: robust conversion of protein sequence alignments into the corresponding codon alignments</p>
            </title>
            <aug>
               <au>
                  <snm>Suyama</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Torrents</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Nucleic acids research</source>
            <pubdate>2006</pubdate>
            <issue>34 Web Server</issue>
            <fpage>W609</fpage>
            <lpage>612</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1538804</pubid>
                  <pubid idtype="pmpid" link="fulltext">16845082</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B70">
            <title>
               <p>PAML 4: phylogenetic analysis by maximum likelihood</p>
            </title>
            <aug>
               <au>
                  <snm>Yang</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>Molecular biology and evolution</source>
            <pubdate>2007</pubdate>
            <volume>24</volume>
            <issue>8</issue>
            <fpage>1586</fpage>
            <lpage>1591</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17483113</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B71">
            <title>
               <p>GSDS: a gene structure display server</p>
            </title>
            <aug>
               <au>
                  <snm>Guo</snm>
                  <fnm>AY</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>QH</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Luo</snm>
                  <fnm>JC</fnm>
               </au>
            </aug>
            <source>Yi chuan = Hereditas/Zhongguo yi chuan xue hui bian ji</source>
            <pubdate>2007</pubdate>
            <volume>29</volume>
            <issue>8</issue>
            <fpage>1023</fpage>
            <lpage>1026</lpage>
            <xrefbib>
               <pubid idtype="pmpid">17681935</pubid>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>

