<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2164-9-383</ui>
   <ji>1471-2164</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Characterization of expressed sequence tags from a full-length enriched cDNA library of <it>Cryptomeria japonica </it>male strobili</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Futamura</snm>
               <fnm>Norihiro</fnm>
               <insr iid="I1"/>
               <email>futa@ffpri.affrc.go.jp</email>
            </au>
            <au id="A2">
               <snm>Totoki</snm>
               <fnm>Yasushi</fnm>
               <insr iid="I2"/>
               <email>totoki@gsc.riken.jp</email>
            </au>
            <au id="A3">
               <snm>Toyoda</snm>
               <fnm>Atsushi</fnm>
               <insr iid="I3"/>
               <email>toyoda@gsc.riken.jp</email>
            </au>
            <au id="A4">
               <snm>Igasaki</snm>
               <fnm>Tomohiro</fnm>
               <insr iid="I1"/>
               <email>iga@ffpri.affrc.go.jp</email>
            </au>
            <au id="A5">
               <snm>Nanjo</snm>
               <fnm>Tokihiko</fnm>
               <insr iid="I1"/>
               <email>nanjo@affrc.go.jp</email>
            </au>
            <au id="A6">
               <snm>Seki</snm>
               <fnm>Motoaki</fnm>
               <insr iid="I4"/>
               <email>mseki@psc.riken.jp</email>
            </au>
            <au id="A7">
               <snm>Sakaki</snm>
               <fnm>Yoshiyuki</fnm>
               <insr iid="I2"/>
               <insr iid="I3"/>
               <email>sakaki@gsc.riken.jp</email>
            </au>
            <au id="A8">
               <snm>Mari</snm>
               <fnm>Adriano</fnm>
               <insr iid="I5"/>
               <email>adriano.mari@allergome.org</email>
            </au>
            <au id="A9">
               <snm>Shinozaki</snm>
               <fnm>Kazuo</fnm>
               <insr iid="I4"/>
               <email>sinozaki@rtc.riken.jp</email>
            </au>
            <au id="A10">
               <snm>Shinohara</snm>
               <fnm>Kenji</fnm>
               <insr iid="I1"/>
               <email>kenjis@ffpri.affrc.go.jp</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Department of Molecular and Cell Biology, Forestry and Forest Products Research Institute, Tsukuba, Ibaraki 305-8687, Japan</p>
            </ins>
            <ins id="I2">
               <p>Computational and Experimental Systems Biology Group, Genomic Sciences Center, RIKEN, Yokohama, Kanagawa 230-0045, Japan</p>
            </ins>
            <ins id="I3">
               <p>Genome Core Technology Facilities, Genomic Sciences Center, RIKEN Yokohama Institute, Yokohama, Kanagawa 230-0045, Japan</p>
            </ins>
            <ins id="I4">
               <p>Plant Functional Genomics Research Group, Plant Science Center, RIKEN Yokohama Institute, Yokohama, Kanagawa 230-0045, Japan</p>
            </ins>
            <ins id="I5">
               <p>Allergy Data Laboratories sc, Via Malipiero 28, 04100 Latina, Italy</p>
            </ins>
         </insg>
         <source>BMC Genomics</source>
         <issn>1471-2164</issn>
         <pubdate>2008</pubdate>
         <volume>9</volume>
         <issue>1</issue>
         <fpage>383</fpage>
         <url>http://www.biomedcentral.com/1471-2164/9/383</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18691438</pubid>
               <pubid idtype="doi">10.1186/1471-2164-9-383</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>28</day>
               <month>1</month>
               <year>2008</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>11</day>
               <month>8</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>11</day>
               <month>8</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Futamura et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p><it>Cryptomeria japonica </it>D. Don is one of the most commercially important conifers in Japan. However, the allergic disease caused by its pollen is a severe public health problem in Japan. Since large-scale analysis of expressed sequence tags (ESTs) in the male strobili of <it>C. japonica </it>should help us to clarify the overall expression of genes during the process of pollen development, we constructed a full-length enriched cDNA library that was derived from male strobili at various developmental stages.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We obtained 36,011 expressed sequence tags (ESTs) from either one or both ends of 19,437 clones derived from the cDNA library of <it>C. japonica </it>male strobili at various developmental stages. The 19,437 cDNA clones corresponded to 10,463 transcripts. Approximately 80% of the transcripts resembled ESTs from <it>Pinus </it>and <it>Picea</it>, while approximately 75% had homologs in <it>Arabidopsis</it>. An analysis of homologies between ESTs from <it>C. japonica </it>male strobili and known pollen allergens in the Allergome Database revealed that products of 180 transcripts exhibited significant homology. Approximately 2% of the transcripts appeared to encode transcription factors. We identified twelve genes for MADS-box proteins among these transcription factors. The twelve MADS-box genes were classified as <it>DEF/GLO/GGM13-, AG-, AGL6-, TM3- </it>and <it>TM8</it>-like MIKC<sup>C </sup>genes and type I MADS-box genes.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>Our full-length enriched cDNA library derived from <it>C. japonica </it>male strobili provides information on expression of genes during the development of male reproductive organs. We provided potential allergens in <it>C. japonica</it>. We also provided new information about transcription factors including MADS-box genes expressed in male strobili of <it>C. japonica</it>. Large-scale gene discovery using full-length cDNAs is a valuable tool for studies of gymnosperm species.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Molecular studies of angiosperm model plants have identified large numbers of genes that are expressed during stamen and pollen development. Genetic analyses have revealed the roles of some of these genes in the specification of stamen identity, the regulation of anther cell division and the differentiation of anthers, the control of male meiosis, the development of pollen, and anther dehiscence <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Pollen development in gymnosperms and angiosperms involves several similar developmental and physiological processes <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr></abbrgrp>. Homologs to floral transcription factors in angiosperms have been isolated from gymnosperm strobili and characterized. For example, the B-class MADS-box genes, which in angiosperms determine petal and stamen identities, and C-class genes that control the identities of stamens and carpels have been identified in conifers <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>. The Norway spruce gene <it>DAL1</it>, belongs to the <it>AGL6 </it>subfamily and <it>DAL10</it>, belongs to gymnosperm-specific subgroup were identified as other MADS-box genes in conifers <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr></abbrgrp>. The flowering time gene, <it>SOC1 </it>and <it>LEAFY</it>, and A class gene <it>APETALA2 </it>have also been identified <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr></abbrgrp>. However, available information about transcripts that are expressed in the male reproductive structures of the gymnosperms is still limited <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>.</p>
         <p>The gymnosperm <it>Cryptomeria </it>is a monoecious plant that is distributed throughout Japan and in some parts of China. <it>Cryptomeria japonica </it>D. Don is widely grown in Japan because of its high productivity and utility. This species comprises 18% of the forests and covers 12% of the landmass of Japan. However, allergic reactions to its pollen have become a severe public health problem in Japan. A recent nationwide epidemiological survey found that at least 13% of the Japanese population suffers from pollinosis due to pollen of <it>C. japonica </it><abbrgrp><abbr bid="B15">15</abbr></abbrgrp>.</p>
         <p>The male strobili of <it>C. japonica </it>develop in axils of small branches near the tips of these branches. The primordia of male strobili are initiated from June to August and first become visible from July to September under natural conditions <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp>. The development of strobili can also be initiated by treatment with gibberellic acid (GA<sub>3</sub>). The promotion of flower formation by exogenous GA<sub>3 </sub>occurs even in one-year-old seedlings in spite of the fact that formation of strobili usually requires around 20 years after germination under natural conditions <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. Meiosis of microsporocytes begins in the middle of October, and microspores are formed from late October to late November. Differentiation of generative cells and tube cells occurs in December. The staminate strobili then remain in an arrested state of development until the following March when pollen grains are released. Each male strobilus is oval, pale yellow, and close to 5 mm in length and 2 mm in diameter, and each consists of microsporophylls attached to a main axis. Three to five rounded microsporangia develop on the lower surface of each microsporophyll. Each microsporangium contains 3,000 or more pollen grains, and as many as 400,000 pollen grains may be produced in a single strobilus <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>.</p>
         <p>Large-scale analysis of expressed sequence tags (ESTs) should help us to clarify the overall expression of genes in the male strobili of <it>C. japonica</it>. Some ESTs have already been derived from male strobili and from the pollen of <it>C. japonica </it><abbrgrp><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp>. However, the numbers of ESTs were relatively small, namely, 739 from male strobili and 3,655 from pollen, and most isolated cDNAs were not full-length. In the present study, we constructed a full-length enriched cDNA library using RNA derived from male strobili at various developmental stages, and we obtained more than 30,000 ESTs. We performed sequence-similarity searches using TBLASTX and BLASTX to compare the ESTs from <it>C. japonica </it>male strobili to sequences in UniProt, amino acid sequences from <it>Arabidopsis </it>and rice (<it>Oryza sativa </it>L.), and EST sequences from poplar, spruce and pine. We also identified ESTs that encoded MADS-box genes, which play a variety of important developmental roles in plants. In this report, we discuss both the utility of full-length cDNAs for gene discovery and the characterization of MADS-box genes in <it>C. japonica</it>.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Construction and quality check of a full-length enriched cDNA library</p>
            </st>
            <p>We constructed a full-length enriched cDNA library from male strobili of <it>C. japonica </it>at various stages of development. The mean length of the insert DNA derived from 91 randomly selected clones was approximately 1.6 kbp. We picked up 19,968 clones (clone IDs: CMFL001_A1 through CMFL052_N24) at random and sequenced them from their 5' and 3' ends. In total, 39,936 sequences were obtained and processed for removal of vector sequences, low-quality data and contamination by genomic DNA from <it>E. coli</it>. We retained 36,011 ESTs, which encompassed 18,843 sequences of 5' ends and 17,168 sequences of 3' ends derived from 19,437 clones. Each EST consisted of at least 100 contiguous nucleotides with a PHRED score greater than 20 (Table <tblr tid="T1">1</tblr>; Additional file <supplr sid="S1">1</supplr>). The average sequenced length of the ESTs was 517 bp. All sequences have been deposited in the dbEST division of DDBJ/EMBL/GenBank. The accession number of each EST is shown in Additional file <supplr sid="S1">1</supplr>.</p>
            <suppl id="S1">
               <title>
                  <p>Additional File 1</p>
               </title>
               <text>
                  <p><b>Annotation and classification of all ESTs from male strobili of <it>Cryptomeria japonica</it></b>. Annotation and classification of all ESTs from male strobili of <it>Cryptomeria japonica</it>. This file shows that the list of DDBJ accession numbers in dbEST, the result of BLASTX against Uniprot TrEMBL, AtCDS and rice cDNA clones, and the functional classification using COG database for ESTs derived from the full-length enriched cDNA library of <it>C. japonica </it>male strobili.</p>
               </text>
               <file name="1471-2164-9-383-S1.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Summary of characteristics of the full-length cDNA library from male strobili of C. japonica</p>
               </caption>
               <tblbdy cols="2">
                  <r>
                     <c ca="left">
                        <p>Total sequences</p>
                     </c>
                     <c ca="right">
                        <p>36,011</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Number of 5'sequences</p>
                     </c>
                     <c ca="right">
                        <p>18,843</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Number of 3' sequences</p>
                     </c>
                     <c ca="right">
                        <p>17,168</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Number of cDNA clones</p>
                     </c>
                     <c ca="right">
                        <p>19,437</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Number of contigs<sup>a</sup></p>
                     </c>
                     <c ca="right">
                        <p>7,686</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Number of singlets<sup>a</sup></p>
                     </c>
                     <c ca="right">
                        <p>15,972</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Number of unique transcripts<sup>b</sup></p>
                     </c>
                     <c ca="right">
                        <p>10,463</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Number of unique transcripts corresponded to</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>1 clone</p>
                     </c>
                     <c ca="right">
                        <p>6,320</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>2 clones</p>
                     </c>
                     <c ca="right">
                        <p>2,078</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>3&#8211;5 clones</p>
                     </c>
                     <c ca="right">
                        <p>1,686</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>6&#8211;10 clones</p>
                     </c>
                     <c ca="right">
                        <p>353</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>11&#8211;20 clones</p>
                     </c>
                     <c ca="right">
                        <p>21</p>
                     </c>
                  </r>
                  <r>
                     <c indent="1" ca="left">
                        <p>>21 clones</p>
                     </c>
                     <c ca="right">
                        <p>5</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><sup>a </sup>ESTs were assembled using the PHRAP program.</p>
                  <p><sup>b </sup>Contigs and singlets were grouped using the BLASTN program and then both 5'- and 3'- end sequences derived from the respective same clones were grouped together.</p>
               </tblfn>
            </tbl>
            <p>We analyzed the full-length cDNA clones that included sites for the initiation of transcription of mRNA. An additional G at the 5' end is observed in many full-length cDNAs that are obtained by the biotinylated CAP trapper method, which we used in this study <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>. Among the 5' sequence in our cDNA library derived from <it>C. japonica </it>male strobili, 16,977 (90%) had G at the 5' end (Table <tblr tid="T2">2</tblr>). The high percentage of cDNA clones with G at the 5' terminus suggests that the relative level of full-length cDNA clones in our library was high. The second nucleotide showed a strong bias towards purines (89% A or G; Table <tblr tid="T2">2</tblr>). This observation is consistent with the fact that transcription usually starts at a purine in <it>Arabidopsis </it>and rice <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. Subsequent positions did not show such an extreme bias towards purines. Our results suggest that most full-length cDNAs had one additional G at their 5' termini.</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Base composition at each position in the 5'-end sequences</p>
               </caption>
               <tblbdy cols="6">
                  <r>
                     <c ca="center">
                        <p>Position</p>
                     </c>
                     <c ca="center">
                        <p>A</p>
                     </c>
                     <c ca="center">
                        <p>G</p>
                     </c>
                     <c ca="center">
                        <p>C</p>
                     </c>
                     <c ca="center">
                        <p>T</p>
                     </c>
                     <c ca="center">
                        <p>%purine (A+G)</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>505 (2.7%)</p>
                     </c>
                     <c ca="center">
                        <p>16977 (90%)</p>
                     </c>
                     <c ca="center">
                        <p>350 (1.9%)</p>
                     </c>
                     <c ca="center">
                        <p>1011 (5.4%)</p>
                     </c>
                     <c ca="center">
                        <p>93</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>2</p>
                     </c>
                     <c ca="center">
                        <p>8185 (43%)</p>
                     </c>
                     <c ca="center">
                        <p>8613 (46%)</p>
                     </c>
                     <c ca="center">
                        <p>1066 (5.7%)</p>
                     </c>
                     <c ca="center">
                        <p>979 (5.2%)</p>
                     </c>
                     <c ca="center">
                        <p>89</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>3</p>
                     </c>
                     <c ca="center">
                        <p>5146 (27%)</p>
                     </c>
                     <c ca="center">
                        <p>5237 (28%)</p>
                     </c>
                     <c ca="center">
                        <p>2978 (16%)</p>
                     </c>
                     <c ca="center">
                        <p>5482 (29%)</p>
                     </c>
                     <c ca="center">
                        <p>55</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>5408 (29%)</p>
                     </c>
                     <c ca="center">
                        <p>5645 (30%)</p>
                     </c>
                     <c ca="center">
                        <p>3545 (19%)</p>
                     </c>
                     <c ca="center">
                        <p>4249 (23%)</p>
                     </c>
                     <c ca="center">
                        <p>59</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>5517 (29%)</p>
                     </c>
                     <c ca="center">
                        <p>5455 (29%)</p>
                     </c>
                     <c ca="center">
                        <p>3225 (17%)</p>
                     </c>
                     <c ca="center">
                        <p>4646 (25%)</p>
                     </c>
                     <c ca="center">
                        <p>58</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>10</p>
                     </c>
                     <c ca="center">
                        <p>5456 (29%)</p>
                     </c>
                     <c ca="center">
                        <p>3879 (21%)</p>
                     </c>
                     <c ca="center">
                        <p>3849 (20%)</p>
                     </c>
                     <c ca="center">
                        <p>5659 (30%)</p>
                     </c>
                     <c ca="center">
                        <p>50</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>The occurrence of each base was counted at each position.</p>
               </tblfn>
            </tbl>
            <p>To confirm the addition of G at the 5' end of full-length cDNA clones, we compared the 5'-terminal sequences of cDNA clones that encoded Cry j 2 with genomic sequences, including the promoter and coding region of <it>Cry j 2</it>, that we determined in a previous study <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. We found seven clones that encoded Cry j 2 among the 5'-end sequences of 18,843 clones. All seven of them were 5'-extended cDNAs, as compared with three Cry j 2 cDNAs that we generated previously <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. Five of the seven had one G, one clone had two Gs and one clone had TG at its 5' terminus; these nucleotides were not present in the corresponding region of <it>Cry j 2 </it>genes (data not shown). The comparison between these seven cDNAs and <it>Cry j 2 </it>genes indicated that full-length cDNAs had one or two additional nucleotides at their 5' termini, with most clones having a single G.</p>
            <p>We performed a BLASTX comparison of amino acids encoded by 5'-terminal sequences of 18,843 cDNA clones and proteins from <it>Arabidopsis</it>. Our analysis revealed that 9,208 ESTs (48.7%) exhibited strong homology to <it>Arabidopsis </it>proteins (<it>E</it>-value &lt; 1e-20). Among these ESTs, the starting positions in the alignment of 7,480 clones (81.2%) were upstream of the initiation codon of the corresponding protein. We also performed a BLASTN comparison with nucleotide sequences of protein-coding regions in <it>Arabidopsis</it>. We found that 2,131 ESTs (11.5%) derived from 5'-terminal sequences of cDNAs exhibited strong homology (<it>E</it>-value &lt; 1e-10) to protein-coding regions in the <it>Arabidopsis </it>genome. The starting positions in the alignment of 1,755 ESTs (82.4%) were upstream of the corresponding coding region of <it>Arabidopsis</it>. These results suggest that most of our cDNA clones are full-length cDNAs.</p>
         </sec>
         <sec>
            <st>
               <p>Classification of ESTs</p>
            </st>
            <p>The assembly of ESTs can be expected to generate an overestimate of the actual number of genes represented since failure of ESTs to assemble can result from alternate splicing, differences in usage of polyadenylation sites, sequence polymorphism, and sequencing errors. Levels of redundancy after EST assembly have been estimated to range from 20% to 22% in previous studies of EST collections <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp>. To reduce redundancy, we compared all consensus sequences (7,686 tentative contigs and 15,972 singletons) using BLASTN after assembly with PHRAP and then we grouped both 5'- and 3'-end sequences derived from the same respective clones together. Finally, our analysis indicated that ESTs derived from 19,437 clones could be grouped into 10,463 clusters as unique transcripts (Table <tblr tid="T1">1</tblr>; Additional file <supplr sid="S1">1</supplr>). The largest cluster contained 35 clones, but only 26 transcripts (0.25%) corresponded to more than 10 clones. We found that 6,320 of the 10,463 transcripts (60.4%) corresponded to only one clone and 2,078 clusters (19.9%) corresponded to two clones. The normalization step appeared to reduce the redundancy of our ESTs. Mitochondrial and chloroplast RNA sequences were not filtered, but they contributed one and three clones to the data set of 19,437 cDNA clones, respectively. No contamination by ribosomal RNA was detected in the data set. Figure <figr fid="F1">1</figr> shows the functional classification of the putative proteins encoded by our ESTs, which was based on assignments in the COG database <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. Of the putative proteins derived from individual transcripts, 7,339 (70.1%) were assigned to 26 putative functional categories by BLASTX (E-value &lt; 1e-5; Additional file <supplr sid="S2">2</supplr>).</p>
            <suppl id="S2">
               <title>
                  <p>Additional File 2</p>
               </title>
               <text>
                  <p><b>Classification and annotation of unique transcripts from male strobili of <it>Cryptomeria japonica</it></b>. Classification and annotation of unique transcripts from male strobili of <it>Cryptomeria japonica</it>. This file shows that the COG classification and Pfam families of unique transcripts from male strobili of <it>C. japonica</it>.</p>
               </text>
               <file name="1471-2164-9-383-S2.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Functional classification and relative levels, as percentages of unique transcripts in the pool, of ESTs derived from male strobili of <it>C. japonica</it></p>
               </caption>
               <text>
                  <p><b>Functional classification and relative levels, as percentages of unique transcripts in the pool, of ESTs derived from male strobili of <it>C. japonica</it></b>. We assigned 7,369 clusters by reference to databases of KOGs, TWOGs and LSEs using a BLAST-based algorithm (<it>E</it>-value &#8804; 10<sup>-5</sup>). Designations of functional categories: A, RNA processing and modification; B, chromatin structure and dynamics; C, energy production and conversion; D, cell cycle control and mitosis; E, amino acid transport and metabolism; F, nucleotide transport and metabolism; G, carbohydrate transport and metabolism; H, coenzyme transport and metabolism; I, lipid transport and metabolism; J, translation, ribosomal structure and biogenesis; K, transcription; L, replication and repair; M, cell wall/membrane/envelope biogenesis; N, cell motility; O, post-translational modification, protein turnover and chaperone functions; P, inorganic ion transport and metabolism; Q, secondary metabolites biosynthesis, transport and catabolism; T, signal transduction; U, intracellular trafficking, secretion, and vesicular transport; V, defense mechanisms; Y, nuclear structure; Z, cytoskeleton; R, prediction of general function only; S, function unknown; and X, unassigned.</p>
               </text>
               <graphic file="1471-2164-9-383-1"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Sequence comparisons with other species</p>
            </st>
            <p>We found that 8,059 (77.0%) of the 10,463 transcripts encoded peptides that were significantly similar to those in the UniProt database, 7,840 (74.9%) were similar to proteins in <it>Arabidopsis</it>, and 7,455 (71.3%) were similar to proteins in <it>japonica </it>rice at a BLASTX <it>E-</it>value of 1e-5 (Figure <figr fid="F2">2</figr>). When we compared our ESTs with sequences of ESTs from <it>Pinus, Picea </it>and <it>Populus </it>by TBLASTX, we found that 8,433 (80.6%), 8,400 (80.3%) and 7,763 (74.2%) transcripts included similar ESTs, respectively, at <it>E-</it>values of 1e-5. A larger overlap was found in the case of coniferous species (Figure <figr fid="F2">2</figr>). We also compared the 10,463 transcripts examined in the present study with ESTs that had been previously isolated from <it>C. japonica</it>, which included 10,328 ESTs derived from cambium and surrounding tissues, 3,677 ESTs from pollen, 3,222 ESTs from male strobili and 1,889 ESTs from other organs <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B26">26</abbr></abbrgrp>. Slightly more than half (5,792; 55.4%) of our transcripts did not exhibit any similarity (BLASTN <it>E</it>value &lt; 1e-10) to ESTs that had been previously isolated from <it>C. japonica</it>. Thus, the ESTs described in the present report contain novel information about protein-coding regions of the <it>C. japonica </it>genome.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Sequence similarities</p>
               </caption>
               <text>
                  <p><b>Sequence similarities</b>. Numbers of transcript sequences of transcripts from <it>C. japonica </it>male strobili similar to sequences in the Uniprot, pine, spruce, poplar, <it>Arabidopsis</it>, and rice databases according to BLASTX <it>E-</it>value cutoff values.</p>
               </text>
               <graphic file="1471-2164-9-383-2"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Functional analysis</p>
            </st>
            <p>We analyzed conserved domains using Pfam <abbrgrp><abbr bid="B27">27</abbr></abbrgrp> as a database to predict the function of products of <it>C. japonica </it>transcripts. Overall, we found that 5,304 (50.7%) of the 10,463 transcripts encoded proteins similar to members of 1,664 Pfam protein families (E-value &lt; 1e-10). Among these Pfam families, 147 families are denoted "DUF, Domain of Unknown Function" and 26 families are denoted "UPF, Uncharacterized Protein Family". In total, products of 4,965 (47.5%) of the transcripts from <it>C. japonica </it>male strobili were similar to members of 1,481 Pfam families when DUFs and UPFs were excluded. We found that 4,850 (46.3%) transcripts belonged to only one Pfam family, 433 (4.1%) transcripts belonged to two families, 17 transcripts (0.16%) belonged to three families, and two transcripts (0.02%) belonged to four families (Additional file <supplr sid="S2">2</supplr>). The most abundant protein families in male strobili of <it>C. japonica </it>corresponded to those most strongly represented in the genome of <it>Arabidopsis </it>(Table <tblr tid="T3">3</tblr>). Similar earlier analyses indicated that these domains occur frequently in sugarcane and white spruce also <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B28">28</abbr></abbrgrp>.</p>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Occurrence of the 25 most common Pfam domains in the predicted proteins of unique transcripts from male strobili of <it>C. japonica</it></p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="left">
                        <p>Description of Pfam domain</p>
                     </c>
                     <c ca="left">
                        <p>Pfam accession</p>
                     </c>
                     <c ca="center">
                        <p>Number of <it>C. japonica </it>transcripts<sup>a</sup></p>
                     </c>
                     <c ca="center">
                        <p>Number of genes in <it>A. thaliana </it>genome<sup>b</sup></p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Protein kinase domain</p>
                     </c>
                     <c ca="left">
                        <p>
                           PF00069
                        </p>
                     </c>
                     <c ca="center">
                        <p>132</p>
                     </c>
                     <c ca="center">
                        <p>789</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cytochrome P450</p>
                     </c>
                     <c ca="left">
                        <p>
                           PF00067
                        </p>
                     </c>
                     <c ca="center">
                        <p>120</p>
                     </c>
                     <c ca="center">
                        <p>242</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>RNA recognition motif.</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>(a.k.a. RRM, RBD, or RNP domain)</p>
                     </c>
                     <c ca="left">
                        <p>
                         PF00076
                        </p>
                     </c>
                     <c ca="center">
                        <p>78</p>
                     </c>
                     <c ca="center">
                        <p>172</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>NAD-dependent epimerase/dehydratase family</p>
                     </c>
                     <c ca="left">
                        <p>
                           PF01370
                        </p>
                     </c>
                     <c ca="center">
                        <p>54</p>
                     </c>
                     <c ca="center">
                        <p>52</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Zinc finger; C3HC4 type (RING finger)</p>
                     </c>
                     <c ca="left">
                        <p>
                           PF00097
                        </p>
                     </c>
                     <c ca="center">
                        <p>48</p>
                     </c>
                     <c ca="center">
                        <p>18</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GDSL-like lipase/acylhydrolase</p>
                     </c>
                     <c ca="left">
                        <p>
                        PF00657
                        </p>
                     </c>
                     <c ca="center">
                        <p>44</p>
                     </c>
                     <c ca="center">
                        <p>107</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Myb-like DNA-binding domain</p>
                     </c>
                     <c ca="left">
                        <p>
                           PF00249
                        </p>
                     </c>
                     <c ca="center">
                        <p>43</p>
                     </c>
                     <c ca="center">
                        <p>53</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>WD domain; G-beta repeat</p>
                     </c>
                     <c ca="left">
                        <p>
                           PF00400
                        </p>
                     </c>
                     <c ca="center">
                        <p>41</p>
                     </c>
                     <c ca="center">
                        <p>17</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Ras family</p>
                     </c>
                     <c ca="left">
                        <p>
                          PF00071
                        </p>
                     </c>
                     <c ca="center">
                        <p>35</p>
                     </c>
                     <c ca="center">
                        <p>73</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>UDP-glucuronosyl and UDP-glycosyl transferase</p>
                     </c>
                     <c ca="left">
                        <p>
                           PF00201
                        </p>
                     </c>
                     <c ca="center">
                        <p>35</p>
                     </c>
                     <c ca="center">
                        <p>86</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Short-chain dehydrogenase</p>
                     </c>
                     <c ca="left">
                        <p>
                           PF00106
                        </p>
                     </c>
                     <c ca="center">
                        <p>34</p>
                     </c>
                     <c ca="center">
                        <p>55</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Alpha/beta hydrolase fold</p>
                     </c>
                     <c ca="left">
                        <p>
                          PF00561
                        </p>
                     </c>
                     <c ca="center">
                        <p>34</p>
                     </c>
                     <c ca="center">
                        <p>37</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Sugar (and other) transporter</p>
                     </c>
                     <c ca="left">
                        <p>
                           PF00083
                        </p>
                     </c>
                     <c ca="center">
                        <p>33</p>
                     </c>
                     <c ca="center">
                        <p>55</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>2OG-Fe(II) oxygenase superfamily</p>
                     </c>
                     <c ca="left">
                        <p>PF03171</p>
                     </c>
                     <c ca="center">
                        <p>33</p>
                     </c>
                     <c ca="center">
                        <p>105</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Core histone H2A/H2B/H3/H4</p>
                     </c>
                     <c ca="left">
                        <p>PF00125</p>
                     </c>
                     <c ca="center">
                        <p>32</p>
                     </c>
                     <c ca="center">
                        <p>46</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Peroxidase</p>
                     </c>
                     <c ca="left">
                        <p>PF00141</p>
                     </c>
                     <c ca="center">
                        <p>32</p>
                     </c>
                     <c ca="center">
                        <p>82</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>DnaJ domain</p>
                     </c>
                     <c ca="left">
                        <p>PF00226</p>
                     </c>
                     <c ca="center">
                        <p>32</p>
                     </c>
                     <c ca="center">
                        <p>88</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Eukaryotic aspartyl protease</p>
                     </c>
                     <c ca="left">
                        <p>PF00026</p>
                     </c>
                     <c ca="center">
                        <p>31</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Aldo/keto reductase family</p>
                     </c>
                     <c ca="left">
                        <p>PF00248</p>
                     </c>
                     <c ca="center">
                        <p>30</p>
                     </c>
                     <c ca="center">
                        <p>21</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Transferase family</p>
                     </c>
                     <c ca="left">
                        <p>PF02458</p>
                     </c>
                     <c ca="center">
                        <p>28</p>
                     </c>
                     <c ca="center">
                        <p>55</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Alcohol dehydrogenase, GroES-like domain</p>
                     </c>
                     <c ca="left">
                        <p>PF08240</p>
                     </c>
                     <c ca="center">
                        <p>28</p>
                     </c>
                     <c ca="center">
                        <p>25</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Mitochondrial carrier protein</p>
                     </c>
                     <c ca="left">
                        <p>PF00153</p>
                     </c>
                     <c ca="center">
                        <p>27</p>
                     </c>
                     <c ca="center">
                        <p>55</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Ubiquitin-conjugating enzyme</p>
                     </c>
                     <c ca="left">
                        <p>PF00179</p>
                     </c>
                     <c ca="center">
                        <p>27</p>
                     </c>
                     <c ca="center">
                        <p>41</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Zinc-binding dehydrogenase</p>
                     </c>
                     <c ca="left">
                        <p>PF00107</p>
                     </c>
                     <c ca="center">
                        <p>26</p>
                     </c>
                     <c ca="center">
                        <p>39</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>AMP-binding enzyme</p>
                     </c>
                     <c ca="left">
                        <p>PF00501</p>
                     </c>
                     <c ca="center">
                        <p>26</p>
                     </c>
                     <c ca="center">
                        <p>45</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><sup>a </sup>Protein families were identified by BLASTX searches with an E-value &lt; 1e-10 in the Pfam database.</p>
                  <p><sup>b </sup>Protein families were assigned on the basis of data in TAIR.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Similarity of products of ESTs to stamen- or male gametophyte-specific proteins of <it>Arabidopsis</it></p>
            </st>
            <p>We compared our deduced proteins with stamen and male gametophyte-specific proteins in <it>Arabidopsis</it>. Referring to previous studies of <it>Arabidopsis</it>, we identified amino acid sequences encoded by 1,145 stamen-specific transcripts and by 1,274 male gametophyte-specific transcripts in the <it>Arabidopsis </it>database <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr></abbrgrp>. We found that 754 (65.9%) stamen-specific transcripts and 617 (48.4%) male gametophyte-specific transcripts of <it>Arabidopsis </it>resembled 509 and 429 transcripts, respectively, from male strobili of <it>C. japonica</it>, at a TBLASTX <it>E-</it>value of 1e-10 (Additional files <supplr sid="S3">3</supplr> and <supplr sid="S4">4</supplr>). Table <tblr tid="T4">4</tblr> shows the Pfam domains that are found in stamen- and male gametophyte-specific genes of <it>Arabidopsis </it>and the numbers of transcripts that correspond to the families of genes in male strobili of <it>C. japonica</it>. Our results suggest that homologs of stamen- and pollen-specific transcripts of <it>Arabidopsis </it>are expressed in the male strobili of <it>C. japonica</it>.</p>
            <suppl id="S3">
               <title>
                  <p>Additional File 3</p>
               </title>
               <text>
                  <p><b>List of stamen-specific transcripts from <it>Arabidopsis </it>and corresponding ESTs from male strobili of <it>Cryptomeria japonica</it></b>. List of stamen-specific transcripts from <it>Arabidopsis </it>and corresponding ESTs from male strobili of <it>Cryptomeria japonica </it>Description of data: This file shows that the list of stamen-specific transcripts from <it>Arabidopsis </it>and their homologs derived from ESTs from male strobili of <it>C. japonica</it>.</p>
               </text>
               <file name="1471-2164-9-383-S3.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S4">
               <title>
                  <p>Additional File 4</p>
               </title>
               <text>
                  <p><b>List of male gametophyte-specific transcripts from <it>Arabidopsis </it>and corresponding ESTs from male strobili of <it>Cryptomeria japonica</it></b>. List of male gametophyte-specific transcripts from <it>Arabidopsis </it>and corresponding ESTs from male strobili of <it>Cryptomeria japonica</it>. This file shows that the list of male gametophyte-specific transcripts from <it>Arabidopsis </it>and their homologs derived from ESTs from male strobili of <it>C. japonica</it>.</p>
               </text>
               <file name="1471-2164-9-383-S4.xls">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <tbl id="T4">
               <title>
                  <p>Table 4</p>
               </title>
               <caption>
                  <p>Pfam domains found in transcripts of both <it>A. thaliana </it>stamen- or male gametophyte-specific genes and in transcripts from male strobili of <it>C. japonica</it></p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="left">
                        <p>Description of Pfam domain</p>
                     </c>
                     <c ca="center">
                        <p>Pfam accession number</p>
                     </c>
                     <c ca="center">
                        <p>Number of transcripts in <it>C. japonica </it>male strobili</p>
                     </c>
                     <c ca="center">
                        <p>Number of <it>A. thaliana </it>stamen- specific transcripts<sup>a</sup></p>
                     </c>
                     <c ca="center">
                        <p>Number of <it>A. thaliana </it>male gametophyte-specific transcripts<sup>b</sup></p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Protein kinase domain</p>
                     </c>
                     <c ca="center">
                        <p>PF00069</p>
                     </c>
                     <c ca="center">
                        <p>132</p>
                     </c>
                     <c ca="center">
                        <p>35</p>
                     </c>
                     <c ca="center">
                        <p>39</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Plant invertase/pectin methylesterase inhibitor</p>
                     </c>
                     <c ca="center">
                        <p>PF04043</p>
                     </c>
                     <c ca="center">
                        <p>12</p>
                     </c>
                     <c ca="center">
                        <p>22</p>
                     </c>
                     <c ca="center">
                        <p>15</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Protein tyrosine kinase</p>
                     </c>
                     <c ca="center">
                        <p>PF07714</p>
                     </c>
                     <c ca="center">
                        <p>19</p>
                     </c>
                     <c ca="center">
                        <p>16</p>
                     </c>
                     <c ca="center">
                        <p>12</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GDSL-like lipase/acylhydrolase</p>
                     </c>
                     <c ca="center">
                        <p>PF00657</p>
                     </c>
                     <c ca="center">
                        <p>44</p>
                     </c>
                     <c ca="center">
                        <p>15</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Pectinesterase</p>
                     </c>
                     <c ca="center">
                        <p>PF01095</p>
                     </c>
                     <c ca="center">
                        <p>14</p>
                     </c>
                     <c ca="center">
                        <p>15</p>
                     </c>
                     <c ca="center">
                        <p>11</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ABC transporter</p>
                     </c>
                     <c ca="center">
                        <p>PF00005</p>
                     </c>
                     <c ca="center">
                        <p>20</p>
                     </c>
                     <c ca="center">
                        <p>13</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Cytochrome P450</p>
                     </c>
                     <c ca="center">
                        <p>PF00067</p>
                     </c>
                     <c ca="center">
                        <p>120</p>
                     </c>
                     <c ca="center">
                        <p>13</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Glycosyl hydrolase family 28</p>
                     </c>
                     <c ca="center">
                        <p>PF00295</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                     <c ca="center">
                        <p>12</p>
                     </c>
                     <c ca="center">
                        <p>11</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Sodium/hydrogen exchanger family</p>
                     </c>
                     <c ca="center">
                        <p>PF00999</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ABC-2 type transporter</p>
                     </c>
                     <c ca="center">
                        <p>PF01061</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Oleosin</p>
                     </c>
                     <c ca="center">
                        <p>PF01277</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>No apical meristem (NAM) protein</p>
                     </c>
                     <c ca="center">
                        <p>PF02365</p>
                     </c>
                     <c ca="center">
                        <p>12</p>
                     </c>
                     <c ca="center">
                        <p>9</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>RNA recognition motif.</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>(a.k.a. RRM, RBD, or RNP domain)</p>
                     </c>
                     <c ca="center">
                        <p>PF00076</p>
                     </c>
                     <c ca="center">
                        <p>78</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Sugar (and other) transporter</p>
                     </c>
                     <c ca="center">
                        <p>PF00083</p>
                     </c>
                     <c ca="center">
                        <p>33</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Calcineurin-like phosphoesterase</p>
                     </c>
                     <c ca="center">
                        <p>PF00149</p>
                     </c>
                     <c ca="center">
                        <p>25</p>
                     </c>
                     <c ca="center">
                        <p>8</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>E1-E2 ATPase</p>
                     </c>
                     <c ca="center">
                        <p>PF00122</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Multicopper oxidase</p>
                     </c>
                     <c ca="center">
                        <p>PF00394</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Haloacid dehalogenase-like hydrolase</p>
                     </c>
                     <c ca="center">
                        <p>PF00702</p>
                     </c>
                     <c ca="center">
                        <p>19</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Multicopper oxidase</p>
                     </c>
                     <c ca="center">
                        <p>PF07731</p>
                     </c>
                     <c ca="center">
                        <p>14</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Multicopper oxidase</p>
                     </c>
                     <c ca="center">
                        <p>PF07732</p>
                     </c>
                     <c ca="center">
                        <p>17</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Glycosyl hydrolase family 1</p>
                     </c>
                     <c ca="center">
                        <p>PF00232</p>
                     </c>
                     <c ca="center">
                        <p>20</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Pectate lyase</p>
                     </c>
                     <c ca="center">
                        <p>PF00544</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>FAD-binding domain</p>
                     </c>
                     <c ca="center">
                        <p>PF01565</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Galactose-binding lectin domain</p>
                     </c>
                     <c ca="center">
                        <p>PF02140</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>MtN3/saliva family</p>
                     </c>
                     <c ca="center">
                        <p>PF03083</p>
                     </c>
                     <c ca="center">
                        <p>14</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><sup>a </sup>1,145 transcripts were picked up on the basis of previous report by Wellmer et al. (2004) <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>.</p>
                  <p><sup>b </sup>1,274 transcripts were picked up on the basis of previous report by Honys and Twell (2004) <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Similarity of the deduced proteins to pollen allergens</p>
            </st>
            <p>We compared the peptides encoded by all the ESTs with known plant allergens in Allergome <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. Table <tblr tid="T5">5</tblr> shows a list of pollen allergens that are similar to the deduced proteins encoded by ESTs derived from <it>C. japonica </it>male strobili. Proteins encoded by 180 transcripts (1.7%) exhibited partial sequence homology to plant allergens, which included known allergens of <it>C. japonica</it>. This result suggests the possibility that unidentified allergens in <it>C. japonica </it>might exist in our EST collection.</p>
            <tbl id="T5">
               <title>
                  <p>Table 5</p>
               </title>
               <caption>
                  <p>Products of ESTs that resemble pollen allergens</p>
               </caption>
               <tblbdy cols="6">
                  <r>
                     <c ca="left">
                        <p>Accession No.<sup>a</sup></p>
                     </c>
                     <c ca="left">
                        <p>Allergen</p>
                     </c>
                     <c ca="left">
                        <p>Species</p>
                     </c>
                     <c ca="left">
                        <p>Putative product</p>
                     </c>
                     <c ca="left">
                        <p>E-value<sup>b</sup></p>
                     </c>
                     <c ca="center">
                        <p>No.<sup>c</sup></p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>BY894724</p>
                     </c>
                     <c ca="left">
                        <p>Cry j 1</p>
                     </c>
                     <c ca="left">
                        <p><it>Cryptomeria japonica </it>(Sugi)</p>
                     </c>
                     <c ca="left">
                        <p>Pectate lyase</p>
                     </c>
                     <c ca="left">
                        <p>1E-127</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>BY895894</p>
                     </c>
                     <c ca="left">
                        <p>Cry j 2</p>
                     </c>
                     <c ca="left">
                        <p><it>Cryptomeria japonica </it>(Sugi)</p>
                     </c>
                     <c ca="left">
                        <p>Polymethylgalacturonase</p>
                     </c>
                     <c ca="left">
                        <p>1E-119</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>BY891770</p>
                     </c>
                     <c ca="left">
                        <p>Cry j 3.8</p>
                     </c>
                     <c ca="left">
                        <p><it>Cryptomeria japonica </it>(Sugi)</p>
                     </c>
                     <c ca="left">
                        <p>PR-5 protein</p>
                     </c>
                     <c ca="left">
                        <p>3E-98</p>
                     </c>
                     <c ca="center">
                        <p>16</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>BY896705</p>
                     </c>
                     <c ca="left">
                        <p>CJP-4</p>
                     </c>
                     <c ca="left">
                        <p><it>Cryptomeria japonica </it>(Sugi)</p>
                     </c>
                     <c ca="left">
                        <p>Class IV chitinase</p>
                     </c>
                     <c ca="left">
                        <p>1E-112</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>BY888350</p>
                     </c>
                     <c ca="left">
                        <p>CJP-6</p>
                     </c>
                     <c ca="left">
                        <p><it>Cryptomeria japonica </it>(Sugi)</p>
                     </c>
                     <c ca="left">
                        <p>Isoflavone reductase family</p>
                     </c>
                     <c ca="left">
                        <p>4E-87</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>BY894232</p>
                     </c>
                     <c ca="left">
                        <p>Jun o 4</p>
                     </c>
                     <c ca="left">
                        <p><it>Juniperus oxycedrus </it>L. (Prickly juniper)</p>
                     </c>
                     <c ca="left">
                        <p>Calcium-binding protein</p>
                     </c>
                     <c ca="left">
                        <p>2E-67</p>
                     </c>
                     <c ca="center">
                        <p>16</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>BY912188</p>
                     </c>
                     <c ca="left">
                        <p>Amb a 3</p>
                     </c>
                     <c ca="left">
                        <p><it>Ambrosia artemisiifolia </it>L. (Ragweed)</p>
                     </c>
                     <c ca="left">
                        <p>Plastocyanin-like protein</p>
                     </c>
                     <c ca="left">
                        <p>2E-8</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>BY881070</p>
                     </c>
                     <c ca="left">
                        <p>Cat r 1</p>
                     </c>
                     <c ca="left">
                        <p><it>Catharanthus roseus </it>(L.) G. Don (Madagascar periwinkle)</p>
                     </c>
                     <c ca="left">
                        <p>Cyclophilin</p>
                     </c>
                     <c ca="left">
                        <p>9E-79</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>BY882008</p>
                     </c>
                     <c ca="left">
                        <p>Che a 1</p>
                     </c>
                     <c ca="left">
                        <p><it>Chenopodium album </it>L. (Lamb's-quarters)</p>
                     </c>
                     <c ca="left">
                        <p>Trypsin inhibitor</p>
                     </c>
                     <c ca="left">
                        <p>3E-28</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>BY911759</p>
                     </c>
                     <c ca="left">
                        <p>Cor a 1.04</p>
                     </c>
                     <c ca="left">
                        <p><it>Corylus avellana </it>L. (Hazel)</p>
                     </c>
                     <c ca="left">
                        <p>PR-10 protein</p>
                     </c>
                     <c ca="left">
                        <p>2E-14</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>BY896550</p>
                     </c>
                     <c ca="left">
                        <p>Cor a 10</p>
                     </c>
                     <c ca="left">
                        <p><it>Corylus avellana </it>L. (Hazel)</p>
                     </c>
                     <c ca="left">
                        <p>Luminal-binding protein</p>
                     </c>
                     <c ca="left">
                        <p>1E-107</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>BY882008</p>
                     </c>
                     <c ca="left">
                        <p>Cro s 1</p>
                     </c>
                     <c ca="left">
                        <p><it>Crocus sativus </it>L. (Saffron)</p>
                     </c>
                     <c ca="left">
                        <p>LAT52 protein</p>
                     </c>
                     <c ca="left">
                        <p>3E-13</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>BY883628</p>
                     </c>
                     <c ca="left">
                        <p>Cyn d 22</p>
                     </c>
                     <c ca="left">
                        <p><it>Cynodon dactylon </it>(L.) Pers. (Bermuda grass)</p>
                     </c>
                     <c ca="left">
                        <p>Enolase</p>
                     </c>
                     <c ca="left">
                        <p>9E-25</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>BY893554</p>
                     </c>
                     <c ca="left">
                        <p>Cyn d 24</p>
                     </c>
                     <c ca="left">
                        <p><it>Cynodon dactylon </it>(L.) Pers. (Bermuda grass)</p>
                     </c>
                     <c ca="left">
                        <p>PR-1 protein</p>
                     </c>
                     <c ca="left">
                        <p>2E-33</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>BY895449</p>
                     </c>
                     <c ca="left">
                        <p>Hum j Profilin</p>
                     </c>
                     <c ca="left">
                        <p><it>Humulus japonicus </it>Siebold &amp; Zucc. (Japanese hop)</p>
                     </c>
                     <c ca="left">
                        <p>Profilin</p>
                     </c>
                     <c ca="left">
                        <p>6E-63</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>BY899168</p>
                     </c>
                     <c ca="left">
                        <p>Hum j 1</p>
                     </c>
                     <c ca="left">
                        <p><it>Humulus japonicus </it>Siebold &amp; Zucc. (Japanese hop)</p>
                     </c>
                     <c ca="left">
                        <p>Uncharacterized protein</p>
                     </c>
                     <c ca="left">
                        <p>1E-10</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>BY892250</p>
                     </c>
                     <c ca="left">
                        <p>Lol p 1</p>
                     </c>
                     <c ca="left">
                        <p><it>Lolium perenne </it>L. (perennial ryegrass)</p>
                     </c>
                     <c ca="left">
                        <p>Expansin</p>
                     </c>
                     <c ca="left">
                        <p>6E-14</p>
                     </c>
                     <c ca="center">
                        <p>11</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>BY896420</p>
                     </c>
                     <c ca="left">
                        <p>Ole e 5</p>
                     </c>
                     <c ca="left">
                        <p><it>Olea europaea </it>L. (Olive)</p>
                     </c>
                     <c ca="left">
                        <p>Superoxide dimutase</p>
                     </c>
                     <c ca="left">
                        <p>7E-66</p>
                     </c>
                     <c ca="center">
                        <p>5</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>BY894301</p>
                     </c>
                     <c ca="left">
                        <p>Ole e 9</p>
                     </c>
                     <c ca="left">
                        <p><it>Olea europaea </it>L. (Olive)</p>
                     </c>
                     <c ca="left">
                        <p>&#946;-1,3-glucanase</p>
                     </c>
                     <c ca="left">
                        <p>2E-46</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>BY905708</p>
                     </c>
                     <c ca="left">
                        <p>Ole e 10</p>
                     </c>
                     <c ca="left">
                        <p><it>Olea europaea </it>L. (Olive)</p>
                     </c>
                     <c ca="left">
                        <p>Glycosyl hydrolase</p>
                     </c>
                     <c ca="left">
                        <p>4E-23</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>BY889873</p>
                     </c>
                     <c ca="left">
                        <p>Sal k 1.03</p>
                     </c>
                     <c ca="left">
                        <p><it>Salsola kali </it>L. (Russian-thistle)</p>
                     </c>
                     <c ca="left">
                        <p>pectin esterase</p>
                     </c>
                     <c ca="left">
                        <p>2E-35</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>BY911213</p>
                     </c>
                     <c ca="left">
                        <p>Sal k 2</p>
                     </c>
                     <c ca="left">
                        <p><it>Salsola kali </it>L. (Russian-thistle)</p>
                     </c>
                     <c ca="left">
                        <p>Protein kinase</p>
                     </c>
                     <c ca="left">
                        <p>6E-47</p>
                     </c>
                     <c ca="center">
                        <p>53</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p><sup>a </sup>The accession number of an EST that exhibited strongest similarity.</p>
                  <p><sup>b </sup>The <it>E-</it>value of an EST that exhibited strongest similarity.</p>
                  <p><sup>c </sup>Total number of transcripts whose products were similar to the respective allergens (BLASTX score > 50 and <it>E-</it>value &lt;10<sup>-7</sup>).</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Families of putative transcription factors</p>
            </st>
            <p>Transcription factors play important roles in the formation of floral tissues throughout organogenesis. To identify transcription factors, we first identified 37 Pfam domains that encode transcription factors by comparing data in the AtTFDB (<it>Arabidopsis thaliana </it>transcription factor database) <abbrgrp><abbr bid="B32">32</abbr></abbrgrp> with the domain annotations of <it>Arabidopsis </it>proteins. As a result of a search for sequence similarity between ESTs from <it>C. japonica </it>male strobili and the 37 identified Pfam domains, we identified a total of 207 (2.0%) unique transcripts that encoded putative transcription factors, which could be assigned to 29 protein families (Table <tblr tid="T6">6</tblr>). Transcripts whose products resembled the C3HC4 zinc finger domain, MYB-like transcription factor, and AP2 were abundant in the male strobili of <it>C. japonica</it>.</p>
            <tbl id="T6">
               <title>
                  <p>Table 6</p>
               </title>
               <caption>
                  <p>Identification of transcripts encoding putative transcription factors in male strobilus of <it>C. japonica</it></p>
               </caption>
               <tblbdy cols="3">
                  <r>
                     <c ca="left">
                        <p>Description of Pfam domains</p>
                     </c>
                     <c ca="left">
                        <p>Pfam accession</p>
                     </c>
                     <c ca="center">
                        <p>Number of <it>C. japonica </it>male strobilus transcripts</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="3">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Zinc finger, C3HC4 type (RING finger)</p>
                     </c>
                     <c ca="left">
                        <p>PF00097</p>
                     </c>
                     <c ca="center">
                        <p>48</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Myb-like DNA-binding domain</p>
                     </c>
                     <c ca="left">
                        <p>PF00249</p>
                     </c>
                     <c ca="center">
                        <p>43</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>AP2 domain</p>
                     </c>
                     <c ca="left">
                        <p>PF00847</p>
                     </c>
                     <c ca="center">
                        <p>18</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>No apical meristem (NAM) protein; NAC domain</p>
                     </c>
                     <c ca="left">
                        <p>PF02365</p>
                     </c>
                     <c ca="center">
                        <p>12</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Homeobox domain</p>
                     </c>
                     <c ca="left">
                        <p>PF00046</p>
                     </c>
                     <c ca="center">
                        <p>11</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>PHD-finger</p>
                     </c>
                     <c ca="left">
                        <p>PF00628</p>
                     </c>
                     <c ca="center">
                        <p>11</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>SRF-type transcription factor; MADS box</p>
                     </c>
                     <c ca="left">
                        <p>PF00319</p>
                     </c>
                     <c ca="center">
                        <p>10</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Histone-like transcription factor (CBF/NF-Y) and archaeal histone</p>
                     </c>
                     <c ca="left">
                        <p>PF00808</p>
                     </c>
                     <c ca="center">
                        <p>7</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Helix-loop-helix DNA-binding domain</p>
                     </c>
                     <c ca="left">
                        <p>PF00010</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>HSF-type DNA-binding</p>
                     </c>
                     <c ca="left">
                        <p>PF00447</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>B-box zinc finger</p>
                     </c>
                     <c ca="left">
                        <p>PF00643</p>
                     </c>
                     <c ca="center">
                        <p>6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Dof domain, zinc finger</p>
                     </c>
                     <c ca="left">
                        <p>PF02701</p>
                     </c>
                     <c ca="center">
                        <p>4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>WRKY DNA-binding domain</p>
                     </c>
                     <c ca="left">
                        <p>PF03106</p>
                     </c>
                     <c ca="center">
                        <p>3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Response regulator receiver domain</p>
                     </c>
                     <c ca="left">
                        <p>PF00072</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>bZIP transcription factor</p>
                     </c>
                     <c ca="left">
                        <p>PF00170</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GATA zinc finger</p>
                     </c>
                     <c ca="left">
                        <p>PF00320</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>B3 DNA binding domain; ABI3/VP1 transcription factor</p>
                     </c>
                     <c ca="left">
                        <p>PF02362</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>SBP domain</p>
                     </c>
                     <c ca="left">
                        <p>PF03110</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GRAS family transcription factor</p>
                     </c>
                     <c ca="left">
                        <p>PF03514</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ZF-HD protein dimerisation region</p>
                     </c>
                     <c ca="left">
                        <p>PF04770</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>CCT motif; CO-like protein</p>
                     </c>
                     <c ca="left">
                        <p>PF06203</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Auxin response factor</p>
                     </c>
                     <c ca="left">
                        <p>PF06507</p>
                     </c>
                     <c ca="center">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>ARID/BRIGHT DNA binding domain</p>
                     </c>
                     <c ca="left">
                        <p>PF01388</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>CCAAT-binding transcription factor (CBF-B/NF-YA) subunit B</p>
                     </c>
                     <c ca="left">
                        <p>PF02045</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>TCP family transcription factor</p>
                     </c>
                     <c ca="left">
                        <p>PF03634</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>CG-1 domain; CAMTA protein</p>
                     </c>
                     <c ca="left">
                        <p>PF03859</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>YABBY protein</p>
                     </c>
                     <c ca="left">
                        <p>PF04690</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Plant protein of unknown function; BZR1/LAT61 family</p>
                     </c>
                     <c ca="left">
                        <p>PF05687</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Whirly transcription factor</p>
                     </c>
                     <c ca="left">
                        <p>PF08536</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Identification and phylogenetic analysis of MADS-box genes</p>
            </st>
            <p>Among the putative transcription factors, ten transcripts encoded MADS-box proteins (Table <tblr tid="T6">6</tblr>). In addition, we found another two transcripts whose products were similar to MADS-box proteins with <it>E-</it>values &lt; 1e-8. We determined the entire sequence of representative cDNA derived from each transcript (accession numbers <ext-link ext-link-type="gen" ext-link-id="AB359027">AB359027</ext-link> through <ext-link ext-link-type="gen" ext-link-id="AB359038">AB359038</ext-link>). Figure <figr fid="F3">3</figr> shows the phylogenetic tree constructed from an alignment of the sequences of MADS, I-, and K-domains. One cDNA clone (CMFL009_N22) did not encode a K-domain and it was classified as the cDNA of a type I MADS-box gene. Six of twelve <it>C. japonica </it>MADS-box genes were classified as <it>DEF/GLO/GGM13</it>-like genes and two genes were categorized as <it>TM8</it>-like genes. The other three genes were categorized as <it>AG-, AGL6</it>-, and <it>TM3</it>-like genes, respectively (Figure <figr fid="F3">3</figr>).</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>A phylogenetic tree based on MADS, I- and K-domains of deduced proteins</p>
               </caption>
               <text>
                  <p><b>A phylogenetic tree based on MADS, I- and K-domains of deduced proteins</b>. The local bootstrap probability is shown on branches where available. This tree is an unrooted tree. MIKC<sup>C</sup>-type genes are divided into 13 subfamilies, including gymnosperm-specific <it>DAL10</it>-like subfamily, on the basis of this tree. The genus from which each transcript was isolated is indicated after the deduced protein name. Deduced proteins from <it>C. japonica </it>that were identified in the present study are indicated in red. Deduced proteins from gymnosperms other than <it>C. japonica </it>are indicated in blue, and deduced proteins from basal angiosperms are indicated in green.</p>
               </text>
               <graphic file="1471-2164-9-383-3"/>
            </fig>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>The genomes of gymnosperm species are usually large and replete with highly repetitive sequences. The haploid DNA content of <it>C. japonica </it>was estimated to correspond to 11 pg of DNA <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>. This value is about half of the mean value for Pinaceae species <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>, but about eighty times that of the <it>Arabidopsis </it>genome. EST analysis is a low-cost approach to characterization of the coding component of the genome. ESTs derived from several tissues of <it>C. japonica </it>were analyzed previously <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr><abbr bid="B35">35</abbr></abbrgrp>. However, most of the previously analyzed ESTs were not full-length. Full-length cDNAs are essential for the determination of sites of initiation of transcription and the functional analysis of genes <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. In the present study, we constructed a full-length enriched cDNA library from <it>C. japonica </it>male strobili at various stages of development. To our knowledge, this is the first report of a full-length enriched cDNA library from a gymnosperm. In fact, the information of 6,464 Sitka spruce full-length cDNA sequences were registered in GenBank (accession numbers <ext-link ext-link-type="gen" ext-link-id="EF081469">EF081469</ext-link> to <ext-link ext-link-type="gen" ext-link-id="EF087932">EF087932</ext-link>), but it has not been published. These full-length cDNAs should be a valuable tool for gene discovery and analysis in gymnosperms. We found that 90% of our cDNA clones included G at the 5' ends (Table <tblr tid="T2">2</tblr>). The G at the 5' end of full-length cDNA can be explained by the addition of C to the 3' end of full-length first-strand cDNA in a non-template-directed manner by the reverse transcriptase <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. Our observation of bias towards G at the 5' end of cDNAs is consistent with the addition of C to the 3' end of full-length cDNAs. The sequences of 5' ends imply that our cDNA library had been enriched for full-length cDNA clones. We also found that the second nucleotide of most cDNAs at their 5' termini was a purine nucleotide, which is common at sites of initiation of transcription in <it>Arabidopsis </it>and rice <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. This result suggests that a purine nucleotide is common at sites of initiation of transcription in <it>C. japonica </it>as well as <it>Arabidopsis </it>and rice. However, further studies about the comparison between genome sequences and the full-length cDNAs are needed to identify the sequence characteristics around the transcription start site in <it>C. japonica</it>.</p>
         <p>We isolated 36,011 ESTs from either one or both ends of 19,437 cDNA clones. These ESTs were clustered into 23,658 consensus sequences (7,686 tentative contigs and 15,972 singletons) by PHRAP. The redundancy is extremely low compared with assemblies of ESTs derived from other conifers. In previous studies, the assembly of 49,101 ESTs derived from 16 cDNA libraries from different tissues of white spruce resulted in 16,578 consensus sequences <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>, and that of 59,797 ESTs from wood-forming tissues of loblolly pine represented 20,377 consensus sequences <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>. The normalization step in the generation of our cDNA library contributed the high rate of gene discovery. We obtained 10,463 transcripts after a round-robin BLASTN search and clustering of 5'- and 3'-end sequences derived from the same respective clones. We found that 1,317 transcripts (12.6%) did not exhibit any similarity in terms of predicted amino acid sequences to those encoded by <it>Arabidopsis </it>and rice genomes or to those derived from <it>Pinus, Picea </it>and <it>Populus </it>ESTs using BLASTX and TBLASTX (<it>E-</it>value > 1e-5). These results indicate that the ESTs generated in the present study have added to the known complement of gymnosperm transcripts.</p>
         <p>We classified and annotated the transcripts isolated in this study (Fig. <figr fid="F1">1</figr>; Table <tblr tid="T3">3</tblr>; Additional file <supplr sid="S2">2</supplr>). In our functional classification, categories with no concrete assignment, such as "prediction of general function only", "function unknown" and "unassigned", accounted for a large fraction of transcripts (Fig. <figr fid="F1">1</figr>). Among assigned pollen transcripts of <it>C. japonica</it>, close to 31% of the predicted proteins are involved in the synthesis and modification of proteins, which are categorized as J and O <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. However, the percentage in those categories was only approximately 14% in the present study. The difference might be due to the large population of transcripts in the present study and/or to the specific characteristics of the male strobili of <it>C. japonica</it>. Table <tblr tid="T3">3</tblr> shows the functional annotation of the most abundant families identified in this study. Protein kinase, cytochrome P450, and the RNA recognition motif were the most frequent domains. These domains are also present at relatively high levels in deduced products of the genome of <it>Arabidopsis</it>. We also found that the numbers of transcripts that encoded certain protein families or domains, such as NAD-dependent epimerase/dehydratase, the C3HC4 type zinc finger, the WD domain, aspartyl proteases and aldo/keto reductases, were larger than those that encoded the corresponding protein families or domains in the <it>Arabidopsis </it>genome. In general, the degree of complexity of multigene families seems to be correlated with the size of the plant genome. Southern hybridization patterns and comparative sequence analysis of conserved ortholog sets suggested that genomes of gymnosperms include complex families of genes <abbrgrp><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr></abbrgrp>. Our results suggest the increased complexity of gene families in <it>C. japonica </it>as compared to <it>Arabidopsis</it>. Since the number of cDNA clones in the present study was insufficient to allow us to prove this hypothesis and since the complete sequence of each clone has not been determined, further studies are needed to clarify the complexity of gene families in <it>C. japonica</it>.</p>
         <p>We also searched potential candidates for novel allergens in <it>C. japonica</it>. In previous study, we analyzed similarity of the deduced proteins encoded by ESTs derived from <it>C. japonica </it>pollen and known plant allergens, and found cDNA clones that encoded proteins similar to six types of pollen allergens, eight types of food allergens and three types of latex allergens <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. In the present study, we found 180 transcripts that encoded proteins similar to 22 pollen allergens including all of five known allergens in <it>C. japonica </it>(Table <tblr tid="T5">5</tblr>). The major allergens in <it>C. japonica </it>are known to have similarity to pollen allergens of other plants <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. The existence of numerous allergens in <it>C. japonica </it>pollen has been suggested but only a few antigens have been identified. These newly deduced proteins in <it>C. japonica </it>are potential candidates for novel plant allergens.</p>
         <p>In this study, we identified 207 transcripts that encoded Pfam domains of transcription factors, including those encoded by MADS-box genes. In plants, MADS-box genes are involved in various aspects of vegetative and reproductive growth, including the morphogenesis of flowers. Plant MADS-box genes are classified into types I and II <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>. Most of plant type II genes have three more domains than type I, namely, an intervening (I) domain, a keratin-like (K) domain, and a C-terminal (C) domain. The plant type II genes are divided into two types based on the intron structure, namely, MIKC<sup>C</sup>- and MIKC*-type genes <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>. The MIKC<sup>C</sup>-type genes can be subdivided into several well-defined gene clades, known as 'subfamilies' <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>. We found twelve MADS-box genes derived from transcripts from male strobili of <it>C. japonica</it>, one of which was a type I gene while the other eleven were MIKC<sup>C</sup>-type genes. The MIKC<sup>C</sup>-type genes in <it>C. japonica </it>could be subdivided into five subfamilies: <it>DEF/GLO/GGM13-, TM8-, AG-, AGL6-</it>, and <it>TM3</it>-like genes. To our knowledge, this is the first report to identify type I genes and <it>TM8</it>-like genes in a gymnosperm. Functional characterization of type I genes in plants has been very limited to date. The type I gene in <it>C. japonica </it>(CMFL_009_N22) was most similar to <it>AGL80</it>, which is required for development of the central cell of the female gametophyte <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>. The functions of type I genes in reproductive organs in <it>C. japonica </it>remain to be determined.</p>
         <p>The MADS-box gene family has been subjected to a model of birth-and death evolution <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>. <it>TM8</it>-like genes have been found in rosids (Cucurbitaceae) and asterids (Solanaceae) but not in <it>Arabidopsis </it><abbrgrp><abbr bid="B43">43</abbr></abbrgrp>. A recent review noted that <it>TM8</it>-like genes exist in the basalmost angiosperms (Amborellaceae) and in magnoliids (Lauraceae) <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>. We found evidence for at least one <it>TM8</it>-like gene in <it>Cryptomeria </it>(Cupressaceae sensu lato). None has been reported in Pinaceae to date, to our knowledge. These results suggest that <it>TM8</it>-like genes were established in the common ancestor of angiosperms and gymnosperms and that they have been lost relatively recently in some lineages. It seems likely that the rate of birth-and-death evolution of <it>TM8</it>-like genes has been high.</p>
         <p>We identified six <it>DEF/GLO/GGM13-</it>like genes in this study. <it>DEF</it>- and <it>GLO</it>-like genes are known as B-class MADS genes, which are key regulators of petal and stamen specification in model species of angiosperms, such as <it>Arabidopsis thaliana, Antirrhinum majus</it>, and <it>Petunia hybrida. GGM13-</it>like genes are the sister lineage of the B-class genes and are, hence, also known as Bsister (Bs) genes <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>. Bs genes are expressed mainly in female reproductive organs and a member of the Bs gene family is involved in development of the seed coat <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>. B-class and Bs genes were treated as one clade in the present study. Three genes (CMFL_009_E12, CMFL_023_G10, CMFL_052_L02) seem to be ancestral <it>DEF</it>- and <it>GLO</it>-like genes, while the other three genes (CMFL_007_I11, CMFL_018_D08, CMFL_046_H20) seem to constitute a sister clade of B-class and Bs genes. Several gene duplications appear to have occurred in the lineage to <it>Cryptomeria </it>after it diverged from other conifers.</p>
         <p>The phylogenetic analysis of MADS-box genes in the present study has implications for the evolution of gymnosperms. The phylogenetic tree of <it>TM3</it>-like genes indicates that the Gnetales and the Pinaceae are nested as sister groups and that <it>Cryptomeria </it>is a sister of the Gnetales plus Pinaceae clade. A similar result was obtained for <it>AG</it>-like genes, but the bootstrap support was not very strong. Our data support those in prior multigene phylogenetic studies that suggested that all conifers, with the exception of Pinaceae, are a sister clade of the Gnetales plus Pinaceae clade <abbrgrp><abbr bid="B48">48</abbr><abbr bid="B49">49</abbr></abbrgrp>. The evolution and divergence of the MADS-box family of genes has been studied extensively. However, most of the studied MADS-box genes in gymnosperms were derived from <it>Gnetum, Pinus</it>, and <it>Picea </it><abbrgrp><abbr bid="B50">50</abbr><abbr bid="B51">51</abbr></abbrgrp>. The identification and characterization of MADS-box genes in <it>Cryptomeria </it>and other gymnosperms should help us to understand the evolution of the structure of MADS-box genes and their roles in reproductive development.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>We established a full-length enriched cDNA library using RNA derived from male strobili at various developmental stages, and we obtained 36,011 ESTs that was grouped into 10,463 clusters as unique transcripts. These full-length cDNAs provide information on expression of genes during the development of male reproductive organs. Our full-length enriched cDNA library will be useful for large-scale gene discovery for studies of gymnosperm species.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Plant Materials</p>
            </st>
            <p>Male strobili of <it>C. japonica </it>were collected at seven-days intervals from mid August to mid November of 2003 and 2004. All the samples were immediately frozen in liquid nitrogen and stored at -80&#176;C until use. RNA was extracted from each set of strobili, and a total RNA mixture derived from all the samples was used for construction of a full-length enriched cDNA library.</p>
         </sec>
         <sec>
            <st>
               <p>RNA isolation</p>
            </st>
            <p>Total RNA was isolated by a previously described method <abbrgrp><abbr bid="B52">52</abbr></abbrgrp> with slight modifications, using the SV Total RNA Isolating System (Promega, Madison WI). Frozen male strobili were powdered with Multi-beads shocker<sup>&#174; </sup>(Yasui Kikai, Osaka, Japan). Then we added 7.5 ml of a solution of 100 mM Tris-HCl (pH 9.5), 20 mM EDTA, 1.4 M NaCl, 2% (w/v) polyvinylpyrrolidinone, 5% (w/v) &#946;-mercaptoethanol and 0.5 mg/ml spermidine per gram of powdered male strobili with through mixing. After incubation of the mixture at 65&#176;C for 5 min, RNA was extracted twice with an equal volume of a mixture of chloroform and isoamyl alcohol (24:1, v/v) and centrifugation at 15,000 &#215; <it>g </it>for 20 min. One-fourth volume of 10 M LiCl was added to the final aqueous phase. Total RNA was allowed to precipitate for 2 h and collected by centrifugation at 15,000 &#215; <it>g </it>for 30 min. The pelleted RNA was dissolved in SV RNA Lysis Buffer (Promega) and purified with the SV Total RNA Isolating System according to the protocol provided by the manufacturer. Poly(A)<sup>+ </sup>RNA was prepared with a &#956;MACS&#8482; mRNA Isolation Kit (Miltenyi Biotec, Bergisch Gladbach, Germany) according to the protocol provided by the manufacturer.</p>
         </sec>
         <sec>
            <st>
               <p>Construction of a full-length enriched cDNA library and DNA sequencing</p>
            </st>
            <p>A full-length enriched cDNA library was generated from poly(A)<sup>+ </sup>RNA by the biotinylated CAP trapper method using trehalose-thermoactivated reverse transcriptase as described previously <abbrgrp><abbr bid="B36">36</abbr><abbr bid="B53">53</abbr></abbrgrp>. The resultant cDNAs were normalized, inserted into &#955;-FLC-III vectors and packaged <abbrgrp><abbr bid="B54">54</abbr><abbr bid="B55">55</abbr></abbrgrp>. The packaged DNA produced approximately 1.1 &#215; 10<sup>6 </sup>plaques and the original phage library was amplified once. Part of the one-round-amplified phage library was excised <it>in vivo </it>and resultant plasmids were used to transform <it>E. coli </it>DH10B. Colonies of bacteria were picked with picking machines (Q-bot; Genetix, New Milton, UK) and transferred to 384-microwell plates. DNA templates corresponding to cDNA inserts were prepared from glycerol stocks by the rolling circle amplification (RCA) method with a TempliPhi&#8482; DNA Amplification Kit (GE Healthcare Bio-Sciences Corp., Piscataway, NJ, USA). Products of RCA were sequenced with BigDye<sup>&#174; </sup>Terminator v3.1 Cycle Sequencing Kits (Applied Biosystems, Foster City, CA) and automatic sequencers (ABI 3700; Applied Biosystems). The M13-21 primer (5'-TGT AAA ACG ACG GCC AGT-3') was used for forward sequencing and the 1233 primer (5'-AGC GGA TAA CAA TTT CAC ACA GGA-3') was used for reverse sequencing. Sequences were quality-trimmed by reference to the high-quality (PHRED 20 or better) contiguous region, as determined with PHRED software <abbrgrp><abbr bid="B56">56</abbr></abbrgrp> and then vector and poly(A) regions were removed. Sequences of less than 100 nucleotides after trimming were discarded. Contaminating regions of <it>E. coli </it>genome were identified with BLASTN <abbrgrp><abbr bid="B57">57</abbr></abbrgrp>, and ESTs with an <it>E-</it>value &lt; 1e-10 were excluded. The remaining sequences were used for further analysis.</p>
            <p>We searched for the nucleotide and amino acid sequences in TAIR (The Arabidopsis Information Resources; genome release version 6.0) <abbrgrp><abbr bid="B58">58</abbr></abbrgrp> homologous to nucleotide and predicted amino acid sequences encoded by the 5'-end sequences of ESTs using the BLASTN and BLASTX program. We also searched for sequence homologies between the 5'-end sequences of ESTs and the cDNA and promoter region of <it>Cry j 2 </it>using the BLASTN program. Assessment of possible contamination by chloroplast and mitochondrial genomes and by ribosomal RNAs was performed with the BLASTN program (<it>E</it>-value &lt; 1e-10). We used the chloroplast and mitochondrial genomes of <it>Arabidopsis thaliana </it>and genes for ribosomal RNAs from <it>A. thaliana, Populus </it>species, <it>Platanus occidentalis </it>and <it>Zea mays </it>as query sequences.</p>
         </sec>
         <sec>
            <st>
               <p>Grouping of ESTs</p>
            </st>
            <p>Related cDNA sequences from both 5' and 3' ends were grouped as contig sequences with the PHRAP program according to the following criteria: -penalty -5; -minmatch 300; -minscore 300; and -trim_qual 20 <abbrgrp><abbr bid="B59">59</abbr></abbrgrp>. After grouping ESTs as contigs, we used the BLASTN program to compare contig sequences. When contig sequences overlapped by more than 200 nucleotides and were more than 99% homologous in the overlapping regions and, in addition, when the overlap began within 5 bp from the ends of the contig sequences, we grouped the contig sequences together. We performed BLASTN searches separately for the EST sequences of between 100 and 200 bp and the resultant contig sequences. When an EST sequence and a contig were more than 98% homologous and the length of non-overlapping sequence in the EST sequence was less than 10 bp and, in addition, the overlap began within 5 bp from the end of the EST sequence or the contig, we grouped the EST sequence and the contig together. Finally both 5'- and 3'-end sequences derived from the same clone were grouped together.</p>
         </sec>
         <sec>
            <st>
               <p>Functional classification and annotation of ESTs</p>
            </st>
            <p>We categorized ESTs on the basis of the putative functions of encoded products using a database of clusters of orthologous groups from seven eukaryotic genomes (COG) <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. We used databases of proteins from more than three species (KOGs), proteins from two species (TWOGs) and lineage-specific expansion groups (LSEs). We compared the results of BLASTX analysis of each sequence in each cluster and adopted the functional category with the highest score.</p>
            <p>We performed similarity searches with the BLASTX and TBLASTX programs. We used RefSeq of <it>Arabidopsis </it>from the National Center for Biotechnology Information <abbrgrp><abbr bid="B60">60</abbr></abbrgrp>, the database of the Knowledge-based Oryza Molecular Biological Encyclopedia <abbrgrp><abbr bid="B61">61</abbr></abbrgrp> and UniProtKB/TrEMBL Release 33.3 <abbrgrp><abbr bid="B62">62</abbr></abbrgrp> as protein databases, and we used <it>Pinus </it>Gene Index Release 6.0, Spruce Gene Index Release 2.0, and Poplar Gene Index Release 3.0 from the TIGR Gene Indices <abbrgrp><abbr bid="B63">63</abbr></abbrgrp> as EST databases. Sequences of stamen- and pollen-specific transcripts of <it>Arabidopsis </it>were obtained from previous studies <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr></abbrgrp>. Similarities to ESTs that had previously been derived from <it>C. japonica </it>were determined with the BLASTN program. Similarities to known plant allergens were determined by BLASTX comparison with proteins in Allergome database <url>http://www.allergome.org</url><abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. We annotated protein families using a list of Pfam domain sequence (Pfam-A.fasta; release 21.0) <abbrgrp><abbr bid="B27">27</abbr></abbrgrp> using the BLASTX program and an <it>E-</it>value &lt; 1e-10. We obtained domain annotations of <it>Arabidopsis thaliana </it>proteins from the TAIR website <abbrgrp><abbr bid="B58">58</abbr></abbrgrp> and selected Pfam families with an <it>E-</it>value &lt; 1e-10.</p>
         </sec>
         <sec>
            <st>
               <p>Phylogenetic analysis</p>
            </st>
            <p>To reconstruct a phylogenetic tree of MADS-box genes, we obtained amino acid sequences from EMBL/DDBJ/GenBank DNA databases and aligned them using the Clustal W program <abbrgrp><abbr bid="B64">64</abbr></abbrgrp>. The alignments of sequences of MADS, intervening (I-) and keratin-like (K-) domains were edited manually using the MacClade program <abbrgrp><abbr bid="B65">65</abbr></abbrgrp>. Maximum likelihood distances were calculated with the NJdist and ProtML programs in MOLPHY <abbrgrp><abbr bid="B66">66</abbr></abbrgrp>. A neighbor-joining (NJ) tree was obtained with NJdist, and was based on ML distances in the JTT model <abbrgrp><abbr bid="B67">67</abbr></abbrgrp>. This tree was used as the starting tree for a local rearrangement search with the ProtML program <abbrgrp><abbr bid="B66">66</abbr></abbrgrp>. The local bootstrap probability of each branch was estimated by the resampling-of-estimated-log-likelihood (RELL) method <abbrgrp><abbr bid="B68">68</abbr></abbrgrp>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>NF designed this study, prepared plant materials and RNA, managed the construction of a full-length enriched cDNA library, performed similarity searches and phylogenetic analysis, and drafted the manuscript. YT and AT did the sequencing work, the gene clustering, the gene annotation and submitted the data to DDBJ. AM provided the sequences and data on allergens derived from the Allergome Database. TI, TN, MS, YS, KaS and KeS participated in the design and coordination of the study.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>The authors are grateful to the members of Sequence Technology Team, RIKEN genomic Sciences Center for technical support. This research was supported in part by a Grant-in-Aid "Research Project for Utilizing Advanced Technologies in Agriculture, Forestry and Fisheries" and in part by a Grant-in-Aid for Scientific Research (No. 18780123) from the Ministry of Education, Culture, Sports, Science and Technology of Japan, and in part by a research grant (No. 200607) from the Forestry and Forest Products Research Institute.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Molecular genetic analyses of microsporogenesis and microgametogenesis in flowering plants</p>
            </title>
            <aug>
               <au>
                  <snm>Ma</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Annu Rev Plant Biol</source>
            <pubdate>2005</pubdate>
            <volume>56</volume>
            <fpage>393</fpage>
            <lpage>434</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.arplant.55.031903.141717</pubid>
                  <pubid idtype="pmpid" link="fulltext">15862102</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>The tapetum: Its form, function, and possible phylogeny in <it>Embryophyta</it></p>
            </title>
            <aug>
               <au>
                  <snm>Pacini</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Franchi</snm>
                  <fnm>GG</fnm>
               </au>
               <au>
                  <snm>Hesse</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Plant Syst Evol</source>
            <pubdate>1985</pubdate>
            <volume>149</volume>
            <fpage>155</fpage>
            <lpage>185</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1007/BF00983304</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <aug>
               <au>
                  <snm>Stanley</snm>
                  <fnm>RG</fnm>
               </au>
               <au>
                  <snm>Linskens</snm>
                  <fnm>HF</fnm>
               </au>
            </aug>
            <source>Pollen: biology, biochemistry, management</source>
            <publisher>Berlin: Springer-Verlag</publisher>
            <pubdate>1974</pubdate>
         </bibl>
         <bibl id="B4">
            <title>
               <p>MADS-box genes active in developing pollen cones of Norway spruce (<it>Picea abies</it>) are homologous to the B-class floral homeotic genes in angiosperms</p>
            </title>
            <aug>
               <au>
                  <snm>Sundstr&#246;m</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Carlsbecker</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Svensson</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Svenson</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Johanson</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Thei&#223;en</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Engstr&#246;m</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Dev Genet</source>
            <pubdate>1999</pubdate>
            <volume>25</volume>
            <fpage>253</fpage>
            <lpage>266</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/(SICI)1520-6408(1999)25:3&lt;253::AID-DVG8>3.0.CO;2-P</pubid>
                  <pubid idtype="pmpid" link="fulltext">10528266</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Conifer reproductive development involves B-type MADS-box genes with distinct and different activities in male organ primordia</p>
            </title>
            <aug>
               <au>
                  <snm>Sundstr&#246;m</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Engstr&#246;m</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Plant J</source>
            <pubdate>2002</pubdate>
            <volume>31</volume>
            <fpage>161</fpage>
            <lpage>169</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-313X.2002.01343.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">12121446</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Characterization of an AGAMOUS homologue from the conifer black spruce (Picea mariana) that produces floral homeotic conversions when expressed in Arabidopsis</p>
            </title>
            <aug>
               <au>
                  <snm>Rutledge</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Regan</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nicolas</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Fobert</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>C&#244;t&#233;</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Bosnich</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Kauffeldt</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Sunohara</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>S&#233;guin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Stewart</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Plant J</source>
            <pubdate>1998</pubdate>
            <volume>15</volume>
            <fpage>625</fpage>
            <lpage>634</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-313x.1998.00250.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">9778845</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>A <it>DEF/GLO</it>-like MADS-box gene from a gymnosperm: <it>Pinus radiata </it>contains an ortholog of angiosperm B class floral homeotic genes</p>
            </title>
            <aug>
               <au>
                  <snm>Mouradov</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hamdorf</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Teasdale</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Winter</snm>
                  <fnm>K-U</fnm>
               </au>
               <au>
                  <snm>Thei&#223;en</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Dev Genet</source>
            <pubdate>1999</pubdate>
            <volume>25</volume>
            <fpage>245</fpage>
            <lpage>252</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/(SICI)1520-6408(1999)25:3&lt;245::AID-DVG7>3.0.CO;2-N</pubid>
                  <pubid idtype="pmpid" link="fulltext">10528265</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>The DAL10 gene from Norway spruce (<it>Picea abies</it>) belongs to a potentially gymnosperm-specific subclass of MADS-box genes and is specifically active in seed cones and pollen cones</p>
            </title>
            <aug>
               <au>
                  <snm>Carlsbecker</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sundstr&#246;m</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Tandre</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Englund</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kvarnheden</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Johanson</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Engstr&#246;m</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Evol Dev</source>
            <pubdate>2003</pubdate>
            <volume>5</volume>
            <fpage>551</fpage>
            <lpage>561</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1525-142X.2003.03060.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">14984037</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>The MADS-box gene <it>DAL1 </it>is a potential mediator of the juvenile-to-adult transition in Norway spruce (<it>Picea abies</it>)</p>
            </title>
            <aug>
               <au>
                  <snm>Carlsbecker</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Tandre</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Johanson</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Englund</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Engstr&#246;m</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Plant J</source>
            <pubdate>2004</pubdate>
            <volume>40</volume>
            <fpage>546</fpage>
            <lpage>557</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1365-313X.2004.02226.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">15500470</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Conifer homologues to genes that control floral development in angiosperms</p>
            </title>
            <aug>
               <au>
                  <snm>Tandre</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Albert</snm>
                  <fnm>VA</fnm>
               </au>
               <au>
                  <snm>Sund&#229;s</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Engstr&#246;m</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Plant Mol Biol</source>
            <pubdate>1995</pubdate>
            <volume>27</volume>
            <fpage>69</fpage>
            <lpage>78</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/BF00019179</pubid>
                  <pubid idtype="pmpid">7865797</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p><it>APETALA2 </it>like genes from <it>Picea abies </it>show functional similarities to their Arabidopsis homologues</p>
            </title>
            <aug>
               <au>
                  <snm>Nilsson</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Carlsbecker</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sund&#229;s-Larsson</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Vahala</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Planta</source>
            <pubdate>2007</pubdate>
            <volume>225</volume>
            <fpage>589</fpage>
            <lpage>602</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s00425-006-0374-1</pubid>
                  <pubid idtype="pmpid" link="fulltext">16953432</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p><it>NEEDLY</it>, a <it>Pinus radiata </it>ortholog of <it>FLORICAULA/LEAFY </it>genes, expressed in both reproductive and vegetative meristems</p>
            </title>
            <aug>
               <au>
                  <snm>Mouradov</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Glassick</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hamdorf</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Murphy</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Fowler</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Marla</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Teasdale</snm>
                  <fnm>RD</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1998</pubdate>
            <volume>95</volume>
            <fpage>6537</fpage>
            <lpage>6542</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">27861</pubid>
                  <pubid idtype="pmpid" link="fulltext">9601002</pubid>
                  <pubid idtype="doi">10.1073/pnas.95.11.6537</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p><it>PRFLL </it>&#8211; a <it>Pinus radiata </it>homologue of <it>FLORICAULA </it>and <it>LEAFY </it>is expressed in buds containing vegetative shoot and undifferentiated male cone primordia</p>
            </title>
            <aug>
               <au>
                  <snm>Mellerowicz</snm>
                  <fnm>EJ</fnm>
               </au>
               <au>
                  <snm>Horgan</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Walden</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Coker</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Walter</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Planta</source>
            <pubdate>1998</pubdate>
            <volume>206</volume>
            <fpage>619</fpage>
            <lpage>629</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s004250050440</pubid>
                  <pubid idtype="pmpid" link="fulltext">9821691</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Genes expressed in <it>Pinus radiata </it>male cones include homologs to anther-specific and pathogenesis response genes</p>
            </title>
            <aug>
               <au>
                  <snm>Walden</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Walter</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Gardner</snm>
                  <fnm>RC</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>1999</pubdate>
            <volume>121</volume>
            <fpage>1103</fpage>
            <lpage>1116</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">59478</pubid>
                  <pubid idtype="pmpid" link="fulltext">10594098</pubid>
                  <pubid idtype="doi">10.1104/pp.121.4.1103</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Epidemiology of Japanese cedar pollinosis throughout Japan</p>
            </title>
            <aug>
               <au>
                  <snm>Okuda</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Ann Allergy Asthma Immunol</source>
            <pubdate>2003</pubdate>
            <volume>91</volume>
            <fpage>288</fpage>
            <lpage>296</lpage>
            <xrefbib>
               <pubid idtype="pmpid">14533662</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>
                  <it>Cryptomeria japonica</it>
               </p>
            </title>
            <aug>
               <au>
                  <snm>Nagao</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sasaki</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Pharis</snm>
                  <fnm>RP</fnm>
               </au>
            </aug>
            <source>CRC Handbook of Flowering</source>
            <publisher>Boca Raton, FL: CRC Press</publisher>
            <editor>Halevy AH</editor>
            <pubdate>1989</pubdate>
            <fpage>247</fpage>
            <lpage>269</lpage>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Detection of time and initiating factors on male flower differentiation of Japanese cedar (<it>Cryptomeria japonica</it>)</p>
            </title>
            <aug>
               <au>
                  <snm>Jin</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Nakanishi</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Ogasawara</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Jpn J Palynol</source>
            <pubdate>2004</pubdate>
            <volume>50</volume>
            <fpage>23</fpage>
            <lpage>29</lpage>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Analysis of expressed sequence tags derived from developing seed and pollen cones of <it>Cryptomeria japonica</it></p>
            </title>
            <aug>
               <au>
                  <snm>Ujino-Ihara</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Taguchi</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Yoshimura</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Tsumura</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Plant Biol</source>
            <pubdate>2003</pubdate>
            <volume>5</volume>
            <fpage>600</fpage>
            <lpage>607</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1055/s-2003-44690</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Analysis of expressed sequence tags from <it>Cryptomeria japonica </it>pollen reveals novel pollen-specific transcripts</p>
            </title>
            <aug>
               <au>
                  <snm>Futamura</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Ujino-Ihara</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Nishiguchi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kanamori</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Yoshimura</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Sakaguchi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Shinohara</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Tree Physiol</source>
            <pubdate>2006</pubdate>
            <volume>26</volume>
            <fpage>1517</fpage>
            <lpage>1528</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17169891</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>High-efficiency cloning of <it>Arabidopsis </it>full-length cDNA by biotinylated CAP trapper</p>
            </title>
            <aug>
               <au>
                  <snm>Seki</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Carninci</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Nishiyama</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Hayashizaki</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Shinozaki</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Plant J</source>
            <pubdate>1998</pubdate>
            <volume>15</volume>
            <fpage>707</fpage>
            <lpage>720</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-313x.1998.00237.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">9778851</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Identification of plant promoter constituents by analysis of local distribution of short sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Yamamoto</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Ichida</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Matsui</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Obokata</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Sakurai</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Satou</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Seki</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Shinozaki</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Abe</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>67</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1832190</pubid>
                  <pubid idtype="pmpid" link="fulltext">17346352</pubid>
                  <pubid idtype="doi">10.1186/1471-2164-8-67</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Characterization of genes for a pollen allergen, Cry j 2, of <it>Cryptomeria japonica</it></p>
            </title>
            <aug>
               <au>
                  <snm>Futamura</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Kusunoki</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Mukai</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Shinohara</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Int Arch Allergy Immunol</source>
            <pubdate>2007</pubdate>
            <volume>143</volume>
            <fpage>59</fpage>
            <lpage>68</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1159/000098225</pubid>
                  <pubid idtype="pmpid" link="fulltext">17192717</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Analysis and functional annotation of an expressed sequence tag collection for tropical crop sugarcane</p>
            </title>
            <aug>
               <au>
                  <snm>Vettore</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>da Silva</snm>
                  <fnm>FR</fnm>
               </au>
               <au>
                  <snm>Kemper</snm>
                  <fnm>EL</fnm>
               </au>
               <au>
                  <snm>Souza</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>da Silva</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Ferro</snm>
                  <fnm>MIT</fnm>
               </au>
               <au>
                  <snm>Henrique-Silva</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Giglioti</snm>
                  <fnm>&#201;A</fnm>
               </au>
               <au>
                  <snm>Lemos</snm>
                  <fnm>MVF</fnm>
               </au>
               <au>
                  <snm>Coutinho</snm>
                  <fnm>LL</fnm>
               </au>
               <au>
                  <snm>Nobrega</snm>
                  <fnm>MP</fnm>
               </au>
               <au>
                  <snm>Carrer</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Fran&#231;a</snm>
                  <fnm>SC</fnm>
               </au>
               <au>
                  <snm>Bacci</snm>
                  <fnm>M</fnm>
                  <suf>Jr</suf>
               </au>
               <au>
                  <snm>Goldman</snm>
                  <fnm>MHS</fnm>
               </au>
               <au>
                  <snm>Gomes</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Nunes</snm>
                  <fnm>LR</fnm>
               </au>
               <au>
                  <snm>Camargo</snm>
                  <fnm>LEA</fnm>
               </au>
               <au>
                  <snm>Siqueira</snm>
                  <fnm>WJ</fnm>
               </au>
               <au>
                  <snm>Van Sluys</snm>
                  <fnm>M-A</fnm>
               </au>
               <au>
                  <snm>Thiemann</snm>
                  <fnm>OH</fnm>
               </au>
               <au>
                  <snm>Kuramae</snm>
                  <fnm>EE</fnm>
               </au>
               <au>
                  <snm>Santelli</snm>
                  <fnm>RV</fnm>
               </au>
               <au>
                  <snm>Marino</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>Targon</snm>
                  <fnm>MLPN</fnm>
               </au>
               <au>
                  <snm>Ferro</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Silveira</snm>
                  <fnm>HCS</fnm>
               </au>
               <au>
                  <snm>Marini</snm>
                  <fnm>DC</fnm>
               </au>
               <au>
                  <snm>Lemos</snm>
                  <fnm>EGM</fnm>
               </au>
               <au>
                  <snm>Monteiro-Vitorello</snm>
                  <fnm>CB</fnm>
               </au>
               <au>
                  <snm>Tambor</snm>
                  <fnm>JHM</fnm>
               </au>
               <au>
                  <snm>Carraro</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Roberto</snm>
                  <fnm>PG</fnm>
               </au>
               <au>
                  <snm>Martins</snm>
                  <fnm>VG</fnm>
               </au>
               <au>
                  <snm>Goldman</snm>
                  <fnm>GH</fnm>
               </au>
               <au>
                  <snm>de Oliveira</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Truffi</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Colombo</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Rossi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>de Araujo</snm>
                  <fnm>PG</fnm>
               </au>
               <au>
                  <snm>Sculaccio</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Angella</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lima</snm>
                  <fnm>MMA</fnm>
               </au>
               <au>
                  <snm>de Rosa</snm>
                  <fnm>VE</fnm>
                  <suf>Jr</suf>
               </au>
               <au>
                  <snm>Siviero</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Coscrato</snm>
                  <fnm>VE</fnm>
               </au>
               <au>
                  <snm>Machado</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Grivet</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Di Mauro</snm>
                  <fnm>SMZ</fnm>
               </au>
               <au>
                  <snm>Nobrega</snm>
                  <fnm>FG</fnm>
               </au>
               <au>
                  <snm>Menck</snm>
                  <fnm>CFM</fnm>
               </au>
               <au>
                  <snm>Braga</snm>
                  <fnm>MDV</fnm>
               </au>
               <au>
                  <snm>Telles</snm>
                  <fnm>GP</fnm>
               </au>
               <au>
                  <snm>Cara</snm>
                  <fnm>FAA</fnm>
               </au>
               <au>
                  <snm>Pedrosa</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Meidanis</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Arruda</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <fpage>2725</fpage>
            <lpage>2735</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">403815</pubid>
                  <pubid idtype="pmpid" link="fulltext">14613979</pubid>
                  <pubid idtype="doi">10.1101/gr.1532103</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Functional annotation of a full-length mouse cDNA collection</p>
            </title>
            <aug>
               <au>
                  <cnm>The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM Consortium</cnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2001</pubdate>
            <volume>409</volume>
            <fpage>685</fpage>
            <lpage>690</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35055500</pubid>
                  <pubid idtype="pmpid" link="fulltext">11217851</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>The COG database: an updated version includes eukaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Tatusov</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Fedorova</snm>
                  <fnm>ND</fnm>
               </au>
               <au>
                  <snm>Jackson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Jacobs</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Kiryutin</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Krylov</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Mazumder</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Mekhedov</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Nikolskaya</snm>
                  <fnm>AN</fnm>
               </au>
               <au>
                  <snm>Rao</snm>
                  <fnm>BS</fnm>
               </au>
               <au>
                  <snm>Smirnov</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sverdlov</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Vasudevan</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Yin</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Natale</snm>
                  <fnm>DA</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <fpage>41</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">222959</pubid>
                  <pubid idtype="pmpid" link="fulltext">12969510</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-4-41</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Comparative analysis of expressed sequence tags of conifers and angiosperms reveals sequences specifically conserved in conifers</p>
            </title>
            <aug>
               <au>
                  <snm>Ujino-Ihara</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Kanamori</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Yamane</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Taguchi</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Namiki</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Mukai</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Yoshimura</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Tsumura</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Plant Mol Biol</source>
            <pubdate>2005</pubdate>
            <volume>59</volume>
            <fpage>895</fpage>
            <lpage>907</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s11103-005-2080-y</pubid>
                  <pubid idtype="pmpid" link="fulltext">16307365</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>The Pfam protein families database</p>
            </title>
            <aug>
               <au>
                  <snm>Bateman</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Coin</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Durbin</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Finn</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Hollich</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Griffiths-Jones</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Khanna</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Marshall</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Moxon</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sonnhammer</snm>
                  <fnm>ELL</fnm>
               </au>
               <au>
                  <snm>Studholme</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Yeats</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Eddy</snm>
                  <fnm>SR</fnm>
               </au>
            </aug>
            <source>Nucl Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <fpage>D138</fpage>
            <lpage>D141</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308855</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681378</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh121</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Generation, annotation, analysis and database integration of 16,500 white spruce EST clusters</p>
            </title>
            <aug>
               <au>
                  <snm>Pavy</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Paule</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Parsons</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Crow</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Morency</snm>
                  <fnm>M-J</fnm>
               </au>
               <au>
                  <snm>Cooke</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Noumen</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Guillet-Claude</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Butterfield</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Barber</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Stott</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kirkpatrick</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Siddiqui</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Holt</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Marra</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Seguin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Retzel</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Bousquet</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>MacKay</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>144</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1277824</pubid>
                  <pubid idtype="pmpid" link="fulltext">16236172</pubid>
                  <pubid idtype="doi">10.1186/1471-2164-6-144</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Genome-wide analysis of spatial gene expression in Arabidopsis flowers</p>
            </title>
            <aug>
               <au>
                  <snm>Wellmer</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Riechmann</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Alves-Ferreira</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Meyerowitz</snm>
                  <fnm>EM</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>2004</pubdate>
            <volume>16</volume>
            <fpage>1314</fpage>
            <lpage>1326</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">423218</pubid>
                  <pubid idtype="pmpid" link="fulltext">15100403</pubid>
                  <pubid idtype="doi">10.1105/tpc.021741</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Transcriptome analysis of haploid male gametophyte development in <it>Arabidopsis</it></p>
            </title>
            <aug>
               <au>
                  <snm>Honys</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Twell</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <fpage>R85</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">545776</pubid>
                  <pubid idtype="pmpid" link="fulltext">15535861</pubid>
                  <pubid idtype="doi">10.1186/gb-2004-5-11-r85</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>Bioinformatics applied to allergy: Allergen databases, from collecting sequence information to data integration. The Allergome platform as a model</p>
            </title>
            <aug>
               <au>
                  <snm>Mari</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Scala</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Palazzo</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Ridolfi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Zennaro</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Carabella</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Cell Immunol</source>
            <pubdate>2006</pubdate>
            <volume>244</volume>
            <fpage>97</fpage>
            <lpage>100</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cellimm.2007.02.012</pubid>
                  <pubid idtype="pmpid" link="fulltext">17434469</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>AGRIS and AtRegNet. A platform to link cis-regulatory elements and transcription factors into regulatory networks</p>
            </title>
            <aug>
               <au>
                  <snm>Palaniswamy</snm>
                  <fnm>SK</fnm>
               </au>
               <au>
                  <snm>James</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sun</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Lamb</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Davuluri</snm>
                  <fnm>RV</fnm>
               </au>
               <au>
                  <snm>Grotewold</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Plant Physiol</source>
            <pubdate>2006</pubdate>
            <volume>140</volume>
            <fpage>818</fpage>
            <lpage>829</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1400579</pubid>
                  <pubid idtype="pmpid" link="fulltext">16524982</pubid>
                  <pubid idtype="doi">10.1104/pp.105.072280</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Flow cytometric determination of genome size in the Taxodiaceae, Cupressaceae <it>sensu stricto </it>and Sciadopityaceae</p>
            </title>
            <aug>
               <au>
                  <snm>Hizume</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kondo</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Shibata</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Ishizuka</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Cytologia</source>
            <pubdate>2001</pubdate>
            <volume>66</volume>
            <fpage>307</fpage>
            <lpage>311</lpage>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Nuclear DNA amounts in gymnosperms</p>
            </title>
            <aug>
               <au>
                  <snm>Murray</snm>
                  <fnm>BG</fnm>
               </au>
            </aug>
            <source>Ann Bot</source>
            <pubdate>1998</pubdate>
            <volume>82</volume>
            <fpage>3</fpage>
            <lpage>15</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1006/anbo.1998.0764</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Expression analysis of ESTs derived from the inner bark of <it>Cryptomeria japonica</it></p>
            </title>
            <aug>
               <au>
                  <snm>Ujino-Ihara</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Yoshimura</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ugawa</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Yoshimaru</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Nagasaka</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Tsumura</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Plant Mol Biol</source>
            <pubdate>2000</pubdate>
            <volume>43</volume>
            <fpage>451</fpage>
            <lpage>457</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1023/A:1006492103063</pubid>
                  <pubid idtype="pmpid" link="fulltext">11052197</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Functional annotation of a full-length <it>Arabidopsis </it>cDNA collection</p>
            </title>
            <aug>
               <au>
                  <snm>Seki</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Narusaka</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kamiya</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ishida</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Satou</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sakurai</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Nakajima</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Enju</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Akiyama</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Oono</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Muramatsu</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hayashizaki</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Kawai</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Carninci</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Itoh</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ishii</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Arakawa</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Shibata</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Shinagawa</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Shinozaki</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2002</pubdate>
            <volume>296</volume>
            <fpage>141</fpage>
            <lpage>145</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1071006</pubid>
                  <pubid idtype="pmpid" link="fulltext">11910074</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>CapSelect: a highly sensitive method for 5' CAP-dependent enrichment of full-length cDNA in PCR-mediated analysis of mRNAs</p>
            </title>
            <aug>
               <au>
                  <snm>Schmidt</snm>
                  <fnm>WM</fnm>
               </au>
               <au>
                  <snm>Mueller</snm>
                  <fnm>MW</fnm>
               </au>
            </aug>
            <source>Nucl Acids Res</source>
            <pubdate>1999</pubdate>
            <volume>27</volume>
            <fpage>e31</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">148683</pubid>
                  <pubid idtype="pmpid" link="fulltext">10518626</pubid>
                  <pubid idtype="doi">10.1093/nar/27.21.e31</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Apparent homology of expressed genes from wood-forming tissues of loblolly pine (<it>Pinus taeda </it>L.) with <it>Arabidopsis thaliana</it></p>
            </title>
            <aug>
               <au>
                  <snm>Kirst</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>AF</fnm>
               </au>
               <au>
                  <snm>Baucom</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ulrich</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Hubbard</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Staggs</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Paule</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Retzel</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Whetten</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Sederoff</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2003</pubdate>
            <volume>100</volume>
            <fpage>7383</fpage>
            <lpage>7388</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">165884</pubid>
                  <pubid idtype="pmpid" link="fulltext">12771380</pubid>
                  <pubid idtype="doi">10.1073/pnas.1132171100</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Complex gene families in pine genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Kinlaw</snm>
                  <fnm>CS</fnm>
               </au>
               <au>
                  <snm>Neale</snm>
                  <fnm>DB</fnm>
               </au>
            </aug>
            <source>Trends Plant Sci</source>
            <pubdate>1996</pubdate>
            <volume>2</volume>
            <fpage>356</fpage>
            <lpage>359</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/S1360-1385(97)84624-9</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Conserved ortholog sets in forest trees</p>
            </title>
            <aug>
               <au>
                  <snm>Krutovsky</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Elsik</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Matvienko</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kozik</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Neale</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Tree Genet Genomes</source>
            <pubdate>2006</pubdate>
            <volume>3</volume>
            <fpage>61</fpage>
            <lpage>70</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1007/s11295-006-0052-2</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Type I MADS-box genes have experienced faster birth-and-death evolution than type II MADS-box genes in angiosperms</p>
            </title>
            <aug>
               <au>
                  <snm>Nam</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>An</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Ma</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Nei</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2004</pubdate>
            <volume>101</volume>
            <fpage>1910</fpage>
            <lpage>1915</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">357026</pubid>
                  <pubid idtype="pmpid" link="fulltext">14764899</pubid>
                  <pubid idtype="doi">10.1073/pnas.0308430100</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Two ancient classes of MIKC-type MADS-box genes are present in the moss <it>Physcomitrella patens</it></p>
            </title>
            <aug>
               <au>
                  <snm>Henschel</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kofuji</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Hasebe</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Saedler</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>M&#252;nster</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Thei&#223;en</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2002</pubdate>
            <volume>19</volume>
            <fpage>801</fpage>
            <lpage>814</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12032236</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>The major clades of MADS-box genes and their role in the development and evolution of flowering plants</p>
            </title>
            <aug>
               <au>
                  <snm>Becker</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Thei&#223;en</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Mol Phylogenet Evol</source>
            <pubdate>2003</pubdate>
            <volume>29</volume>
            <fpage>464</fpage>
            <lpage>489</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S1055-7903(03)00207-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">14615187</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>AGL80 is required for central cell and endosperm development in <it>Arabidopsis</it></p>
            </title>
            <aug>
               <au>
                  <snm>Portereiko</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Lloyd</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Steffen</snm>
                  <fnm>JG</fnm>
               </au>
               <au>
                  <snm>Punwani</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Otsuga</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Drews</snm>
                  <fnm>GN</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>2006</pubdate>
            <volume>18</volume>
            <fpage>1862</fpage>
            <lpage>1872</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1533969</pubid>
                  <pubid idtype="pmpid" link="fulltext">16798889</pubid>
                  <pubid idtype="doi">10.1105/tpc.106.040824</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>The floral genome: an evolutionary history of gene duplication and shifting patterns of gene expression</p>
            </title>
            <aug>
               <au>
                  <snm>Soltis</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Ma</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Frohlich</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Soltis</snm>
                  <fnm>PS</fnm>
               </au>
               <au>
                  <snm>Albert</snm>
                  <fnm>VA</fnm>
               </au>
               <au>
                  <snm>Oppenheimer</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Altman</snm>
                  <fnm>NS</fnm>
               </au>
               <au>
                  <snm>dePamphilis</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Leebens-Mack</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Trends Plant Sci</source>
            <pubdate>2007</pubdate>
            <volume>12</volume>
            <fpage>358</fpage>
            <lpage>367</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tplants.2007.06.012</pubid>
                  <pubid idtype="pmpid" link="fulltext">17658290</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>A novel MADS-box gene subfamily with a sister-group relationship to class B floral homeotic genes</p>
            </title>
            <aug>
               <au>
                  <snm>Becker</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kaufmann</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Freialdenhoven</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Vincent</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>M-A</fnm>
               </au>
               <au>
                  <snm>Saedler</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Theissen</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Mol Genet Genomics</source>
            <pubdate>2002</pubdate>
            <volume>266</volume>
            <fpage>942</fpage>
            <lpage>950</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s00438-001-0615-8</pubid>
                  <pubid idtype="pmpid" link="fulltext">11862488</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>The <it>TRANSPARENT TESTA16 </it>locus encodes the ARABIDOPSIS BSISTER MADS domain protein and is required for proper development and pigmentation of the seed coat</p>
            </title>
            <aug>
               <au>
                  <snm>Nesi</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Debeaujon</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Jond</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Stewart</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Jenkins</snm>
                  <fnm>GI</fnm>
               </au>
               <au>
                  <snm>Caboche</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lepiniec</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Plant Cell</source>
            <pubdate>2002</pubdate>
            <volume>14</volume>
            <fpage>2463</fpage>
            <lpage>2479</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">151229</pubid>
                  <pubid idtype="pmpid" link="fulltext">12368498</pubid>
                  <pubid idtype="doi">10.1105/tpc.004127</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>Phylogenetic signal in nucleotide data from seed plants: implications for resolving the seed plant tree of life</p>
            </title>
            <aug>
               <au>
                  <snm>Burleigh</snm>
                  <fnm>JG</fnm>
               </au>
               <au>
                  <snm>Mathews</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Am J Bot</source>
            <pubdate>2004</pubdate>
            <volume>91</volume>
            <fpage>1599</fpage>
            <lpage>1613</lpage>
            <xrefbib>
               <pubid idtype="doi">10.3732/ajb.91.10.1599</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>Seed plant phylogeny inferred from all three plant genomes: Monophyly of extant gymnosperms and origin of Gnetales from conifers</p>
            </title>
            <aug>
               <au>
                  <snm>Chaw</snm>
                  <fnm>S-M</fnm>
               </au>
               <au>
                  <snm>Parkinson</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>Cheng</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Vincent</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Palmer</snm>
                  <fnm>JD</fnm>
               </au>
            </aug>
            <source>PNAS</source>
            <pubdate>2000</pubdate>
            <volume>97</volume>
            <fpage>4086</fpage>
            <lpage>4091</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">18157</pubid>
                  <pubid idtype="pmpid" link="fulltext">10760277</pubid>
                  <pubid idtype="doi">10.1073/pnas.97.8.4086</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>Evolution and divergence of the MADS-box gene family based on genome-wide expression analyses</p>
            </title>
            <aug>
               <au>
                  <snm>Kofuji</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Sumikawa</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Yamasaki</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kondo</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ueda</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ito</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hasebe</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2003</pubdate>
            <volume>20</volume>
            <fpage>1963</fpage>
            <lpage>1977</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msg216</pubid>
                  <pubid idtype="pmpid" link="fulltext">12949148</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>A short history of MADS-box genes in plants</p>
            </title>
            <aug>
               <au>
                  <snm>Theissen</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Becker</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Rosa</snm>
                  <fnm>AD</fnm>
               </au>
               <au>
                  <snm>Kanno</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kim</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>M&#252;nster</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Winter</snm>
                  <fnm>K-U</fnm>
               </au>
               <au>
                  <snm>Saedler</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Plant Mol Biol</source>
            <pubdate>2000</pubdate>
            <volume>42</volume>
            <fpage>115</fpage>
            <lpage>149</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1023/A:1006332105728</pubid>
                  <pubid idtype="pmpid" link="fulltext">10688133</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>A simple and efficient method for isolating RNA from pine trees</p>
            </title>
            <aug>
               <au>
                  <snm>Chang</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Puryear</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Cairney</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Plant Mol Biol Rep</source>
            <pubdate>1993</pubdate>
            <volume>11</volume>
            <fpage>113</fpage>
            <lpage>116</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1007/BF02670468</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>Thermostabilization and thermoactivation of thermolabile enzymes by trehalose and its application for the synthesis of full length cDNA</p>
            </title>
            <aug>
               <au>
                  <snm>Carninci</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Nishiyama</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Westover</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Itoh</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Nagaoka</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sasaki</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Okazaki</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Muramatsu</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hayashizaki</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1998</pubdate>
            <volume>95</volume>
            <fpage>520</fpage>
            <lpage>524</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">18452</pubid>
                  <pubid idtype="pmpid" link="fulltext">9435224</pubid>
                  <pubid idtype="doi">10.1073/pnas.95.2.520</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>Normalization and subtraction of cap-trapper-selected cDNAs to prepare full-length cDNA libraries for rapid discovery of new genes</p>
            </title>
            <aug>
               <au>
                  <snm>Carninci</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Shibata</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Hayatsu</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Sugahara</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Shibata</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Itoh</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Konno</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Okazaki</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Muramatsu</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hayashizaki</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2000</pubdate>
            <volume>10</volume>
            <fpage>1617</fpage>
            <lpage>1630</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">310980</pubid>
                  <pubid idtype="pmpid" link="fulltext">11042159</pubid>
                  <pubid idtype="doi">10.1101/gr.145100</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>Balanced-size and long-size cloning of full-length, cap-trapped cDNAs into vectors of the novel &#955;-FLC family allows enhanced gene discovery rate and functional analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Carninci</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Shibata</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Hayatsu</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Itoh</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Shiraki</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hirozane</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Watahiki</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Shibata</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Konno</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Muramatsu</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hayashizaki</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Genomics</source>
            <pubdate>2001</pubdate>
            <volume>77</volume>
            <fpage>79</fpage>
            <lpage>90</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/geno.2001.6601</pubid>
                  <pubid idtype="pmpid" link="fulltext">11543636</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <title>
               <p>Base-calling of automated sequencer traces using <it>phred</it>. I. Accuracy assessment</p>
            </title>
            <aug>
               <au>
                  <snm>Ewing</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Hillier</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Wendl</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Green</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>1998</pubdate>
            <volume>8</volume>
            <fpage>175</fpage>
            <lpage>185</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9521921</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <title>
               <p>Basic local alignment search tool</p>
            </title>
            <aug>
               <au>
                  <snm>Altschul</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Gish</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Meyers</snm>
                  <fnm>EW</fnm>
               </au>
               <au>
                  <snm>Lipman</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1990</pubdate>
            <volume>215</volume>
            <fpage>403</fpage>
            <lpage>410</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">2231712</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B58">
            <title>
               <p>TAIR: a resource for integrated <it>Arabidopsis </it>data</p>
            </title>
            <aug>
               <au>
                  <snm>Garcia-Hernandez</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Berardini</snm>
                  <fnm>TZ</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Crist</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Doyle</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Huala</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Knee</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Lambrecht</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Mueller</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Mundodi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Reiser</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Rhee</snm>
                  <fnm>SY</fnm>
               </au>
               <au>
                  <snm>Scholl</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Tacklind</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Weems</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Xu</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Yoo</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Yoon</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Funct Integr Genomics</source>
            <pubdate>2002</pubdate>
            <volume>2</volume>
            <fpage>239</fpage>
            <lpage>253</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s10142-002-0077-z</pubid>
                  <pubid idtype="pmpid" link="fulltext">12444417</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <title>
               <p><it>Consed: </it>a graphical tool for sequence finishing</p>
            </title>
            <aug>
               <au>
                  <snm>Gordon</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Abajian</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Green</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>1998</pubdate>
            <volume>8</volume>
            <fpage>195</fpage>
            <lpage>202</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9521923</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B60">
            <title>
               <p>NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Pruitt</snm>
                  <fnm>KD</fnm>
               </au>
               <au>
                  <snm>Tatusova</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Maglott</snm>
                  <fnm>DR</fnm>
               </au>
            </aug>
            <source>Nucl Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <fpage>D501</fpage>
            <lpage>504</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">539979</pubid>
                  <pubid idtype="pmpid" link="fulltext">15608248</pubid>
                  <pubid idtype="doi">10.1093/nar/gki025</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B61">
            <title>
               <p>Collection, mapping, and annotation of over 28,000 cDNA clones from <it>japonica </it>rice</p>
            </title>
            <aug>
               <au>
                  <cnm>The Rice Full-Length cDNA Consortium</cnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2003</pubdate>
            <volume>301</volume>
            <fpage>376</fpage>
            <lpage>379</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1081288</pubid>
                  <pubid idtype="pmpid" link="fulltext">12869764</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B62">
            <title>
               <p>The Universal Protein Resource (UniProt)</p>
            </title>
            <aug>
               <au>
                  <snm>Bairoch</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Apweiler</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>CH</fnm>
               </au>
               <au>
                  <snm>Barker</snm>
                  <fnm>WC</fnm>
               </au>
               <au>
                  <snm>Boeckmann</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Ferro</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gasteiger</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Huang</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Lopez</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Magrane</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Martin</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Natale</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>O'Donovan</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Redaschi</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Yeh</snm>
                  <fnm>L-SL</fnm>
               </au>
            </aug>
            <source>Nucl Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <fpage>D154</fpage>
            <lpage>D159</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">540024</pubid>
                  <pubid idtype="pmpid" link="fulltext">15608167</pubid>
                  <pubid idtype="doi">10.1093/nar/gki070</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B63">
            <title>
               <p>The TIGR Gene Indices: clustering and assembling EST and known genes and integration with eukaryotic genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Lee</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Tsai</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Sunkara</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Karamycheva</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Pertea</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Sultana</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Antonescu</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Chan</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Cheung</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Quackenbush</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Nucl Acids Res</source>
            <pubdate>2005</pubdate>
            <volume>33</volume>
            <fpage>D71</fpage>
            <lpage>D74</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">540018</pubid>
                  <pubid idtype="pmpid" link="fulltext">15608288</pubid>
                  <pubid idtype="doi">10.1093/nar/gki064</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B64">
            <title>
               <p>CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice</p>
            </title>
            <aug>
               <au>
                  <snm>Thompson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Higgins</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Gibson</snm>
                  <fnm>TJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1994</pubdate>
            <volume>22</volume>
            <fpage>4673</fpage>
            <lpage>4680</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308517</pubid>
                  <pubid idtype="pmpid" link="fulltext">7984417</pubid>
                  <pubid idtype="doi">10.1093/nar/22.22.4673</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B65">
            <title>
               <p>MacClade 4: Analysis of phylogeny and character evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Maddison</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Maddison</snm>
                  <fnm>WP</fnm>
               </au>
            </aug>
            <publisher>Sinauer Associates, Inc</publisher>
            <pubdate>2000</pubdate>
         </bibl>
         <bibl id="B66">
            <title>
               <p>MOLPHY version 2.3: programs for molecular phylogenetics based on maximum likelihood</p>
            </title>
            <aug>
               <au>
                  <snm>Adachi</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hasegawa</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Comput Sci Monogr</source>
            <pubdate>1996</pubdate>
            <volume>28</volume>
            <fpage>1</fpage>
            <lpage>150</lpage>
         </bibl>
         <bibl id="B67">
            <title>
               <p>The rapid generation of mutation data matrices from protein sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Jones</snm>
                  <fnm>DT</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>WR</fnm>
               </au>
               <au>
                  <snm>Thornton</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Comput Appl Biosci</source>
            <pubdate>1992</pubdate>
            <volume>8</volume>
            <fpage>275</fpage>
            <lpage>282</lpage>
            <xrefbib>
               <pubid idtype="pmpid">1633570</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B68">
            <title>
               <p>Accuracies of the simple methods for estimating the bootstrap probability of a maximum-likelihood tree</p>
            </title>
            <aug>
               <au>
                  <snm>Hasegawa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kishino</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1994</pubdate>
            <volume>11</volume>
            <fpage>142</fpage>
            <lpage>145</lpage>
         </bibl>
      </refgrp>
   </bm>
</art>
