<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2180-6-7</ui>
   <ji>1471-2180</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Generation, annotation, and analysis of an extensive <it>Aspergillus niger </it>EST collection</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Semova</snm>
               <fnm>Natalia</fnm>
               <insr iid="I1"/>
               <email>nsemova@gene.concordia.ca</email>
            </au>
            <au id="A2">
               <snm>Storms</snm>
               <fnm>Reginald</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>storms@vax2.concordia.ca</email>
            </au>
            <au id="A3">
               <snm>John</snm>
               <fnm>Tricia</fnm>
               <insr iid="I1"/>
               <email>tjohn@gene.concordia.ca</email>
            </au>
            <au id="A4">
               <snm>Gaudet</snm>
               <fnm>Pascale</fnm>
               <insr iid="I1"/>
               <insr iid="I4"/>
               <email>pgaudet@northwestern.edu</email>
            </au>
            <au id="A5">
               <snm>Ulycznyj</snm>
               <fnm>Peter</fnm>
               <insr iid="I1"/>
               <email>peteru@vax2.concordia.ca</email>
            </au>
            <au id="A6">
               <snm>Min</snm>
               <mnm>Jia</mnm>
               <fnm>Xiang</fnm>
               <insr iid="I1"/>
               <email>jack@gene.concordia.ca</email>
            </au>
            <au id="A7">
               <snm>Sun</snm>
               <fnm>Jian</fnm>
               <insr iid="I1"/>
               <email>jian@gene.concordia.ca</email>
            </au>
            <au id="A8">
               <snm>Butler</snm>
               <fnm>Greg</fnm>
               <insr iid="I1"/>
               <insr iid="I3"/>
               <email>gregb@encs.concordia.ca</email>
            </au>
            <au id="A9" ca="yes">
               <snm>Tsang</snm>
               <fnm>Adrian</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>tsang@gene.concordia.ca</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Centre for Structural and Functional Genomics, Concordia University, Montreal, Canada</p>
            </ins>
            <ins id="I2">
               <p>Department of Biology, Concordia University, Montreal, Canada</p>
            </ins>
            <ins id="I3">
               <p>Department of Computer Science and Software Engineering, Concordia University, Montreal, Canada</p>
            </ins>
            <ins id="I4">
               <p>Northwestern University, 676 N. St. Clair Street, Chicago, IL 60611</p>
            </ins>
         </insg>
         <source>BMC Microbiology</source>
         <issn>1471-2180</issn>
         <pubdate>2006</pubdate>
         <volume>6</volume>
         <issue>1</issue>
         <fpage>7</fpage>
         <url>http://www.biomedcentral.com/1471-2180/6/7</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">16457709</pubid>
               <pubid idtype="doi">10.1186/1471-2180-6-7</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>03</day>
               <month>10</month>
               <year>2005</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>02</day>
               <month>2</month>
               <year>2006</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>02</day>
               <month>2</month>
               <year>2006</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2006</year>
         <collab>Semova et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p><it>Aspergillus niger</it>, a saprophyte commonly found on decaying vegetation, is widely used and studied for industrial purposes. Despite its place as one of the most important organisms for commercial applications, the lack of available information about its genetic makeup limits research with this filamentous fungus.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>We present here the analysis of 12,820 expressed sequence tags (ESTs) generated from <it>A. niger </it>cultured under seven different growth conditions. These ESTs identify about 5,108 genes of which 44.5% code for proteins sharing similarity (E &#8804; 1e <sup>-5</sup>) with GenBank entries of known function, 38% code for proteins that only share similarity with GenBank entries of unknown function and 17.5% encode proteins that do not have a GenBank homolog. Using the Gene Ontology hierarchy, we present a first classification of the <it>A. niger </it>proteins encoded by these genes and compare its protein repertoire with other well-studied fungal species. We have established a searchable web-based database that includes the EST and derived contig sequences and their annotation. Details about this project and access to the annotated <it>A. niger </it>database are available.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>This EST collection and its annotation provide a significant resource for fundamental and applied research with <it>A. niger</it>. The gene set identified in this manuscript will be highly useful in the annotation of the genome sequence of <it>A. niger</it>, the genes described in the manuscript, especially those encoding hydrolytic enzymes will provide a valuable source for researchers interested in enzyme properties and applications.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Members of the genus <it>Aspergillus</it>, including <it>Aspergillus niger</it>, are distributed worldwide and are commonly present on decaying plant debris. These saprophytes degrade the complex molecules in plant cell materials by secreting an extensive assortment of hydrolytic enzymes <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Since <it>A. niger </it>grows on organic matter over a wide range of temperature, 6&#8211;47&#176;C, and pH, 1.4&#8211;9.8 <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>, this fungus produces enzymes that are active in diverse environmental conditions. Indeed, many enzymes produced by this fungus have already found application in the food, beverage, textile, agriculture, and paper and pulp industries <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B3">3</abbr></abbrgrp>. <it>A. niger </it>is also widely used in the manufacture of organic acids including citric, gluconic and fumaric acids <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. Importantly, citric acid and many enzymes produced in <it>A. niger </it>have received 'generally regarded as safe' or GRAS status by the United States Food and Drug Administration (FDA), and can therefore, be safely used for agro-food applications <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>.</p>
         <p><it>Aspergillus niger</it>, with its long history of use for various industrial applications and the ability to efficiently produce native proteins, is an attractive host for the production of heterologous proteins <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>. The commercial production of heterologous proteins using <it>A. niger </it>started when Genencor International (San Francisco) produced bovine chymosin in <it>A. niger </it><abbrgrp><abbr bid="B7">7</abbr></abbrgrp> and received US FDA approval for its application in cheese making. <it>A. niger </it>has subsequently been used as an expression host to produce commercially viable levels of many heterologous proteins, including; human cytokine interleukin -6 (<it>IL-6</it>) <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>, <it>Phanerochaete chrysosporium </it>manganese peroxidase (<it>MnP</it>) <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>, barley alpha-amylase <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>, porcine pancreatic prophospholipase A2 (<it>proPLA2</it>) <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>, and correctly assembled human immunoglobulins <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>.</p>
         <p><it>Aspergillus niger </it>is presently one of the most important organisms used in biotechnology. Reflecting this, there are 784 genomic DNA and mRNA sequence entries representing 379 unique genes available in GenBank databases (July 20, 2005 release). The identification of additional genes will enhance further efforts to increase the industrial utility of this organism. Analysis of EST sequences provides a cost-effective approach for gene discovery. Furthermore, EST-derived sequences facilitate genome sequence annotation through the identification of transcription unit boundaries, exon-intron junctions, and genes that lack sequence similarity with previously discovered genes. For these reasons, we initiated an <it>A. niger </it>EST-based gene discovery program. Using normalization methods to enrich for cDNA templates representing weakly expressed genes we identified 5,108 unique genes of which 44.5% encode proteins with significant similarity to GenBank entries that have at least a tentatively assigned function. Using the Gene Ontology hierarchy <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>, we present a classification of the proteins encoded by these <it>A. niger </it>genes and compare its protein repertoire with other well-studied fungal species. Our annotated <it>A. niger </it>EST collection is available at our website <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>.</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <sec>
            <st>
               <p>Library normalization and subtraction</p>
            </st>
            <p>A major challenge confronting EST-based gene discovery programs is differential mRNA abundance. Usually, a few hundred highly and moderately expressed genes produce more than half of the cellular mRNA molecules, whereas several thousand genes account for the remaining mRNA mass <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. Sequencing randomly selected clones from standard cDNA libraries therefore inefficiently identifies rare transcripts, owing to the repeated occurrence of moderately and highly abundant cDNA species. We employed virtual subtraction and direct subtraction to enhance the number of unique genes identified. The virtual subtraction method <abbrgrp><abbr bid="B16">16</abbr></abbrgrp> classifies cDNA clones according to the abundance of the mRNAs they represent (Figure <figr fid="F1">1A</figr>). The direct subtraction method removes previously identified cDNA clones from the gene discovery pipeline. We initiated this EST-based gene discovery program by sequencing 2,000 randomly selected clones. Next, we sequenced 2,304 of the low intensity clones identified by virtual subtraction. Finally, we sequenced 10,738 clones that gave very low hybridization signals when subjected to both virtual and direct subtraction.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Virtual normalization and direct subtraction</p>
               </caption>
               <text>
                  <p><b>Virtual normalization and direct subtraction</b>. A) Relative signal intensity of clones, determined as a ratio of the signal intensity of each individual clone versus the maximum signal intensity present on the array. Signal intensities for the colonies derived from a single 384 well microplate are displayed as a function of relative colony signal strength. B) Proportion of unique ESTs obtained at various stages of the gene discovery process. 1: Sequencing of the first 192 randomly selected clones, 2: Sequencing of the last 192 clones of the 1920 randomly selected clones. 3: Sequencing of the first 192 clones obtained by virtual normalization. 4: Sequencing of the last 192 clones from the 2,304 clones obtained by virtual subtraction. 5: Sequencing of the first 192 clones selected after virtual normalization and direct subtraction, first round. 6: Sequencing of the last 192 clones selected after both virtual normalization and direct subtraction, first round. 7: Sequencing of the first 192 clones selected after virtual normalization and direct subtraction, second round. 8: Sequencing of the last 192 clones from the 10,738 clones selected after virtual normalization and direct subtraction, second round.</p>
               </text>
               <graphic file="1471-2180-6-7-1"/>
            </fig>
            <p>Figure <figr fid="F1">1B</figr> presents the gene discovery rates obtained while sequencing the randomly selected clones, the clones selected following virtual subtraction, and the clones selected following virtual and direct subtraction. We obtained 5,202 singleton and contig sequences after processing 12,820 high quality EST sequences (Table <tblr tid="T1">1</tblr>). This means that we identified roughly one gene for every 2.5 EST sequences. This result compares favorably with the results obtained by some other large-scale EST projects of lower eukaryotes. For instance, a <it>Neurospora crassa </it>project produced 20,019 ESTs and identified 1,431 genes <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> for a gene discovery rate of one gene for every 14 EST sequences, and a <it>Dictyostelium discoideum </it>gene discovery project that generated 26,954 ESTs identified 5,381 unigenes for a gene discovery rate of one gene for every 5 EST sequences <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p><it>A. niger </it>EST summary</p>
               </caption>
               <tblbdy cols="2">
                  <r>
                     <c ca="left">
                        <p>Total templates processed</p>
                     </c>
                     <c ca="center">
                        <p>15,052</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Total EST sequences obtained</p>
                     </c>
                     <c ca="center">
                        <p>12,820</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Average insert size (bp)</p>
                     </c>
                     <c ca="center">
                        <p>1,470</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Average length of high quality sequence per EST sequence obtained (bp)</p>
                     </c>
                     <c ca="center">
                        <p>551</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Average contig size (bp)</p>
                     </c>
                     <c ca="center">
                        <p>693</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Number of clusters</p>
                     </c>
                     <c ca="center">
                        <p>5,108</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Number of clones with full-length inserts</p>
                     </c>
                     <c ca="center">
                        <p>2,407</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Number of coding sequences completely sequenced</p>
                     </c>
                     <c ca="center">
                        <p>650</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Number of clusters derived from more that one unique singleton and/or contig</p>
                     </c>
                     <c ca="center">
                        <p>74</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>GC content</p>
                     </c>
                     <c ca="center">
                        <p>53.5 %</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Full-length ORFs with a potential signal peptide</p>
                     </c>
                     <c ca="center">
                        <p>292</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Unknown sequences which encode a signal peptide</p>
                     </c>
                     <c ca="center">
                        <p>107</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Contig assembling and analysis of <it>A. niger </it>ESTs</p>
            </st>
            <p>We submitted the 12,820 high quality ESTs to GenBank [GenBank: <ext-link ext-link-type="gen" ext-link-id="DR697868">DR697868</ext-link> &#8211; GenBank: <ext-link ext-link-type="gen" ext-link-id="DR710686">DR710686</ext-link>]. Table <tblr tid="T1">1</tblr> shows that the individual sequencing reads contained 400&#8211;800 nucleotides of high-quality sequence. The EST assembly produced by phrap <abbrgrp><abbr bid="B19">19</abbr></abbrgrp> yielded 5,202 unisequences that included 2,183 singletonsand 3,019 contigs. Following assembly, we used BLASTN to cluster the closely related singletons and contigs. Clustering assembled 168 of the 5,202 phrap unisequences into 74 clusters, each containing 2&#8211;4 sequences. Manually confirmed ClustalW alignments showed that 56 clusters were generated by assembling alternatively spliced derivatives of 117 phrap unisequences. Taking into account the 74 clusters assembled from multiple unisequences, the 12,820 ESTs generated 5,108 clusters. The clusters predicted to have arisen through alternative splicing are available in <supplr sid="S1">Additional file 1</supplr>. Prior to submission of our EST sequences, we found 784 <it>A. niger </it>genomic DNA and cDNA-derived sequence entries in the GenBank database (June 22, 2005 release). These entries formed 379 unique genes. BLASTN analysis showed that 252 of the phrap unisequences aligned with at least one of the <it>A. niger </it>GenBank entries (alignment length >50, identity >95%). Therefore, this study identified about 4,856 new <it>A. niger </it>genes. The results from our EST sequencing, contig assembly and clustering analysis are summarized in Table <tblr tid="T1">1</tblr>.</p>
            <suppl id="S1">
               <title>
                  <p>Additional File 1</p>
               </title>
               <text>
                  <p><b><it>A. niger </it>clusters derived from alternatively spliced unisequences</b>. Table presenting the 56 manually verified clusters that were generated by alternative splicing of 117 phrap unisequences. For each cluster the table includes; the unisequences present in each cluster, the function as assigned by BLAST-based similarity, the BLAST subject species, the GenBank ID for the BLAST subject used for functional assignment, and the Expect value obtained with each unisequence.</p>
               </text>
               <file name="1471-2180-6-7-S1.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Comparative analysis of the phrap unisequences</p>
            </st>
            <p>We attempted to determine the putative function of the set of 5,202 phrap unisequences by searching for homologs in the GenBank non-redundant protein database using BLASTX (Table <tblr tid="T2">2</tblr>). Setting the BLASTX cutoff value at E = 1e <sup>-5</sup>, about 83% of these sequences display similarity to at least one GenBank entry, 44.5% to genes of known function and 38% to genes of unknown function. The remaining sequences, 17 %, code for proteins that lack similarity with any GenBank entry.</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Distribution of homology between the unique set of <it>A. niger </it>singleton and contig sequences and various databases as determined by BLASTX</p>
               </caption>
               <tblbdy cols="6">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Number of genes with similarity</b>
                        </p>
                        <p>
                           <b> (E &#8804; e</b>
                           <sup>
                              <b>-5</b>
                           </sup>
                           <b>)</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Highly significant homology</b>
                        </p>
                        <p>
                           <b> (E &#8804; e</b>
                           <sup>
                              <b>-30 </b>
                           </sup>
                           <b>)</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Moderate homology</b>
                        </p>
                        <p>
                           <b> (e</b>
                           <sup>
                              <b>-30</b>
                           </sup>
                           <b> &lt; E &#8804; e</b>
                           <sup>
                              <b>-10</b>
                           </sup>
                           <b>)</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Weak homology</b>
                        </p>
                        <p>
                           <b> (e</b>
                           <sup>
                              <b>-10</b>
                           </sup>
                           <b>&lt; E &#8804; e</b>
                           <sup>
                              <b>-5</b>
                           </sup>
                           <b>)</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Insignificant similarity</b>
                        </p>
                        <p>
                           <b> (e</b>
                           <sup>
                              <b>-5</b>
                           </sup>
                           <b>&lt; E)</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="6">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Total GenBank Database set</p>
                     </c>
                     <c ca="center">
                        <p>4321 (83.06%)</p>
                     </c>
                     <c ca="center">
                        <p>3367 (64.73 %)</p>
                     </c>
                     <c ca="center">
                        <p>811 (15.59%)</p>
                     </c>
                     <c ca="center">
                        <p>143 (2.75%)</p>
                     </c>
                     <c ca="center">
                        <p>881 (16.93%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Set of predicted proteins for <it>A. nidulans</it></p>
                     </c>
                     <c ca="center">
                        <p>4195 (80.64%)</p>
                     </c>
                     <c ca="center">
                        <p>3313 (63.69%)</p>
                     </c>
                     <c ca="center">
                        <p>682 (13.11%)</p>
                     </c>
                     <c ca="center">
                        <p>200 (3.84%)</p>
                     </c>
                     <c ca="center">
                        <p>1007 (19.36%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Set of predicted proteins for <it>N. crassa</it></p>
                     </c>
                     <c ca="center">
                        <p>3615 (69.49%)</p>
                     </c>
                     <c ca="center">
                        <p>2261 (43.46%)</p>
                     </c>
                     <c ca="center">
                        <p>995 (19.13%)</p>
                     </c>
                     <c ca="center">
                        <p>359 (6.90%)</p>
                     </c>
                     <c ca="center">
                        <p>1587 (30.51%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Set of predicted proteins for <it>P. chrysosporium</it></p>
                     </c>
                     <c ca="center">
                        <p>2581 (49.62%)</p>
                     </c>
                     <c ca="center">
                        <p>1292 (24.84%)</p>
                     </c>
                     <c ca="center">
                        <p>864 (16.60%)</p>
                     </c>
                     <c ca="center">
                        <p>425 (8.17%)</p>
                     </c>
                     <c ca="center">
                        <p>2621 (50.38%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="center">
                        <p>Set of predicted proteins for <it>S. cerevisiae</it></p>
                     </c>
                     <c ca="center">
                        <p>2386 (45.87%)</p>
                     </c>
                     <c ca="center">
                        <p>1110 (21.34%)</p>
                     </c>
                     <c ca="center">
                        <p>803 (15.44%)</p>
                     </c>
                     <c ca="center">
                        <p>359 (6.90%)</p>
                     </c>
                     <c ca="center">
                        <p>2816 (54.13%)</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>We also compared the proteins encoded by these sequences with the proteins predicted from the completely sequenced genomes of three Ascomycetes, <it>Saccharomyces cerevisiae </it><abbrgrp><abbr bid="B20">20</abbr></abbrgrp>, <it>Aspergillus nidulans </it>and <it>Neurospora crassa </it><abbrgrp><abbr bid="B21">21</abbr></abbrgrp>, and one Basidiomycete, the white rot fungus <it>Phanerochaete chrysosporium </it><abbrgrp><abbr bid="B22">22</abbr></abbrgrp>. As expected, the highest degree of similarity (BLASTX alignments with E values &#8804; e<sup>-30</sup>) is with <it>A. nidulans</it>, where 64% of these <it>A. niger </it>unisequences encode proteins that have <it>A. nidulans </it>homologs (Table <tblr tid="T2">2</tblr>). Nonetheless, almost 20% of the <it>A. niger </it>genes did not have a homolog (E > e<sup>-5</sup>) in <it>A. nidulans</it>.</p>
            <p>Although the Sordariomycetes, which include <it>N. crassa</it>, and the Eurotiomycetes, which include the Aspergilli, diverged about 670 million years (Myr) ago <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>, over 43% of the predicted <it>A. niger </it>proteins are highly similar (E &#8804; e<sup>-30</sup>) to <it>N. crassa </it>predicted proteins. For the more distantly related Saccharomycotinna <it>S. cerevisiae </it>and Hymenomycete <it>P. chrysosporium</it>, which diverged from the Eurotiomycetes lineage about 1,090 and 1,210 Myr ago, respectively <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>, only 21% and 25% of the <it>A. niger </it>predicted proteins had highly similar homologs (E &#8804; e<sup>-30</sup>).</p>
         </sec>
         <sec>
            <st>
               <p>Functional classification of genes based on Gene Ontology terms</p>
            </st>
            <p>The predicted <it>A. niger </it>protein products were assigned Gene Ontology (GO) classifiers based on BLASTX alignments (expected values of E &#8804; e<sup>-5</sup>) generated by searching the GO annotated Swiss-Prot and TrEMBL databases. GO categories were assigned to 2,549 of the 5,202 predicted protein products. Figure <figr fid="F2">2</figr> summarizes the resulting GO assignments, which are available in <supplr sid="S2">Additional file 2</supplr>. More detailed annotations, including the BLAST alignments, Expect Values and BLAST Scores generated by searching the GenBank nr database are available online <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> and can be used to assess the reliability of functional predictions on a gene by gene basis.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>GO mappings for the <it>A. niger </it>proteins</p>
               </caption>
               <text>
                  <p><b>GO mappings for the <it>A. niger </it>proteins</b>. Relative representation of GO mappings for the proteins coded for by the unique set of <it>A. niger </it>singletons and contigs. A) Biological process; B) Cellular component; C) Molecular function. Note, because individual proteins can map to multiple GO categories, the sum of the GO mappings can exceed 100%.</p>
               </text>
               <graphic file="1471-2180-6-7-2"/>
            </fig>
            <suppl id="S2">
               <title>
                  <p>Additional File 2</p>
               </title>
               <text>
                  <p><b>Gene Ontology annotations of the <it>A. niger </it>proteins</b>. Tables presenting the distribution of Gene Ontology classifiers for the 2,549 <it>A. niger </it>unisequences that encoded proteins with similarity to protein entries in the GO annotated Swiss-Prot and TrEMBL database. Table A, presents the distribution of 1,696 <it>A. niger </it>proteins that could be assigned Biological Process category and subcategory classifiers. Table B, presents the distribution of the 1,195 <it>A. niger </it>proteins that could be assigned Cellular Component category and subcategory classifiers. Table C, presents the distribution of the 1,691 <it>A. niger </it>proteins that could be assigned Cellular Component category and subcategory classifiers.</p>
               </text>
               <file name="1471-2180-6-7-S2.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>We compared the distribution of GO classifiers obtained for the <it>A. niger </it>unisequences and the predicted genes of six fungal species (Table <tblr tid="T3">3</tblr>). The gene distribution in the main ontology categories was very similar across all seven species. However, the fission and budding yeasts have a higher proportion of genes in the "cell growth and/or maintenance" categories, 45.2% and 48.5%, than did the filamentous fungi, where the proportion ranged from 29.4% to 36.2%. Since we found no correlation between evolutionary distance and these differences, it seems likely that they reflect differences in gene number. The genomes of the five filamentous fungi encode 9,000&#8211;12,000 genes <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr></abbrgrp> whereas the fission and baker's yeast genomes have about 4,824 <abbrgrp><abbr bid="B26">26</abbr></abbrgrp> and 6,335 <abbrgrp><abbr bid="B27">27</abbr></abbrgrp> protein-coding genes, respectively. The much smaller number of genes present in these two yeast species suggests that they may have close to the minimum number of genes needed by a free-living eukaryotic cell <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>.</p>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Comparison of GO profiling among different fungal species</p>
               </caption>
               <tblbdy cols="9">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="7" ca="center">
                        <p>
                           <b>% Representation to total in main category</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c cspan="7">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Gene Ontology</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Categories and subcategories</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <it>S. cerevisiae</it>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <it>S. pombe</it>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <it>N. crassa</it>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <it>P. chrysos-porium</it>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <it>A. niger</it>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <it>A. nidulans</it>
                           </b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>
                              <it>M. grisea</it>
                           </b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="9">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Biological process</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>metabolism</p>
                     </c>
                     <c ca="center">
                        <p>71.4%</p>
                     </c>
                     <c ca="center">
                        <p>72.7%</p>
                     </c>
                     <c ca="center">
                        <p>76.0%</p>
                     </c>
                     <c ca="center">
                        <p>74.6%</p>
                     </c>
                     <c ca="center">
                        <p>75.6%</p>
                     </c>
                     <c ca="center">
                        <p>71.4%</p>
                     </c>
                     <c ca="center">
                        <p>75.3%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>cellular physiological process</p>
                     </c>
                     <c ca="center">
                        <p>50.3%</p>
                     </c>
                     <c ca="center">
                        <p>46.7%</p>
                     </c>
                     <c ca="center">
                        <p>34.2%</p>
                     </c>
                     <c ca="center">
                        <p>32.6%</p>
                     </c>
                     <c ca="center">
                        <p>30.2%</p>
                     </c>
                     <c ca="center">
                        <p>34.8%</p>
                     </c>
                     <c ca="center">
                        <p>36.9%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="right">
                        <p>cell growth and/or maintenance</p>
                     </c>
                     <c ca="center">
                        <p>48.5%</p>
                     </c>
                     <c ca="center">
                        <p>45.2%</p>
                     </c>
                     <c ca="center">
                        <p>31.9%</p>
                     </c>
                     <c ca="center">
                        <p>31.6%</p>
                     </c>
                     <c ca="center">
                        <p>29.4%</p>
                     </c>
                     <c ca="center">
                        <p>33.8%</p>
                     </c>
                     <c ca="center">
                        <p>36.2%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>cell communication</p>
                     </c>
                     <c ca="center">
                        <p>4.8%</p>
                     </c>
                     <c ca="center">
                        <p>11.05%</p>
                     </c>
                     <c ca="center">
                        <p>12.8%</p>
                     </c>
                     <c ca="center">
                        <p>4.7%</p>
                     </c>
                     <c ca="center">
                        <p>3.8%</p>
                     </c>
                     <c ca="center">
                        <p>3.9%</p>
                     </c>
                     <c ca="center">
                        <p>4.3%</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="9">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Cellular component</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>cell</p>
                     </c>
                     <c ca="center">
                        <p>99.3%</p>
                     </c>
                     <c ca="center">
                        <p>99.74%</p>
                     </c>
                     <c ca="center">
                        <p>96.8%</p>
                     </c>
                     <c ca="center">
                        <p>95.5%</p>
                     </c>
                     <c ca="center">
                        <p>98.1%</p>
                     </c>
                     <c ca="center">
                        <p>97.8%</p>
                     </c>
                     <c ca="center">
                        <p>96.5%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>unlocalized</p>
                     </c>
                     <c ca="center">
                        <p>1.0%</p>
                     </c>
                     <c ca="center">
                        <p>0.26%</p>
                     </c>
                     <c ca="center">
                        <p>1.9%</p>
                     </c>
                     <c ca="center">
                        <p>2.7%</p>
                     </c>
                     <c ca="center">
                        <p>0.8%</p>
                     </c>
                     <c ca="center">
                        <p>0.9%</p>
                     </c>
                     <c ca="center">
                        <p>0.8%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>extracellular</p>
                     </c>
                     <c ca="center">
                        <p>0.3%</p>
                     </c>
                     <c ca="center">
                        <p>0.0%</p>
                     </c>
                     <c ca="center">
                        <p>0.7%</p>
                     </c>
                     <c ca="center">
                        <p>2.2%</p>
                     </c>
                     <c ca="center">
                        <p>1.3%</p>
                     </c>
                     <c ca="center">
                        <p>1.3%</p>
                     </c>
                     <c ca="center">
                        <p>2.3%</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="9">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <b>Molecular function</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>catalytic activity</p>
                     </c>
                     <c ca="center">
                        <p>52.1%</p>
                     </c>
                     <c ca="center">
                        <p>76.46%</p>
                     </c>
                     <c ca="center">
                        <p>54.9%</p>
                     </c>
                     <c ca="center">
                        <p>65.4%</p>
                     </c>
                     <c ca="center">
                        <p>62.6%</p>
                     </c>
                     <c ca="center">
                        <p>60.4%</p>
                     </c>
                     <c ca="center">
                        <p>59.9%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>binding</p>
                     </c>
                     <c ca="center">
                        <p>46.6%</p>
                     </c>
                     <c ca="center">
                        <p>19.47%</p>
                     </c>
                     <c ca="center">
                        <p>45.7%</p>
                     </c>
                     <c ca="center">
                        <p>43.0%</p>
                     </c>
                     <c ca="center">
                        <p>41.8%</p>
                     </c>
                     <c ca="center">
                        <p>41.5%</p>
                     </c>
                     <c ca="center">
                        <p>49.5%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>transporter activity</p>
                     </c>
                     <c ca="center">
                        <p>14.8%</p>
                     </c>
                     <c ca="center">
                        <p>1.50%</p>
                     </c>
                     <c ca="center">
                        <p>11.0%</p>
                     </c>
                     <c ca="center">
                        <p>10.6%</p>
                     </c>
                     <c ca="center">
                        <p>15.5%</p>
                     </c>
                     <c ca="center">
                        <p>12.2%</p>
                     </c>
                     <c ca="center">
                        <p>10.1%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>transcription regulator activity</p>
                     </c>
                     <c ca="center">
                        <p>6.6%</p>
                     </c>
                     <c ca="center">
                        <p>4.28%</p>
                     </c>
                     <c ca="center">
                        <p>4.9%</p>
                     </c>
                     <c ca="center">
                        <p>3.3%</p>
                     </c>
                     <c ca="center">
                        <p>4.6%</p>
                     </c>
                     <c ca="center">
                        <p>6.6%</p>
                     </c>
                     <c ca="center">
                        <p>4.1%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>structural molecule activity</p>
                     </c>
                     <c ca="center">
                        <p>5.3%</p>
                     </c>
                     <c ca="center">
                        <p>0.0%</p>
                     </c>
                     <c ca="center">
                        <p>2.9%</p>
                     </c>
                     <c ca="center">
                        <p>1.9%</p>
                     </c>
                     <c ca="center">
                        <p>2.1%</p>
                     </c>
                     <c ca="center">
                        <p>1.7%</p>
                     </c>
                     <c ca="center">
                        <p>2.2%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>enzyme regulator activity</p>
                     </c>
                     <c ca="center">
                        <p>2.9%</p>
                     </c>
                     <c ca="center">
                        <p>0.86%</p>
                     </c>
                     <c ca="center">
                        <p>1.3%</p>
                     </c>
                     <c ca="center">
                        <p>1.1%</p>
                     </c>
                     <c ca="center">
                        <p>0.7%</p>
                     </c>
                     <c ca="center">
                        <p>0.7%</p>
                     </c>
                     <c ca="center">
                        <p>0.9%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>chaperone activity</p>
                     </c>
                     <c ca="center">
                        <p>2.2%</p>
                     </c>
                     <c ca="center">
                        <p>2.50%</p>
                     </c>
                     <c ca="center">
                        <p>1.5%</p>
                     </c>
                     <c ca="center">
                        <p>1.1%</p>
                     </c>
                     <c ca="center">
                        <p>1.1%</p>
                     </c>
                     <c ca="center">
                        <p>0.8%</p>
                     </c>
                     <c ca="center">
                        <p>1.0%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>signal transducer activity</p>
                     </c>
                     <c ca="center">
                        <p>1.9%</p>
                     </c>
                     <c ca="center">
                        <p>0.0%</p>
                     </c>
                     <c ca="center">
                        <p>1.8%</p>
                     </c>
                     <c ca="center">
                        <p>1.9%</p>
                     </c>
                     <c ca="center">
                        <p>1.5%</p>
                     </c>
                     <c ca="center">
                        <p>1.1%</p>
                     </c>
                     <c ca="center">
                        <p>1.2%</p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>translation regulator activity</p>
                     </c>
                     <c ca="center">
                        <p>1.4%</p>
                     </c>
                     <c ca="center">
                        <p>0.0%</p>
                     </c>
                     <c ca="center">
                        <p>1.2%</p>
                     </c>
                     <c ca="center">
                        <p>1.1%</p>
                     </c>
                     <c ca="center">
                        <p>1.8%</p>
                     </c>
                     <c ca="center">
                        <p>0.8%</p>
                     </c>
                     <c ca="center">
                        <p>0.8%</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Identification of putative secreted proteins</p>
            </st>
            <p><it>Aspergillus niger </it>is the source of a number of secreted proteins produced for various industrial applications. Gene Ontology mapping categorized only 15 of the predicted proteins as "extracellular" (<supplr sid="S2">Additional file 2</supplr> ). However, we were able to assign a GO component classifier to only 1,195 (23.4%) of the encoded proteins. To identify potential secreted proteins we used SignalP 3 <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> to search for proteins with a secretion signal. SignalP predicted that about 400 of the predicted proteins had a signal peptide (<supplr sid="S3">Additional file 3</supplr> ). Blast searches showed that 293 of these proteins were similar (E &#8804; e<sup>-5</sup>)to at least one GenBank entry. The 27% of predicted proteins with a signal peptide that do not have a GenBank homolog is significantly higher that the 17.5% of predicted orphan proteins. The reason for these differences remains unknown although they may suggest that the fungal secretome is subject to rapid evolution.</p>
            <suppl id="S3">
               <title>
                  <p>Additional File 3</p>
               </title>
               <text>
                  <p><b><it>A. niger </it>unisequences coding for proteins with a predicted signal peptide</b>. This file is a table listing the 399 <it>A. niger </it>unisequences that code for proteins with a predicted signal peptide. For each unisequence the table includes the unisequence identifier (column indicated Contig), the Mean Value in the output of SiganlP, the position of the signal peptide relative to the predicted N-terminal methionine, the GenBank definition line for the BLAST subject with the lowest Expect value and an assigned function, the GenBank ID for the BLAST subject used as the source of the definition line and the BLAST E value.</p>
               </text>
               <file name="1471-2180-6-7-S3.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Characterization of secretion pathway proteins</p>
            </st>
            <p>Recent strategies for improving the efficiency of heterologous protein expression in <it>A. niger </it>have focused on molecular genetic manipulation of the secretory pathway. In some cases, these approaches have significantly increased the expression of selected heterologous proteins <abbrgrp><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr></abbrgrp>. Using GO mappings and BLAST analysis we identified 118 genes that apparently participate in various steps of the protein secretion pathway (<supplr sid="S4">Additional file 4</supplr> ). Fifteen genes encode secretion-related ER chaperones, foldases and proteases; 77 encode putative proteins involved in protein transport, protein targeting and vesicle-mediated transport; and 26 code for proteins that are involved in secretion-related post-translational modifications. The <it>A. niger </it>genes identified in this study included all the previously identified secretion-related ER chaperones, foldases and quality control proteins: <it>bipA </it>(Asp84), <it>pdiA </it>(Asp734, Asp1902), <it>prpA </it>(Asp4188), <it>tigA </it>(Asp1020), <it>cybB </it>(Asp662), <it>clxA </it>(calnexin) (Asp1882), and <it>kexB </it>(kexin) (Asp177) <abbrgrp><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr></abbrgrp>.</p>
            <suppl id="S4">
               <title>
                  <p>Additional File 4</p>
               </title>
               <text>
                  <p><b>Putative secretory pathway proteins</b>. This file is a table listing the <it>A. niger </it>unisequences that code for proteins that are predicted to function in the secretory pathway. For each unisequence the table includes the unisequence identifier (column designated "Contig"), the predicted function assigned as described for Additional file 3, the GenBank ID for the BLAST subject that provided the predicted function, the BLAST subject organism, the associated Expect value and BLAST score, and the number of identical residues over the number of amino acids in the alignment.</p>
               </text>
               <file name="1471-2180-6-7-S4.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>Previous studies with <it>A. niger </it>identified five secretion-related GTPases belonging to the Ras super-family, <it>SrgA</it>, <it>SrgB</it>, <it>SrgC</it>, <it>SrgD</it>, and <it>SrgE</it>, and one member of the ARF/SAR subfamily, <it>SarA </it><abbrgrp><abbr bid="B31">31</abbr><abbr bid="B34">34</abbr></abbrgrp>. Our <it>A. niger </it>sequences included the earlier identified <it>SarA </it>(Asp4377), <it>SrgA </it>(Asp5114, Asp4222), <it>SrgB </it>(Asp3374, Asp70) and <it>SrgE </it>(Asp1610) genes. We also identified contigs Asp1708, which encodes a protein with 47% similarity to the <it>S. cerevisiae </it>GTP-binding protein YPT52 <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>, and Asp1824 and Asp1217 that code for proteins with 87% and 94% identity with <it>Aspergillus nidulans </it>members of the Rab subfamily of small GTPases <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>.</p>
            <p>Post-translational modifications such as glycosylations are often important for the production of biologically active secreted proteins. For instance, introducing an N-glycosylation site into bovine chymosin increased the amount of secreted chymosin expressed by <it>A. niger </it>10-fold <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. Identification of the various genes involved in <it>O</it>- and <it>N</it>- linked glycosylations <abbrgrp><abbr bid="B38">38</abbr></abbrgrp> would facilitate efforts to engineer the <it>A. niger </it>glycosylation pathway. We identified several putative members of the <it>N</it>- and <it>O</it>-linked protein glycosylation pathways, including; six PTM related O-mannosyltransferases, contigs Asp370, Asp4472, Asp170, Asp1044, Asp1344, and Asp3205 <abbrgrp><abbr bid="B39">39</abbr><abbr bid="B40">40</abbr></abbrgrp> and genes that are involved in <it>N</it>-linked protein glycosylation such as two contigs, Asp1340, and Asp458, that encode homologs of oligosaccharyl transferases <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>The 12,820 ESTs identified in this study represent a major attempt to define the <it>A. niger </it>gene set and represent about 5,108 genes. These data dramatically increase the number of identified <it>A. niger </it>genes. We have established a searchable web-based database that includes annotations for each EST and the derived contig assemblies to facilitate research community access to this important resource.</p>
         <p>Annotation of the phrap unisequences revealed that 83% had a putative homolog in other species, and therefore about 17% represented novel genes. The template cDNA clones, and their derived EST and contig sequences provide a basis for studying the function of individual genes as well as genome-wide studies of the regulatory networks and cellular functions that define <it>A. niger</it>. They will also assist gene identification, mapping and annotation efforts once the draft genome sequence of <it>A. niger </it>is completed and released. <it>A. niger</it>, known for its efficient secretion machinery, is widely used as a host for the production of native and foreign secreted proteins. However, for many proteins problems have arisen in obtaining high amounts in the culture medium. This study identified 399 putative secreted proteins, and 118 proteins that are putatively involved in various steps of the protein secretion pathway. These sequences should facilitate future efforts to engineering <it>A. niger </it>strains with improved secretion capabilities for proteins presently difficult to express. Additional details about this study and access to the <it>A. niger </it>EST database can be found on our fungal genomics web site <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Source material, total and poly (A)<sup>+</sup>RNA isolation</p>
            </st>
            <p><it>Aspergillus niger </it>strain N402, FGSC #4732 was grown at 30&#176;C in Minimal Medium <abbrgrp><abbr bid="B42">42</abbr></abbrgrp> containing 1% w/v of various carbon sources with shaking at 150 RPM. The carbon sources used were: glucose, bran, maltose, xylan, xylose, sorbitol, and lactose. Mycelial samples harvested by filtration and pressed between layers of filter paper to remove excess liquid, were stored at -80&#176;C.</p>
            <p>Total RNA was extracted from each mycelial sample. For this, 1.5 g of each frozen mycelial sample was ground to a fine powder in liquid nitrogen. Total RNA was extracted from the powdered mycelial masses using TRIzol<sup>&#174; </sup>reagent following the manufacturer's recommendations (Invitrogen, Burlington, ON). Total RNA (200 &#956;g) from each culture condition was pooled and the poly(A)<sup>+ </sup>RNA was purified using oligo-dT cellulose column chromatography (Amersham Biosciences Corp, Piscataway, NJ). Quality and quantification of the RNA were analyzed by running the RNA samples on an Agilent 2100 bioanalyzer (Agilent Technologies, Palo Alto, CA).</p>
         </sec>
         <sec>
            <st>
               <p>cDNA library construction</p>
            </st>
            <p>The cDNA library was constructed using a Zap-cDNA<sup>&#174; </sup>Synthesis Kit according to the manufacturer's instructions (Stratagene, La Jolla, CA). Double-stranded cDNA was directionally cloned into the pBluescript<sup>&#174; </sup>KS <sup>+ </sup>vector (Stratagene, La Jolla, CA) between its <it>Eco</it>RI (5'-end) and <it>Xho</it>I (3'-end) sites and transformed into <it>E. coli </it>strain DH5&#945;.</p>
         </sec>
         <sec>
            <st>
               <p>Plasmid DNA extraction and sequencing</p>
            </st>
            <p>The cDNA library was plated onto LB-ampicillin agar containing X-GAL and IPTG. White colonies were picked and inoculated into 384-well plates containing LB-ampicillin medium using a VersArray robotic colony picker and arrayer system (Bio-Rad, Laboratories, Canada), grown overnight and stored at -70&#176;C after the addition of glycerol (10% v/v). To prepare plasmid DNA from each sample, bacterial inoculates were transferred from the 384 well storage plates to 96-well growth blocks containing 1 ml of 2YT-ampicillin medium per well (Corning, Acton, MA) and grown overnight. Recombinant plasmids were extracted using alkaline lysis <abbrgrp><abbr bid="B43">43</abbr></abbrgrp> and subjected to single-pass sequencing from the T7 universal primer site (5'-end) using an ABI 3730 XL automated sequencing machine (Applied Biosystems, Foster City, CA) at the G&#233;nome Qu&#233;bec Innovation Centre (Montreal, PQ).</p>
         </sec>
         <sec>
            <st>
               <p>Virtual normalization, direct subtraction and selection of colonies forsequencing</p>
            </st>
            <p>Two methods were used to normalize the library. For virtual normalization <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>, bacterial colonies harboring independent cDNA clones were arrayed from the 384-well plates onto nitrocellulose membranes, 9,216 colonies per 492-cm<sup>2 </sup>membrane. The membranes were probed using radiolabeled cDNA. The probe was prepared as follows; double-stranded cDNA was produced from the same mRNA population that was used for library construction using the SMART cDNA construction kit (BD Biosciences, Mississauga, ON) according to the manufacturer's instructions. The double-stranded cDNA was labeled with [<sup>32</sup>P]dCTP by random priming, using the Rediprime&#8482; II Random Prime Labeling System (Amersham Biosciences Corp, Piscataway, NJ). The labeled cDNA was used to probe six membranes, arrayed with 55,296 clones, and the clones were ranked according to the relative intensity of their hybridization signals (Figure <figr fid="F1">1A</figr>). Based on these intensity ratios the colonies were divided into three groups, high (relative intensity 50%-100%), moderate (relative intensity 10%-50%), and weak (relative intensity less than 10%).</p>
            <p>For direct subtraction, plasmid DNAs representing each of the non-redundant genes that had already been identified was pooled. The pooled plasmid DNAs were linearized with the restriction endonuclease <it>Xho</it>I and radiolabeled "run-off" transcripts were generated using the Riboprobe <it>in vitro </it>Transcription System (Promega, Madison, WI). The probe RNA was then used to hybridize to the same membranes that had been subjected to virtual subtraction.</p>
            <p>After hybridization, the membranes were exposed to X-ray film, and the intensity of the signal for each colony was quantified using GeneTools image software (Synoptics Limited). The intensity data for each clone was stored in our in-house database. The clones chosen for sequencing were based on the relative intensity of their hybridization signals, determined as a ratio of signal intensity of the individual clone to the maximum signal intensity present on the array.</p>
         </sec>
         <sec>
            <st>
               <p>Sequence quality control, contig assembly, and sequence analysis</p>
            </st>
            <p>The chromatograms obtained following single pass sequencing of the cDNA clones were processed using three software tools, phred to assign sequence quality values <abbrgrp><abbr bid="B44">44</abbr><abbr bid="B45">45</abbr></abbrgrp>, lucy to remove vector sequences and regions of low quality sequence <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>, and phrap to assemble overlapping sequences into contigs <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. Sequence similarity searches against the NCBI non-redundant database were conducted using BLASTX <abbrgrp><abbr bid="B47">47</abbr></abbrgrp> with default BLAST parameters. The top 5 scoring BLASTX hits with E values less than e<sup>-5 </sup>were used to annotate each EST and EST-derived assembly using our annotation program TargetIdentifier <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>. Sequences that did not return alignments with E values less than e<sup>-5 </sup>were then used to perform BLASTN searches against the NCBI non-redundant nucleotide database. The top 5 BLASTN hits for each query, where the <it>E </it>value was required to be &lt; e<sup>-5</sup>, were then used for annotation. The resulting output files are uploaded to a local MySQL database.</p>
            <p>Redundancy was also analyzed by means of clustering based on the BLASTN alignments. Sequences that exhibited more than 93% identity over lengths of at least 100 bases were assigned to the same cluster. Cluster assignments were confirmed by additional analysis using ClustalW <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>.</p>
            <p>For comparing E values obtained by searching databases of different sizes, we normalized the E-values using the following formula:</p>
            <p>E<sub>n </sub>= E specific *S nr/S specific,</p>
            <p>E<sub>n</sub>: the normalized E value, it is the subject/query E value that would have been obtained had the alignment been generated by searching a database having the same number of amino acids as the NCBI-nr database; E specific: E-value retuned by BLASTX when searching a user specified database other than the NCBI-nr database; S specific: number of amino acids in the user defined database; S nr: number of amino acids in the NCBI-nr database (total 617,284,665).</p>
            <p>TargetIdentifier was used to estimate the proportion of the clones that contained complete coding sequences. The criteria used for establishing that a cDNA included the complete ORF can be found on our web site <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Annotation and functional binning</p>
            </st>
            <p>Annotation and functional binning were accomplished using tools provided by the Gene Ontology Consortium <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>. Annotations were based on the Gene Ontology (GO) terms and hierarchical structure <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>. Reference sequences were selected from the BLASTX results with E values less than e-5 obtained by searching the Swiss-Prot database of manually annotated proteins and the TrEMBL database of proteins with automated annotations. The GO categories associated with the BLASTX subject giving the highest score from the Swiss-Prot and TrEMBL databases were used to annotate our <it>A. niger </it>singletons and contigs. The GO term annotations were merged and loaded into the AmiGO browser and database <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>. The resulting GO-derived annotations can be viewed with the AmiGO browser at our website <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Signal peptide prediction</p>
            </st>
            <p>The coding region of each singleton and contig was predicted and translated into protein sequences using our OrfPredictor program <abbrgrp><abbr bid="B55">55</abbr></abbrgrp>. The N-terminal 50 amino acids of each predicted polypeptide were searched for a signal peptide using SignalP version 3 <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>NS normalized the library, prepared the manuscript, and contributed to the gene ontology classification. RS, GB and AT designed the project and the databases, and contributed to the preparation of manuscript. TJ contributed to the culture of the fungus and construction of the library. PG prepared the cDNA for library construction and contributed to the classification of gene ontology terms. PU tracked the clones at different stages of manipulations. XJM contributed to the annotation of the sequences. JS analyzed the raw sequences and contributed to the construction of the EST database.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank the staff at the Centre for Structural and Functional Genomics and the G&#233;nome Qu&#233;bec Innovation Centre for their technical assistance; especially Chris Beck, Aleks Spurmanis, Nathalie Brodeur, Michele Boudreau and Parviz Pezeshki. This work was supported by a Strategic Projects grant from the Natural Sciences and Engineering Research Council of Canada, and by G&#233;nome Qu&#233;bec and Genome Canada.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Aspergillus enzymes involved in degradation of plant cell wall polysaccharides</p>
            </title>
            <aug>
               <au>
                  <snm>De Vries</snm>
                  <fnm>RP</fnm>
               </au>
               <au>
                  <snm>Visser</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Microbiol Mol Biol Rev</source>
            <pubdate>2001</pubdate>
            <volume>65</volume>
            <fpage>497</fpage>
            <lpage>522</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">99039</pubid>
                  <pubid idtype="pmpid" link="fulltext">11729262</pubid>
                  <pubid idtype="doi">10.1128/MMBR.65.4.497-522.2001</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>On the safety of <it>Aspergillus niger</it>-a review</p>
            </title>
            <aug>
               <au>
                  <snm>Schuster</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Dunn-Coleman</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Frisvad</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Van Dijck</snm>
                  <fnm>PW</fnm>
               </au>
            </aug>
            <source>Appl Microbiol Biotechnol</source>
            <pubdate>2002</pubdate>
            <volume>59</volume>
            <fpage>426</fpage>
            <lpage>35</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s00253-002-1032-6</pubid>
                  <pubid idtype="pmpid" link="fulltext">12172605</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Regulation of <it>Aspergillus </it>genes encoding plant cell wall polysaccharide-degrading enzymes; relevance for industrial production</p>
            </title>
            <aug>
               <au>
                  <snm>De Vries</snm>
                  <fnm>RP</fnm>
               </au>
            </aug>
            <source>Appl Microbiol Biotechnol</source>
            <pubdate>2003</pubdate>
            <volume>61</volume>
            <fpage>10</fpage>
            <lpage>20</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12658510</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Production of citric acid using immobilized conidia of <it>Aspergillus niger</it></p>
            </title>
            <aug>
               <au>
                  <snm>Bayraktar</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Mehmetoglu</snm>
                  <fnm>U</fnm>
               </au>
            </aug>
            <source>Appl Biochem Biotechnol</source>
            <pubdate>2000</pubdate>
            <volume>87</volume>
            <fpage>117</fpage>
            <lpage>25</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1385/ABAB:87:2:117</pubid>
                  <pubid idtype="pmpid" link="fulltext">10949692</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Citric and gluconic acid production from fig by <it>Aspergillus niger </it>using solid-state fermentation</p>
            </title>
            <aug>
               <au>
                  <snm>Roukas</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>J Ind Microbiol Biotechnol</source>
            <pubdate>2000</pubdate>
            <volume>25</volume>
            <fpage>298</fpage>
            <lpage>304</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/sj.jim.7000101</pubid>
                  <pubid idtype="pmpid" link="fulltext">11320416</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Identification, classification and phylogeny of the <it>Aspergillus </it>section <it>Nigri </it>inferred from mitochondrial cytochrome b gene</p>
            </title>
            <aug>
               <au>
                  <snm>Yokoyama</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Miyaji</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Nishimura</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>FEMS Microbiology Letters</source>
            <pubdate>2001</pubdate>
            <volume>200</volume>
            <fpage>241</fpage>
            <lpage>246</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11425482</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Commercial levels of chymosin production by <it>Aspergillus</it></p>
            </title>
            <aug>
               <au>
                  <snm>Dunn-Coleman</snm>
                  <fnm>NS</fnm>
               </au>
               <au>
                  <snm>Bloebaum</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Berka</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Bodie</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Robinson</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Armstrong</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Ward</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Przetak</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Carter</snm>
                  <fnm>GL</fnm>
               </au>
               <au>
                  <snm>LaCost</snm>
                  <fnm>R</fnm>
               </au>
               <etal/>
            </aug>
            <source>Biotechnology (N Y)</source>
            <pubdate>1991</pubdate>
            <volume>9</volume>
            <fpage>976</fpage>
            <lpage>981</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nbt1091-976</pubid>
                  <pubid idtype="pmpid">1368725</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Secretion of heterologous proteins by <it>Aspergillus niger</it>: production of active human interleukin-6 in a protease-deficient mutant by KEX2-like processing of a glucoamylase-hIL6 fusion protein</p>
            </title>
            <aug>
               <au>
                  <snm>Broekhuijsen</snm>
                  <fnm>MP</fnm>
               </au>
               <au>
                  <snm>Mattern</snm>
                  <fnm>IE</fnm>
               </au>
               <au>
                  <snm>Contreras</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Kinghorn</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>van den Hondel</snm>
                  <fnm>CA</fnm>
               </au>
            </aug>
            <source>J Biotechnol</source>
            <pubdate>1993</pubdate>
            <volume>31</volume>
            <fpage>135</fpage>
            <lpage>145</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0168-1656(93)90156-H</pubid>
                  <pubid idtype="pmpid">7764298</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Filamentous fungi as cell factories for heterologous protein production</p>
            </title>
            <aug>
               <au>
                  <snm>Punt</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>van Biezen</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Conesa</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Albers</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Mangnus</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>van den Hondel</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Trends Biotechnol</source>
            <pubdate>2002</pubdate>
            <volume>20</volume>
            <fpage>200</fpage>
            <lpage>206</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0167-7799(02)01933-9</pubid>
                  <pubid idtype="pmpid" link="fulltext">11943375</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Secretion, purification, and characterisation of barley alpha-amylase produced by heterologous gene expression in <it>Aspergillus niger</it></p>
            </title>
            <aug>
               <au>
                  <snm>Juge</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Svensson</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Williamson</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Appl Microbiol Biotechnol</source>
            <pubdate>1998</pubdate>
            <volume>49</volume>
            <fpage>385</fpage>
            <lpage>392</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s002530051187</pubid>
                  <pubid idtype="pmpid" link="fulltext">9615479</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Heterologous gene expression in <it>Aspergillus niger</it>: a glucoamylase-porcine pancreatic prophospholipase A2 fusion protein is secreted and processed to yield mature enzyme</p>
            </title>
            <aug>
               <au>
                  <snm>Roberts</snm>
                  <fnm>IN</fnm>
               </au>
               <au>
                  <snm>Jeenes</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>MacKenzie</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Wilkinson</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Sumner</snm>
                  <fnm>IG</fnm>
               </au>
               <au>
                  <snm>Archer</snm>
                  <fnm>DB</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>1992</pubdate>
            <volume>122</volume>
            <fpage>155</fpage>
            <lpage>61</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0378-1119(92)90043-O</pubid>
                  <pubid idtype="pmpid">1452022</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Characterization of humanized antibodies secreted by <it>Aspergillus niger</it></p>
            </title>
            <aug>
               <au>
                  <snm>Ward</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lin</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Victoria</snm>
                  <fnm>DC</fnm>
               </au>
               <au>
                  <snm>Fox</snm>
                  <fnm>BP</fnm>
               </au>
               <au>
                  <snm>Fox</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Wong</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Meerman</snm>
                  <fnm>HJ</fnm>
               </au>
               <au>
                  <snm>Pucci</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Fong</snm>
                  <fnm>RB</fnm>
               </au>
               <au>
                  <snm>Heng</snm>
                  <fnm>MH</fnm>
               </au>
               <au>
                  <snm>Tsurushita</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Gieswein</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Park</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Appl Environ Microbiol</source>
            <pubdate>2004</pubdate>
            <volume>70</volume>
            <fpage>2567</fpage>
            <lpage>2576</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">404402</pubid>
                  <pubid idtype="pmpid" link="fulltext">15128505</pubid>
                  <pubid idtype="doi">10.1128/AEM.70.5.2567-2576.2004</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>The Gene Ontology Annotation (GOA) project: implementation of GO in SWISS-PROT, TrEMBL, and InterPro</p>
            </title>
            <aug>
               <au>
                  <snm>Camon</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Magrane</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Barrell</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Binns</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Fleischmann</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Kersey</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Mulder</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Oinn</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Maslen</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Cox</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Apweiler</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <fpage>662</fpage>
            <lpage>672</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">430163</pubid>
                  <pubid idtype="pmpid" link="fulltext">12654719</pubid>
                  <pubid idtype="doi">10.1101/gr.461403</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Fungal Genomics Project</p>
            </title>
            <url>https://fungalgenomics.concordia.ca/fungi/Anig.php</url>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Construction and characterization of a normalized cDNA library</p>
            </title>
            <aug>
               <au>
                  <snm>Soares</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Bonaldo</snm>
                  <fnm>MF</fnm>
               </au>
               <au>
                  <snm>Jelene</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Su</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Lawton</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Efstratiadis</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>1994</pubdate>
            <volume>91</volume>
            <fpage>9228</fpage>
            <lpage>32</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">44785</pubid>
                  <pubid idtype="pmpid" link="fulltext">7937745</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Negative selection: a method for obtaining low-abundance cDNAs using high-density cDNA clone arrays</p>
            </title>
            <aug>
               <au>
                  <snm>Nelson</snm>
                  <fnm>PS</fnm>
               </au>
               <au>
                  <snm>Hawkins</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Schummer</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bumgarner</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Ng</snm>
                  <fnm>WL</fnm>
               </au>
               <au>
                  <snm>Ideker</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Ferguson</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Hood</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Genet Anal</source>
            <pubdate>1999</pubdate>
            <volume>15</volume>
            <fpage>209</fpage>
            <lpage>215</lpage>
            <xrefbib>
               <pubid idtype="pmpid">10609756</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Analysis of expressed sequence tags from two starvation, time-of-day-specific libraries of <it>Neurospora crassa </it>reveals novel clock-controlled genes</p>
            </title>
            <aug>
               <au>
                  <snm>Zhu</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Nowrousian</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kupfer</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Colot</snm>
                  <fnm>HV</fnm>
               </au>
               <au>
                  <snm>Berrocal-Tito</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Lai</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Bell-Pedersen</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Roe</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Loros</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Dunlap</snm>
                  <fnm>JC</fnm>
               </au>
            </aug>
            <source>Genetics</source>
            <pubdate>2001</pubdate>
            <volume>157</volume>
            <fpage>1057</fpage>
            <lpage>1065</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11238394</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Analyses of cDNAs from growth and slug stages of <it>Dictyostelium discoideum</it></p>
            </title>
            <aug>
               <au>
                  <snm>Urushihara</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Morio</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Saito</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Kohara</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Koriki</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Ochiai</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Maeda</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Williams</snm>
                  <fnm>JG</fnm>
               </au>
               <au>
                  <snm>Takeuchi</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Tanaka</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <fpage>1647</fpage>
            <lpage>1653</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">390320</pubid>
                  <pubid idtype="pmpid" link="fulltext">15010511</pubid>
                  <pubid idtype="doi">10.1093/nar/gkh262</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Department of Genome Sciences, University of Washington</p>
            </title>
            <url>http://www.phrap.org/</url>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Saccharomyces Genome Database</p>
            </title>
            <url>http://www.yeastgenome.org/</url>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Broad Institute, Fungal Genome Initiative</p>
            </title>
            <url>http://www.broad.mit.edu/annotation/fungi/fgi/</url>
         </bibl>
         <bibl id="B22">
            <aug>
               <au>
                  <cnm>DOE Joint Genome Institute</cnm>
               </au>
            </aug>
            <url>http://genome.jgi-psf.org/whiterot1/whiterot1.home.html</url>
         </bibl>
         <bibl id="B23">
            <title>
               <p>The origin and evolution of model organisms</p>
            </title>
            <aug>
               <au>
                  <snm>Hedges</snm>
                  <fnm>SB</fnm>
               </au>
            </aug>
            <source>Nat Rev Genet</source>
            <pubdate>2002</pubdate>
            <volume>3</volume>
            <fpage>838</fpage>
            <lpage>849</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrg929</pubid>
                  <pubid idtype="pmpid" link="fulltext">12415314</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>The genome sequence of the filamentous fungus <it>Neurospora crassa</it></p>
            </title>
            <aug>
               <au>
                  <snm>Galagan</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Calvo</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Borkovich</snm>
                  <fnm>KA</fnm>
               </au>
               <au>
                  <snm>Selker</snm>
                  <fnm>EU</fnm>
               </au>
               <au>
                  <snm>Read</snm>
                  <fnm>ND</fnm>
               </au>
               <au>
                  <snm>Jaffe</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>FitzHugh</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Ma</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Smirnov</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Purcell</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Rehman</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Elkins</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Engels</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>CB</fnm>
               </au>
               <au>
                  <snm>Butler</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Endrizzi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Qui</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Ianakiev</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Bell-Pedersen</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Werner-Washburne</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Selitrennikoff</snm>
                  <fnm>CP</fnm>
               </au>
               <au>
                  <snm>Kinsey</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Braun</snm>
                  <fnm>EL</fnm>
               </au>
               <au>
                  <snm>Zelter</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Schulte</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Kothe</snm>
                  <fnm>GO</fnm>
               </au>
               <au>
                  <snm>Jedd</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Mewes</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Staben</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Marcotte</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Greenberg</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Roy</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Foley</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Naylor</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Stange-Thomann</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Barrett</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Gnerre</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kamal</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kamvysselis</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Mauceli</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Bielke</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Rudd</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Frishman</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Krystofova</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Rasmussen</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Metzenberg</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Perkins</snm>
                  <fnm>DD</fnm>
               </au>
               <au>
                  <snm>Kroken</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Cogoni</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Macino</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Catcheside</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Pratt</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Osmani</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>DeSouza</snm>
                  <fnm>CP</fnm>
               </au>
               <au>
                  <snm>Glass</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Orbach</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Berglund</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Voelker</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Yarden</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Plamann</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Seiler</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Dunlap</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Radford</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Aramayo</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Natvig</snm>
                  <fnm>DO</fnm>
               </au>
               <au>
                  <snm>Alex</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Mannhaupt</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Ebbole</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Freitag</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Paulsen</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Sachs</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Lander</snm>
                  <fnm>ES</fnm>
               </au>
               <au>
                  <snm>Nusbaum</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Birren</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2003</pubdate>
            <volume>422</volume>
            <fpage>859</fpage>
            <lpage>868</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature01554</pubid>
                  <pubid idtype="pmpid" link="fulltext">12712197</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Genome sequence of the lignocellulose degrading fungus <it>Phanerochaete chrysosporium </it>strain RP78</p>
            </title>
            <aug>
               <au>
                  <snm>Martinez</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Larrondo</snm>
                  <fnm>LF</fnm>
               </au>
               <au>
                  <snm>Putnam</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Gelpke</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Huang</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Chapman</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Helfenbein</snm>
                  <fnm>KG</fnm>
               </au>
               <au>
                  <snm>Ramaiya</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Detter</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Larimer</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Coutinho</snm>
                  <fnm>PM</fnm>
               </au>
               <au>
                  <snm>Henrissat</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Berka</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Cullen</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Rokhsar</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Nat Biotechnol</source>
            <pubdate>2004</pubdate>
            <volume>22</volume>
            <fpage>695</fpage>
            <lpage>700</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nbt967</pubid>
                  <pubid idtype="pmpid" link="fulltext">15122302</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>The genome sequence of <it>Schizosaccharomyces pombe</it></p>
            </title>
            <aug>
               <au>
                  <snm>Wood</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Gwilliam</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Rajandream</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Lyne</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lyne</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Stewart</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sgouros</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Peat</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Hayles</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Baker</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Basham</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Bowman</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Brooks</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Chillingworth</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Churcher</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Collins</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Connor</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Cronin</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Feltwell</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Fraser</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Gentles</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Goble</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hamlin</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Hidalgo</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hodgson</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Holroyd</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hornsby</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Howarth</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Huckle</snm>
                  <fnm>EJ</fnm>
               </au>
               <au>
                  <snm>Hunt</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Jagels</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>James</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Leather</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>McDonald</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>McLean</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Mooney</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Moule</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Mungall</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Murphy</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Niblett</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Odell</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Oliver</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>O'Neil</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Pearson</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Quail</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Rabbinowitsch</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Rutherford</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Rutter</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Saunders</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Seeger</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Sharp</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Skelton</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Simmonds</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Squares</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Squares</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Stevens</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>RG</fnm>
               </au>
               <au>
                  <snm>Tivey</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Walsh</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Warren</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Whitehead</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Woodward</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Volckaert</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Aert</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Robben</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Grymonprez</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Weltjens</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Vanstreels</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Rieger</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Schafer</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Muller-Auer</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gabel</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Fuchs</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Dusterhoft</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Fritzc</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Holzer</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Moestl</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Hilbert</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Borzym</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Langer</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Beck</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lehrach</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Reinhardt</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Pohl</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Eger</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Zimmermann</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Wedler</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Wambutt</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Purnelle</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Goffeau</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Cadieu</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Dreano</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gloux</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lelaure</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Mottier</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Galibert</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Aves</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Xiang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Hunt</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Moore</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Hurst</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Lucas</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rochet</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Gaillardin</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Tallada</snm>
                  <fnm>VA</fnm>
               </au>
               <au>
                  <snm>Garzon</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Thode</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Daga</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Cruzado</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Jimenez</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Sanchez</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>del Rey</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Benito</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Dominguez</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Revuelta</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Moreno</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Armstrong</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Forsburg</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Cerutti</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Lowe</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>McCombie</snm>
                  <fnm>WR</fnm>
               </au>
               <au>
                  <snm>Paulsen</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Potashkin</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Shpakovski</snm>
                  <fnm>GV</fnm>
               </au>
               <au>
                  <snm>Ussery</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Barrell</snm>
                  <fnm>BG</fnm>
               </au>
               <au>
                  <snm>Nurse</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>415</volume>
            <fpage>871</fpage>
            <lpage>880</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature724</pubid>
                  <pubid idtype="pmpid" link="fulltext">11859360</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Life with 6000 genes</p>
            </title>
            <aug>
               <au>
                  <snm>Goffeau</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Barrell</snm>
                  <fnm>BG</fnm>
               </au>
               <au>
                  <snm>Bussey</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>Dujon</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Feldmann</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Galibert</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Hoheisel</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Jacq</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Johnston</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Louis</snm>
                  <fnm>EJ</fnm>
               </au>
               <au>
                  <snm>Mewes</snm>
                  <fnm>HW</fnm>
               </au>
               <au>
                  <snm>Murakami</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Philippsen</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Tettelin</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Oliver</snm>
                  <fnm>SG</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1996</pubdate>
            <volume>274</volume>
            <fpage>563</fpage>
            <lpage>567</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1126/science.274.5287.546</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>The <it>Ashbya gossypii </it>genome as a tool for mapping the ancient <it>Saccharomyces cerevisiae </it>genome</p>
            </title>
            <aug>
               <au>
                  <snm>Dietrich</snm>
                  <fnm>FS</fnm>
               </au>
               <au>
                  <snm>Voegeli</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Brachat</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lerch</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Gates</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Steiner</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Mohr</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Pohlmann</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Luedi</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Choi</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wing</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Flavier</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Gaffney</snm>
                  <fnm>TD</fnm>
               </au>
               <au>
                  <snm>Philippsen</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2004</pubdate>
            <volume>304</volume>
            <fpage>304</fpage>
            <lpage>307</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1095781</pubid>
                  <pubid idtype="pmpid" link="fulltext">15001715</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Improved prediction of signal peptides: SignalP 3.0</p>
            </title>
            <aug>
               <au>
                  <snm>Bendtsen</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>von Heijne</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Brunak</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2004</pubdate>
            <volume>340</volume>
            <fpage>783</fpage>
            <lpage>795</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.jmb.2004.05.028</pubid>
                  <pubid idtype="pmpid" link="fulltext">15223320</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>Efficient production of secreted proteins by <it>Aspergillus</it>: progress, limitations and prospects</p>
            </title>
            <aug>
               <au>
                  <snm>Gouka</snm>
                  <fnm>RJ</fnm>
               </au>
               <au>
                  <snm>Punt</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>van den Hondel</snm>
                  <fnm>CA</fnm>
               </au>
            </aug>
            <source>Appl Microbiol Biotechno</source>
            <pubdate>1997</pubdate>
            <volume>47</volume>
            <fpage>1</fpage>
            <lpage>11</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1007/s002530050880</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>The secretion pathway in filamentous fungi: a biotechnological view</p>
            </title>
            <aug>
               <au>
                  <snm>Conesa</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Punt</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>van Luijk</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>van den Hondel</snm>
                  <fnm>CA</fnm>
               </au>
            </aug>
            <source>Fungal Genet Bio</source>
            <pubdate>2001</pubdate>
            <volume>33</volume>
            <fpage>155</fpage>
            <lpage>171</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1006/fgbi.2001.1276</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Characterization of a foldase, protein disulfide isomerase A, in the protein secretory pathway of <it>Aspergillus niger</it></p>
            </title>
            <aug>
               <au>
                  <snm>Ngiam</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Jeenes</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Punt</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>van den Hondel</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Archer</snm>
                  <fnm>DB</fnm>
               </au>
            </aug>
            <source>Appl Environ Microbiol</source>
            <pubdate>2000</pubdate>
            <volume>66</volume>
            <fpage>775</fpage>
            <lpage>782</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">91895</pubid>
                  <pubid idtype="pmpid" link="fulltext">10653750</pubid>
                  <pubid idtype="doi">10.1128/AEM.66.2.775-782.2000</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>The role of the <it>Aspergillus niger </it>furin-type protease gene in processing of fungal proproteins and fusion proteins. Evidence for alternative processing of recombinant (fusion-) proteins</p>
            </title>
            <aug>
               <au>
                  <snm>Punt</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Drint-Kuijvenhoven</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lokman</snm>
                  <fnm>BC</fnm>
               </au>
               <au>
                  <snm>Spencer</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Jeenes</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Archer</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>van den Hondel</snm>
                  <fnm>CA</fnm>
               </au>
            </aug>
            <source>J Biotechnol</source>
            <pubdate>2003</pubdate>
            <volume>106</volume>
            <fpage>23</fpage>
            <lpage>32</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.jbiotec.2003.09.005</pubid>
                  <pubid idtype="pmpid" link="fulltext">14636707</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>Identification and characterization of a family of secretion-related small GTPase-encoding genes from the filamentous fungus <it>Aspergillus niger</it>: a putative SEC4 homologue is not essential for growth</p>
            </title>
            <aug>
               <au>
                  <snm>Punt</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Seiboth</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Weenink</snm>
                  <fnm>XO</fnm>
               </au>
               <au>
                  <snm>van Zeijl</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Lenders</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Konetschny</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ram</snm>
                  <fnm>AF</fnm>
               </au>
               <au>
                  <snm>Montijn</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Kubicek</snm>
                  <fnm>CP</fnm>
               </au>
               <au>
                  <snm>van den Hondel</snm>
                  <fnm>CA</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>2001</pubdate>
            <volume>41</volume>
            <fpage>513</fpage>
            <lpage>525</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-2958.2001.02541.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">11489135</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Role of three rab5-like GTPases, Ypt51p, Ypt52p, and Ypt53p, in the endocytic and vacuolar protein sorting pathways of yeast</p>
            </title>
            <aug>
               <au>
                  <snm>Singer-Kruger</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Stenmark</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Dusterhoft</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Philippsen</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Yoo</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Gallwitz</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Zerial</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>J Cell Biol</source>
            <pubdate>1994</pubdate>
            <volume>125</volume>
            <fpage>283</fpage>
            <lpage>298</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1083/jcb.125.2.283</pubid>
                  <pubid idtype="pmpid" link="fulltext">8163546</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>The mammalian Rab family of small GTPases: definition of family and subfamily sequence motifs suggests a mechanism for functional specificity in the Ras superfamily</p>
            </title>
            <aug>
               <au>
                  <snm>Pereira-Leal</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Seabra</snm>
                  <fnm>MC</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>2000</pubdate>
            <volume>301</volume>
            <fpage>1077</fpage>
            <lpage>1087</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.2000.4010</pubid>
                  <pubid idtype="pmpid" link="fulltext">10966806</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>The development of <it>Aspergillus niger </it>var. awamori as a host for the expression and secretion of heterologous gene products</p>
            </title>
            <aug>
               <au>
                  <snm>Berka</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Kodama</snm>
                  <fnm>KH</fnm>
               </au>
               <au>
                  <snm>Rey</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Wilson</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Ward</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Biochem Soc Trans</source>
            <pubdate>1991</pubdate>
            <volume>19</volume>
            <fpage>681</fpage>
            <lpage>685</lpage>
            <xrefbib>
               <pubid idtype="pmpid">1783197</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>Characterisation of the <it>gpt</it>A gene, encoding UDP <it>N</it>-acetylglucosamine: dolichol phosphate <it>N</it>-acetylglucosaminylphosphoryl transferase, from the filamentous fungus,<it>Aspergillus niger</it></p>
            </title>
            <aug>
               <au>
                  <snm>Sorensen</snm>
                  <fnm>TK</fnm>
               </au>
               <au>
                  <snm>Dyer</snm>
                  <fnm>PS</fnm>
               </au>
               <au>
                  <snm>Fierro</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Laube</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Peberdy</snm>
                  <fnm>JF</fnm>
               </au>
            </aug>
            <source>Biochim Biophys Acta</source>
            <pubdate>2003</pubdate>
            <volume>1619</volume>
            <fpage>89</fpage>
            <lpage>97</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12495819</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>O-mannosyl glycans: from yeast to novel associations with human disease</p>
            </title>
            <aug>
               <au>
                  <snm>Willer</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Valero</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Tanner</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Cruces</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Strahl</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Curr Opin Struct Biol</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <fpage>621</fpage>
            <lpage>630</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.sbi.2003.09.003</pubid>
                  <pubid idtype="pmpid" link="fulltext">14568618</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Molecular characterization of protein O-mannosyltransferase and its involvement in cell-wall synthesis in <it>Aspergillus nidulans</it></p>
            </title>
            <aug>
               <au>
                  <snm>Oka</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hamaguchi</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Sameshima</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Goto</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Furukawa</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Microbiology</source>
            <pubdate>2004</pubdate>
            <volume>150</volume>
            <fpage>1973</fpage>
            <lpage>1982</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1099/mic.0.27005-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">15184583</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Studies of yeast oligosaccharyl transferase subunits using the split-ubiquitin system: Topological features and in vivo interactions</p>
            </title>
            <aug>
               <au>
                  <snm>Yan</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Lennarz</snm>
                  <fnm>WJ</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <fpage>7121</fpage>
            <lpage>7126</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1129144</pubid>
                  <pubid idtype="pmpid" link="fulltext">15886282</pubid>
                  <pubid idtype="doi">10.1073/pnas.0502669102</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Meiotic and mitotic recombination in <it>Aspergillus </it>and its chromosomal aberrations</p>
            </title>
            <aug>
               <au>
                  <snm>Kafer</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Adv Genet</source>
            <pubdate>1977</pubdate>
            <volume>19</volume>
            <fpage>33</fpage>
            <lpage>131</lpage>
            <xrefbib>
               <pubid idtype="pmpid">327767</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <aug>
               <au>
                  <snm>Sambrook</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Fritsch</snm>
                  <fnm>EF</fnm>
               </au>
               <au>
                  <snm>Maniatis</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Molecular cloning: a laboratory manual</source>
            <publisher>New York: Cold Spring Harbor Laboratory Press</publisher>
            <edition>2</edition>
            <pubdate>1989</pubdate>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Base-calling of automated sequencer traces using Phred. II. Error probabilities</p>
            </title>
            <aug>
               <au>
                  <snm>Ewing</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Green</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>1998</pubdate>
            <volume>8</volume>
            <fpage>186</fpage>
            <lpage>194</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9521922</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Base-calling of automated sequencer traces using Phred. I. Accuracy assessment</p>
            </title>
            <aug>
               <au>
                  <snm>Ewing</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Hillier</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Wendl</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Green</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>1998</pubdate>
            <volume>8</volume>
            <fpage>175</fpage>
            <lpage>185</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9521921</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>DNA sequence quality trimming and vector removal</p>
            </title>
            <aug>
               <au>
                  <snm>Chou</snm>
                  <fnm>HH</fnm>
               </au>
               <au>
                  <snm>Holmes</snm>
                  <fnm>MH</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2001</pubdate>
            <volume>17</volume>
            <fpage>1093</fpage>
            <lpage>1104</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/17.12.1093</pubid>
                  <pubid idtype="pmpid" link="fulltext">11751217</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>Basic local alignment search tool</p>
            </title>
            <aug>
               <au>
                  <snm>Altschul</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Gish</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Myers</snm>
                  <fnm>EW</fnm>
               </au>
               <au>
                  <snm>Lipman</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>J Mol Biol</source>
            <pubdate>1990</pubdate>
            <volume>215</volume>
            <fpage>403</fpage>
            <lpage>410</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/jmbi.1990.9999</pubid>
                  <pubid idtype="pmpid" link="fulltext">2231712</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>TargetIdentifier: a webserver for identifying full-length cDNAs from EST sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Min</snm>
                  <fnm>XJ</fnm>
               </au>
               <au>
                  <snm>Butler</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Storms</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Tsang</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <issue>33 Web Server</issue>
            <fpage>W669</fpage>
            <lpage>W672</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1160197</pubid>
                  <pubid idtype="pmpid" link="fulltext">15980559</pubid>
                  <pubid idtype="doi">10.1093/nar/gki436</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>The use of CLUSTAL W and CLUSTAL X for multiple sequence alignment</p>
            </title>
            <aug>
               <au>
                  <snm>Aiyar</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Bioinformatics Methods and Protocols, Methods in Molecular Biology</source>
            <publisher>New Jersey: Humana Press, Totowa</publisher>
            <editor>Misener S, Krawetz SA</editor>
            <pubdate>2000</pubdate>
            <volume>32</volume>
            <fpage>221</fpage>
            <lpage>241</lpage>
         </bibl>
         <bibl id="B50">
            <title>
               <p>TargetIdentifier Server</p>
            </title>
            <url>https://fungalgenome.concordia.ca/tools/docs/TargetIdentifier_faq.html</url>
         </bibl>
         <bibl id="B51">
            <title>
               <p>Gene ontology: tool for the unification of biology. The Gene Ontology Consortium</p>
            </title>
            <aug>
               <au>
                  <snm>Ashburner</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ball</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Blake</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Botstein</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Butler</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Cherry</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Dolinski</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Dwight</snm>
                  <fnm>SS</fnm>
               </au>
               <au>
                  <snm>Eppig</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Hill</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Issel-Tarver</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Kasarskis</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Lewis</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Matese</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Richardson</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Ringwald</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rubin</snm>
                  <fnm>GM</fnm>
               </au>
               <au>
                  <snm>Sherlock</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Nat Genet</source>
            <pubdate>2000</pubdate>
            <volume>25</volume>
            <fpage>25</fpage>
            <lpage>29</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/75556</pubid>
                  <pubid idtype="pmpid" link="fulltext">10802651</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>Gene Ontology Consortium</p>
            </title>
            <url>http://www.geneontology.org/</url>
         </bibl>
         <bibl id="B53">
            <title>
               <p>AmiGO browser and documentation</p>
            </title>
            <url>http://www.godatabase.org</url>
         </bibl>
         <bibl id="B54">
            <title>
               <p>Fungal Genomics Project. GO Results for <it>A. niger</it></p>
            </title>
            <url>https://fungalgenomics.concordia.ca:444/cgi-bin/oldigo/go.cgi?session_id=51051110567526&amp;species_db=A.niger</url>
         </bibl>
         <bibl id="B55">
            <title>
               <p>OrfPredictor: predicting protein-coding regions in EST-derived sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Min</snm>
                  <fnm>XJ</fnm>
               </au>
               <au>
                  <snm>Butler</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Storms</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Tsang</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2005</pubdate>
            <issue>33 Web Server</issue>
            <fpage>W677</fpage>
            <lpage>680</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1160155</pubid>
                  <pubid idtype="pmpid" link="fulltext">15980561</pubid>
                  <pubid idtype="doi">10.1093/nar/gki394</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
