<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2164-8-426</ui>
   <ji>1471-2164</ji>
   <fm>
      <dochead>Database</dochead>
      <bibl>
         <title>
            <p>e-Fungi: a data resource for comparative analysis of fungal genomes</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Hedeler</snm>
               <fnm>Cornelia</fnm>
               <insr iid="I1"/>
               <email>chedeler@cs.manchester.ac.uk</email>
            </au>
            <au id="A2">
               <snm>Wong</snm>
               <mnm>Min</mnm>
               <fnm>Han</fnm>
               <insr iid="I3"/>
               <email>H.M.Wong@exeter.ac.uk</email>
            </au>
            <au id="A3">
               <snm>Cornell</snm>
               <mi>J</mi>
               <fnm>Michael</fnm>
               <insr iid="I1"/>
               <email>mcornell@cs.manchester.ac.uk</email>
            </au>
            <au id="A4">
               <snm>Alam</snm>
               <fnm>Intikhab</fnm>
               <insr iid="I1"/>
               <email>ialam@cs.manchester.ac.uk</email>
            </au>
            <au id="A5">
               <snm>Soanes</snm>
               <mi>M</mi>
               <fnm>Darren</fnm>
               <insr iid="I3"/>
               <email>D.M.Soanes@exeter.ac.uk</email>
            </au>
            <au id="A6">
               <snm>Rattray</snm>
               <fnm>Magnus</fnm>
               <insr iid="I1"/>
               <email>magnus.rattray@manchester.ac.uk</email>
            </au>
            <au id="A7">
               <snm>Hubbard</snm>
               <mi>J</mi>
               <fnm>Simon</fnm>
               <insr iid="I2"/>
               <email>simon.hubbard@manchester.ac.uk</email>
            </au>
            <au id="A8">
               <snm>Talbot</snm>
               <mi>J</mi>
               <fnm>Nicholas</fnm>
               <insr iid="I3"/>
               <email>n.j.talbot@exeter.ac.uk</email>
            </au>
            <au id="A9">
               <snm>Oliver</snm>
               <mi>G</mi>
               <fnm>Stephen</fnm>
               <insr iid="I4"/>
               <email>steve.oliver@mole.bio.cam.ac.uk</email>
            </au>
            <au id="A10">
               <snm>Paton</snm>
               <mi>W</mi>
               <fnm>Norman</fnm>
               <insr iid="I1"/>
               <email>npaton@manchester.ac.uk</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>School of Computer Science, The University of Manchester, Manchester, M13 9PL, UK</p>
            </ins>
            <ins id="I2">
               <p>Faculty of Life Sciences, The University of Manchester, Manchester, M13 9PT, UK</p>
            </ins>
            <ins id="I3">
               <p>School of Biosciences, University of Exeter, Exeter, EX4 4QD, UK</p>
            </ins>
            <ins id="I4">
               <p>Department of Biochemistry, University of Cambridge, Cambridge CB2 1GA, UK</p>
            </ins>
         </insg>
         <source>BMC Genomics</source>
         <issn>1471-2164</issn>
         <pubdate>2007</pubdate>
         <volume>8</volume>
         <issue>1</issue>
         <fpage>426</fpage>
         <url>http://www.biomedcentral.com/1471-2164/8/426</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18028535</pubid>
               <pubid idtype="doi">10.1186/1471-2164-8-426</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>15</day>
               <month>5</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>20</day>
               <month>11</month>
               <year>2007</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>20</day>
               <month>11</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Hedeler et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>The number of sequenced fungal genomes is ever increasing, with about 200 genomes already fully sequenced or in progress. Only a small percentage of those genomes have been comprehensively studied, for example using techniques from functional genomics. Comparative analysis has proven to be a useful strategy for enhancing our understanding of evolutionary biology and of the less well understood genomes. However, the data required for these analyses tends to be distributed in various heterogeneous data sources, making systematic comparative studies a cumbersome task. Furthermore, comparative analyses benefit from close integration of derived data sets that cluster genes or organisms in a way that eases the expression of requests that clarify points of similarity or difference between species.</p>
            </sec>
            <sec>
               <st>
                  <p>Description</p>
               </st>
               <p>To support systematic comparative analyses of fungal genomes we have developed the e-Fungi database, which integrates a variety of data for more than 30 fungal genomes. Publicly available genome data, functional annotations, and pathway information has been integrated into a single data repository and complemented with results of comparative analyses, such as MCL and OrthoMCL cluster analysis, and predictions of signaling proteins and the sub-cellular localisation of proteins. To access the data, a library of analysis tasks is available through a web interface. The analysis tasks are motivated by recent comparative genomics studies, and aim to support the study of evolutionary biology as well as community efforts for improving the annotation of genomes. Web services for each query are also available, enabling the tasks to be incorporated into workflows.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>The e-Fungi database provides fungal biologists with a resource for comparative studies of a large range of fungal genomes. Its analysis library supports the comparative study of genome data, functional annotation, and results of large scale analyses over all the genomes stored in the database. The database is accessible at <url>http://www.e-fungi.org.uk</url>, as is the WSDL for the web services.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>A large number of genome projects are under way, with about 670 genomes completely sequenced and more than 2,500 genomes still in progress (Genomes OnLine Database (GOLD) <abbrgrp><abbr bid="B1">1</abbr></abbrgrp> statistic, accessed November 2007). Bacterial sequencing projects form the largest group of genome projects with about 1,800 completed or ongoing, followed by the eukaryotes with about 850 projects. Amongst the eukaryotes, about 200 fungal genomes are being sequenced, followed by protozoa and plants with about 140 and 130 sequencing projects, respectively. The large number of sequenced genomes can provide the basis for comparative genomics analyses, which have already proven invaluable for studying the evolution and genetic diversity of kingdoms, identifying species-specific genes and those conserved between genomes, or examining the expansion or contraction of protein families (e.g., <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp>).</p>
         <p>Not only are the fungi the most frequently sequenced kingdom within the eukaryotes, in addition the sequenced fungi have been selected to form clusters of related species, thus maximising their combined value for comparative genomics and evolutionary biology <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. They also play an important role in medicine, agriculture and industry. This makes the fungi a prime candidate for a systematic comparative study of eukaryotic biology and evolution.</p>
         <p>Comparative analyses can be used, amongst others, for the following analyses:</p>
         <p>&#8226; Identification of species-specific proteins/protein families or those conserved in closely related species, which can help to analyse conservations in species exhibiting distinct phenotypes, e.g., growth habits, lifestyles, or pathogenicity;</p>
         <p>&#8226; Study of genome redundancy in a range of related species, which can be used to analyse genome duplication;</p>
         <p>&#8226; Study of contraction or expansion of gene/protein families;</p>
         <p>&#8226; Identification of secreted proteins, which in pathogenic fungi could play important roles in host-pathogen interactions;</p>
         <p>&#8226; Conservation of genes defined as essential for growth in <it>Saccharomyces cerevisiae </it><abbrgrp><abbr bid="B8">8</abbr></abbrgrp> in fungal genomes;</p>
         <p>&#8226; Study of metabolic pathways in the fungi, analysis of conservation of components of pathways in fungal genomes; and</p>
         <p>&#8226; Distribution, diversity and conservation of proteins with particular functional domains in related fungal genomes.</p>
         <p>With the wealth of sequenced fungal genomes, the fungi can therefore not only serve as model organisms for eukaryotes <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>, but could also provide an important setting for the development of techniques for comparative analysis of eukaryotes.</p>
         <p>To facilitate comparative genomics, genomic data needs to be stored in multi-species databases instead of model genome databases capturing only data on a single genome <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>. For the fungi, a number of multi-genome data repositories are already available in which data generated by fungal genome sequencing projects is deposited. These data sources include SGD <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>, the Fungal Genome Initiative (FGI) at the Broad Institute <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> or the Integrated Microbial Genome (IMG) resource provided at the JGI <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. A large number of genomes are also available through Entrez <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. Although these data sources store many fungi, the emphasis in their design is not primarily on a systematic comparison as such.</p>
         <p>Furthermore, a number of additional databases are available, specialising in particular kinds of data, some of which are placed in Figure <figr fid="F1">1</figr> according to the diversity of data they integrate and the number of genomes they cover. These resources include the Gene Ontology project <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> providing functional annotation of proteins, the Pfam database <abbrgrp><abbr bid="B16">16</abbr></abbrgrp> providing information on protein domains and families, KEGG <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>, Reactome <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> and Metacyc <abbrgrp><abbr bid="B19">19</abbr></abbrgrp> capturing information about pathways, as well as PCAS <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> and SPdb <abbrgrp><abbr bid="B21">21</abbr></abbrgrp> storing predicted signal peptides. These specialised databases tend to contain only one particular kind of data but for a fairly large number of genomes. However, despite the large number of genomes integrated, as shown in Figure <figr fid="F1">1</figr>, most of these databases tend to cover only a limited number of fungal genomes. In addition to the types of data already mentioned, more and more functional genomics data sets are becoming available in various data sources.</p>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Overview of diversity of available databases</p>
            </caption>
            <text>
               <p>Overview of diversity of available databases.</p>
            </text>
            <graphic file="1471-2164-8-426-1"/>
         </fig>
         <p>Even though a number of multi-genome data sources are available, the distribution of the genomic data and additional data, such as functional annotation or pathway information, in heterogeneous data repositories, makes systematic comparisons of a large number of genomes a challenging task. To overcome the issues associated with the distribution of data in heterogeneous data repositories, and to facilitate comparative studies, a number of approaches have been taken to integrating a variety of different kinds of data for a large number of genomes. Databases that contain fungal among other genomes include G&#233;nolevures <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>, IMG <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>, Ensembl <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>, the UCSC genome browser <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> and Entrez <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>.</p>
         <p>However, even though these are multispecies databases they do not provide analysis facilities powerful enough to carry out comparative analyses as mentioned above. This is due to the provision of predominantly gene-centred query facilities and visualisations that also tend to be species-centred in the sense that the analysis or search is focussed on a particular genome, and the results can then be related to other species, for example, by identification of orthologous proteins. This limitation makes the systematic comparative analysis of a large range of genomes a cumbersome task.</p>
         <p>Here, we present e-Fungi, the first large-scale integrative repository of fungal genomes with an emphasis on supporting systematic comparative studies. To achieve this, e-Fungi integrates primary data obtained from a number of data sources and complements it with results of cluster analyses and other derived data that has been generated using large scale analyses of the genome data. The stored data can be analysed using a library of tasks that can be accessed using a web interface provided on the e-Fungi website <abbrgrp><abbr bid="B25">25</abbr></abbrgrp> and as web services. With an emphasis on cluster-based analysis carried out over a range of genomes, e-Fungi represents a departure from gene-centric data sources and a move towards cluster-based data sources that provide better support for comparative studies.</p>
      </sec>
      <sec>
         <st>
            <p>Construction and content</p>
         </st>
         <p>In this section the construction and content of the e-Fungi database are described. The data sources from which the primary data are obtained are introduced, as well as the processes that generate the derived data. Furthermore, an overview of the database schema is provided, the loading infrastructure introduced, and the library of analysis tasks presented.</p>
         <sec>
            <st>
               <p>Data collection</p>
            </st>
            <sec>
               <st>
                  <p>Primary data</p>
               </st>
               <p>Four different types of primary data are obtained from a variety of repositories and integrated into the e-Fungi database: genomic data, Gene Ontology annotations, pathway data and EST data. Genomic data consists of the genome sequence with varying degrees of annotation. This annotation can include the prediction of genes with their introns, exons and predicted proteins, as well as their locations on contigs, supercontigs or chromosomes. Table <tblr tid="T1">1</tblr> lists all the genomes integrated into the e-Fungi database with the data sources from which the data has been obtained. Other data has been obtained as follows:</p>
               <tbl id="T1">
                  <title>
                     <p>Table 1</p>
                  </title>
                  <caption>
                     <p>Genomes in e-Fungi with associated data sources</p>
                  </caption>
                  <tblbdy cols="5">
                     <r>
                        <c ca="left">
                           <p>
                              <b>Genome</b>
                           </p>
                        </c>
                        <c ca="left">
                           <p>
                              <b>Taxonomy</b>
                           </p>
                        </c>
                        <c ca="left">
                           <p>
                              <b>Pathogenicity</b>
                           </p>
                        </c>
                        <c ca="left">
                           <p>
                              <b>Growth form</b>
                           </p>
                        </c>
                        <c ca="left">
                           <p>
                              <b>Source</b>
                           </p>
                        </c>
                     </r>
                     <r>
                        <c cspan="5">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Phytophthora sojae</it>
                           </p>
                        </c>
                        <c ca="left">
                           <p>Oomycete</p>
                        </c>
                        <c ca="left">
                           <p>plant pathogen</p>
                        </c>
                        <c ca="left">
                           <p>filamentous</p>
                        </c>
                        <c ca="left">
                           <p>JGI</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Phytophthora ramorum</it>
                           </p>
                        </c>
                        <c ca="left">
                           <p>Oomycete</p>
                        </c>
                        <c ca="left">
                           <p>plant pathogen</p>
                        </c>
                        <c ca="left">
                           <p>filamentous</p>
                        </c>
                        <c ca="left">
                           <p>JGI</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Rhizopus oryzae</it>
                           </p>
                        </c>
                        <c ca="left">
                           <p>Zygomycota &#8211; Mucorales</p>
                        </c>
                        <c ca="left">
                           <p>animal pathogen</p>
                        </c>
                        <c ca="left">
                           <p>filamentous</p>
                        </c>
                        <c ca="left">
                           <p>Broad</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Ustilago maydis</it>
                           </p>
                        </c>
                        <c ca="left">
                           <p>Basidiomycete &#8211; Ustilaginomycota</p>
                        </c>
                        <c ca="left">
                           <p>plant pathogen</p>
                        </c>
                        <c ca="left">
                           <p>dimorphic</p>
                        </c>
                        <c ca="left">
                           <p>Broad</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Phanerochaete chrysosporium</it>
                           </p>
                        </c>
                        <c ca="left">
                           <p>Basidiomycete &#8211; Homobasidiomycota</p>
                        </c>
                        <c ca="left">
                           <p>non pathogen</p>
                        </c>
                        <c ca="left">
                           <p>filamentous</p>
                        </c>
                        <c ca="left">
                           <p>JGI</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Schizosaccharomyces pombe</it>
                           </p>
                        </c>
                        <c ca="left">
                           <p>Ascomycete &#8211; Schizosaccharomycetes</p>
                        </c>
                        <c ca="left">
                           <p>non pathogen</p>
                        </c>
                        <c ca="left">
                           <p>yeast &#8211; fission</p>
                        </c>
                        <c ca="left">
                           <p>Entrez</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Yarrowia lipolytica</it>
                           </p>
                        </c>
                        <c ca="left">
                           <p>Ascomycete &#8211; Saccharomycetes</p>
                        </c>
                        <c ca="left">
                           <p>non pathogen</p>
                        </c>
                        <c ca="left">
                           <p>yeast &#8211; dimorphic</p>
                        </c>
                        <c ca="left">
                           <p>Entrez</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Saccharomyces paradoxus</it>
                           </p>
                        </c>
                        <c ca="left">
                           <p>Ascomycete &#8211; Saccharomycetes</p>
                        </c>
                        <c ca="left">
                           <p>non pathogen</p>
                        </c>
                        <c ca="left">
                           <p>yeast</p>
                        </c>
                        <c ca="left">
                           <p>SGD</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Saccharomyces cerevisiae</it>
                           </p>
                        </c>
                        <c ca="left">
                           <p>Ascomycete &#8211; Saccharomycetes</p>
                        </c>
                        <c ca="left">
                           <p>non pathogen</p>
                        </c>
                        <c ca="left">
                           <p>yeast</p>
                        </c>
                        <c ca="left">
                           <p>Entrez</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Saccharomyces mikatae</it>
                           </p>
                        </c>
                        <c ca="left">
                           <p>Ascomycete &#8211; Saccharomycetes</p>
                        </c>
                        <c ca="left">
                           <p>non pathogen</p>
                        </c>
                        <c ca="left">
                           <p>yeast</p>
                        </c>
                        <c ca="left">
                           <p>SGD</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Saccharomyces kudriavzevii</it>
                           </p>
                        </c>
                        <c ca="left">
                           <p>Ascomycete &#8211; Saccharomycetes</p>
                        </c>
                        <c ca="left">
                           <p>non pathogen</p>
                        </c>
                        <c ca="left">
                           <p>yeast</p>
                        </c>
                        <c ca="left">
                           <p>SGD</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Saccharomyces bayanus</it>
                           </p>
                        </c>
                        <c ca="left">
                           <p>Ascomycete &#8211; Saccharomycetes</p>
                        </c>
                        <c ca="left">
                           <p>non pathogen</p>
                        </c>
                        <c ca="left">
                           <p>yeast</p>
                        </c>
                        <c ca="left">
                           <p>SGD</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Saccharomyces castellii</it>
                           </p>
                        </c>
                        <c ca="left">
                           <p>Ascomycete &#8211; Saccharomycetes</p>
                        </c>
                        <c ca="left">
                           <p>non pathogen</p>
                        </c>
                        <c ca="left">
                           <p>yeast</p>
                        </c>
                        <c ca="left">
                           <p>SGD</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Candida glabrata</it>
                           </p>
                        </c>
                        <c ca="left">
                           <p>Ascomycete &#8211; Saccharomycetes</p>
                        </c>
                        <c ca="left">
                           <p>animal pathogen</p>
                        </c>
                        <c ca="left">
                           <p>psuedo hyphae &#8211; dimorphic</p>
                        </c>
                        <c ca="left">
                           <p>Entrez</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Kluyveromyces waltii</it>
                           </p>
                        </c>
                        <c ca="left">
                           <p>Ascomycete &#8211; Saccharomycetes</p>
                        </c>
                        <c ca="left">
                           <p>non pathogen</p>
                        </c>
                        <c ca="left">
                           <p>yeast</p>
                        </c>
                        <c ca="left">
                           <p>Entrez</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Saccharomyces kluyveri</it>
                           </p>
                        </c>
                        <c ca="left">
                           <p>Ascomycete &#8211; Saccharomycetes</p>
                        </c>
                        <c ca="left">
                           <p>non pathogen</p>
                        </c>
                        <c ca="left">
                           <p>yeast</p>
                        </c>
                        <c ca="left">
                           <p>SGD</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Kluyveromyces lactis</it>
                           </p>
                        </c>
                        <c ca="left">
                           <p>Ascomycete &#8211; Saccharomycetes</p>
                        </c>
                        <c ca="left">
                           <p>non pathogen</p>
                        </c>
                        <c ca="left">
                           <p>yeast</p>
                        </c>
                        <c ca="left">
                           <p>Entrez</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Eremothecium gossypii</it>
                           </p>
                        </c>
                        <c ca="left">
                           <p>Ascomycete &#8211; Saccharomycetes</p>
                        </c>
                        <c ca="left">
                           <p>plant pathogen</p>
                        </c>
                        <c ca="left">
                           <p>filamentous</p>
                        </c>
                        <c ca="left">
                           <p>Entrez</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Candida albicans</it>
                           </p>
                        </c>
                        <c ca="left">
                           <p>Ascomycete &#8211; Saccharomycetes</p>
                        </c>
                        <c ca="left">
                           <p>animal pathogen</p>
                        </c>
                        <c ca="left">
                           <p>psuedo hyphae &#8211; dimorphic</p>
                        </c>
                        <c ca="left">
                           <p>Entrez</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Debaryomyces hansenii</it>
                           </p>
                        </c>
                        <c ca="left">
                           <p>Ascomycete &#8211; Saccharomycetes</p>
                        </c>
                        <c ca="left">
                           <p>non pathogen</p>
                        </c>
                        <c ca="left">
                           <p>yeast &#8211; dimorphic</p>
                        </c>
                        <c ca="left">
                           <p>Entrez</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Candida lusitaniae</it>
                           </p>
                        </c>
                        <c ca="left">
                           <p>Ascomycete &#8211; Saccharomycetes</p>
                        </c>
                        <c ca="left">
                           <p>animal pathogen</p>
                        </c>
                        <c ca="left">
                           <p>yeast &#8211; dimorphic</p>
                        </c>
                        <c ca="left">
                           <p>Broad</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Coccidioides immitis</it>
                           </p>
                        </c>
                        <c ca="left">
                           <p>Ascomycete &#8211; Eurotiomycetes</p>
                        </c>
                        <c ca="left">
                           <p>animal pathogen</p>
                        </c>
                        <c ca="left">
                           <p>filamentous</p>
                        </c>
                        <c ca="left">
                           <p>Broad</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Aspergillus oryzae</it>
                           </p>
                        </c>
                        <c ca="left">
                           <p>Ascomycete &#8211; Eurotiomycetes</p>
                        </c>
                        <c ca="left">
                           <p>non pathogen</p>
                        </c>
                        <c ca="left">
                           <p>filamentous</p>
                        </c>
                        <c ca="left">
                           <p>Dogan</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Aspergillus niger</it>
                           </p>
                        </c>
                        <c ca="left">
                           <p>Ascomycete &#8211; Eurotiomycetes</p>
                        </c>
                        <c ca="left">
                           <p>non pathogen</p>
                        </c>
                        <c ca="left">
                           <p>filamentous</p>
                        </c>
                        <c ca="left">
                           <p>JGI</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Aspergillus fumigatus</it>
                           </p>
                        </c>
                        <c ca="left">
                           <p>Ascomycete &#8211; Eurotiomycetes</p>
                        </c>
                        <c ca="left">
                           <p>animal pathogen</p>
                        </c>
                        <c ca="left">
                           <p>filamentous</p>
                        </c>
                        <c ca="left">
                           <p>CADRE</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Aspergillus terreus</it>
                           </p>
                        </c>
                        <c ca="left">
                           <p>Ascomycete &#8211; Eurotiomycetes</p>
                        </c>
                        <c ca="left">
                           <p>animal pathogen</p>
                        </c>
                        <c ca="left">
                           <p>filamentous</p>
                        </c>
                        <c ca="left">
                           <p>Broad</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Aspergillus nidulans</it>
                           </p>
                        </c>
                        <c ca="left">
                           <p>Ascomycete &#8211; Eurotiomycetes</p>
                        </c>
                        <c ca="left">
                           <p>non pathogen</p>
                        </c>
                        <c ca="left">
                           <p>filamentous</p>
                        </c>
                        <c ca="left">
                           <p>Broad</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Stagonospora nodorum</it>
                           </p>
                        </c>
                        <c ca="left">
                           <p>Ascomycete &#8211; Dothideomycetes</p>
                        </c>
                        <c ca="left">
                           <p>plant pathogen</p>
                        </c>
                        <c ca="left">
                           <p>filamentous</p>
                        </c>
                        <c ca="left">
                           <p>Broad</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Sclerotinia sclerotiorum</it>
                           </p>
                        </c>
                        <c ca="left">
                           <p>Ascomycete &#8211; Leotiomycetes</p>
                        </c>
                        <c ca="left">
                           <p>plant pathogen</p>
                        </c>
                        <c ca="left">
                           <p>filamentous</p>
                        </c>
                        <c ca="left">
                           <p>Broad</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Botrytis cinerea</it>
                           </p>
                        </c>
                        <c ca="left">
                           <p>Ascomycete &#8211; Leotiomycetes</p>
                        </c>
                        <c ca="left">
                           <p>plant pathogen</p>
                        </c>
                        <c ca="left">
                           <p>filamentous</p>
                        </c>
                        <c ca="left">
                           <p>Broad</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Trichoderma reesei</it>
                           </p>
                        </c>
                        <c ca="left">
                           <p>Ascomycete &#8211; Sordariomycetes</p>
                        </c>
                        <c ca="left">
                           <p>non pathogen</p>
                        </c>
                        <c ca="left">
                           <p>filamentous</p>
                        </c>
                        <c ca="left">
                           <p>JGI</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Gibberella zeae</it>
                           </p>
                        </c>
                        <c ca="left">
                           <p>Ascomycete &#8211; Sordariomycetes</p>
                        </c>
                        <c ca="left">
                           <p>plant pathogen</p>
                        </c>
                        <c ca="left">
                           <p>filamentous</p>
                        </c>
                        <c ca="left">
                           <p>Broad</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Magnaporthe grisea</it>
                           </p>
                        </c>
                        <c ca="left">
                           <p>Ascomycete &#8211; Sordariomycetes</p>
                        </c>
                        <c ca="left">
                           <p>plant pathogen</p>
                        </c>
                        <c ca="left">
                           <p>filamentous</p>
                        </c>
                        <c ca="left">
                           <p>Broad</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Chaetomium globosum</it>
                           </p>
                        </c>
                        <c ca="left">
                           <p>Ascomycete &#8211; Sordariomycetes</p>
                        </c>
                        <c ca="left">
                           <p>animal pathogen</p>
                        </c>
                        <c ca="left">
                           <p>filamentous</p>
                        </c>
                        <c ca="left">
                           <p>Broad</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Neurospora crassa</it>
                           </p>
                        </c>
                        <c ca="left">
                           <p>Ascomycete &#8211; Sordariomycetes</p>
                        </c>
                        <c ca="left">
                           <p>non pathogen</p>
                        </c>
                        <c ca="left">
                           <p>filamentous</p>
                        </c>
                        <c ca="left">
                           <p>Broad</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>
                              <it>Encephalitazoon cuniculi</it>
                           </p>
                        </c>
                        <c ca="left">
                           <p>Microsporidia</p>
                        </c>
                        <c ca="left">
                           <p>animal pathogen</p>
                        </c>
                        <c ca="left">
                           <p>microsporidia</p>
                        </c>
                        <c ca="left">
                           <p>Entrez</p>
                        </c>
                     </r>
                  </tblbdy>
               </tbl>
               <p>&#8226; Gene Ontology annotation for <it>S. cerevisiae, S. pombe </it>and <it>C. albicans </it>has been obtained from SGD <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>, Sanger GeneDB <abbrgrp><abbr bid="B26">26</abbr></abbrgrp> and CGD <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>.</p>
               <p>&#8226; Pathway information including the assignment of pathways to proteins for <it>S. cerevisiae, S. paradoxus, S. mikatae, S. bayanus, E. gossypii, K. lactis, K. waltii, D. hansenii, C. albicans, C. glabrata, Y. lipolytica, S. pombe, N. crassa, M. grisea, A. nidulans, A. fumigatus, A. oryzae </it>and <it>E. cuniculi </it>has been obtained from KEGG <abbrgrp><abbr bid="B17">17</abbr></abbrgrp>.</p>
               <p>&#8226; Expressed Sequence Tag (EST) data are obtained from the COGEME Phytopathogenic Fungi and Oomycete EST Database <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>.</p>
            </sec>
            <sec>
               <st>
                  <p>Derived data</p>
               </st>
               <p>The following kinds of derived data are stored in the database:</p>
               <p>&#8226; Clustering sequences from 36 fungal genomes: We compared 348,995 protein sequences from the 36 genomes integrated in e-Fungi (see Table <tblr tid="T1">1</tblr>) using BlastP <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> with an E-value cut-off of 10<sup>-5</sup>. This resulted in 47,342,483 hits. Markov Chain Clustering (MCL) <abbrgrp><abbr bid="B30">30</abbr></abbrgrp> was then applied to generate clusters of similar proteins, using 2.5 as a moderate inflation value and 10<sup>-10 </sup>as a comparatively strict E-value cut-off. This generated 23,724 clusters containing in total 282,061 sequences, while 66,934 sequences were singletons.</p>
               <p>&#8226; Orthology assignments: To identify orthologous proteins between the 36 genomes, the BlastP results were analysed with OrthoMCL <abbrgrp><abbr bid="B31">31</abbr></abbrgrp> using its default parameters (i.e., an E-value of 10<sup>-5</sup>). The analysis produced in total 30,084 clusters, with 5,406 of those containing just paralogues and 24,678 containing potential orthologous proteins. Out of these clusters of potential orthologues, 14,113 are unambiguous orthologue clusters, while 10,565 are ambiguous clusters with orthologues and recent paralogues.</p>
               <p>&#8226; Domain assignments: To identify functional domains and other known sequence motifs, predicted proteins from all 36 genomes were scanned with the Pfam database release 18 <abbrgrp><abbr bid="B16">16</abbr></abbrgrp> using hmmpfam <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. A total of 6,260 different Pfam domains were identified in 196,425 proteins, using an E-value cut-off of 0.1. The distribution of 5 of the most frequently found Pfam domains among the genomes is shown in Figure <figr fid="F2">2</figr>.</p>
               <fig id="F2">
                  <title>
                     <p>Figure 2</p>
                  </title>
                  <caption>
                     <p>Distribution of 5 of the most frequently found Pfam domains</p>
                  </caption>
                  <text>
                     <p>Distribution of 5 of the most frequently found Pfam domains.</p>
                  </text>
                  <graphic file="1471-2164-8-426-2"/>
               </fig>
               <p>&#8226; Protein localisation predictions: Protein sub-cellular localisations were predicted using SignalP <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>, PSort <abbrgrp><abbr bid="B34">34</abbr></abbrgrp> and Wolf-PSort <abbrgrp><abbr bid="B35">35</abbr></abbrgrp> with the default parameters. Distributions of the most frequently assigned PSort and Wolf-PSort predictions among the genomes are shown in Figures <figr fid="F3">3</figr> and <figr fid="F4">4</figr>.</p>
               <fig id="F3">
                  <title>
                     <p>Figure 3</p>
                  </title>
                  <caption>
                     <p>Distribution of the most frequently assigned PSort predictions</p>
                  </caption>
                  <text>
                     <p>Distribution of the most frequently assigned PSort predictions.</p>
                  </text>
                  <graphic file="1471-2164-8-426-3"/>
               </fig>
               <fig id="F4">
                  <title>
                     <p>Figure 4</p>
                  </title>
                  <caption>
                     <p>Distribution of the most frequently assigned Wolf-PSort predictions</p>
                  </caption>
                  <text>
                     <p>Distribution of the most frequently assigned Wolf-PSort predictions.</p>
                  </text>
                  <graphic file="1471-2164-8-426-4"/>
               </fig>
               <p>All the generated data are integrated into the e-Fungi database using the loading infrastructure described below.</p>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Implementation</p>
            </st>
            <p>The e-Fungi infrastructure consists of several components: the database itself, the population infrastructure, and the library of analysis tasks. An overview of the infrastructure is shown in Figure <figr fid="F5">5</figr> and its components are introduced below.</p>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Overview of e-Fungi architecture</p>
               </caption>
               <text>
                  <p>Overview of e-Fungi architecture.</p>
               </text>
               <graphic file="1471-2164-8-426-5"/>
            </fig>
            <sec>
               <st>
                  <p>Database schema</p>
               </st>
               <p>The Object Database Management System Versant FastObjects <abbrgrp><abbr bid="B36">36</abbr></abbrgrp> is used to store the data integrated into e-Fungi. The database schema has been implemented using Java Data Objects (JDO), an industry standard interface-based abstraction of persistence. Using JDO for storing the data allows the direct implementation of the object data model without the need to map between the object model and, for example, a relational database model. Such a mapping often results in a less intuitive representation of the data.</p>
               <p>Using an object data model in combination with an object-oriented programming language, such as Java, also enables a tighter integration of analysis tasks with the stored data. Complex queries that analyse a large variety of different types of data can, therefore, be realised in a fairly intuitive manner.</p>
               <p>The database schema can be divided into different parts, modelling the different types of data introduced above. The parts of the schema for genomic sequences, annotations, pathways and ESTs are based on published models <abbrgrp><abbr bid="B17">17</abbr><abbr bid="B28">28</abbr><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr></abbrgrp>. The part of the schema modelling the derived data is introduced in more detail in the following.</p>
               <p>Results of the MCL and OrthoMCL cluster analyses consist of an identifier for each cluster and the assignments of proteins to clusters, captured in the classes MclCluster and OrthoMclCluster. To be able to retrieve the MCL cluster or the OrthoMCL cluster for a particular protein, the class Protein has an association with both MclCluster and OrthoMclCluster.</p>
               <p>The results of the predictions of protein sub-cellular localisations are captured following a similar approach for all three different prediction methods. Each prediction method can have a number of different outcomes, e.g., golgi, cytoplasmic, or plasma membrane. These are captured in PSortPrediction, WolfPSortPrediction and SignalPPrediction. Each prediction has a 0-to-many association with Protein, enabling the retrieval of all proteins with a particular predicted localisation. However, not only are the final predictions provided as a result of the analyses, so are a number of scores associated with the predictions. Scores returned by each prediction analysis are captured in PSortResult, WolfPSortResult and SignalPResult, which have a 1-to-1 association with the protein for which the prediction has been made. The scores are captured as provenance information associated with each analysis, thereby recording all the information contained in the report provided as a result of each analysis.</p>
            </sec>
            <sec>
               <st>
                  <p>Loading infrastructure</p>
               </st>
               <p>A loading infrastructure has been developed to integrate data from a variety of data sources, as listed in Table <tblr tid="T1">1</tblr>, and map the information onto the e-Fungi database schema. The infrastructure consists of 3 general modules, the loaders, parsers and wrappers (see Figure <figr fid="F6">6</figr>). The loaders are specific to each data source and data format. For example, the genomic data loader for data from the Broad Institute gathers the contigs, genes and protein sequence data from 3 separate FASTA files and relates the data to information provided in other files of different formats. Data from each source is parsed into a generalised format that can be processed by the loader using the respective parsers for each of the available data formats (e.g., FASTA, GTF, GFF). The wrappers are responsible for creating and linking objects, e.g., when loading data on a protein the wrapper will create a Protein, create a PrimaryPolypeptide sequence, and link these together.</p>
               <fig id="F6">
                  <title>
                     <p>Figure 6</p>
                  </title>
                  <caption>
                     <p>Loading infrastructure</p>
                  </caption>
                  <text>
                     <p><b>Loading infrastructure</b>. Schematic overview of the loading infrastructure employed to integrate primary and derived data into the e-Fungi database.</p>
                  </text>
                  <graphic file="1471-2164-8-426-6"/>
               </fig>
               <p>The loading infrastructure is designed to minimise maintenance and ensure extensibility of the database. Each module protects the others from sections that are prone to changes. For example, changes in the source, e.g., changes in location, format or method of access, will only require modifications of the loaders without affecting the existing parsers or wrappers. The parsers, on the other hand, act as tools for the loaders and can be easily improved or added as required. For the database, wrappers provide a layer of protection that allows the schema to be changed without the need to modify existing loaders or parsers. This allows the database to be extended easily to include further kinds of data without the need to rebuild the loading infrastructure.</p>
            </sec>
            <sec>
               <st>
                  <p>Library of analysis tasks</p>
               </st>
               <p>The data stored in the e-Fungi database can be analysed using pre-determined analysis tasks that can be parameterised, so-called canned queries. More than 90 queries are currently available, varying in their complexity from simple retrieval tasks to complex analysis tasks. Similar queries based on the type of data analysed are grouped together (see Table <tblr tid="T2">2</tblr> for an overview of the categories). Providing pre-determined analysis tasks as a means to explore and analyse the stored data might seem quite limiting at first. However, as shown in the next section, it allows complex analysis tasks to be provided that are beyond simple keyword- or identifier-based retrieval of stored data.</p>
               <tbl id="T2">
                  <title>
                     <p>Table 2</p>
                  </title>
                  <caption>
                     <p>Canned query groups currently provided</p>
                  </caption>
                  <tblbdy cols="2">
                     <r>
                        <c ca="left">
                           <p>
                              <b>Canned query group</b>
                           </p>
                        </c>
                        <c ca="left">
                           <p>
                              <b>Canned query group</b>
                           </p>
                        </c>
                     </r>
                     <r>
                        <c cspan="2">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Annotation of proteins in clusters</p>
                        </c>
                        <c ca="left">
                           <p>Queries in this group retrieve annotation of all the proteins in particular clusters. The annotation consists of PSort, Wolf-PSort and SignalP predictions, as well as GO annotations, Pfam domains, Enzyme annotation and pathways for each protein, as well as its assignment to a particular MCL and OrthoMCL cluster. The clusters can either be chosen by providing an identifier of a particular cluster or they can be based on the proteins they contain, such as proteins with a particular GO annotation or a particular cellular localisation as predicted by PSort or Wolf- PSort.</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Cellular localisation analysis</p>
                        </c>
                        <c ca="left">
                           <p>This group of queries retrieves the cellular localisation for proteins as predicted by PSort and Wolf-PSort. It also retrieves proteins with a particular predicted cellular localisation.</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>EST analysis</p>
                        </c>
                        <c ca="left">
                           <p>Collection of general EST analyses. Information available include group/hierarchy structure of ESTs and genes as well as number of homologs of genes in all genomes in the database.</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Essential yeast genes cluster analysis</p>
                        </c>
                        <c ca="left">
                           <p>Queries to retrieve Mcl Clusters containing proteins of a given genome and proteins of essential or non-essential yeast genes.</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Essential yeast genes orthology analysis</p>
                        </c>
                        <c ca="left">
                           <p>This group of queries analyses clusters containing a given genome and proteins of essential or non-essential yeast genes in terms of the number of genomes present in those clusters.</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Functional annotation analysis</p>
                        </c>
                        <c ca="left">
                           <p>Queries in this group enable the retrieval of Gene Ontology or Pfam annotation for a given protein, or the retrieval of proteins with a given annotation.</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Genomics analysis</p>
                        </c>
                        <c ca="left">
                           <p>Collection of queries for general genomic analyses, such as retrieving the exons of a particular gene.</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>MCL cluster analysis</p>
                        </c>
                        <c ca="left">
                           <p>Queries in this group provide a general analysis of the MCL clusters in the database. Clusters containing proteins of a given genome, or a group of genomes, such as plant pathogens or filamentous fungi, can be retrieved. Furthermore, clusters that contain more or less than a given percentage of proteins of a given genome can also be obtained.</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>OrthoMCL cluster analysis</p>
                        </c>
                        <c ca="left">
                           <p>This group of queries provide a general analysis of the OrthoMCL clusters in the database. The queries in this group are similar in scope to the queries in the MCL cluster analysis group.</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Pathway analysis</p>
                        </c>
                        <c ca="left">
                           <p>Queries provided in this group retrieve pathways and enzyme annotations for a particular protein as well as all the proteins in a given pathway or with a particular enzyme annotation.</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Redundancy analysis</p>
                        </c>
                        <c ca="left">
                           <p>The query in this group analyses the redundancy in a given species. Genome redundancy is determined by counting the number of proteins of that given genome in MCL clusters.</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Secretome analysis</p>
                        </c>
                        <c ca="left">
                           <p>To retrieve the SignalP prediction for a given protein or proteins with a given SignalP prediction, i.e., secretory or non-secretory proteins, queries in this group can be used.</p>
                        </c>
                     </r>
                     <r>
                        <c ca="left">
                           <p>Transcript abundance</p>
                        </c>
                        <c ca="left">
                           <p>Collection of queries for transcript abundance analyses. These queries enable the identification of genes that may be highly expressed under a particular growth condition. Information of these genes and conditions can also be retrieved.</p>
                        </c>
                     </r>
                  </tblbdy>
               </tbl>
            </sec>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Utility</p>
         </st>
         <p>The data stored in e-Fungi can be accessed through a web interface and web services, which have been generated using Pierre <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>. A Java Graphical User Interface (GUI), described elsewhere <abbrgrp><abbr bid="B38">38</abbr></abbrgrp>, is currently only used locally, but can be made available on request.</p>
         <sec>
            <st>
               <p>Web interface access</p>
            </st>
            <p>The database can be accessed through the e-Fungi web site <abbrgrp><abbr bid="B25">25</abbr></abbrgrp> by either following the link 'Connect to the database' or by choosing the category 'Database' in the menu on the left hand side of the page and choosing the 'Connect to the database' link. In both cases, a further link to the WSDL describing the web services is also provided. The web services are introduced later.</p>
            <sec>
               <st>
                  <p>Browse</p>
               </st>
               <p>Browse provides an overview of the genomes stored in the database, but can also be used as an entry point to explore and analyse the data by following either of the two different kinds of links provided: (i) navigational links, and (ii) links to analysis tasks. The former, for example, enable the retrieval of contigs or chromosomes for a particular genome, whereas the latter link to analysis tasks provided in the canned query library using the chosen entry, e.g., a particular genome, as input.</p>
            </sec>
            <sec>
               <st>
                  <p>Simple Search</p>
               </st>
               <p>The Simple Search feature exposes the tasks provided in the canned query library mentioned above. Documentation for each query can be found under the category 'Documentation' in the menu on the left hand side of the e-Fungi web site or by following the 'Help' link provided on each query form (see Figure <figr fid="F7">7</figr>). Information includes the type of input required, a number of example inputs, and a description of the output provided by the query. Furthermore, information on the runtime of long running queries is also provided. The canned queries provided to support the comparative analyses listed in the background section are introduced in the following to illustrate the utility of the e-Fungi database:</p>
               <fig id="F7">
                  <title>
                     <p>Figure 7</p>
                  </title>
                  <caption>
                     <p>Screenshot of parameterisation of a canned query</p>
                  </caption>
                  <text>
                     <p><b>Screenshot of parameterisation of a canned query</b>. Screenshot of the web interface showing the parameterisation of the query 'Get clusters with proteins of a given genome.', which is part of the group 'MCL cluster analysis'.</p>
                  </text>
                  <graphic file="1471-2164-8-426-7"/>
               </fig>
               <p>1. <it>Identification of species-specific protein families or those conserved in closely related species</it>. The queries 'Get MCL clusters with proteins of a given genome' or 'Get OrthoMCL clusters with proteins of a given genome', which can be found in the group 'MCL cluster analysis' and 'OrthoMCL cluster analysis', respectively, can be used to retrieve all clusters containing proteins of a particular genome and perhaps identify clusters containing only paralogues of the chosen genome, i.e., possible species-specific proteins. To run a query, the appropriate query category is chosen, e.g., MCL clusters or OrthoMCL clusters. From the list of canned queries in the chosen group, the canned query of interest is selected and the user is presented with a form for the required input parameters (e.g., Figure <figr fid="F7">7</figr>). For some of the input parameters, an existing value feature exists, enabling users to choose a value from a list of possible values. For other parameters, the Advanced Search feature can be used to retrieve the exact value, such as for a particular Gene Ontology Annotation or Pfam domain, as illustrated later. With the input parameters provided, the query can be executed and the results displayed (see Figure <figr fid="F8">8</figr>). Some of the result reports provide navigation in the form of links, similar to the navigation in Browse, for further exploration and analysis of the results.</p>
               <fig id="F8">
                  <title>
                     <p>Figure 8</p>
                  </title>
                  <caption>
                     <p>Screenshot of the query result</p>
                  </caption>
                  <text>
                     <p><b>Screenshot of the query result</b>. Screenshot of the web interface showing a subset of the MCL clusters with <it>Aspergillus nidulans </it>proteins. The clusters shown are the three clusters containing only proteins of filamentous genomes and no yeast like genomes, whereas all the remaining 7593 contain both.</p>
                  </text>
                  <graphic file="1471-2164-8-426-8"/>
               </fig>
               <p>2. <it>Contraction or expansion of protein families</it>. The query 'Get all MCL clusters with more than a given percentage of proteins of a given genome' can be used to identify outlying clusters. The query is part of the group 'MCL cluster analysis' and has a counterpart in the group 'OrthoMCL cluster analysis'. To identify protein families that are conserved in genomes exhibiting a certain phenotype, the query 'Get MCL clusters containing proteins of a group of genomes' or its counterpart that analyses OrthoMCL clusters can be used. A group of genomes can be specified by their exhibited phenotypes, such as growth form or pathogenicity. Analyses to identify species-specific protein families or those that are conserved in related species with a particular phenotype, as well as studies of contraction or expansion of protein families, have been part of recent comparative studies <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr></abbrgrp>.</p>
               <p>3. <it>Genome redundancy in a range of related species, illustrating the importance of genome duplication </it><abbrgrp><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr></abbrgrp>. The canned query 'Get the number of paralogues for all clusters containing proteins of a given genome' that can be found in the group 'Redundancy analysis' can be used.</p>
               <p>4. <it>Identification of secreted proteins, which in pathogens could play important roles in host-pathogen interactions</it>. This analysis can be aided by executing either of the following canned queries 'Get secretory proteins for a given genome', which is part of the group 'Secretome analysis', or 'Get annotation for proteins of a given genome in MCL/OrthoMCL clusters with secretory proteins'. The queries retrieving the annotation of proteins in MCL or OrthoMCL clusters are part of the group 'Annotation of proteins in clusters'.</p>
               <p>5. <it>Conservation of genes defined as essential for growth in Saccharomyces cerevisiae </it><abbrgrp><abbr bid="B8">8</abbr></abbrgrp><it> among fungal genomes</it>. This analysis is supported by a number of queries that can be found in the groups 'Essential yeast genes cluster analysis' and 'Essential yeast genes orthology analysis'. Similar studies using the essential genes identified in <it>Candida albicans </it>have been reported in <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>.</p>
               <p>6. <it>Conservation of components of metabolic pathways among fungal genomes </it><abbrgrp><abbr bid="B45">45</abbr></abbrgrp>. Again, this analysis is supported by a number of canned queries, such as 'Get proteins that are in the same (KEGG reference) pathway as a given protein' of the group 'Pathway analysis', which retrieves all the proteins that are known to participate in a particular pathway. To analyse newly sequenced genomes and identify proteins that could potentially be part of a pathway, the query 'Get annotation for proteins of a given genome in the same MCL/OrthoMCL clusters as proteins in a given pathway', part of the 'Annotation of proteins in clusters' group, can be used.</p>
            </sec>
            <sec>
               <st>
                  <p>Advanced Search</p>
               </st>
               <p>The Advanced Search feature can be used to retrieve entries for which a property value or a range of property values can be specified. The user specifies the type of entry to be retrieved and the filters that the returned entries have to match. Similar to the Simple Search, a form is provided requesting input parameters for the Advanced Search. The example of an Advanced Search shown in Figure <figr fid="F9">9</figr> retrieves all the biosynthesis pathways, i.e., all the KEGG pathways the name of which ends in 'biosynthesis'.</p>
               <fig id="F9">
                  <title>
                     <p>Figure 9</p>
                  </title>
                  <caption>
                     <p>Screenshot of Advanced search</p>
                  </caption>
                  <text>
                     <p><b>Screenshot of Advanced search</b>. Screenshot of the Advanced search feature of the web interface. This feature enables the filtering of objects of a particular type and can be used to retrieve the exact value of names or identifiers of which only the beginning or end is known.</p>
                  </text>
                  <graphic file="1471-2164-8-426-9"/>
               </fig>
            </sec>
         </sec>
         <sec>
            <st>
               <p>Web service access</p>
            </st>
            <p>Programmatic access to all the simple and advanced search facilities is provided by a web service interface. This enables the integration of e-Fungi web services with other web services to build complex workflows for data analysis and visualisation.</p>
            <p>A simple workflow example, implemented in Taverna <abbrgrp><abbr bid="B46">46</abbr></abbrgrp>, is shown in Figure <figr fid="F10">10</figr>. In this example, the ESTs representing an Open Reading Frame (ORF) are aligned. The workflow, built using web services from e-Fungi and EBI SOAPLab <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>, retrieves the ESTs (that represent the ORF of interest), and generates a set of aligned EST sequences and an alignment plot. Firstly, the e-Fungi web service operation, 'getEstFromOpenReadingFrame' is used to retrieve all the ESTs that represent the ORF of interest. The results are then parsed, the required information extracted (using tools in Taverna) and passed to the web service operation 'emma' (from EBI SOAPLab) that performs multiple sequence alignments. The results are then sent to the operation 'prettyplot' (from EBI SOAPLab) to generate an alignment plot, highlighting the aligned sections for the group of ESTs. The WSDL for the e-Fungi web service used within this workflow is:</p>
            <fig id="F10">
               <title>
                  <p>Figure 10</p>
               </title>
               <caption>
                  <p>Sample workflow</p>
               </caption>
               <text>
                  <p><b>Sample workflow</b>. Workflow schema describing multi-sequence alignment and visualisation using web services.</p>
               </text>
               <graphic file="1471-2164-8-426-10"/>
            </fig>
            <p>&lt;wsdl:definitions targetNamespace="urn:uk.org.efungi"</p>
            <p>&#160;&#160;&#160;xmlns:soapenc="http://schemas.xmlsoap.org/soap/encoding/"></p>
            <p>&#160;&#160;&#160;&lt;wsdl:message name="getEstFromOpenReadingFrameRequest"></p>
            <p>&#160;&#160;&#160;&#160;&#160;&#160;&lt;wsdl:part name="id" type="soapenc:string"/></p>
            <p>&#160;&#160;&#160;&lt;/wsdl:message></p>
            <p>&#160;&#160;&#160;&lt;wsdl:message name="getEstFromOpenReadingFrameResponse"></p>
            <p>&#160;&#160;&#160;&#160;&#160;&#160;&lt;wsdl:part name="getEstFromOpenReadingFrameReturn" type="soapenc:string"/></p>
            <p>&#160;&#160;&#160;&lt;/wsdl:message></p>
            <p>&#160;&#160;&#160;&lt;wsdl:portType></p>
            <p>&#160;&#160;&#160;&#160;&#160;&#160;&lt;wsdl:operation name="getEstFromOpenReadingFrame" parameterOrder="id"></p>
            <p>&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&lt;wsdl:input message="impl:getEstFromOpenReadingFrameRequest"</p>
            <p>&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;name="getEstFromOpenReadingFrameRequest"/></p>
            <p>&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&lt;wsdl:output message="impl:getEstFromOpenReadingFrameResponse"</p>
            <p>&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;&#160;name="getEstFromOpenReadingFrameResponse"/></p>
            <p>&#160;&#160;&#160;&#160;&#160;&#160;&lt;/wsdl:operation></p>
            <p>&#160;&#160;&#160;&lt;/wsdl:portType></p>
            <p>&lt;/wsdl:definitions></p>
            <p>Two different kinds of web services are provided for the simple searches: (i) specific and (ii) generic. The specific web service offers users a separate operation for each individual canned query, while the generic service provides users an all-in-one operation that is able to access all the available canned queries. All results returned from the web services are formatted in XML to ease parsing. Supporting operations are also provided to aid the usage of the web service. For example, the web service operation 'identifyAdvancedSearchCollections' returns all the available types of data that support the Advanced Search. The e-Fungi web service is deployed using Axis <abbrgrp><abbr bid="B48">48</abbr></abbrgrp> and can be accessed via Taverna (among other methods) by using Taverna's 'WSDL scavenger' feature.</p>
         </sec>
         <sec>
            <st>
               <p>Case study &#8211; Using e-Fungi to investigate fungal cytochrome P450 proteins</p>
            </st>
            <p>The case study presented in this section investigates the distribution, diversity and conservation of proteins with particular functional domains among related fungal genomes <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr><abbr bid="B49">49</abbr></abbrgrp>.</p>
            <p>Cytochrome P450 proteins form a superfamily of proteins which are found in many organisms, including bacteria, fungi, plants and mammals. They are monooxygenase enzymes that catalyse bioconversion processes. These include the degradation of complex biopolymers, such as the breakdown of lignin by <it>Phanerochaete chrysosporium </it><abbrgrp><abbr bid="B50">50</abbr></abbrgrp>, and the production of secondary metabolites. In order to compare the P450omes <abbrgrp><abbr bid="B50">50</abbr></abbrgrp> of different fungal species, we used the query 'Get proteins with a given Pfam annotation', which can be found in the group 'Functional annotation analysis', entering the accession number PF00067. The result of this query is a list of proteins in which this motif has been identified, along with the E-Value and score associated with each identification as well as the MCL and OrthoMCL cluster in which the protein has been placed. The numbers of P450 proteins identified in each fungal species (see Table <tblr tid="T3">3</tblr>) appear to be in agreement with previously published results for <it>A. oryzae </it>(149 P450 proteins), <it>A. nidulans </it>(102), <it>A. fumigatus </it>(72) <abbrgrp><abbr bid="B51">51</abbr></abbrgrp> and <it>P. chrysosporium </it>(150) <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>.</p>
            <tbl id="T3">
               <title>
                  <p>Table 3</p>
               </title>
               <caption>
                  <p>Clustering of 450 proteins</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c ca="left">
                        <p>
                           <b>Species</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b># P450 proteins</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b># OrthoMCL clusters</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b># Proteins not in OrthoMCL clusters</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Phytophthora sojae</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>35</p>
                     </c>
                     <c ca="left">
                        <p>18</p>
                     </c>
                     <c ca="left">
                        <p>6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Phytophthora ramorum</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>31</p>
                     </c>
                     <c ca="left">
                        <p>17</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Rhizopus oryzae</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>49</p>
                     </c>
                     <c ca="left">
                        <p>8</p>
                     </c>
                     <c ca="left">
                        <p>4</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Ustilago maydis</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>20</p>
                     </c>
                     <c ca="left">
                        <p>15</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Phanerochaete chrysosporium</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>150</p>
                     </c>
                     <c ca="left">
                        <p>25</p>
                     </c>
                     <c ca="left">
                        <p>10</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Schizosaccharomyces pombe</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Yarrowia lipolytica</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>17</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>1</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Saccharomyces paradoxus</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Saccharomyces cerevisiae</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Saccharomyces mikatae</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Saccharomyces kudriavzevii</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Saccharomyces bayanus</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Saccharomyces castellii</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Saccharomyces kluyveri</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Kluyveromyces waltii</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Kluyveromyces lactis</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>5</p>
                     </c>
                     <c ca="left">
                        <p>5</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Eremothecium gossypii</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Candida glabrata</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Candida albicans</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>19</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Debaryomyces hansenii</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>9</p>
                     </c>
                     <c ca="left">
                        <p>6</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Candida lusitaniae</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>8</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Coccidioides immitis</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>44</p>
                     </c>
                     <c ca="left">
                        <p>32</p>
                     </c>
                     <c ca="left">
                        <p>6</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Aspergillus oryzae</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>155</p>
                     </c>
                     <c ca="left">
                        <p>86</p>
                     </c>
                     <c ca="left">
                        <p>28</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Aspergillus niger</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>150</p>
                     </c>
                     <c ca="left">
                        <p>86</p>
                     </c>
                     <c ca="left">
                        <p>20</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Aspergillus fumigatus</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>72</p>
                     </c>
                     <c ca="left">
                        <p>50</p>
                     </c>
                     <c ca="left">
                        <p>5</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Aspergillus terreus</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>116</p>
                     </c>
                     <c ca="left">
                        <p>67</p>
                     </c>
                     <c ca="left">
                        <p>25</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Aspergillus nidulans</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>119</p>
                     </c>
                     <c ca="left">
                        <p>79</p>
                     </c>
                     <c ca="left">
                        <p>15</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Stagonospora nodorum</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>148</p>
                     </c>
                     <c ca="left">
                        <p>83</p>
                     </c>
                     <c ca="left">
                        <p>37</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Sclerotinia sclerotiorum</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>92</p>
                     </c>
                     <c ca="left">
                        <p>70</p>
                     </c>
                     <c ca="left">
                        <p>13</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Botrytis cinerea</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>79</p>
                     </c>
                     <c ca="left">
                        <p>53</p>
                     </c>
                     <c ca="left">
                        <p>18</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Trichoderma reesei</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>71</p>
                     </c>
                     <c ca="left">
                        <p>43</p>
                     </c>
                     <c ca="left">
                        <p>10</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Gibberella zeae</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>107</p>
                     </c>
                     <c ca="left">
                        <p>65</p>
                     </c>
                     <c ca="left">
                        <p>14</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Chaetomium globosum</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>89</p>
                     </c>
                     <c ca="left">
                        <p>68</p>
                     </c>
                     <c ca="left">
                        <p>14</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Magnaporthe grisea</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>133</p>
                     </c>
                     <c ca="left">
                        <p>67</p>
                     </c>
                     <c ca="left">
                        <p>30</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Neurospora crassa</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>39</p>
                     </c>
                     <c ca="left">
                        <p>33</p>
                     </c>
                     <c ca="left">
                        <p>2</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Encephalitazoon cuniculi</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                     <c ca="left">
                        <p>0</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
            <p>The distribution of P450 proteins amongst the fungi is clearly unequal. The Hemiascomycetes and the Schizosaccharomycete <it>Sz. pombe </it>have far fewer P450 proteins than filamentous Ascomycetes and the Basidiomycetes. However, there are also large differences between different filamentous Ascomycetes and Basidiomyctes. Analysis of the result with respect to the placement of P450 proteins in OrthoMCL clusters reveals differences between several Pezizomycotina and <it>P. chrysosporium</it>. Firstly, for the Pezizomycotina species, there are more P450 proteins that are not part of an OrthoMCL cluster than there are for <it>P. chrysosporium</it>. The second difference is that <it>P. chrysosporium </it>P450 proteins are found in far fewer OrthoMCL clusters than the P450 proteins from the Pezizomycotina species. This difference is due in part to a few highly duplicated <it>P. chrysosporium </it>genes. Using the query 'Get annotation for proteins in a given OrthoMCL cluster' from the group 'Annotation of proteins in clusters', for example, shows that OrthoMCL cluster ORTHOMCL3134 contains 32 P450 <it>P. chrysosporium </it>paralogues and no proteins from any other species, while cluster ORTHOMCL190 contains 53 proteins including 14 <it>P. chrysosporium </it>proteins. In summary, our analysis identifies enormous differences in the P450omes of different fungal species. It is clear that budding yeasts and fission yeasts possess much smaller P450omes than Pezizomycetes, Basidiomycetes and Zygomycetes. More detailed analysis of five fungal species with large P450omes demonstrates that those of the four Pezizomycotina species appear to possess greater sequence diversity and less tandem duplication.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>Queries provided by e-Fungi are focussed on the analysis of biological or evolutionary phenomena, such as gene duplication, or expansion and contraction of protein families in related species, rather than on sequence level comparisons of genomes, genes and proteins. Even though the clusters, forming the foundation for the majority of analyses provided, are created based on sequence similarity of proteins, the queries are not limited to retrieval of the results of those sequence-based analyses, but instead correlate cluster information with additional information, such as functional annotation, or prediction of protein sub-cellular localisations or identified Pfam motifs.</p>
         <p>A number of data repositories support comparative genomics analysis, for example, the UCSC Genome Browser <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B52">52</abbr></abbrgrp>, NCBI <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>, and Ensembl <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B53">53</abbr><abbr bid="B54">54</abbr></abbrgrp>, all of which integrate a wide variety of genomes from different kingdoms, coliBase/xBase <abbrgrp><abbr bid="B55">55</abbr><abbr bid="B56">56</abbr></abbrgrp>, Microbase <abbrgrp><abbr bid="B57">57</abbr></abbrgrp>, MolliGen <abbrgrp><abbr bid="B58">58</abbr></abbrgrp> and the Comprehensive Microbial Resource (CMR) <abbrgrp><abbr bid="B59">59</abbr></abbrgrp>, which are data sources dedicated to bacterial comparative genomics. Furthermore, there is the Integrated Microbial Genomes (IMG) system <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>, integrating a large number of microbial genomes, amongst them a smaller number of eukaryotes, and G&#233;nolevures <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> which contains 14 hemiascomycetous yeasts. These data repositories differ with respect to the number and diversity of genomes they cover, but also in the kinds of data integrated in addition to sequence data, and the analyses they provide.</p>
         <p>Ensembl, NCBI, coliBase/xBase, CMR and Microbase integrate predominantly sequence data and provide comparative analyses based on nucleotide sequence similarities and orthologous proteins. In addition to sequence data, G&#233;nolevures integrates pathway and Pfam data, CMR also captures functional annotation and pathways, MolliGen provides pathway data, and both UCSC and IMG integrate amongst other data Pfam, functional annotation and pathway data, all of which have also been integrated into e-Fungi. However, despite the differences in the number of genomes and types of additional data integrated, search facilities tend to be quite similar, and limited to sequence similarity- or keyword-, identifier- or name-based searches for retrieval of the stored data. Such search facilities tend to be straightforward and self-explanatory to use, but less suitable for complex analyses of stored data, than those provided by e-Fungi.</p>
         <p>However, in addition to data retrieval and sequence based comparisons, NCBI, MolliGen, G&#233;nolevures and IMG provide analyses that are aimed at understanding molecular evolution and are to some extent similar in scope to analyses provided by e-Fungi. Such analyses include the study of conservation of proteins between genomes or groups of genomes, as well as the conservation of pathways. However, these analyses are provided by bespoke analysis tools that are not part of the general query and analysis infrastructure, unlike in e-Fungi. Such tools include TaxPlots provided by the NCBI <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>, Phylogenetic Profiler and Abundance Profiler provided by IMG <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>, or the multi-proteome differential analysis facility provided by MolliGen <abbrgrp><abbr bid="B58">58</abbr></abbrgrp>. Using bespoke tools for complex analyses is limiting in terms of scalability, as new tools have to be developed to provide complex analyses of different kinds. As complex analyses are part of the e-Fungi query and analysis infrastructure, new queries analysing different kinds of data can easily be added. The e-Fungi database integrates and makes available various of the data sets that have been used in previous comparative studies (e.g. <abbrgrp><abbr bid="B40">40</abbr><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr></abbrgrp>) but that have not typically been central to genomic databases. With its cluster-based genome comparison analyses, its integration of a variety of other kinds of information in addition to sequence and orthologue data, and its complex analysis tasks, e-Fungi moves away from sequence-based comparative genomics data sources that can predominantly be accessed by keyword or gene identifier-based queries.</p>
         <p>The e-Fungi database is updated and extended in the form of themed releases, with 'Sequence' and 'Functional annotation' being the first two releases, and 'Functional genomics' the next release scheduled for the end of 2007. Not only are new types of data and new queries added according to the theme of the release, but also new genomes are added. For each release, all the derived data, including clustering and PFAM analysis, is regenerated and updated.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>The e-Fungi database integrates a large number of diverse fungal genomes and complements the wealth of genomic data with derived data generated by a range of analyses performed on the genomic data of all the genomes. The e-Fungi database is unique in the diversity of data that it provides for the large number of genomes it integrates. It is also unique in terms of the extensive canned query library for the analysis of the stored data it provides. The canned queries are motivated by recent comparative studies carried out to improve our understanding of evolutionary biology.</p>
      </sec>
      <sec>
         <st>
            <p>Availability and requirements</p>
         </st>
         <p>The e-Fungi database can be accessed freely at <url>http://www.e-fungi.org.uk</url>. e-Fungi WSDL files can also be obtained from the website.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>CH and HMW implemented the e-Fungi infrastructure. CH, HMW, IA, MC wrote the initial draft. NWP and MR provided feedback on the initial draft. CH has revised the draft. DMS, MR, SJH, NJT, SGO and NWP provided input on the development and direction of the warehouse. All authors have read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>The e-Fungi project is funded by the BBSRC Bioinformatics and E-science Programme II. We gratefully acknowledge the support of the North-West Grid, Manchester.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>The Genomes On Line Database (GOLD) v.2: a monitor of genome projects worldwide</p>
            </title>
            <aug>
               <au>
                  <snm>Liolios</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Tavernarakis</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Hugenholtz</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Kyrpides</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>2006</pubdate>
            <issue>34 Database</issue>
            <fpage>D332</fpage>
            <lpage>D334</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1347507</pubid>
                  <pubid idtype="pmpid" link="fulltext">16381880</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Ten years of bacterial genome sequencing: comparative-genomics-based discoveries</p>
            </title>
            <aug>
               <au>
                  <snm>Binnewies</snm>
                  <fnm>TT</fnm>
               </au>
               <au>
                  <snm>Motro</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Hallin</snm>
                  <fnm>PF</fnm>
               </au>
               <au>
                  <snm>Lund</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Dunn</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>La</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hampson</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Bellgard</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Wassenaar</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Ussery</snm>
                  <fnm>DW</fnm>
               </au>
            </aug>
            <source>Functional &amp; Integrative Genomics</source>
            <pubdate>2006</pubdate>
            <volume>6</volume>
            <fpage>165</fpage>
            <lpage>185</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16773396</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Comparative Genomics of Trypanosomatid Parasitic Protozoa</p>
            </title>
            <aug>
               <au>
                  <snm>El-Sayed</snm>
                  <fnm>NM</fnm>
               </au>
               <au>
                  <snm>Myler</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Blandin</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Berriman</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Crabtree</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Aggarwal</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Caler</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Renauld</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Worthey</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Hertz-Fowler</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ghedin</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Peacocl</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Bartholomeu</snm>
                  <fnm>DC</fnm>
               </au>
               <au>
                  <snm>Haas</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Tran</snm>
                  <fnm>AN</fnm>
               </au>
               <au>
                  <snm>Wortman</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Alsmark</snm>
                  <fnm>UCM</fnm>
               </au>
               <au>
                  <snm>Angiuoli</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Anupama</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Badger</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Bringaud</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Cadag</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Carlton</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Cerqueira</snm>
                  <fnm>GC</fnm>
               </au>
               <au>
                  <snm>Creasy</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Delcher</snm>
                  <fnm>AL</fnm>
               </au>
               <au>
                  <snm>Djikeng</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Embley</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Hauser</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ivens</snm>
                  <fnm>AC</fnm>
               </au>
               <au>
                  <snm>Kummerfeld</snm>
                  <fnm>SK</fnm>
               </au>
               <au>
                  <snm>Pereira-Leal</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Nilsson</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Peterson</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Salzberg</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Shallom</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Silva</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Sundaram</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Westenberger</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Melville</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Donelson</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Andersson</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Stuart</snm>
                  <fnm>KD</fnm>
               </au>
               <au>
                  <snm>Hall</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2005</pubdate>
            <volume>309</volume>
            <fpage>404</fpage>
            <lpage>409</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16020724</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Sequencing of Aspergillus nidulans and comparative analysis with A. fumigatus and A. oryzae</p>
            </title>
            <aug>
               <au>
                  <snm>Galagan</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Calvo</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Cuomo</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ma</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Wortman</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Batzoglou</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>SI</fnm>
               </au>
               <au>
                  <snm>Bat&#252;rkmen</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Spevak</snm>
                  <fnm>CC</fnm>
               </au>
               <au>
                  <snm>Clutterbuck</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kapitonov</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Jurka</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Scazzocchio</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Farman</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Butler</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Purcell</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Braus</snm>
                  <fnm>GH</fnm>
               </au>
               <au>
                  <snm>Draht</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Busch</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>D'Enfert</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Bouchier</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Goldman</snm>
                  <fnm>GH</fnm>
               </au>
               <au>
                  <snm>Bell-Pedersen</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Griffths-Jones</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Doonan</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Vienken</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Pain</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Freitag</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Selker</snm>
                  <fnm>EU</fnm>
               </au>
               <au>
                  <snm>Archer</snm>
                  <fnm>DB</fnm>
               </au>
               <au>
                  <snm>Penalva</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Oakley</snm>
                  <fnm>BR</fnm>
               </au>
               <au>
                  <snm>Momany</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Tanaka</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Kumagai</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Asai</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Machida</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Nierman</snm>
                  <fnm>WC</fnm>
               </au>
               <au>
                  <snm>Denning</snm>
                  <fnm>DW</fnm>
               </au>
               <au>
                  <snm>Caddick</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hynes</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Paoletti</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Fischer</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Dyer</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Sachs</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Osmani</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Birren</snm>
                  <fnm>BW</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2005</pubdate>
            <volume>438</volume>
            <fpage>1105</fpage>
            <lpage>1115</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16372000</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Comparative genomics of nematodes</p>
            </title>
            <aug>
               <au>
                  <snm>Mitreva</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Blaxter</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Bird</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>McCarter</snm>
                  <fnm>JP</fnm>
               </au>
            </aug>
            <source>TRENDS in Genetics</source>
            <pubdate>2005</pubdate>
            <volume>21</volume>
            <issue>10</issue>
            <fpage>573</fpage>
            <lpage>581</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16099532</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Mitochondrial genome sequences and comparative genomics of Phytopthora ramorum and P. sojae</p>
            </title>
            <aug>
               <au>
                  <snm>Martin</snm>
                  <fnm>FN</fnm>
               </au>
               <au>
                  <snm>Bensasson</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Tyler</snm>
                  <fnm>BM</fnm>
               </au>
               <au>
                  <snm>Boore</snm>
                  <fnm>JL</fnm>
               </au>
            </aug>
            <source>Current Genetics</source>
            <pubdate>2007</pubdate>
            <volume>51</volume>
            <issue>5</issue>
            <fpage>285</fpage>
            <lpage>296</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17310332</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Genomics of the fungal kingdom: Insights into eukaryotic biology</p>
            </title>
            <aug>
               <au>
                  <snm>Galagan</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Henn</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Ma</snm>
                  <fnm>LJ</fnm>
               </au>
               <au>
                  <snm>Cuomo</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Birren</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Genome Research</source>
            <pubdate>2005</pubdate>
            <volume>15</volume>
            <fpage>1620</fpage>
            <lpage>1631</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16339359</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Functional profiling of the Saccharomyces cerevisiae genome</p>
            </title>
            <aug>
               <au>
                  <snm>Giaever</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Chu</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Ni</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Connelly</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Riles</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>V&#233;ronneau</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Dow</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lucau-Danila</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Anderson</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Andr&#233;</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Arkin</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Astromoff</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bakkoury</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Bangham</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Benito</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Brachat</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Campanaro</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Curtiss</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Deutschbauer</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Entian</snm>
                  <fnm>KD</fnm>
               </au>
               <au>
                  <snm>Flaherty</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Foury</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Garfinkel</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Gerstein</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Gotte</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>G&#252;ldener</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Hegemann</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Hempel</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Herman</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Jaramillo</snm>
                  <fnm>DF</fnm>
               </au>
               <au>
                  <snm>Kelly</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Kelly</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>K&#246;tter</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>LaBonte</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Lamb</snm>
                  <fnm>DC</fnm>
               </au>
               <au>
                  <snm>Lan</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Liang</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Liao</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Liu</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Luo</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Lussier</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Mao</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Menard</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Ooi</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Revuelta</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Roberts</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Rose</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ross-Macdonald</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Scherens</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Schimmack</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Shafer</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Shoemaker</snm>
                  <fnm>DD</fnm>
               </au>
               <au>
                  <snm>Sookhai-Mahadeo</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Storms</snm>
                  <fnm>RK</fnm>
               </au>
               <au>
                  <snm>Strathern</snm>
                  <fnm>JN</fnm>
               </au>
               <au>
                  <snm>Valle</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Voet</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Volckaert</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>yun Wang</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ward</snm>
                  <fnm>TR</fnm>
               </au>
               <au>
                  <snm>Wilhelmy</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Winzeler</snm>
                  <fnm>EA</fnm>
               </au>
               <au>
                  <snm>Yang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Yen</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Youngman</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Bussey</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Boeke</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Snyder</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Philippsen</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>Johnston</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>418</volume>
            <fpage>387</fpage>
            <lpage>391</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">12140549</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>What's next for Bioinformatics?</p>
            </title>
            <aug>
               <au>
                  <snm>Stein</snm>
                  <fnm>LD</fnm>
               </au>
            </aug>
            <source>The Scientist</source>
            <pubdate>2005</pubdate>
            <volume>19</volume>
            <issue>10</issue>
            <fpage>31</fpage>
         </bibl>
         <bibl id="B10">
            <title>
               <p>The first filamentous fungal genome sequences: Aspergillus leads the way for essential everyday resources or dusty museum specimens?</p>
            </title>
            <aug>
               <au>
                  <snm>Jones</snm>
                  <fnm>MG</fnm>
               </au>
            </aug>
            <source>Microbiology</source>
            <pubdate>2007</pubdate>
            <volume>153</volume>
            <fpage>1</fpage>
            <lpage>6</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">17185529</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Expanded protein information at SGD: new pages and proteome browser</p>
            </title>
            <aug>
               <au>
                  <snm>Nash</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Weng</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hitz</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Balakrishnan</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Christie</snm>
                  <fnm>KR</fnm>
               </au>
               <au>
                  <snm>Costanzo</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Dwight</snm>
                  <fnm>SS</fnm>
               </au>
               <au>
                  <snm>Engel</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Fisk</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Hirschman</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Hong</snm>
                  <fnm>EL</fnm>
               </au>
               <au>
                  <snm>Livstone</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Oughtred</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Park</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Skrzypek</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Theesfeld</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>Binkley</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Dong</snm>
                  <fnm>Q</fnm>
               </au>
               <au>
                  <snm>Lane</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Miyasato</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Sethuraman</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Schroeder</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Dolinski</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Botstein</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Cherry</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>2007</pubdate>
            <issue>35 Database</issue>
            <fpage>D468</fpage>
            <lpage>D471</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1669759</pubid>
                  <pubid idtype="pmpid" link="fulltext">17142221</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Fungal Genome Initiative</p>
            </title>
            <url>http://www.broad.mit.edu/annotation/fgi/</url>
         </bibl>
         <bibl id="B13">
            <title>
               <p>The integrated microbial genomes (IMG) system</p>
            </title>
            <aug>
               <au>
                  <snm>Markowitz</snm>
                  <fnm>VM</fnm>
               </au>
               <au>
                  <snm>Korzeniewski</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Palaniappan</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Szeto</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Werner</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Padki</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>X</fnm>
               </au>
               <au>
                  <snm>Dubchak</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Hugenholtz</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Anderson</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Lykidis</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Mavromatis</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ivanova</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Kyrpides</snm>
                  <fnm>NC</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>2006</pubdate>
            <issue>34 Database</issue>
            <fpage>D344</fpage>
            <lpage>D348</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1347387</pubid>
                  <pubid idtype="pmpid" link="fulltext">16381883</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Database resources of the National Center for Biotechnology Information</p>
            </title>
            <aug>
               <au>
                  <snm>Wheeler</snm>
                  <fnm>DL</fnm>
               </au>
               <au>
                  <snm>Barrett</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Benson</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Bryant</snm>
                  <fnm>SH</fnm>
               </au>
               <au>
                  <snm>Canese</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Chetvernin</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Church</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>DiCuccio</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Edgar</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Federhen</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Geer</snm>
                  <fnm>LY</fnm>
               </au>
               <au>
                  <snm>Kapustin</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Khovayko</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Landsman</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Lipman</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Madden</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Maglott</snm>
                  <fnm>DR</fnm>
               </au>
               <au>
                  <snm>Ostell</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Pruitt</snm>
                  <fnm>KD</fnm>
               </au>
               <au>
                  <snm>Schuler</snm>
                  <fnm>GD</fnm>
               </au>
               <au>
                  <snm>Sequeira</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Sherry</snm>
                  <fnm>ST</fnm>
               </au>
               <au>
                  <snm>Sirotkin</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Souvorov</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Starchenko</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Tatusov</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Tatusova</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Wagner</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Yaschenko</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Nucl Acids Res</source>
            <pubdate>2007</pubdate>
            <volume>35</volume>
            <fpage>D5</fpage>
            <lpage>D12</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1781113</pubid>
                  <pubid idtype="pmpid" link="fulltext">17170002</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>The Gene Ontology (GO) project in 2006</p>
            </title>
            <aug>
               <au>
                  <cnm>Gene Ontology Consortium</cnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2006</pubdate>
            <volume>34</volume>
            <issue>Database issue</issue>
            <fpage>D322</fpage>
            <lpage>D326</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1347384</pubid>
                  <pubid idtype="pmpid" link="fulltext">16381878</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Pfam: clans, web tools and services</p>
            </title>
            <aug>
               <au>
                  <snm>Finn</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Mistry</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Schuster-B&#246;ckler</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Griffths-Jones</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hollich</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Lassmann</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Moxon</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Marshall</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Khanna</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Durbin</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Eddy</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Sonnhammer</snm>
                  <fnm>EL</fnm>
               </au>
               <au>
                  <snm>Bateman</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>2006</pubdate>
            <issue>34 Database</issue>
            <fpage>D247</fpage>
            <lpage>D251</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1347511</pubid>
                  <pubid idtype="pmpid" link="fulltext">16381856</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>From genomics to chemical genomics: new developments in KEGG</p>
            </title>
            <aug>
               <au>
                  <snm>Kanehisa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Goto</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hattori</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Aoki-Kinoshita</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Itoh</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kawashima</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Katayama</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Araki</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hirakawa</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>2006</pubdate>
            <issue>34 Database</issue>
            <fpage>D354</fpage>
            <lpage>357</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1347464</pubid>
                  <pubid idtype="pmpid" link="fulltext">16381885</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Reactome: a knowledgebase of biological pathways</p>
            </title>
            <aug>
               <au>
                  <snm>Joshi-Tope</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Gillespie</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Vastrik</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>D'Eustachio</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Schmidt</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>de Bono</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Jassal</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Gopinath</snm>
                  <fnm>GR</fnm>
               </au>
               <au>
                  <snm>Wu</snm>
                  <fnm>GR</fnm>
               </au>
               <au>
                  <snm>Matthews</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Lewis</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Birney</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Stein</snm>
                  <fnm>L</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>2005</pubdate>
            <issue>33 Database</issue>
            <fpage>D428</fpage>
            <lpage>D432</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">540026</pubid>
                  <pubid idtype="pmpid" link="fulltext">15608231</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>MetaCyc: a multiorganism database of metabolic pathways and enzymes</p>
            </title>
            <aug>
               <au>
                  <snm>Caspi</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Foerster</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Fulcher</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Hopkinson</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Ingraham</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kaipa</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Krummenacker</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Paley</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Pick</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Rhee</snm>
                  <fnm>SY</fnm>
               </au>
               <au>
                  <snm>Tissier</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Zhang</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Karp</snm>
                  <fnm>PD</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>2006</pubdate>
            <issue>34 Database</issue>
            <fpage>D511</fpage>
            <lpage>D516</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1347490</pubid>
                  <pubid idtype="pmpid" link="fulltext">16381923</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>PCAS &#8211; a precomputed proteome annotation database resource</p>
            </title>
            <aug>
               <au>
                  <snm>Zhang</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Yin</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Gao</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Luo</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Jiang</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">293463</pubid>
                  <pubid idtype="pmpid" link="fulltext">14594458</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>SPdb &#8211; a signal peptide database</p>
            </title>
            <aug>
               <au>
                  <snm>Choo</snm>
                  <fnm>KH</fnm>
               </au>
               <au>
                  <snm>Tan</snm>
                  <fnm>TW</fnm>
               </au>
               <au>
                  <snm>Ranganathan</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>249</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1276010</pubid>
                  <pubid idtype="pmpid" link="fulltext">16221310</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>G&#233;nolevures complete genomes provide data and tools for comparative genomics of hemiascomycetous yeasts</p>
            </title>
            <aug>
               <au>
                  <snm>Sherman</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Durrens</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Iragne</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Beyne</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Nikolski</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Souciet</snm>
                  <fnm>JL</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>2006</pubdate>
            <issue>34 Database</issue>
            <fpage>D432</fpage>
            <lpage>D435</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1347522</pubid>
                  <pubid idtype="pmpid" link="fulltext">16381905</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Ensembl 2007</p>
            </title>
            <aug>
               <au>
                  <snm>Hubbard</snm>
                  <fnm>TJP</fnm>
               </au>
               <au>
                  <snm>Aken</snm>
                  <fnm>BL</fnm>
               </au>
               <au>
                  <snm>Beal</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ballester</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Caccamo</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Clarke</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Coates</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Cunningham</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Cutts</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Down</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Dyer</snm>
                  <fnm>SC</fnm>
               </au>
               <au>
                  <snm>Fitzgerald</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Fernandez-Banet</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Graf</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Haider</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hammond</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Herrero</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Holland</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Howe</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Howe</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Kahari</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Keefe</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kokocinski</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Kulesha</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Lawson</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Longden</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Melsopp</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Megy</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Meidl</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Ouverdin</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Parker</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Prlic</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Rice</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Rios</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Schuster</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Sealy</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Severin</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Slater</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Smedley</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Spudich</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Trevanion</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Vilella</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Vogel</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Wood</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Cox</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Curwen</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Durbin</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Fernandez-Suarez</snm>
                  <fnm>XM</fnm>
               </au>
               <au>
                  <snm>Flicek</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Kasprzyk</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Proctor</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Searle</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ureta-Vidal</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Birney</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Nucl Acids Res</source>
            <pubdate>2007</pubdate>
            <issue>35 Database</issue>
            <fpage>D610</fpage>
            <lpage>617</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1761443</pubid>
                  <pubid idtype="pmpid" link="fulltext">17148474</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>The UCSC genome browser database: update 2007</p>
            </title>
            <aug>
               <au>
                  <snm>Kuhn</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Karolchik</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Zweig</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Trumbower</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Thomas</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>Thakkapallayil</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Sugnet</snm>
                  <fnm>CW</fnm>
               </au>
               <au>
                  <snm>Stanke</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>KE</fnm>
               </au>
               <au>
                  <snm>Siepel</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Rosenbloom</snm>
                  <fnm>KR</fnm>
               </au>
               <au>
                  <snm>Rhead</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Raney</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Pohl</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Pedersen</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Hsu</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Hinrichs</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Harte</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Diekhans</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Clawson</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Bejerano</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Barber</snm>
                  <fnm>GP</fnm>
               </au>
               <au>
                  <snm>Baertsch</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Haussler</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kent</snm>
                  <fnm>WJ</fnm>
               </au>
            </aug>
            <source>Nucl Acids Res</source>
            <pubdate>2007</pubdate>
            <issue>35 Database</issue>
            <fpage>D668</fpage>
            <lpage>673</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1669757</pubid>
                  <pubid idtype="pmpid" link="fulltext">17142222</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>e-Fungi</p>
            </title>
            <url>http://www.e-fungi.org.uk</url>
         </bibl>
         <bibl id="B26">
            <title>
               <p>GeneDB: a resource for prokaryotic and eukaryotic organisms</p>
            </title>
            <aug>
               <au>
                  <snm>Hertz-Fowler</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Peacock</snm>
                  <fnm>CS</fnm>
               </au>
               <au>
                  <snm>Wood</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Aslett</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kerhornou</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Mooney</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Tivey</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Berriman</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hall</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Rutherford</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Parkhill</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Ivens</snm>
                  <fnm>AC</fnm>
               </au>
               <au>
                  <snm>Rajandream</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Barrell</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>2004</pubdate>
            <issue>32 Database</issue>
            <fpage>D339</fpage>
            <lpage>D343</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308742</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681429</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Sequence resources at the Candida Genome Database</p>
            </title>
            <aug>
               <au>
                  <snm>Arnaud</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Costanzo</snm>
                  <fnm>MC</fnm>
               </au>
               <au>
                  <snm>Skrzypek</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Shah</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Binkley</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Lane</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Miyasato</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Sherlock</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>2007</pubdate>
            <issue>35 Database</issue>
            <fpage>D452</fpage>
            <lpage>D456</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1669745</pubid>
                  <pubid idtype="pmpid" link="fulltext">17090582</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Comparative genomic analysis of phytopathogenic fungi using expressed sequence tag (EST) collections</p>
            </title>
            <aug>
               <au>
                  <snm>Soanes</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Talbot</snm>
                  <fnm>NJ</fnm>
               </au>
            </aug>
            <source>Molecular Plant Pathology</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>61</fpage>
            <lpage>70</lpage>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Basic local alignment search tool</p>
            </title>
            <aug>
               <au>
                  <snm>Altschul</snm>
                  <fnm>SF</fnm>
               </au>
               <au>
                  <snm>Gish</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Myers</snm>
                  <fnm>EW</fnm>
               </au>
               <au>
                  <snm>Lipman</snm>
                  <fnm>DJ</fnm>
               </au>
            </aug>
            <source>Journal of Molecular Biology</source>
            <pubdate>1990</pubdate>
            <volume>215</volume>
            <issue>3</issue>
            <fpage>403</fpage>
            <lpage>410</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">2231712</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>An effcient algorithm for large-scale detection of protein families</p>
            </title>
            <aug>
               <au>
                  <snm>Enright</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Dongen</snm>
                  <fnm>SV</fnm>
               </au>
               <au>
                  <snm>Ouzounis</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>2002</pubdate>
            <volume>30</volume>
            <issue>7</issue>
            <fpage>1575</fpage>
            <lpage>1584</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">101833</pubid>
                  <pubid idtype="pmpid" link="fulltext">11917018</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>OrthoMCL: Identification of Ortholog Groups for Eukaryotic Genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Li</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Christian</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Stoeckert</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Roos</snm>
                  <fnm>DS</fnm>
               </au>
            </aug>
            <source>Genome Research</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <issue>9</issue>
            <fpage>2178</fpage>
            <lpage>2189</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">403725</pubid>
                  <pubid idtype="pmpid" link="fulltext">12952885</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Profile hidden Markov models</p>
            </title>
            <aug>
               <au>
                  <snm>Eddy</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>1998</pubdate>
            <volume>14</volume>
            <issue>9</issue>
            <fpage>755</fpage>
            <lpage>763</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">9918945</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B33">
            <title>
               <p>Improved Prediction of Signal Peptides: SignalP 3.0</p>
            </title>
            <aug>
               <au>
                  <snm>Bendtsen</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Nielsen</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>von Heijne</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Brunak</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Journal of Molecular Biology</source>
            <pubdate>2004</pubdate>
            <volume>340</volume>
            <fpage>783</fpage>
            <lpage>795</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15223320</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>PSORT: a program for detecting sorting signals in proteins and predicting their subcellular localization</p>
            </title>
            <aug>
               <au>
                  <snm>Nakai</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Horton</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Trends in Biochemical Sciences</source>
            <pubdate>1999</pubdate>
            <volume>24</volume>
            <fpage>34</fpage>
            <lpage>35</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10087920</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Protein Subcellular Localization Prediction with WoLF PSORT</p>
            </title>
            <aug>
               <au>
                  <snm>Horton</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Park</snm>
                  <fnm>KJ</fnm>
               </au>
               <au>
                  <snm>Obayashi</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Nakai</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Proceedings of the 4th Annual Asia Pacific Bioinformatics Conference APBC06, Taipei, Taiwan</source>
            <pubdate>2006</pubdate>
            <fpage>39</fpage>
            <lpage>48</lpage>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Versant</p>
            </title>
            <url>http://www.versant.com</url>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Conceptual Modelling of Genomic Information</p>
            </title>
            <aug>
               <au>
                  <snm>Paton</snm>
                  <fnm>NW</fnm>
               </au>
               <au>
                  <snm>Khan</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Hayes</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Moussouni</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Brass</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Eilbeck</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Goble</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Hubbard</snm>
                  <fnm>SJ</fnm>
               </au>
               <au>
                  <snm>Oliver</snm>
                  <fnm>SG</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2000</pubdate>
            <volume>16</volume>
            <issue>6</issue>
            <fpage>548</fpage>
            <lpage>558</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10980152</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>GIMS: an integrated data storage and analysis environment for genomic and functional data</p>
            </title>
            <aug>
               <au>
                  <snm>Cornell</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Paton</snm>
                  <fnm>NW</fnm>
               </au>
               <au>
                  <snm>Hedeler</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Kirby</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Delneri</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Hayes</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Oliver</snm>
                  <fnm>SG</fnm>
               </au>
            </aug>
            <source>Yeast</source>
            <pubdate>2003</pubdate>
            <volume>20</volume>
            <issue>15</issue>
            <fpage>1291</fpage>
            <lpage>1306</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">14618567</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Model-driven user interfaces for bioinformatics data resources: regenerating the wheel as an alternative to reinventing it</p>
            </title>
            <aug>
               <au>
                  <snm>Garwood</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Garwood</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Hedeler</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Griffths</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Swainston</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Oliver</snm>
                  <fnm>SG</fnm>
               </au>
               <au>
                  <snm>Paton</snm>
                  <fnm>NW</fnm>
               </au>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>7</volume>
            <fpage>532</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1713253</pubid>
                  <pubid idtype="pmpid" link="fulltext">17169146</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Whole genome comparison of the A. fumigatus family</p>
            </title>
            <aug>
               <au>
                  <snm>Wortman</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Fedorova</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Crabtree</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Joardar</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Maiti</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Haas</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Amedeo</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Angiuoli</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Jiang</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Anderson</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Denning</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Nierman</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Medical Mycology</source>
            <pubdate>2006</pubdate>
            <volume>44</volume>
            <issue>S1</issue>
            <fpage>S3</fpage>
            <lpage>S7</lpage>
         </bibl>
         <bibl id="B41">
            <title>
               <p>The genome sequence of Caenorhabditis briggsae: a platform for comparative genomics</p>
            </title>
            <aug>
               <au>
                  <snm>Stein</snm>
                  <fnm>LD</fnm>
               </au>
               <au>
                  <snm>Bao</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Blasiar</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Blumenthal</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Brent</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Chinwalla</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Clarke</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Clee</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Coghlan</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Coulson</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>D'Eustachio</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Fitch</snm>
                  <fnm>DHA</fnm>
               </au>
               <au>
                  <snm>Fulton</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Fulton</snm>
                  <fnm>RE</fnm>
               </au>
               <au>
                  <snm>Griffths-Jones</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>TW</fnm>
               </au>
               <au>
                  <snm>Hillier</snm>
                  <fnm>LW</fnm>
               </au>
               <au>
                  <snm>Kamath</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Kuwabara</snm>
                  <fnm>PE</fnm>
               </au>
               <au>
                  <snm>Mardis</snm>
                  <fnm>ER</fnm>
               </au>
               <au>
                  <snm>Marra</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Miner</snm>
                  <fnm>TL</fnm>
               </au>
               <au>
                  <snm>Minx</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Mullikin</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Plumb</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>Rogers</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Schein</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Sohrmann</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Spieth</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Stajich</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Wei</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Willey</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Wilson</snm>
                  <fnm>RK</fnm>
               </au>
               <au>
                  <snm>Durbin</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Waterston</snm>
                  <fnm>RH</fnm>
               </au>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2003</pubdate>
            <volume>1</volume>
            <issue>2</issue>
            <fpage>E45</fpage>
            <lpage/>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">261899</pubid>
                  <pubid idtype="pmpid" link="fulltext">14624247</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>The Dawn of Fungal Pathogen Genomics</p>
            </title>
            <aug>
               <au>
                  <snm>Xu</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Peng</snm>
                  <fnm>YL</fnm>
               </au>
               <au>
                  <snm>Dickman</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Sharon</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Annual Review of Phytopathology</source>
            <pubdate>2006</pubdate>
            <volume>44</volume>
            <fpage>337</fpage>
            <lpage>366</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16704358</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Genome evolution in yeasts</p>
            </title>
            <aug>
               <au>
                  <snm>Dujon</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Sherman</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Fischer</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Durrens</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Casaregola</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lafontaine</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>de Montigny</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Marck</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Neuv&#233;glise</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Talla</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Goffard</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Frangeul</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Aigle</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Anthouard</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Babour</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Barbe</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Barnay</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Blanchin</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Beckerich</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Beyne</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Bleykasten</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Boyer</snm>
                  <fnm>ABJ</fnm>
               </au>
               <au>
                  <snm>Cattolico</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Confanioleri</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>de Daruvar</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Despons</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Fabre</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Fairhead</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ferry-Dumazet</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Groppi</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hantraye</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Hennequin</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Jauniaux</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Joyet</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Kachouri</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Kerrest</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Koszul</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Lemaire</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lesur</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Ma</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Muller</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Nicaud</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Nikolski</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Oztas</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ozier-Kalogeropoulos</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Pellenz</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Potier</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Richard</snm>
                  <fnm>GF</fnm>
               </au>
               <au>
                  <snm>Straub</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Suleau</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Swennen</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Tekaia</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>W&#233;solowski-Louvel</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Westhof</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Wirth</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Zeniou-Meyer</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Zivanovic</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Bolotin-Fukuhara</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Thierry</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bouchier</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Caudron</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Scarpelli</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Gaillardin</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Weissenbach</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Wincker</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Souciet</snm>
                  <fnm>JL</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2004</pubdate>
            <volume>430</volume>
            <fpage>35</fpage>
            <lpage>44</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15229592</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Yeasts illustrate the molecular mechanisms of eukaryotic genome evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Dujon</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>TRENDS in Genetics</source>
            <pubdate>2006</pubdate>
            <volume>22</volume>
            <issue>7</issue>
            <fpage>375</fpage>
            <lpage>387</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16730849</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Comparative analysis of programmed cell death pathways in filamentous fungi</p>
            </title>
            <aug>
               <au>
                  <snm>Fedorova</snm>
                  <fnm>ND</fnm>
               </au>
               <au>
                  <snm>Badger</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Robson</snm>
                  <fnm>GD</fnm>
               </au>
               <au>
                  <snm>Wortman</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Nierman</snm>
                  <fnm>WC</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>177</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1325252</pubid>
                  <pubid idtype="pmpid" link="fulltext">16336669</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B46">
            <title>
               <p>Taverna: a tool for the composition and enactment of bioinformatics workflows</p>
            </title>
            <aug>
               <au>
                  <snm>Oinn</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Addis</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ferris</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Marvin</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Senger</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Greenwood</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Carver</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Glover</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Pocock</snm>
                  <fnm>MR</fnm>
               </au>
               <au>
                  <snm>Wipat</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <issue>17</issue>
            <fpage>3045</fpage>
            <lpage>3054</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">15201187</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>Soaplab &#8211; a unified Sesame door to analysis tools</p>
            </title>
            <aug>
               <au>
                  <snm>Senger</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rice</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Oinn</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Proceedings, UK e-Science, All Hands Meeting 2003</source>
            <editor>Cox SJ</editor>
            <pubdate>2003</pubdate>
            <fpage>509</fpage>
            <lpage>513</lpage>
         </bibl>
         <bibl id="B48">
            <title>
               <p>Axis</p>
            </title>
            <url>http://ws.apache.org/axis/</url>
         </bibl>
         <bibl id="B49">
            <title>
               <p>Whole genome comparison of Aspergillus flavus and A. oryzae</p>
            </title>
            <aug>
               <au>
                  <snm>Payne</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Nierman</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Wortman</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Pritchard</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Brown</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Dean</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Bhatnagar</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Cleveland</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Machida</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Medical Mycology</source>
            <pubdate>2006</pubdate>
            <volume>44</volume>
            <fpage>S9</fpage>
            <lpage>S11</lpage>
         </bibl>
         <bibl id="B50">
            <title>
               <p>Genome-wide structural and evolutionary analysis of the P450 monooxygenase genes (P450ome) in the white rot fungus Phanerochaete chrysosporium: Evidence for gene duplications and extensive gene clustering</p>
            </title>
            <aug>
               <au>
                  <snm>Doddapaneni</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Chakraborty</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Yadav</snm>
                  <fnm>JS</fnm>
               </au>
            </aug>
            <source>BMC Genomics</source>
            <pubdate>2005</pubdate>
            <volume>6</volume>
            <fpage>92</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1184071</pubid>
                  <pubid idtype="pmpid" link="fulltext">15955240</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>Genome sequencing and analysis of Aspergillus oryzae</p>
            </title>
            <aug>
               <au>
                  <snm>Machida</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Asai</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Sano</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Tanaka</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Kumagai</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Terai</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Kusumoto</snm>
                  <fnm>KI</fnm>
               </au>
               <au>
                  <snm>Arima</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Akita</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Kashiwagi</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Abe</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Gomi</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Horiuchi</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Kitamoto</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kobayashi</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Takeuchi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Denning</snm>
                  <fnm>DW</fnm>
               </au>
               <au>
                  <snm>Galagan</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Nierman</snm>
                  <fnm>WC</fnm>
               </au>
               <au>
                  <snm>Yu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Archer</snm>
                  <fnm>DB</fnm>
               </au>
               <au>
                  <snm>Bennett</snm>
                  <fnm>JW</fnm>
               </au>
               <au>
                  <snm>Bhatnagar</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Cleveland</snm>
                  <fnm>TE</fnm>
               </au>
               <au>
                  <snm>Fedorova</snm>
                  <fnm>ND</fnm>
               </au>
               <au>
                  <snm>Gotoh</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Horikawa</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Hosoyama</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Ichinomiya</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Igarashi</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Iwashita</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Juvvadi</snm>
                  <fnm>PR</fnm>
               </au>
               <au>
                  <snm>Kato</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kato</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Kin</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Kokubun</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Maeda</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Maeyama</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>ichi Maruyama</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Nagasaki</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Nakajima</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Oda</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Okada</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Paulsen</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Sakamoto</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Sawano</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Takahashi</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Takase</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Terabayashi</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Wortman</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Yamada</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Yamagata</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Anazawa</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Hata</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Koide</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Komori</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Koyama</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Minetoki</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Suharnan</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Tanaka</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Isono</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kuhara</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ogasawara</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Kikuchi</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2005</pubdate>
            <volume>438</volume>
            <issue>7071</issue>
            <fpage>1157</fpage>
            <lpage>1161</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16372010</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>The UCSC Genome Browser Database: update 2006</p>
            </title>
            <aug>
               <au>
                  <snm>Hinrichs</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Karolchik</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Baertsch</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Barber</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Bejerano</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Clawson</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Diekhans</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Furey</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Harte</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Hsu</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Hillman-Jackson</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Kuhn</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Pedersen</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Pohl</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Raney</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Rosenbloom</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Siepel</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Sugnet</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Sultan-Qurraie</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Thomas</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Trumbower</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Weber</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Weirauch</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Zweig</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Haussler</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Kent</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>2006</pubdate>
            <issue>34 Database</issue>
            <fpage>D590</fpage>
            <lpage>D598</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1347506</pubid>
                  <pubid idtype="pmpid" link="fulltext">16381938</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>Ensembl 2002: accommodating comparative genomics</p>
            </title>
            <aug>
               <au>
                  <snm>Clamp</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Andrews</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Barker</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Bevan</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Cameron</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Clark</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Cox</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Cuff</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Curwen</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Down</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Durbin</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Eyras</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Gilbert</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hammond</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hubbard</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Kasprzyk</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Keefe</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Lehvaslaiho</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Iyer</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Melsopp</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Mongin</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Pettett</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Potter</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Rust</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Schmidt</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Searle</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Slater</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Spooner</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Stabenau</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Stalker</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Stupka</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Ureta-Vidal</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Vastrik</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Birney</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2003</pubdate>
            <volume>31</volume>
            <issue>1</issue>
            <fpage>38</fpage>
            <lpage>42</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">165530</pubid>
                  <pubid idtype="pmpid" link="fulltext">12519943</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>Ensembl 2005</p>
            </title>
            <aug>
               <au>
                  <snm>Hubbard</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Andrews</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Caccamo</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Cameron</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Chen</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Clamp</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Clarke</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Coates</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Cox</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Cunningham</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Curwen</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Cutts</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Down</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Durbin</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Fernandez-Suarez</snm>
                  <fnm>XM</fnm>
               </au>
               <au>
                  <snm>Gilbert</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hammond</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Herrero</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Hotz</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Howe</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Iyer</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Jekosch</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Kahari</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Kasprzyk</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Keefe</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Keenan</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kokocinsci</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>London</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Longden</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>McVicker</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Melsopp</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Meidl</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Potter</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Proctor</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Rae</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Rios</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Schuster</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Searle</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Severin</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Slater</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Smedley</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Spooner</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Stabenau</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Stalker</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Storey</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Trevanion</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Ureta-Vidal</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Vogel</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Woodwark</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Birney</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>2005</pubdate>
            <issue>33 Database</issue>
            <fpage>D447</fpage>
            <lpage>D453</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">540092</pubid>
                  <pubid idtype="pmpid" link="fulltext">15608235</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>coliBASE: an online database for Escherichia coli, Shigella and Salmonella comparative genomics</p>
            </title>
            <aug>
               <au>
                  <snm>Chaudhuri</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Khan</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Pallen</snm>
                  <fnm>MJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <issue>32(Database issue)</issue>
            <fpage>D296</fpage>
            <lpage>D299</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308765</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681417</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <title>
               <p>xBASE, a collection of online databases for bacterial comparative genomics</p>
            </title>
            <aug>
               <au>
                  <snm>Chaudhuri</snm>
                  <fnm>RR</fnm>
               </au>
               <au>
                  <snm>Pallen</snm>
                  <fnm>MJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>2006</pubdate>
            <issue>34 Database</issue>
            <fpage>D335</fpage>
            <lpage>D337</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1347502</pubid>
                  <pubid idtype="pmpid" link="fulltext">16381881</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <title>
               <p>A Grid-based System for Microbial Genome Comparison and Analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Sun</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Wipat</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Pocock</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>PA</fnm>
               </au>
               <au>
                  <snm>Watson</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Flanagan</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Worthington</snm>
                  <fnm>JT</fnm>
               </au>
            </aug>
            <source>Proceedings of the 2005 IEEE International Symposium on Cluster Computing and the Grid (CCGrid 2005)</source>
            <publisher>Cardiff, Wales: IEEE Computer Society</publisher>
            <pubdate>2005</pubdate>
            <volume>2</volume>
            <fpage>977</fpage>
            <lpage>984</lpage>
         </bibl>
         <bibl id="B58">
            <title>
               <p>MolliGen, a database dedicated to the comparative genomics of Mollicutes</p>
            </title>
            <aug>
               <au>
                  <snm>Barr&#233;</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>de Daruvar</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Blanchard</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>2004</pubdate>
            <issue>32 Database</issue>
            <fpage>D307</fpage>
            <lpage>D310</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308848</pubid>
                  <pubid idtype="pmpid" link="fulltext">14681420</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <title>
               <p>The Comprehensive Microbial Resource</p>
            </title>
            <aug>
               <au>
                  <snm>Peterson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Umayam</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Dickinson</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hickey</snm>
                  <fnm>EK</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Research</source>
            <pubdate>2001</pubdate>
            <volume>29</volume>
            <fpage>123</fpage>
            <lpage>125</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">29848</pubid>
                  <pubid idtype="pmpid" link="fulltext">11125067</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
