<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2148-7-134</ui>
   <ji>1471-2148</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Cophenetic correlation analysis as a strategy to select phylogenetically informative proteins: an example from the fungal kingdom</p>
         </title>
         <aug>
            <au id="A1" ca="yes" ce="yes">
               <snm>Kuramae</snm>
               <mi>E</mi>
               <fnm>Eiko</fnm>
               <insr iid="I1"/>
               <email>e.kuramae@nioo.knaw.nl</email>
            </au>
            <au id="A2" ce="yes">
               <snm>Robert</snm>
               <fnm>Vincent</fnm>
               <insr iid="I1"/>
               <email>v.robert@cbs.knaw.nl</email>
            </au>
            <au id="A3">
               <snm>Echavarri-Erasun</snm>
               <fnm>Carlos</fnm>
               <insr iid="I1"/>
               <email>c.echavar@cbs.knaw.nl</email>
            </au>
            <au id="A4">
               <snm>Boekhout</snm>
               <fnm>Teun</fnm>
               <insr iid="I1"/>
               <email>t.boekhout@cbs.knaw.nl</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Yeast Research, CBS-Fungal Biodiversity Centre, Uppsalalaan 8, 3584 CT Utrecht, The Netherlands</p>
            </ins>
         </insg>
         <source>BMC Evolutionary Biology</source>
         <issn>1471-2148</issn>
         <pubdate>2007</pubdate>
         <volume>7</volume>
         <issue>1</issue>
         <fpage>134</fpage>
         <url>http://www.biomedcentral.com/1471-2148/7/134</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">17688684</pubid>
               <pubid idtype="doi">10.1186/1471-2148-7-134</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>03</day>
               <month>10</month>
               <year>2006</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>09</day>
               <month>8</month>
               <year>2007</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>09</day>
               <month>8</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Kuramae et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>The construction of robust and well resolved phylogenetic trees is important for our understanding of many, if not all biological processes, including speciation and origin of higher taxa, genome evolution, metabolic diversification, multicellularity, origin of life styles, pathogenicity and so on. Many older phylogenies were not well supported due to insufficient phylogenetic signal present in the single or few genes used in phylogenetic reconstructions. Importantly, single gene phylogenies were not always found to be congruent. The phylogenetic signal may, therefore, be increased by enlarging the number of genes included in phylogenetic studies. Unfortunately, concatenation of many genes does not take into consideration the evolutionary history of each individual gene. Here, we describe an approach to select informative phylogenetic proteins to be used in the Tree of Life (TOL) and barcoding projects by comparing the cophenetic correlation coefficients (CCC) among individual protein distance matrices of proteins, using the fungi as an example. The method demonstrated that the quality and number of concatenated proteins is important for a reliable estimation of TOL. Approximately 40&#8211;45 concatenated proteins seem needed to resolve fungal TOL.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>In total 4852 orthologous proteins (KOGs) were assigned among 33 fungal genomes from the Asco- and Basidiomycota and 70 of these represented single copy proteins. The individual protein distance matrices based on 531 concatenated proteins that has been used for phylogeny reconstruction before <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> were compared one with another in order to select those with the highest CCC, which then was used as a reference. This reference distance matrix was compared with those of the 70 single copy proteins selected and their CCC values were calculated. Sixty four KOGs showed a CCC above 0.50 and these were further considered for their phylogenetic potential. Proteins belonging to the cellular processes and signaling KOG category seem more informative than those belonging to the other three categories: information storage and processing; metabolism; and the poorly characterized category. After concatenation of 40 proteins the topology of the phylogenetic tree remained stable, but after concatenation of 60 or more proteins the bootstrap support values of some branches decreased, most likely due to the inclusion of proteins with lowers CCC values. The selection of protein sequences to be used in various TOL projects remains a critical and important process. The method described in this paper will contribute to a more objective selection of phylogenetically informative protein sequences.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>This study provides candidate protein sequences to be considered as phylogenetic markers in different branches of fungal TOL. The selection procedure described here will be useful to select informative protein sequences to resolve branches of TOL that contain few or no species with completely sequenced genomes. The robust phylogenetic trees resulting from this method may contribute to our understanding of organismal diversification processes. The method proposed can be extended easily to other branches of TOL.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Many biological processes can be better understood in the framework of reliable phylogenetic analyses. This is not only true for our understanding of evolutionary systematics and phylogenetics, including TOL, but it will also largely contribute to our understanding of diversification at the subcellular, cellular and organismal levels of integration. One well documented example in this respect is the postulated whole-genome duplication (WGD) that occurred during the evolution of some species belonging to the Saccharomycotina <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Only using a correctly inferred phylogenetic TOL it was possible to distinguish between "pre-WGD" and "post-WGD" species of Saccharomycotina. Other examples refer to our understanding of evolution of metabolic pathways <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>, structure of genomes <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>, life styles <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>, and pathogenicity <abbrgrp><abbr bid="B6">6</abbr></abbrgrp>.</p>
         <p>Until recently, our understanding of the (fungal) TOL has been based on two approaches, which basically differ in number of species and genes considered: (1) few genes and large number of species; (2) large number of genes and few species. The clear advantage of the first approach is the availability of many sequences, e.g. of the rDNA locus, in publicly available databases (i.e. National Center for Biotechnology Information &#8211; NCBI), and, secondly, it is generally rather easy to generate complete or partial sequences of a few genes for a large number of species. Besides, the rDNA loci have the clear advantage of being universally present in all branches of TOL, universal primers are well known and it has been successfully explored in many branches of TOL. The disadvantage of the rDNA loci, however, is that the deeper branches are usually less supported <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. As an answer to this, various authors started to include multiple protein coding genes in their phylogenetic analyses <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr></abbrgrp>. Unfortunately, the rationale behind the selection of these protein coding genes is not always clear, and discrepancies and incongruences between individual gene trees may result in unresolved phylogenetic trees <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr></abbrgrp>. This may be due to different evolutionary rates, and/or different origins of the genes, e.g. whether nuclearly encoded (e.g. RPB1 and RPB2) or mitochondrial in origin (e.g. ATP6). In the second approach, large numbers of genes have been used for phylogenetic studies as an attempt to contribute to the first approach described above. This was firstly applied in the prokaryotes <abbrgrp><abbr bid="B11">11</abbr></abbrgrp> and, more recently, in eukaryotes as well <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr></abbrgrp>. A large selection of genes and/or proteins are concatenated and used for inferring phylogenetic relationships, thereby increasing the phylogenetic signal considerably <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp>. However, although this approach resolved the fungal phylogenetic tree <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B14">14</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp> it also suffers from some limitations. For instance, it does not take into consideration the evolutionary history of each individual gene and it depends on the availability of complete genome data.</p>
         <p>Here, we explored the usefulness of comparing the cophenetic correlation coefficients (CCCs) among distance matrices of individual gene trees in order to make a phylogenetically meaningful selection of orthologs to be considered for further phylogenomics studies as well as large scale TOL and barcoding applications. We used the fungal kingdom as an example as it represents one of the major clades of life with approximately 1.5 million species <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>, of which only approximately 80.000 have been described. Moreover, the fungi are morphologically, metabolically and ecologically highly diverse and, importantly, the number of completely sequenced genomes is high among the eukaryotes.</p>
         <p>Candidate proteins to be considered for TOL and/or barcoding studies were assessed from 33 fungal proteomes by comparing (i) distance matrices of each individual orthologous protein (KOGs) matrix, (ii) to compare these with that of a well supported guide tree <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>, and (iii) analyze for their phylogenetic signal. The method presented here may be universally applied for the selection of markers in various TOL and barcoding studies.</p>
      </sec>
      <sec>
         <st>
            <p>Results and Discussion</p>
         </st>
         <p>The 33 genomes investigated shared 4852 KOGs from which 70 were single copy proteins. The function of these 70 KOGs was assessed from the <it>Saccharomyces cerevisiae </it>genome database <abbrgrp><abbr bid="B19">19</abbr></abbrgrp> (Additional file <supplr sid="S1">1</supplr>). The corresponding systematic name, standard name, description, chromosome number and knock out phenotype are presented in Table <tblr tid="T1">1</tblr> (Additional file <supplr sid="S1">1</supplr>). Knock out phenotypes of 32 genes were lethal (Table <tblr tid="T1">1</tblr>) when deleted in <it>S. cerevisiae </it><abbrgrp><abbr bid="B19">19</abbr></abbrgrp>, thus suggesting that they code for essential proteins. Genes coding for the 70 KOG proteins are distributed on almost all chromosomes of <it>S. cerevisiae</it>, except chromosome VI (Table <tblr tid="T1">1</tblr>), thus representing the entire genome.</p>
         <suppl id="S1">
            <title>
               <p>Additional file 1</p>
            </title>
            <text>
               <p>Correlation values of KOG distance matrices compared to KOG2671, KOG functional category, the corresponding single protein KOGs to the systematic name, standard name, description, systematic deletion and chromosome number of ORFs of <it>Saccharomyces cerevisiae </it>according to <url>http://www.yeastgenome.org/</url><abbrgrp><abbr bid="B19">19</abbr></abbrgrp>.</p>
            </text>
            <file name="1471-2148-7-134-S1.xls">
               <p>Click here for file</p>
            </file>
         </suppl>
         <tbl id="T1">
            <title>
               <p>Table 1</p>
            </title>
            <caption>
               <p>Correlation values of KOG distance matrices compared to that of KOG2671, KOG functional category, the corresponding single protein KOGs to the systematic name, systematic deletion and chromosome number of ORFs of <it>Saccharomyce cerevisae </it>(Sce) [19].</p>
            </caption>
            <tblbdy cols="5">
               <r>
                  <c ca="left">
                     <p>
                        <b>Correlation value</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>KOG number</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Sce Systematic name</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Systematic deletion</b>
                     </p>
                  </c>
                  <c ca="left">
                     <p>
                        <b>Chromosome number</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c cspan="5">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>1.00</p>
                  </c>
                  <c ca="left">
                     <p>KOG2671</p>
                  </c>
                  <c ca="left">
                     <p>YOL124c</p>
                  </c>
                  <c ca="left">
                     <p>viable</p>
                  </c>
                  <c ca="left">
                     <p>XV</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.93</p>
                  </c>
                  <c ca="left">
                     <p>KOG0340</p>
                  </c>
                  <c ca="left">
                     <p>YHR169w</p>
                  </c>
                  <c ca="left">
                     <p>inviable</p>
                  </c>
                  <c ca="left">
                     <p>VIII</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.91</p>
                  </c>
                  <c ca="left">
                     <p>KOG4089</p>
                  </c>
                  <c ca="left">
                     <p>YDR405w</p>
                  </c>
                  <c ca="left">
                     <p>viable</p>
                  </c>
                  <c ca="left">
                     <p>IV</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.91</p>
                  </c>
                  <c ca="left">
                     <p>KOG0173</p>
                  </c>
                  <c ca="left">
                     <p>YOR157C</p>
                  </c>
                  <c ca="left">
                     <p>inviable</p>
                  </c>
                  <c ca="left">
                     <p>XV</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.91</p>
                  </c>
                  <c ca="left">
                     <p>KOG2728</p>
                  </c>
                  <c ca="left">
                     <p>YIL083c</p>
                  </c>
                  <c ca="left">
                     <p>inviable</p>
                  </c>
                  <c ca="left">
                     <p>IX</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.90</p>
                  </c>
                  <c ca="left">
                     <p>KOG3111</p>
                  </c>
                  <c ca="left">
                     <p>YJL121c</p>
                  </c>
                  <c ca="left">
                     <p>viable</p>
                  </c>
                  <c ca="left">
                     <p>X</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.89</p>
                  </c>
                  <c ca="left">
                     <p>KOG3800</p>
                  </c>
                  <c ca="left">
                     <p>YDR460w</p>
                  </c>
                  <c ca="left">
                     <p>inviable</p>
                  </c>
                  <c ca="left">
                     <p>IV</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.89</p>
                  </c>
                  <c ca="left">
                     <p>KOG3024</p>
                  </c>
                  <c ca="left">
                     <p>YOR164c</p>
                  </c>
                  <c ca="left">
                     <p>viable</p>
                  </c>
                  <c ca="left">
                     <p>XV</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.89</p>
                  </c>
                  <c ca="left">
                     <p>KOG0816</p>
                  </c>
                  <c ca="left">
                     <p>YKL009w</p>
                  </c>
                  <c ca="left">
                     <p>viable</p>
                  </c>
                  <c ca="left">
                     <p>XI</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.89</p>
                  </c>
                  <c ca="left">
                     <p>KOG2905</p>
                  </c>
                  <c ca="left">
                     <p>YGR005c</p>
                  </c>
                  <c ca="left">
                     <p>inviable</p>
                  </c>
                  <c ca="left">
                     <p>VII</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.89</p>
                  </c>
                  <c ca="left">
                     <p>KOG3013</p>
                  </c>
                  <c ca="left">
                     <p>YHR069c</p>
                  </c>
                  <c ca="left">
                     <p>inviable</p>
                  </c>
                  <c ca="left">
                     <p>VIII</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.89</p>
                  </c>
                  <c ca="left">
                     <p>KOG1416</p>
                  </c>
                  <c ca="left">
                     <p>YNL062c</p>
                  </c>
                  <c ca="left">
                     <p>inviable</p>
                  </c>
                  <c ca="left">
                     <p>XIV</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.88</p>
                  </c>
                  <c ca="left">
                     <p>KOG2299</p>
                  </c>
                  <c ca="left">
                     <p>YNL072w</p>
                  </c>
                  <c ca="left">
                     <p>viable</p>
                  </c>
                  <c ca="left">
                     <p>XIV</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.88</p>
                  </c>
                  <c ca="left">
                     <p>KOG3045</p>
                  </c>
                  <c ca="left">
                     <p>YDR083w</p>
                  </c>
                  <c ca="left">
                     <p>viable</p>
                  </c>
                  <c ca="left">
                     <p>IV</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.88</p>
                  </c>
                  <c ca="left">
                     <p>KOG3003</p>
                  </c>
                  <c ca="left">
                     <p>YOR232w</p>
                  </c>
                  <c ca="left">
                     <p>inviable</p>
                  </c>
                  <c ca="left">
                     <p>XV</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.87</p>
                  </c>
                  <c ca="left">
                     <p>KOG4018</p>
                  </c>
                  <c ca="left">
                     <p>YDR152w</p>
                  </c>
                  <c ca="left">
                     <p>viable</p>
                  </c>
                  <c ca="left">
                     <p>IV</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.87</p>
                  </c>
                  <c ca="left">
                     <p>KOG3786</p>
                  </c>
                  <c ca="left">
                     <p>YLR418c</p>
                  </c>
                  <c ca="left">
                     <p>viable</p>
                  </c>
                  <c ca="left">
                     <p>XII</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.87</p>
                  </c>
                  <c ca="left">
                     <p>KOG3789</p>
                  </c>
                  <c ca="left">
                     <p>YEL062w</p>
                  </c>
                  <c ca="left">
                     <p>viable</p>
                  </c>
                  <c ca="left">
                     <p>V</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.86</p>
                  </c>
                  <c ca="left">
                     <p>KOG0809</p>
                  </c>
                  <c ca="left">
                     <p>YOL018c</p>
                  </c>
                  <c ca="left">
                     <p>viable</p>
                  </c>
                  <c ca="left">
                     <p>XV</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.86</p>
                  </c>
                  <c ca="left">
                     <p>KOG4093</p>
                  </c>
                  <c ca="left">
                     <p>YPL225w</p>
                  </c>
                  <c ca="left">
                     <p>viable</p>
                  </c>
                  <c ca="left">
                     <p>XVI</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.86</p>
                  </c>
                  <c ca="left">
                     <p>KOG3015</p>
                  </c>
                  <c ca="left">
                     <p>YJL180c</p>
                  </c>
                  <c ca="left">
                     <p>viable</p>
                  </c>
                  <c ca="left">
                     <p>X</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.86</p>
                  </c>
                  <c ca="left">
                     <p>KOG2487</p>
                  </c>
                  <c ca="left">
                     <p>YPR056w</p>
                  </c>
                  <c ca="left">
                     <p>inviable</p>
                  </c>
                  <c ca="left">
                     <p>XVI</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.85</p>
                  </c>
                  <c ca="left">
                     <p>KOG0438</p>
                  </c>
                  <c ca="left">
                     <p>YEL050c</p>
                  </c>
                  <c ca="left">
                     <p>viable</p>
                  </c>
                  <c ca="left">
                     <p>V</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.85</p>
                  </c>
                  <c ca="left">
                     <p>KOG0645</p>
                  </c>
                  <c ca="left">
                     <p>YDR267c</p>
                  </c>
                  <c ca="left">
                     <p>inviable</p>
                  </c>
                  <c ca="left">
                     <p>IV</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.85</p>
                  </c>
                  <c ca="left">
                     <p>KOG2851</p>
                  </c>
                  <c ca="left">
                     <p>YIR008c</p>
                  </c>
                  <c ca="left">
                     <p>inviable</p>
                  </c>
                  <c ca="left">
                     <p>IX</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.85</p>
                  </c>
                  <c ca="left">
                     <p>KOG2267</p>
                  </c>
                  <c ca="left">
                     <p>YKL045w</p>
                  </c>
                  <c ca="left">
                     <p>inviable</p>
                  </c>
                  <c ca="left">
                     <p>XI</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.84</p>
                  </c>
                  <c ca="left">
                     <p>KOG2732</p>
                  </c>
                  <c ca="left">
                     <p>YJR006w</p>
                  </c>
                  <c ca="left">
                     <p>inviable</p>
                  </c>
                  <c ca="left">
                     <p>X</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.84</p>
                  </c>
                  <c ca="left">
                     <p>KOG2021</p>
                  </c>
                  <c ca="left">
                     <p>YKL205w</p>
                  </c>
                  <c ca="left">
                     <p>viable</p>
                  </c>
                  <c ca="left">
                     <p>XI</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.83</p>
                  </c>
                  <c ca="left">
                     <p>KOG0991</p>
                  </c>
                  <c ca="left">
                     <p>YOL094c</p>
                  </c>
                  <c ca="left">
                     <p>inviable</p>
                  </c>
                  <c ca="left">
                     <p>XV</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.83</p>
                  </c>
                  <c ca="left">
                     <p>KOG3224</p>
                  </c>
                  <c ca="left">
                     <p>YPR040w</p>
                  </c>
                  <c ca="left">
                     <p>viable</p>
                  </c>
                  <c ca="left">
                     <p>XVI</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.83</p>
                  </c>
                  <c ca="left">
                     <p>KOG2994</p>
                  </c>
                  <c ca="left">
                     <p>YML021c</p>
                  </c>
                  <c ca="left">
                     <p>viable</p>
                  </c>
                  <c ca="left">
                     <p>XIII</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.82</p>
                  </c>
                  <c ca="left">
                     <p>KOG3103</p>
                  </c>
                  <c ca="left">
                     <p>YGR172c</p>
                  </c>
                  <c ca="left">
                     <p>inviable</p>
                  </c>
                  <c ca="left">
                     <p>VII</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.82</p>
                  </c>
                  <c ca="left">
                     <p>KOG1598</p>
                  </c>
                  <c ca="left">
                     <p>YGR246c</p>
                  </c>
                  <c ca="left">
                     <p>inviable</p>
                  </c>
                  <c ca="left">
                     <p>VII</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.82</p>
                  </c>
                  <c ca="left">
                     <p>KOG0436</p>
                  </c>
                  <c ca="left">
                     <p>YGR171c</p>
                  </c>
                  <c ca="left">
                     <p>viable</p>
                  </c>
                  <c ca="left">
                     <p>VII</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.81</p>
                  </c>
                  <c ca="left">
                     <p>KOG2326</p>
                  </c>
                  <c ca="left">
                     <p>YMR106C</p>
                  </c>
                  <c ca="left">
                     <p>viable</p>
                  </c>
                  <c ca="left">
                     <p>XIII</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.81</p>
                  </c>
                  <c ca="left">
                     <p>KOG1355</p>
                  </c>
                  <c ca="left">
                     <p>YNL220w</p>
                  </c>
                  <c ca="left">
                     <p>viable</p>
                  </c>
                  <c ca="left">
                     <p>XIV</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.81</p>
                  </c>
                  <c ca="left">
                     <p>KOG1741</p>
                  </c>
                  <c ca="left">
                     <p>YPR166c</p>
                  </c>
                  <c ca="left">
                     <p>viable</p>
                  </c>
                  <c ca="left">
                     <p>XVI</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.80</p>
                  </c>
                  <c ca="left">
                     <p>KOG3381</p>
                  </c>
                  <c ca="left">
                     <p>YHR122w</p>
                  </c>
                  <c ca="left">
                     <p>inviable</p>
                  </c>
                  <c ca="left">
                     <p>VIII</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.79</p>
                  </c>
                  <c ca="left">
                     <p>KOG3244</p>
                  </c>
                  <c ca="left">
                     <p>YDR204w</p>
                  </c>
                  <c ca="left">
                     <p>viable</p>
                  </c>
                  <c ca="left">
                     <p>IV</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.79</p>
                  </c>
                  <c ca="left">
                     <p>KOG1534</p>
                  </c>
                  <c ca="left">
                     <p>YLR243w</p>
                  </c>
                  <c ca="left">
                     <p>inviable</p>
                  </c>
                  <c ca="left">
                     <p>XII</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.78</p>
                  </c>
                  <c ca="left">
                     <p>KOG3229</p>
                  </c>
                  <c ca="left">
                     <p>YKL041w</p>
                  </c>
                  <c ca="left">
                     <p>viable</p>
                  </c>
                  <c ca="left">
                     <p>XI</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.77</p>
                  </c>
                  <c ca="left">
                     <p>KOG3438</p>
                  </c>
                  <c ca="left">
                     <p>YNL113w</p>
                  </c>
                  <c ca="left">
                     <p>inviable</p>
                  </c>
                  <c ca="left">
                     <p>XIV</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.77</p>
                  </c>
                  <c ca="left">
                     <p>KOG1069</p>
                  </c>
                  <c ca="left">
                     <p>YGR095c</p>
                  </c>
                  <c ca="left">
                     <p>inviable</p>
                  </c>
                  <c ca="left">
                     <p>VII</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.76</p>
                  </c>
                  <c ca="left">
                     <p>KOG3364</p>
                  </c>
                  <c ca="left">
                     <p>YIL065c</p>
                  </c>
                  <c ca="left">
                     <p>viable</p>
                  </c>
                  <c ca="left">
                     <p>IX</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.76</p>
                  </c>
                  <c ca="left">
                     <p>KOG0989</p>
                  </c>
                  <c ca="left">
                     <p>YJR068w</p>
                  </c>
                  <c ca="left">
                     <p>inviable</p>
                  </c>
                  <c ca="left">
                     <p>X</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.75</p>
                  </c>
                  <c ca="left">
                     <p>KOG3911</p>
                  </c>
                  <c ca="left">
                     <p>YDR087c</p>
                  </c>
                  <c ca="left">
                     <p>inviable</p>
                  </c>
                  <c ca="left">
                     <p>IV</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.73</p>
                  </c>
                  <c ca="left">
                     <p>KOG3104</p>
                  </c>
                  <c ca="left">
                     <p>YDR005c</p>
                  </c>
                  <c ca="left">
                     <p>viable</p>
                  </c>
                  <c ca="left">
                     <p>IV</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.73</p>
                  </c>
                  <c ca="left">
                     <p>KOG0304</p>
                  </c>
                  <c ca="left">
                     <p>YNR052c</p>
                  </c>
                  <c ca="left">
                     <p>viable</p>
                  </c>
                  <c ca="left">
                     <p>XIV</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.73</p>
                  </c>
                  <c ca="left">
                     <p>KOG3341</p>
                  </c>
                  <c ca="left">
                     <p>YPL002c</p>
                  </c>
                  <c ca="left">
                     <p>viable</p>
                  </c>
                  <c ca="left">
                     <p>XVI</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.72</p>
                  </c>
                  <c ca="left">
                     <p>KOG3059</p>
                  </c>
                  <c ca="left">
                     <p>YPL076w</p>
                  </c>
                  <c ca="left">
                     <p>inviable</p>
                  </c>
                  <c ca="left">
                     <p>XVI</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.71</p>
                  </c>
                  <c ca="left">
                     <p>KOG3259</p>
                  </c>
                  <c ca="left">
                     <p>YJR017c</p>
                  </c>
                  <c ca="left">
                     <p>inviable</p>
                  </c>
                  <c ca="left">
                     <p>X</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.71</p>
                  </c>
                  <c ca="left">
                     <p>KOG3313</p>
                  </c>
                  <c ca="left">
                     <p>YGR078c</p>
                  </c>
                  <c ca="left">
                     <p>viable</p>
                  </c>
                  <c ca="left">
                     <p>VII</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.70</p>
                  </c>
                  <c ca="left">
                     <p>KOG1750</p>
                  </c>
                  <c ca="left">
                     <p>YNR036c</p>
                  </c>
                  <c ca="left">
                     <p>viable</p>
                  </c>
                  <c ca="left">
                     <p>XIV</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.70</p>
                  </c>
                  <c ca="left">
                     <p>KOG0396</p>
                  </c>
                  <c ca="left">
                     <p>YIL097w</p>
                  </c>
                  <c ca="left">
                     <p>viable</p>
                  </c>
                  <c ca="left">
                     <p>IX</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.70</p>
                  </c>
                  <c ca="left">
                     <p>KOG3240</p>
                  </c>
                  <c ca="left">
                     <p>YPR113w</p>
                  </c>
                  <c ca="left">
                     <p>inviable</p>
                  </c>
                  <c ca="left">
                     <p>XVI</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.69</p>
                  </c>
                  <c ca="left">
                     <p>KOG1173</p>
                  </c>
                  <c ca="left">
                     <p>YKL022c</p>
                  </c>
                  <c ca="left">
                     <p>inviable</p>
                  </c>
                  <c ca="left">
                     <p>XI</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.68</p>
                  </c>
                  <c ca="left">
                     <p>KOG2626</p>
                  </c>
                  <c ca="left">
                     <p>YLR015w</p>
                  </c>
                  <c ca="left">
                     <p>viable</p>
                  </c>
                  <c ca="left">
                     <p>XII</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.66</p>
                  </c>
                  <c ca="left">
                     <p>KOG1299</p>
                  </c>
                  <c ca="left">
                     <p>YGL095c</p>
                  </c>
                  <c ca="left">
                     <p>viable</p>
                  </c>
                  <c ca="left">
                     <p>VII</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.65</p>
                  </c>
                  <c ca="left">
                     <p>KOG3327</p>
                  </c>
                  <c ca="left">
                     <p>YJR057w</p>
                  </c>
                  <c ca="left">
                     <p>inviable</p>
                  </c>
                  <c ca="left">
                     <p>X</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.62</p>
                  </c>
                  <c ca="left">
                     <p>KOG1746</p>
                  </c>
                  <c ca="left">
                     <p>YOR103c</p>
                  </c>
                  <c ca="left">
                     <p>inviable</p>
                  </c>
                  <c ca="left">
                     <p>XV</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.61</p>
                  </c>
                  <c ca="left">
                     <p>KOG3159</p>
                  </c>
                  <c ca="left">
                     <p>YJL046w</p>
                  </c>
                  <c ca="left">
                     <p>viable</p>
                  </c>
                  <c ca="left">
                     <p>X</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.56</p>
                  </c>
                  <c ca="left">
                     <p>KOG0325</p>
                  </c>
                  <c ca="left">
                     <p>YLR239c</p>
                  </c>
                  <c ca="left">
                     <p>viable</p>
                  </c>
                  <c ca="left">
                     <p>XII</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.50</p>
                  </c>
                  <c ca="left">
                     <p>KOG3063</p>
                  </c>
                  <c ca="left">
                     <p>YJL053w</p>
                  </c>
                  <c ca="left">
                     <p>viable</p>
                  </c>
                  <c ca="left">
                     <p>X</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.50</p>
                  </c>
                  <c ca="left">
                     <p>KOG0282</p>
                  </c>
                  <c ca="left">
                     <p>YDR364c</p>
                  </c>
                  <c ca="left">
                     <p>viable</p>
                  </c>
                  <c ca="left">
                     <p>IV</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.48</p>
                  </c>
                  <c ca="left">
                     <p>KOG2874</p>
                  </c>
                  <c ca="left">
                     <p>YCL059c</p>
                  </c>
                  <c ca="left">
                     <p>inviable</p>
                  </c>
                  <c ca="left">
                     <p>III</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.44</p>
                  </c>
                  <c ca="left">
                     <p>KOG4017</p>
                  </c>
                  <c ca="left">
                     <p>YMR201c</p>
                  </c>
                  <c ca="left">
                     <p>viable</p>
                  </c>
                  <c ca="left">
                     <p>XIII</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.36</p>
                  </c>
                  <c ca="left">
                     <p>KOG3228</p>
                  </c>
                  <c ca="left">
                     <p>YDR163w</p>
                  </c>
                  <c ca="left">
                     <p>viable</p>
                  </c>
                  <c ca="left">
                     <p>IV</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.35</p>
                  </c>
                  <c ca="left">
                     <p>KOG0551</p>
                  </c>
                  <c ca="left">
                     <p>YBR155w</p>
                  </c>
                  <c ca="left">
                     <p>inviable</p>
                  </c>
                  <c ca="left">
                     <p>II</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.24</p>
                  </c>
                  <c ca="left">
                     <p>KOG0285</p>
                  </c>
                  <c ca="left">
                     <p>YPL151c</p>
                  </c>
                  <c ca="left">
                     <p>inviable</p>
                  </c>
                  <c ca="left">
                     <p>XVI</p>
                  </c>
               </r>
               <r>
                  <c ca="left">
                     <p>0.08</p>
                  </c>
                  <c ca="left">
                     <p>KOG2441</p>
                  </c>
                  <c ca="left">
                     <p>YAL032c</p>
                  </c>
                  <c ca="left">
                     <p>inviable</p>
                  </c>
                  <c ca="left">
                     <p>I</p>
                  </c>
               </r>
            </tblbdy>
         </tbl>
         <p>Comparing the CCC values of a 531 &#215; 531 distance matrices analyzed before <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> using Pearson's correlation, indicated that KOG2671 represents the single copy protein with the highest correlation value of 0.96 (Additional file <supplr sid="S2">2</supplr>). This KOG2671 protein (putative RNA methylase KOG annotation) corresponds to ORF YOL124c of <it>S. cerevisiae </it>[Catalytic subunit of an adoMet-dependent tRNA methyltransferase complex (Trm11p-Trm112p), required for the methylation of the guanosine nucleotide at position 10 (m2G10) in tRNAs; contains a THUMP domain and a methyltransferase domain]. The CCC values of the remaining 69 single copy KOGs were compared with that of KOG2671. Any of the subsequent five single protein KOGs present in the list of 531 KOG proteins <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>, namely KOG2728 (Uncharacterized conserved protein with similarity to phosphopantothenoylcysteine synthetase/decarboxylase), KOG0991 (Replication factor C, subunit RFC2), KOG0340, (ATP-dependent RNA helicase), KOG0809 (SNARE protein TLG2/Syntaxin 16), and KOG3786 (RNA polymerase II assessory factor Cdc73p), could be used as a starting point for this comparison, because the correlation values ranged between 0.95 and 0.96 (Additional file <supplr sid="S2">2</supplr>). The correlation values between the distance matrix of KOG2671 and that of each of the remaining 69 KOG proteins ranged from 0.08 to 0.93 (Table <tblr tid="T1">1</tblr>), and were statistically significant (Additional file <supplr sid="S3">3</supplr>). The majority of the KOGs (i.e. 64 from 70 KOGs) gave correlation values higher than 0.50 (Table <tblr tid="T1">1</tblr>). As an example, we constructed a phylogenetic tree based on concatenation of these 64 KOGs (Fig. <figr fid="F1">1</figr>), which is in accordance with previously published trees. Four KOGs gave CCC values below 0.36 (Table <tblr tid="T1">1</tblr>), thus indicating that they have different phylogenetic signals. This is sustained by the resulting phylogenetic tree showing a different topology (Additional file <supplr sid="S4">4</supplr>) if compared with that based on 64 KOGs (Fig. <figr fid="F1">1</figr>). For instance, the Pezizomycotina formed a sister clade to the Basidiomycetes and, <it>S. pombe </it>occured as a basal lineage to both of them, but without statistical support (Additional file <supplr sid="S4">4</supplr>).</p>
         <suppl id="S2">
            <title>
               <p>Additional file 2</p>
            </title>
            <text>
               <p>Correlation values of the comparison between KOGs distance matrices from a set of 531 proteins used in the study of Kuramae et al., 2006 <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>.</p>
            </text>
            <file name="1471-2148-7-134-S2.XLS">
               <p>Click here for file</p>
            </file>
         </suppl>
         <suppl id="S3">
            <title>
               <p>Additional file 3</p>
            </title>
            <text>
               <p>Correlation values obtained by Pearson's correlation by comparing the distance matrix of KOG2671 with each distance matrix of 69 KOGs, non-directional probability and T- distribution.</p>
            </text>
            <file name="1471-2148-7-134-S3.XLS">
               <p>Click here for file</p>
            </file>
         </suppl>
         <fig id="F1">
            <title>
               <p>Figure 1</p>
            </title>
            <caption>
               <p>Phylogenetic relationship of 33 complete fungal genomes</p>
            </caption>
            <text>
               <p>Phylogenetic relationship of 33 complete fungal genomes. The same tree topology is given by concatenation of 30, 40, 50, 60 and 64 KOG proteins with correlation values >0.50 when compared to reference KOG2671 distance matrix. Asp. = Aspergillus, Can. = Candida, Cry. = Cryptococcus, Sac. = Saccharomyces, Ash. = Ashbya. Phyla: I = Ascomycota, II = Basidiomycota, III = Rhyzomycota. Subphyla: IA = Saccharomycotina, IB = Pezizomycotina, IC = Taphrinamycotina, IIA = Agaricomycotina, IIB = Ustilaginomycotina, IIIA = Mucormycotina. IB1 = Sardariomycetes, IB2 = Letiomycetes, IB3 = Eurotiomycetes, IB4 = Dothideomycetes. Support values indicated on the branches were obtained by bootstrap analysis using 100 replicates. * indicates support values of 98&#8211;100%.</p>
            </text>
            <graphic file="1471-2148-7-134-1"/>
         </fig>
         <suppl id="S4">
            <title>
               <p>Additional file 4</p>
            </title>
            <text>
               <p>Phylogenetic tree based on concatenation of four KOG proteins with correlation below 0.36. The four KOGs distances matrices were compared with KOG2671 reference distance matrix.</p>
            </text>
            <file name="1471-2148-7-134-S4.ppt">
               <p>Click here for file</p>
            </file>
         </suppl>
         <p>Among the KOG proteins with CCC values above 0.50, are many proteins involved in cellular processes and signaling. The other tree KOG categories <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>, namely information storage and processing, metabolism, and poorly characterized categories seem to be less informative (Fig. <figr fid="F2">2</figr>). When the KOG proteins are concatenated in increasing numbers (e.g. the 10 with the highest CCC values; the 20 with the highest CCC values and so on) it can be seen that the CCC values remains above 0.8 until 44 proteins have been concatenated (Fig. <figr fid="F2">2</figr>). Thereafter, the CCC values showed a sharp decline, indicating that the KOG proteins 44&#8211;64 have different phylogenetic signals. Interestingly, the topology of the phylogenetic trees stabilizes after the concatenation of 40 proteins (Additional file <supplr sid="S5">5</supplr>). After concatenation of only 10 and 20 proteins the lineages with <it>C. glabrata, S. kluyveri, K. lactis </it>and <it>A. gossypii</it>, and that of <it>C. lusitaniae, D. hansenii, C. guilliermondii </it>and <it>C. albicans</it>, and finally the Euascomycete lineage of <it>C. globosum, N. crassa, M. grisea </it>and <it>F. graminearum </it>showed varying topologies (Additional file <supplr sid="S5">5</supplr>). Bootstrap values of most branches were high irrespective the number of proteins concatenated (Fig. <figr fid="F1">1</figr>, Additional file <supplr sid="S5">5</supplr>). However, for two branches, labeled 7 and 9 in Additional file <supplr sid="S5">5</supplr>, that received lower bootstrap values, the maximum value (85%) was obtained after concatenation of 40 KOG proteins. The <it>A. gossypii-K. lactis-Sac. kluyveri </it>lineage (labeled as branches 4 and 5 in Additional file <supplr sid="S5">5</supplr>) received only low support, and this was even true after concatenation of 531 orthologues <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. This most likely indicates that further improvement can only be obtained by further species sampling in this lineage. Summarizing we estimate that 40&#8211;45 concatenated single copy protein KOGs are needed to fully resolve fungal TOL. Below this number the tree topology may be different, and above this number the CCC values as well as the support values tend to drop.</p>
         <fig id="F2">
            <title>
               <p>Figure 2</p>
            </title>
            <caption>
               <p>Graph representing the number of concatenated KOGs (x-axis) per functional KOG category (information storage and processing; cellular processes and signaling; metabolism; poorly characterized), and the correlation values between KOG2671 distance matrix and each distance matrix of the 70 KOGs (right y-axis)</p>
            </caption>
            <text>
               <p>Graph representing the number of concatenated KOGs (x-axis) per functional KOG category (information storage and processing; cellular processes and signaling; metabolism; poorly characterized), and the correlation values between KOG2671 distance matrix and each distance matrix of the 70 KOGs (right y-axis). The left y-axis illustrates the cumulative values of each KOG functional category when they are concatenated. The corresponding KOG protein number in x-axis is listed in the Table 1 and the corresponding functional category is in Supplemental Table 1.</p>
            </text>
            <graphic file="1471-2148-7-134-2"/>
         </fig>
         <suppl id="S5">
            <title>
               <p>Additional file 5</p>
            </title>
            <text>
               <p>Phylogenetic trees based on concatenation of 20, 30, 40, 50, 60 and 64 KOG proteins with correlation values higher than 0.50. Branches with different topologies obtained after concatenation of 10 or 20 proteins are indicated separately. Support values indicated on the branches were obtained by bootstrap analysis using 100 replicates. * indicates support values of 98&#8211;100%. In the Table support values obtained after 10, 20, 30, 40, 50, 60 and 64 proteins (from left to right) are indicated for those branches (labeled 1&#8211;12 in tree and table) that received support &lt; 98% in at least one of the sets analyzed are indicated. Note the low to moderate support for lineages 4, 5, 7 and 9. Overall the bootstrap values tend to increase until concatenation of 40 to 50 KOG proteins, but this is also lineage dependent (compare e.g. lineages 7, 9 and 12).</p>
            </text>
            <file name="1471-2148-7-134-S5.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <sec>
            <st>
               <p>Reevaluating fungal TOL</p>
            </st>
            <p>In all phylogenetic trees using 10&#8211;64 concatenated single KOG proteins, the clades I, II and III correspond to the Ascomycota, Basidiomycota and Zygomycota phyla, respectively (Fig. <figr fid="F1">1</figr>, Additional file <supplr sid="S5">5</supplr>), thus agreeing with analyses using a supertree method <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>, a super alignment using restricted orthology <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>, and concatenation of six genes <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>, and 153 <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> and 531 proteins <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>, respectively. Not surprisingly, the Ascomycota formed a sister clade to the Basidiomycota, with the Zygomycota forming a basal lineage.</p>
            <p>The Ascomycota are well represented because of the number of available sequenced genomes, and is subdivided into subphyla Pezizomycotina, Saccharomycotina and Taphrinamycotina (Fig. <figr fid="F1">1</figr>). The Saccharomycotina (clade IA) formed a sister group to the Pezizomycotina (clade IB), with Taphrinamycotina (clade IC) forming a basal lineage to both (Fig. <figr fid="F1">1</figr>). The resolution of the Saccharomycotina and Pezizomycotina is in agreement with previous phylogenomic analyses <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B16">16</abbr><abbr bid="B21">21</abbr></abbrgrp>.</p>
            <p>The phylogenetic structure of the subphylum Saccharomycotina in our tree (Fig. <figr fid="F1">1</figr>) is similar to that based on a combination of 153 protein families <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>, but slightly differs from that based on an analysis using six combined genes <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. Noticeable differences are the positions of <it>D. hansenii</it>, <it>C. guilliermondii</it>, <it>C. lusitaniae </it>and <it>C. albicans</it>. In our analysis and the study of Fitzpatrick et al. <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>, these four species formed a single cluster (Fig. <figr fid="F1">1</figr>), while in the six-gene analysis <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>, <it>C. albicans </it>clusters with <it>C. guilliermondii</it>, and <it>D. hansenii </it>with <it>C. lusitaniae</it>.</p>
            <p>Within the Saccharomycotina, seven species evolved after WGD <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>, namely <it>S. cerevisiae</it>, <it>S. bayanus</it>, <it>S. castellii</it>,<it>S. kudriavzevii</it>, <it>S. mikatae</it>, <it>S. paradoxus </it>and <it>C. glabrata</it>. The basal position of <it>C. glabrata </it>among these species agrees with results from Fitzpatrick et al. <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>, but only after removal of fast evolving site classes in their dataset. The phylogenetic structure of the <it>Saccharomyces sensu stricto </it>species, <it>S. cerevisiae</it>, <it>S. paradoxus</it>, <it>S. mikatae</it>, <it>S. kudriavzevii </it>and <it>S. bayanus </it>corroborated with previous results of Rokas et al. <abbrgrp><abbr bid="B12">12</abbr></abbrgrp> and Kuramae et al. <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>, but was found to be somewhat different if compared with data obtained by comparative genomic hybridization (CGH) <abbrgrp><abbr bid="B22">22</abbr></abbrgrp> and a four-gene analysis <abbrgrp><abbr bid="B8">8</abbr></abbrgrp> (Additional file <supplr sid="S6">6</supplr>). In the CGH study the positions of <it>S. mikatae </it>and <it>S. kudriavzevii </it>differ, whereas in the four-gene analysis <it>S. cerevisiae</it>, <it>S. paradoxus </it>and <it>S. mikatae </it>occupied different positions.</p>
            <suppl id="S6">
               <title>
                  <p>Additional file 6</p>
               </title>
               <text>
                  <p>Topological differences in phylogenetic trees of the <it>Saccharomyces sensu stricto </it>lineage as inferred from various publications.</p>
               </text>
               <file name="1471-2148-7-134-S6.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>The subphylum Pezizomycotina is divided into four clades: Sordariomycetes (clade IB1), Leotiomycetes (clade IB2), Eurotiomycetes (clade IB3) and Dothideomycetes (clade IB4) (Fig. <figr fid="F1">1</figr>). The phylogenetic positions of the Sordariomycetes, Leotiomycetes and Dothiomycetes have been a matter of controversy. According to our analysis, the Sordariomycetes and Leotiomycetes are sister clades, which is in agreement with other studies <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B16">16</abbr><abbr bid="B23">23</abbr></abbrgrp>, although the tree in the latter study was only weakly supported. All these results are, however, in disagreement with data resulting from a four-gene analysis <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>, in which the Dothideomycetes occurred as a sister clade to the Sordariomycetes. The position of <it>Stagonospora nodorum </it>(Dothiomycetes) as a basal lineage in the Pezizomycetes is highly supported in our analysis (> 90% bootstrap) (Fig. <figr fid="F1">1</figr>, Additional file <supplr sid="S5">5</supplr>) and corroborates with data from James et al. <abbrgrp><abbr bid="B10">10</abbr></abbrgrp> and Robbertse et al. <abbrgrp><abbr bid="B17">17</abbr></abbrgrp> who used maximum parsimony. However, in analyses based on a supertree method, and 153 concatenated proteins <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> and a four-gene analysis <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>, <it>S. nodorum </it>was found to be positioned next to the Eurotiomycetes <abbrgrp><abbr bid="B9">9</abbr><abbr bid="B16">16</abbr><abbr bid="B21">21</abbr></abbrgrp> or closely to the Sordariomycetes and Leotiomycetes <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>.</p>
            <p>All analyses using concatenated proteins with CCC values above 0.50 (Fig. <figr fid="F1">1</figr>, Additional file <supplr sid="S5">5</supplr>) positioned <it>S. pombe </it>(Taphrinomycotina) as a basal lineage within the phylum Ascomycota, which is in concordance with many other studies <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B21">21</abbr></abbrgrp> using different sets of genes or orthologous proteins and different methods of analysis <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. However, in another study <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>, part of the concatenated orthologues resulted in a different position, which was explained by assuming a different evolutionary origin of these proteins.</p>
            <p>The topology of the few basidiomycetous species included, representing only two subphyla Agaricomycotina (clade IIA with <it>Coprinopsis cinerea</it>, <it>Phanerochaete chrysosporium</it>, <it>Cryptococcus neoformans </it>var. <it>neoformans</it>, <it>C. neoformans </it>var. <it>grubii</it>) and Ustilaginomycotina (clade IIB with <it>Ustilago maydis</it>) (Fig. <figr fid="F1">1</figr>, Additional file <supplr sid="S5">5</supplr>) corroborates with previous studies <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B16">16</abbr></abbrgrp>.</p>
            <p>Our method of protein selection using CCC values of individual protein distance matrices seems an useful approach as the resulting phylogenetic trees are largely in agreement with those published elsewhere, and, importantly, most of the branches are well supported. The resulting selection of proteins may also be used to analyze the majority of fungal species for which a full genome is not yet available in order to improve our understanding of fungal TOL.</p>
            <p>The performance of our method, if compared to the recent AFTOL study <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>, was assessed by comparing CCC values between the protein distance matrix of reference KOG2671 and that based on the combined data set of six AFTOL genes. The correlation value obtained was 0.73, thus indicating that our reference protein has a rather similar phylogenetic signal if compared to the AFTOL genes. However, the inclusion of more genes increases the phylogenetic signal as demonstrated in our analysis (Fig. <figr fid="F1">1</figr>, Additional file <supplr sid="S5">5</supplr>), which may contribute to the resolution of discordant branches, such as that of <it>A. gossypii-K. lactis-S. kluyveri </it>clade.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>In short, the set of proteins resulting from our studies presents a good selection to be elaborated in further studies on fungal TOL, which may include many non-sequenced species. As the proteins were selected across the fungal kingdom and because they represent single KOG proteins, they may also be suitable for the development of molecular barcodes. This proposed method is universal and can be extended easily to bacterial and archaeal TOLs as well as other eukaryote lineages of TOL.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Assignment of genomes to KOG</p>
            </st>
            <p>In this study we used the complete genomes of 33 fungal and one metazoa (<it>Caenorhabditis elegans</it>) (Table <tblr tid="T2">2</tblr>). The group orthology framework presented in the KOG database <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> was the basis of our analyses. KOGs of <it>Caenorhabditis. elegans</it>, <it>Saccharomyces cerevisiae </it>S288c and <it>Schizosaccharomyces pombe </it>were obtained from the KOG database <abbrgrp><abbr bid="B24">24</abbr></abbrgrp>. Thirty one proteomes (<it>Ashbya gossypii</it>, <it>Aspergillus fumigatus, Asp. nidulans, Botritys cinerea</it>, <it>Candida albicans</it>, <it>Can. glabrata</it>, <it>Can. guilliermondii, Can. lusitaniae, Chaetomium globosum, Coccidioides immitis, Coprinopsis cinerea, Cryptococcus neoformans </it>var. <it>neoformans</it>, <it>Cryp. neoformans </it>var.<it>grubii</it>, <it>Debaryomyces hansenii</it>, <it>Fusarium graminearum</it>, <it>Kluyveromyces lactis</it>, <it>Magnaporthe grisea</it>, <it>Neurospora crassa</it>, <it>Phanerochaete chrysosporium, Rhizopus oryzae, Saccharomyces cerevisiae </it>RM11-1a, <it>Sac. bayanus</it>, <it>Sac. castellii</it>, <it>Sac. kluyveri</it>, <it>Sac. kudriavzevii</it>, <it>Sac. mikatae</it>, <it>Sac. paradoxus</it>, <it>Sclerotinia sclerotiorum, Stagonospora nodorum, Ustilago maydis </it>and <it>Yarrowia lipolytica </it>were assigned for orthologies using the STRING program as described before <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>.</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Genome sources, genome size (Mb), number of KOGs assigned to each genome used in the study</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="left">
                        <p>
                           <b>Genome</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Strain</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Genome size (Mb)</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Number of KOG</b>
                        </p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Location</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Ashbya gossypii</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>ATCC10895</p>
                     </c>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>2,592</p>
                     </c>
                     <c ca="left">
                        <p>Zoologisches Institut der Univ. Basel, Switzerland</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Aspergillus fumigatus</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Af293</p>
                     </c>
                     <c ca="left">
                        <p>30</p>
                     </c>
                     <c ca="left">
                        <p>3,182</p>
                     </c>
                     <c ca="left">
                        <p>TIGR</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Aspergillus nidulans</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>FGSC A4</p>
                     </c>
                     <c ca="left">
                        <p>31</p>
                     </c>
                     <c ca="left">
                        <p>2,982</p>
                     </c>
                     <c ca="left">
                        <p>Broad Institute</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Botritis cinerea</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>B05.10</p>
                     </c>
                     <c ca="left">
                        <p>38</p>
                     </c>
                     <c ca="left">
                        <p>3,191</p>
                     </c>
                     <c ca="left">
                        <p>Broad Institute</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Caenorhabditis elegans</it>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>100</p>
                     </c>
                     <c ca="left">
                        <p>4,235</p>
                     </c>
                     <c ca="left">
                        <p>Welcome Trust Sanger Institute</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Candida albicans</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>SC5314</p>
                     </c>
                     <c ca="left">
                        <p>16</p>
                     </c>
                     <c ca="left">
                        <p>2,636</p>
                     </c>
                     <c ca="left">
                        <p>Stanford University</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Candida glabrata</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>CBS138</p>
                     </c>
                     <c ca="left">
                        <p>13</p>
                     </c>
                     <c ca="left">
                        <p>2,505</p>
                     </c>
                     <c ca="left">
                        <p>Genolevures</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Candida guilliermondii</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>ATCC6260</p>
                     </c>
                     <c ca="left">
                        <p>12</p>
                     </c>
                     <c ca="left">
                        <p>2,750</p>
                     </c>
                     <c ca="left">
                        <p>Broad Institute</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Candida lusitaniae</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>ATCC42720</p>
                     </c>
                     <c ca="left">
                        <p>16</p>
                     </c>
                     <c ca="left">
                        <p>2,742</p>
                     </c>
                     <c ca="left">
                        <p>Broad Institute</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Chaetomium globosum</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>CBS148.51</p>
                     </c>
                     <c ca="left">
                        <p>36</p>
                     </c>
                     <c ca="left">
                        <p>3,144</p>
                     </c>
                     <c ca="left">
                        <p>Broad Institute</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Coccidioides immitis</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>RS</p>
                     </c>
                     <c ca="left">
                        <p>28.78</p>
                     </c>
                     <c ca="left">
                        <p>3,137</p>
                     </c>
                     <c ca="left">
                        <p>Broad Institute</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Coprinopsis cinereus</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Okayama 7 (#130).</p>
                     </c>
                     <c ca="left">
                        <p>37.5</p>
                     </c>
                     <c ca="left">
                        <p>3,210</p>
                     </c>
                     <c ca="left">
                        <p>Broad Institute</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Cryptococcus neoformans var. neoformans</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>JEC21</p>
                     </c>
                     <c ca="left">
                        <p>24</p>
                     </c>
                     <c ca="left">
                        <p>2,876</p>
                     </c>
                     <c ca="left">
                        <p>TIGR</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Cryptococcus neoformans var. grubii</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>H99</p>
                     </c>
                     <c ca="left">
                        <p>20</p>
                     </c>
                     <c ca="left">
                        <p>3,074</p>
                     </c>
                     <c ca="left">
                        <p>Broad Institute</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Debaryomyces hansenii</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>CBS767</p>
                     </c>
                     <c ca="left">
                        <p>12.22</p>
                     </c>
                     <c ca="left">
                        <p>2,760</p>
                     </c>
                     <c ca="left">
                        <p>Genolevures</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Fusarium graminearum</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>PH-1 (NRRL 31084)</p>
                     </c>
                     <c ca="left">
                        <p>36</p>
                     </c>
                     <c ca="left">
                        <p>3,063</p>
                     </c>
                     <c ca="left">
                        <p>Broad Institute</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Kluyveromyces lactis</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>CLIB210</p>
                     </c>
                     <c ca="left">
                        <p>10.69</p>
                     </c>
                     <c ca="left">
                        <p>2,596</p>
                     </c>
                     <c ca="left">
                        <p>Genolevures</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Magnaporthe grisea</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>70-15</p>
                     </c>
                     <c ca="left">
                        <p>40</p>
                     </c>
                     <c ca="left">
                        <p>2,917</p>
                     </c>
                     <c ca="left">
                        <p>Broad Institute</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Neurospora crassa</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>N-150</p>
                     </c>
                     <c ca="left">
                        <p>40</p>
                     </c>
                     <c ca="left">
                        <p>2,962</p>
                     </c>
                     <c ca="left">
                        <p>Broad Institute</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Phanerochaete chrysosporium</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>RP78</p>
                     </c>
                     <c ca="left">
                        <p>30</p>
                     </c>
                     <c ca="left">
                        <p>2,945</p>
                     </c>
                     <c ca="left">
                        <p>DOE Joint Genome Institute</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Rhizopus oryzae</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>RA99&#8211;880</p>
                     </c>
                     <c ca="left">
                        <p>40</p>
                     </c>
                     <c ca="left">
                        <p>3,310</p>
                     </c>
                     <c ca="left">
                        <p>Broad Institute</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Saccharomyces bayanus</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>MCYC623</p>
                     </c>
                     <c ca="left">
                        <p>12</p>
                     </c>
                     <c ca="left">
                        <p>2,560</p>
                     </c>
                     <c ca="left">
                        <p>Stanford University</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Saccharomyces castellii</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>NRRL Y-12630</p>
                     </c>
                     <c ca="left">
                        <p>10.2</p>
                     </c>
                     <c ca="left">
                        <p>2,390</p>
                     </c>
                     <c ca="left">
                        <p>Stanford University</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Saccharomyces cerevisiae</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>RM11-1a</p>
                     </c>
                     <c ca="left">
                        <p>12</p>
                     </c>
                     <c ca="left">
                        <p>2,665</p>
                     </c>
                     <c ca="left">
                        <p>Broad Institute</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Saccharomyces cerevisiae</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>S288c</p>
                     </c>
                     <c ca="left">
                        <p>12.07</p>
                     </c>
                     <c ca="left">
                        <p>2,668</p>
                     </c>
                     <c ca="left">
                        <p>Welcome Trust Sanger Institute</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Saccharomyces kluyveri</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>NRRL Y-12651</p>
                     </c>
                     <c ca="left">
                        <p>10.2</p>
                     </c>
                     <c ca="left">
                        <p>1,747</p>
                     </c>
                     <c ca="left">
                        <p>Stanford University</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Saccharomyces kudriavzevii</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>IFO1802</p>
                     </c>
                     <c ca="left">
                        <p>10.6</p>
                     </c>
                     <c ca="left">
                        <p>1,855</p>
                     </c>
                     <c ca="left">
                        <p>Stanford University</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Saccharomyces mikatae</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>IFO1815</p>
                     </c>
                     <c ca="left">
                        <p>12</p>
                     </c>
                     <c ca="left">
                        <p>2,557</p>
                     </c>
                     <c ca="left">
                        <p>Stanford University</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Saccharomyces paradoxus</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>NRRLY-17217</p>
                     </c>
                     <c ca="left">
                        <p>12</p>
                     </c>
                     <c ca="left">
                        <p>2,592</p>
                     </c>
                     <c ca="left">
                        <p>Stanford University</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Saccharomyces cerevisiae</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>S288C</p>
                     </c>
                     <c ca="left">
                        <p>13</p>
                     </c>
                     <c ca="left">
                        <p>2,668</p>
                     </c>
                     <c ca="left">
                        <p>Stanford University</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Schizosaccharomyces pombe</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>Urs Leupold 972 h<sup>-</sup></p>
                     </c>
                     <c ca="left">
                        <p>14</p>
                     </c>
                     <c ca="left">
                        <p>2,762</p>
                     </c>
                     <c ca="left">
                        <p>Welcome Trust Sanger Institute</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Sclerotinia sclerotiorum</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>1980</p>
                     </c>
                     <c ca="left">
                        <p>38</p>
                     </c>
                     <c ca="left">
                        <p>3,219</p>
                     </c>
                     <c ca="left">
                        <p>Broad Institute</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Stagonospora nodorum</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>SN15</p>
                     </c>
                     <c ca="left">
                        <p>37.1</p>
                     </c>
                     <c ca="left">
                        <p>3,324</p>
                     </c>
                     <c ca="left">
                        <p>Broad Institute</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Ustilago maydis</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>521</p>
                     </c>
                     <c ca="left">
                        <p>20</p>
                     </c>
                     <c ca="left">
                        <p>2,850</p>
                     </c>
                     <c ca="left">
                        <p>Broad Institute</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>
                           <it>Yarrowia lipolytica</it>
                        </p>
                     </c>
                     <c ca="left">
                        <p>CLIB99</p>
                     </c>
                     <c ca="left">
                        <p>20&#8211;21</p>
                     </c>
                     <c ca="left">
                        <p>2,699</p>
                     </c>
                     <c ca="left">
                        <p>Genolevures</p>
                     </c>
                  </r>
               </tblbdy>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Comparison of KOGs represented by single protein</p>
            </st>
            <p>In order to avoid problems of paralogy we selected only those 70 KOGs represented by a single protein shared by 33 complete fungal genomes. First, each protein from the list of the KOGs that fulfilled this criterion was aligned by Clustal X <abbrgrp><abbr bid="B26">26</abbr></abbrgrp>. Second, poorly aligned positions and divergent regions in each KOG alignment were removed by using Gblocks 0.91b <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. The threshold parameters used were: minimum number of sequences for a conserved position = 50% of the number of sequences + 1, minimum number of sequences for a flank position = 85% of the number of sequences, maximum number of contiguous nonconserved positions = 8, minimum length of a block = 10, not allowed gap positions, use similarity matrices. Third, the distance matrix (percent divergence) of each KOG protein was calculated between all pairs of sequences from a multiple alignment of each KOG. Finally, each KOG protein distance matrix was compared to each other (70 &#215; 70) by Pearson's correlation.</p>
         </sec>
         <sec>
            <st>
               <p>Selection of the reference KOG distance matrix</p>
            </st>
            <p>The distance matrices of the 531 KOGs used by Kuramae et al. <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> were calculated. Then, the correlation matrix values between distance matrices were determined by Pearson's correlation as described. To find the KOG distance matrix to be used as reference we selected the single copy KOG protein with the highest correlation value. This reference distance matrix was then compared to the distance matrices of the remaining 69 KOGs selected.</p>
         </sec>
         <sec>
            <st>
               <p>Phylogenetic analysis</p>
            </st>
            <p>KOG distance matrices with correlation values higher than 0.50 when compared to the reference KOG distance matrix were concatenated, aligned, the poorly aligned regions removed, and a phylogenetic analysis was done by Maximum Likelihood <b>(</b>PHYML) <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. The amino acid model substitution used was JTT <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>. The number of substitution rate categories was 2. The model of rate heterogeneity was Gamma distribution rates with 4 categories. We used <it>Caenorhabditis elegans </it>as outgroup for all phylogenetic trees reconstructions. Groups of 10, 20, 30, 40, 50, 60 and 64 KOGs protein according to decreasing cophenetic correlation values were selected, subsequently used to build phylogenetic trees, and their support values assessed using 100 replicates.</p>
         </sec>
         <sec>
            <st>
               <p>Comparison KOG reference and AFTOL combined genes</p>
            </st>
            <p>For this comparison we used 24 genomes present in AFTOL for which entire genome data are available to calculate the distance matrix of the alignment from AFTOL <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. The six combined genes distance matrix from AFTOL and the distance matrix of our reference KOG2671 were compared by Pearson's correlation.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Abbreviations</p>
         </st>
         <p>KOG: Clusters of orthologous groups for eukaryotic complete genomes.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>EEK participated in the design of the study, analyses and drafted the manuscript.</p>
         <p>VR participated in the design of the study, analyses and drafted the manuscript.</p>
         <p>CEE participated in the sequence alignment and drafted the manuscript.</p>
         <p>TB participated in the design of the study, analyses and drafted the manuscript.</p>
         <p>All authors contributed to the final manuscript preparation.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>This work was supported by the Renewal Fund of the Royal Netherlands Academy of Arts and Sciences (RNAAS &#8211; KNAW) (EEK), and the European Union Marie Curie International Reintegration Grant (MIRG-CT-2006-036584) (CAA).</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Molecular evidence for an ancient duplication of the entire yeast genome</p>
            </title>
            <aug>
               <au>
                  <snm>Wolfe</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Shields</snm>
                  <fnm>DC</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>1997</pubdate>
            <volume>387</volume>
            <fpage>708</fpage>
            <lpage>713</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/42711</pubid>
                  <pubid idtype="pmpid" link="fulltext">9192896</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Parallel inactivation of multiple GAL pathway genes and ecological diversification in yeasts</p>
            </title>
            <aug>
               <au>
                  <snm>Hittinger</snm>
                  <fnm>CT</fnm>
               </au>
               <au>
                  <snm>Rokas</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Carroll</snm>
                  <fnm>SB</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci USA</source>
            <pubdate>2004</pubdate>
            <volume>101</volume>
            <fpage>14144</fpage>
            <lpage>14149</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">521130</pubid>
                  <pubid idtype="pmpid" link="fulltext">15381776</pubid>
                  <pubid idtype="doi">10.1073/pnas.0404319101</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>The RNA polymerase III-dependent family of genes in hemiascomycetes: comparative RNomics, decoding strategies, transcription and evolutionary implications</p>
            </title>
            <aug>
               <au>
                  <snm>Marck</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Kachouri-Lafond</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Lafontaine</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Westhof</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Dujon</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Grosjean</snm>
                  <fnm>HT</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2006</pubdate>
            <volume>34</volume>
            <fpage>1816</fpage>
            <lpage>1835</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1447645</pubid>
                  <pubid idtype="pmpid" link="fulltext">16600899</pubid>
                  <pubid idtype="doi">10.1093/nar/gkl085</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Comparative genomics and genome evolution in yeasts</p>
            </title>
            <aug>
               <au>
                  <snm>Wolfe</snm>
                  <fnm>KH</fnm>
               </au>
            </aug>
            <source>Philos Trans R Soc Lond B Biol Sci</source>
            <pubdate>2006</pubdate>
            <volume>361</volume>
            <fpage>403</fpage>
            <lpage>412</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1098/rstb.2005.1799</pubid>
                  <pubid idtype="pmpid" link="fulltext">16524829</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Comparative genomics of <it>Pneumocystis carinii </it>with other protists: implications for life style</p>
            </title>
            <aug>
               <au>
                  <snm>Cushion</snm>
                  <fnm>MT</fnm>
               </au>
            </aug>
            <source>J Eukaryot Microbiol</source>
            <pubdate>2004</pubdate>
            <volume>51</volume>
            <fpage>30</fpage>
            <lpage>37</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1550-7408.2004.tb00158.x</pubid>
                  <pubid idtype="pmpid">15068263</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>The dawn of fungal pathogen genomics</p>
            </title>
            <aug>
               <au>
                  <snm>Xu</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Peng</snm>
                  <fnm>YL</fnm>
               </au>
               <au>
                  <snm>Dickman</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Sharon</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Annu Rev Phytopathol</source>
            <pubdate>2006</pubdate>
            <volume>44</volume>
            <fpage>337</fpage>
            <lpage>366</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.phyto.44.070505.143412</pubid>
                  <pubid idtype="pmpid" link="fulltext">16704358</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>The full-length phylogenetic tree from 1551 ribosomal sequences of chitinous fungi, fungi</p>
            </title>
            <aug>
               <au>
                  <snm>Tehler</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Little</snm>
                  <fnm>DP</fnm>
               </au>
               <au>
                  <snm>Farris</snm>
                  <fnm>JS</fnm>
               </au>
            </aug>
            <source>Mycol Res</source>
            <pubdate>2003</pubdate>
            <volume>107</volume>
            <fpage>901</fpage>
            <lpage>916</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1017/S0953756203008128</pubid>
                  <pubid idtype="pmpid">14531615</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Phylogenetic relationships among yeasts of the <it>Saccharomyces </it>complex determined from multigene sequence analyses</p>
            </title>
            <aug>
               <au>
                  <snm>Kurtzman</snm>
                  <fnm>CP</fnm>
               </au>
               <au>
                  <snm>Robnett</snm>
                  <fnm>CJ</fnm>
               </au>
            </aug>
            <source>FEMS Yeast Res</source>
            <pubdate>2003</pubdate>
            <volume>3</volume>
            <fpage>417</fpage>
            <lpage>432</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S1567-1356(03)00012-6</pubid>
                  <pubid idtype="pmpid" link="fulltext">12748053</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Assembling the fungal tree of life:Progress, classification and evolution of subcellular traits</p>
            </title>
            <aug>
               <au>
                  <snm>Lutzoni</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Kauff</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Cox</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>McLaughlin</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Celio</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Dentinger</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Padamsee</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hibbett</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>James</snm>
                  <fnm>TY</fnm>
               </au>
               <au>
                  <snm>Baloch</snm>
                  <fnm>E</fnm>
               </au>
               <etal/>
            </aug>
            <source>Am J Bot</source>
            <pubdate>2004</pubdate>
            <volume>91</volume>
            <fpage>1446</fpage>
            <lpage>1480</lpage>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Reconstructing the early evolution of Fungi using a six-gene phylogeny</p>
            </title>
            <aug>
               <au>
                  <snm>James</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Kauff</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Schoch</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>Matheny</snm>
                  <fnm>PB</fnm>
               </au>
               <au>
                  <snm>Hofstetter</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Cox</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Celio</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Gueidan</snm>
                  <fnm>C</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2006</pubdate>
            <volume>443</volume>
            <fpage>818</fpage>
            <lpage>822</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature05110</pubid>
                  <pubid idtype="pmpid" link="fulltext">17051209</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Genome trees constructed using five different approaches suggest new major bacterial clades</p>
            </title>
            <aug>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Rogozin</snm>
                  <fnm>IB</fnm>
               </au>
               <au>
                  <snm>Grishin</snm>
                  <fnm>NV</fnm>
               </au>
               <au>
                  <snm>Tatusov</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>BMC Evol Biol</source>
            <pubdate>2001</pubdate>
            <volume>1</volume>
            <fpage>8</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">60490</pubid>
                  <pubid idtype="pmpid" link="fulltext">11734060</pubid>
                  <pubid idtype="doi">10.1186/1471-2148-1-8</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Genome-scale approaches to resolving incongruence in molecular phylogenies</p>
            </title>
            <aug>
               <au>
                  <snm>Rokas</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Williams</snm>
                  <fnm>BL</fnm>
               </au>
               <au>
                  <snm>King</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Carroll</snm>
                  <fnm>SB</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2003</pubdate>
            <volume>425</volume>
            <fpage>798</fpage>
            <lpage>804</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature02053</pubid>
                  <pubid idtype="pmpid" link="fulltext">14574403</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Coelomata and not Ecdysozoa: evidence from genome-wide phylogenetic analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Wolf</snm>
                  <fnm>YI</fnm>
               </au>
               <au>
                  <snm>Rogozin</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <fpage>29</fpage>
            <lpage>36</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">314272</pubid>
                  <pubid idtype="pmpid" link="fulltext">14707168</pubid>
                  <pubid idtype="doi">10.1101/gr.1347404</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Phylogenomics reveal a robust fungal tree of life</p>
            </title>
            <aug>
               <au>
                  <snm>Kuramae</snm>
                  <fnm>EE</fnm>
               </au>
               <au>
                  <snm>Robert</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Wei&#223;</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Boekhout</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>FEMS Yeast Res</source>
            <pubdate>2006</pubdate>
            <volume>6</volume>
            <fpage>1213</fpage>
            <lpage>1220</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1567-1364.2006.00119.x</pubid>
                  <pubid idtype="pmpid">17156018</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Conflicting phylogenetic position of <it>Schizosaccharomyces pombe</it></p>
            </title>
            <aug>
               <au>
                  <snm>Kuramae</snm>
                  <fnm>EE</fnm>
               </au>
               <au>
                  <snm>Robert</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Boekhout</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Genomics</source>
            <pubdate>2006</pubdate>
            <volume>88</volume>
            <fpage>387</fpage>
            <lpage>393</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.ygeno.2006.07.001</pubid>
                  <pubid idtype="pmpid" link="fulltext">16904286</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>A fungal phylogeny based on 42 complete genomes derived from supertree and combined gene analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Fitzpatrick</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Logue</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Stajich</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Butler</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>BMC Evolutionary Biology</source>
            <pubdate>2006</pubdate>
            <volume>6</volume>
            <fpage>99</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1679813</pubid>
                  <pubid idtype="pmpid" link="fulltext">17121679</pubid>
                  <pubid idtype="doi">10.1186/1471-2148-6-99</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>A phylogenomic analysis of the Ascomycota</p>
            </title>
            <aug>
               <au>
                  <snm>Robbertse</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Reeves</snm>
                  <fnm>JB</fnm>
               </au>
               <au>
                  <snm>Schoch</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>Spatafora</snm>
                  <fnm>JW</fnm>
               </au>
            </aug>
            <source>Fungal Genet Biol</source>
            <pubdate>2006</pubdate>
            <volume>43</volume>
            <fpage>715</fpage>
            <lpage>725</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.fgb.2006.05.001</pubid>
                  <pubid idtype="pmpid" link="fulltext">16781175</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>The magnitude of fungal diversity: the 1.5 million species estimate revised</p>
            </title>
            <aug>
               <au>
                  <snm>Hawksworth</snm>
                  <fnm>DL</fnm>
               </au>
            </aug>
            <source>Mycol Res</source>
            <pubdate>2001</pubdate>
            <volume>109</volume>
            <fpage>1422</fpage>
            <lpage>1432</lpage>
         </bibl>
         <bibl id="B19">
            <url>http://www.yeastgenome.org/</url>
         </bibl>
         <bibl id="B20">
            <title>
               <p>The COG database: an updated version includes eukaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Tatusov</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Fedorova</snm>
                  <fnm>ND</fnm>
               </au>
               <au>
                  <snm>Jackson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Jacobs</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Kiryutin</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Krylov</snm>
                  <fnm>DM</fnm>
               </au>
               <au>
                  <snm>Mazumder</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Mekhedov</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Nikolskaya</snm>
                  <fnm>AN</fnm>
               </au>
               <etal/>
            </aug>
            <source>BMC Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <fpage>41</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">222959</pubid>
                  <pubid idtype="pmpid" link="fulltext">12969510</pubid>
                  <pubid idtype="doi">10.1186/1471-2105-4-41</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Assessment of phylogenomic and orthology approaches for phylogenetic inference</p>
            </title>
            <aug>
               <au>
                  <snm>Dutihl</snm>
                  <fnm>BE</fnm>
               </au>
               <au>
                  <snm>van Noort</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>van der Heijden</snm>
                  <fnm>RTJM</fnm>
               </au>
               <au>
                  <snm>Boekhout</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Huynen</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2007</pubdate>
            <volume>23</volume>
            <fpage>815</fpage>
            <lpage>224</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btm015</pubid>
                  <pubid idtype="pmpid" link="fulltext">17237036</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Comparative genomic hybridization provides new insights into the molecular taxonomy of the <it>Saccharomyces sensu stricto </it>complex</p>
            </title>
            <aug>
               <au>
                  <snm>Edwards-Ingram</snm>
                  <fnm>LC</fnm>
               </au>
               <au>
                  <snm>Gent</snm>
                  <fnm>ME</fnm>
               </au>
               <au>
                  <snm>Hoyle</snm>
                  <fnm>DC</fnm>
               </au>
               <au>
                  <snm>Hayes</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Stateva</snm>
                  <fnm>LI</fnm>
               </au>
               <au>
                  <snm>Oliver</snm>
                  <fnm>SG</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <fpage>1043</fpage>
            <lpage>1051</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">419782</pubid>
                  <pubid idtype="pmpid" link="fulltext">15173111</pubid>
                  <pubid idtype="doi">10.1101/gr.2114704</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>Performance of four ribosomal DNA regions to infer higher- level phylogenetic relationships of inoperculate euascomycetes (Leotiomyceta)</p>
            </title>
            <aug>
               <au>
                  <snm>Lumbsch</snm>
                  <fnm>HT</fnm>
               </au>
               <au>
                  <snm>Schmitt</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Lindemuth</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Mangold</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Fernandez</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Huhndorf</snm>
                  <fnm>S</fnm>
               </au>
            </aug>
            <source>Mol Phylogenet Evol</source>
            <pubdate>2005</pubdate>
            <volume>34</volume>
            <fpage>512</fpage>
            <lpage>24</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.ympev.2004.11.007</pubid>
                  <pubid idtype="pmpid" link="fulltext">15683926</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <url>ftp://ftp.ncbi.nih.gov/pub/COG/KOG/</url>
         </bibl>
         <bibl id="B25">
            <title>
               <p>STRING: a web-server to retrieve and display the repeatedly occurring neighbourhood of a gene</p>
            </title>
            <aug>
               <au>
                  <snm>Snel</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Lehmann</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Bork</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Huynen</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2000</pubdate>
            <volume>28</volume>
            <fpage>3442</fpage>
            <lpage>3444</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">110752</pubid>
                  <pubid idtype="pmpid" link="fulltext">10982861</pubid>
                  <pubid idtype="doi">10.1093/nar/28.18.3442</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>The ClustalX windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools</p>
            </title>
            <aug>
               <au>
                  <snm>Thompson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Gibson</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Plewniak</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Jeanmougin</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Higgins</snm>
                  <fnm>DG</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1997</pubdate>
            <volume>25</volume>
            <fpage>4876</fpage>
            <lpage>4882</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">147148</pubid>
                  <pubid idtype="pmpid" link="fulltext">9396791</pubid>
                  <pubid idtype="doi">10.1093/nar/25.24.4876</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Castresana</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2000</pubdate>
            <volume>17</volume>
            <fpage>540</fpage>
            <lpage>552</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10742046</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood</p>
            </title>
            <aug>
               <au>
                  <snm>Guindon</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gascuel</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Syst Biol</source>
            <pubdate>2003</pubdate>
            <volume>52</volume>
            <fpage>696</fpage>
            <lpage>704</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1080/10635150390235520</pubid>
                  <pubid idtype="pmpid">14530136</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>The rapid generation of mutation data matrices from protein sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Jones</snm>
                  <fnm>DT</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>WR</fnm>
               </au>
               <au>
                  <snm>Thornton</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>CABIOS</source>
            <pubdate>1992</pubdate>
            <volume>8</volume>
            <fpage>275</fpage>
            <lpage>282</lpage>
            <xrefbib>
               <pubid idtype="pmpid">1633570</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <url>http://www.aftol.org/alignments/comb_prot_ex.nex</url>
         </bibl>
      </refgrp>
   </bm>
</art>
