<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2105-8-405</ui>
   <ji>1471-2105</ji>
   <fm>
      <dochead>Software</dochead>
      <bibl>
         <title>
            <p>AxPcoords &amp; parallel AxParafit: statistical co-phylogenetic analyses on thousands of taxa</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Stamatakis</snm>
               <fnm>Alexandros</fnm>
               <insr iid="I1"/>
               <insr iid="I2"/>
               <email>Alexandros.Stamatakis@epfl.ch</email>
            </au>
            <au id="A2">
               <snm>Auch</snm>
               <mi>F</mi>
               <fnm>Alexander</fnm>
               <insr iid="I3"/>
               <email>auch@informatik.uni-tuebingen.de</email>
            </au>
            <au id="A3">
               <snm>Meier-Kolthoff</snm>
               <fnm>Jan</fnm>
               <insr iid="I3"/>
               <email>jan.mk@gmx.de</email>
            </au>
            <au id="A4">
               <snm>G&#246;ker</snm>
               <fnm>Markus</fnm>
               <insr iid="I4"/>
               <email>markus.goeker@uni-tuebingen.de</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>&#201;cole Polytechnique F&#233;d&#233;rale de Lausanne, School of Computer &amp; Communication Sciences, Laboratory for Computational Biology and Bioinformatics STATION 14, CH-1015 Lausanne, Switzerland</p>
            </ins>
            <ins id="I2">
               <p>Swiss Institute of Bioinformatics</p>
            </ins>
            <ins id="I3">
               <p>Center for Bioinformatics (ZBIT), Sand 14, T&#252;bingen, University of T&#252;bingen, Germany</p>
            </ins>
            <ins id="I4">
               <p>Organismic Botany/Mycology, Auf der Morgenstelle 1, T&#252;bingen, University of T&#252;bingen, Germany</p>
            </ins>
         </insg>
         <source>BMC Bioinformatics</source>
         <issn>1471-2105</issn>
         <pubdate>2007</pubdate>
         <volume>8</volume>
         <issue>1</issue>
         <fpage>405</fpage>
         <url>http://www.biomedcentral.com/1471-2105/8/405</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">17953748</pubid>
               <pubid idtype="doi">10.1186/1471-2105-8-405</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>26</day>
               <month>6</month>
               <year>2007</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>22</day>
               <month>10</month>
               <year>2007</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>22</day>
               <month>10</month>
               <year>2007</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2007</year>
         <collab>Stamatakis et al.; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Current tools for Co-phylogenetic analyses are not able to cope with the continuous accumulation of phylogenetic data. The sophisticated statistical test for host-parasite co-phylogenetic analyses implemented in Parafit does not allow it to handle large datasets in reasonable times. The Parafit and DistPCoA programs are the by far most compute-intensive components of the Parafit analysis pipeline. We present AxParafit and AxPcoords (Ax stands for Accelerated) which are highly optimized versions of Parafit and DistPCoA respectively.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>Both programs have been entirely re-written in C. Via optimization of the algorithm and the C code as well as integration of highly tuned BLAS and LAPACK methods AxParafit runs 5&#8211;61 times faster than Parafit with a lower memory footprint (up to 35% reduction) while the performance benefit increases with growing dataset size. The MPI-based parallel implementation of AxParafit shows good scalability on up to 128 processors, even on medium-sized datasets. The parallel analysis with AxParafit on 128 CPUs for a medium-sized dataset with an 512 by 512 association matrix is more than 1,200/128 times faster per processor than the sequential Parafit run. AxPcoords is 8&#8211;26 times faster than DistPCoA and numerically stable on large datasets. We outline the substantial benefits of using parallel AxParafit by example of a large-scale empirical study on smut fungi and their host plants. To the best of our knowledge, this study represents the largest co-phylogenetic analysis to date.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>The highly efficient AxPcoords and AxParafit programs allow for large-scale co-phylogenetic analyses on several thousands of taxa for the first time. In addition, AxParafit and AxPcoords have been integrated into the easy-to-use CopyCat tool.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>One of the basic questions in evolutionary analyses <abbrgrp><abbr bid="B1">1</abbr></abbrgrp> is whether parasites (e.g., lice or Papillomaviruses) or mutualists have co-speciated with their respective hosts (e.g., mammals). The constant accumulation of DNA and AA sequence data coupled with recent advances in tree building software, such as TNT <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>, MrBayes <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>, GARLI <abbrgrp><abbr bid="B4">4</abbr></abbrgrp> or RAxML <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>, allow for large-scale phylogenetic analyses with several hundred or thousand taxa <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr><abbr bid="B8">8</abbr><abbr bid="B9">9</abbr><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr><abbr bid="B12">12</abbr></abbrgrp>. Thus, large-scale co-phylogenetic studies have also potentially become feasible. However, most common co-phylogenetic tools or methods such as BPA, TreeMap or TreeFitter (see review in <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>) are not able to handle datasets with a large number of taxa or have not been tested in this regard with respect to their statistical properties. Therefore, there is a performance and scalability gap between tools for phylogenetic analysis and meta-analysis. The capability to analyze large datasets is important to infer "deep co-phylogenetic" relationships which could otherwise not be assessed <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>.</p>
         <p>Parafit <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> implements statistical tests for both overall phylogenetic congruence as well as for the significance of individual associations. Extensive simulations have shown that the Parafit tests are statistically well-behaved and yield acceptable error rates. The method has been successfully applied in a number of biological studies <abbrgrp><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr><abbr bid="B18">18</abbr><abbr bid="B19">19</abbr></abbrgrp>. In addition, the Type-II statistical error of Parafit decreases with the size of the dataset (see <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>), i.e., this approach scales well on large phylogenies of hosts and associates. Due to these desirable properties, recent work on CopyCat <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> focused on improving the usability of Parafit via a Graphical User Interface (GUI) and automation of the analysis pipeline which transforms phylogenetic trees to patristic (tree-based) distance matrices, converts distance matrices to matrices of eigenvectors using DistPCoA <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>, invokes Parafit, and parses input, intermediate, as well as output files. However, co-phylogenetic analyses with CopyCat can not be conducted on large datasets due to the excessive run time requirements of Parafit and DistPCoA, which represent the by far most compute-intensive part of the CopyCat analysis pipeline.</p>
         <p>Here we present AxParafit and AxPcoords which are highly optimized and parallelized versions of Parafit and DistPCoA respectively. As outlined by the case-study on smut fungi on page 6 these accelerated programs allow for more thorough large-scale co-phylogenetic analyses and extend the applicability of the approach by 1&#8211;2 orders of magnitude, thus closing the aforementioned performance gap concerning current phylogenetic meta-analysis tools. Coupled with the easy-to-use CopyCat tool AxParafit/AxPcoords facilitate statistical co-phylogenetic analyses on the largest trees that can currently be computed.</p>
      </sec>
      <sec>
         <st>
            <p>Implementation</p>
         </st>
         <p>For programming convenience and portability as well as due to the structure of the original Fortran code we re-implemented Parafit and DistPCoA in C from scratch.</p>
         <sec>
            <st>
               <p>Sequential Optimization</p>
            </st>
            <p>The sequential C code was optimized by reducing unnecessary memory allocations for matrices in AxPcoords/AxParafit and using a faster method to permute matrices in AxParafit.</p>
            <p>Thereafter the compute-intensive for-loops in AxParafit/AxPcoords were manually tuned. After those initial optimizations we profiled both programs and found that the run-times were now largely dominated (over 90% of total execution time) by a dense matrix-matrix multiplication in AxParafit and the computation of eigenvectors/eigenvalues in AxPcoords respectively. To further accelerate the programs we integrated function calls to the highly optimized matrix multiplication of the BLAS (Basic Linear Algebra Package <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>) package and eigenvector/eigenvalue decomposition in LAPACK (Linear Algebra PACKage <abbrgrp><abbr bid="B22">22</abbr></abbrgrp>).</p>
            <p>For BLAS we assessed the usage of ATLAS BLAS (Automatically Tuned Linear Algebra Software, math-atlas.sourceforge.net) as well as the ACML BLAS (AMD Core Math Library <abbrgrp><abbr bid="B23">23</abbr></abbrgrp>) libraries on a 2.4 GHz AMD Opteron CPU. The ACML package showed slightly faster speeds (&#8776; 7&#8211;9%). However, AxParafit also provides an interface to the INTEL MKL (Math Kernel Library) and ATLAS BLAS implementations. AMD ACML, INTEL MKL, and ATLAS are all freely available for academic use. AxParafit can also be compiled without BLAS and rely on a manually tuned matrix multiplication which is approximately 4 times slower.</p>
            <p>AxPcoords can use either the LAPACK functions implemented in the AMD ACML or INTEL MKL libraries. In addition, AxPcoords can also make use of the GNU scientific library <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> for eigenvector/eigenvalue computations.</p>
            <p>The tuned programs were designed to yield <it>exactly </it>the same results as Parafit and DistPCoA. Note however, that in contrast to AxPcoords we observed numerically unstable results for DistPCoA on datasets with large association matrices, containing more than 4,096 entries. This is due to some well-known problems with the stability of eigenvector/eigenvalue decomposition <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp> on large datasets and due to the fact that the original Parafit code uses the algorithm from <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>. Therefore, the integration of the thoroughly tested LAPACK routines, apart from speed benefits, also yields increased numerical stability. We integrated AxPcoords and AxParafit into CopyCat <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. Figure <figr fid="F1">1</figr> provides a screen-shot of CopyCat whit a drop-down menu that allows the user to select AxParafit/AxPcoords for executing the analyses.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Screen-shot of AxParafit/AxPcoords Option in CopyCat</p>
               </caption>
               <text>
                  <p><b>Screen-shot of AxParafit/AxPcoords Option in CopyCat</b>. This screen-shot shows the CopyCat drop-down menu that allows the user to select AxParafit/AxPcoords for executing the analyses and to switch between the U and W modes of branch length computation.</p>
               </text>
               <graphic file="1471-2105-8-405-1"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Parallelization</p>
            </st>
            <p>AxPcoords requires less than 24 hours of run-time on a single CPU, even for distance matrices with several thousands of taxa. Therefore, we exclusively focused on the parallelization of AxParafit which requires run-times of several days or weeks on large datasets.</p>
            <p>The execution time of Parafit depends on the sizes of input matrices <it>A</it>, <it>B</it>, and <it>C </it>with dimensions <it>n</it><sub>1</sub><it>n</it><sub>2</sub>, <it>n</it><sub>4</sub><it>n</it><sub>1</sub>, and <it>n</it><sub>3</sub><it>n</it><sub>2 </sub>respectively (for details see <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>). The complexity is roughly <it>O</it>(<it>nonZero</it>(<it>A</it>)<it>n</it><sub>3</sub><it>n</it><sub>4</sub><it>n</it><sub>1</sub><it>p</it>). The term <it>n</it><sub>3</sub><it>n</it><sub>4</sub><it>n</it><sub>1 </sub>is the complexity of the dense matrix multiplication in AxParafit. The variable <it>p </it>is the user-specified number of permutations that shall be executed (typically 99&#8211;9,999, not counting the original permutation) and <it>nonZero</it>(<it>A</it>) is the number of non-zero elements in the binary association matrix <it>A</it>. The program executes two main steps: the global test of co-speciation with complexity <it>O</it>(<it>n</it><sub>3</sub><it>n</it><sub>4</sub><it>n</it><sub>1</sub><it>p</it>) and the individual tests with complexity <it>O</it>(<it>nonZero</it>(<it>A</it>)<it>n</it><sub>3</sub><it>n</it><sub>4</sub><it>n</it><sub>1</sub><it>p</it>). Since in real-world analyses <it>nonZero</it>(<it>A</it>) &#8811; 1 we only parallelized all individual tests of co-speciation which typically generate over 99% of the total computational load. Our approach represents a trade-off between the amount of programming effort required for the parallelization and the expected performance gains. Thus, initially the global test of co-speciation must be executed using the sequential version of AxParafit. The sequential program provides an option to conduct the global test, write a binary output file that can be used to start the parallel computation of individual host-parasite links, and then exit.</p>
            <p>The statistical test of individual associations has been parallelized with MPI (Message Passing Interface) via a master-worker scheme. The parallelization is straight-forward since all tests of individual associations are independent from one another and can thus be computed independently on individual workers. Moreover, each individual test has approximately the same execution time, such that there are no problems due to load imbalance. The maximum number of CPUs that can be used by our parallelization is thus <it>nonZero</it>(<it>A</it>). However, this can be improved by using the ACML or MKL BLAS implementations that exploit fine-grained loop level parallelism on SMP (Symmetric Multi-Processing) architectures. This allows for a more efficient utilization of hybrid supercomputer architecture. Moreover, it might help to improve performance on huge datasets where SMP implementations can profit from super-linear speedups due to increased cache efficiency.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Results and Discussion</p>
         </st>
         <p>The current Section is split into two parts: Part 1 describes the computational results while Part 2 outlines the substantial benefits of using AxParafit for large-scale empirical co-phylogenetic studies.</p>
         <sec>
            <st>
               <p>Computational Performance</p>
            </st>
            <p>Here we provide performance data regarding the purely computational aspects of AxParafit.</p>
            <sec>
               <st>
                  <p>Experimental Setup</p>
               </st>
               <p>To conduct computational experiments we used an unloaded system of 36 4-way AMD 2.4 GHz Opteron processors with 8 GB of main memory per node which are interconnected by an Infiniband switch. Parafit and DistPCoA were compiled using g77 -ffixed-line-length-0 -ff90-intrinsics-delete -03. AxParafit and AxPcoords were compiled with -03 -fomit-frame-pointer -funroll-loops and linked with the AMD ACML library. We also assessed additional compiler optimizations (-fomit-frame-pointer, -funroll-loops, -m64, -march = k8) with g77 for Fortran, which actually lead to performance decrease of Parafit and DistPCoA (data not shown).</p>
               <p>In order to assess performance of AxParafit we extracted subsets from a large empirical dataset with more than 30,000 host-associate links (collected from entries in the EMBL database <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>), which we are currently analyzing with our tools. We sampled square association matrices <it>A</it>, i.e., <it>n</it><sub>1 </sub>= <it>n</it><sub>2 </sub>of dimensions 128, 256, 512, 1,024, and 2,048. The number <it>nonZero</it>(<it>A</it>) was 128, 256, 512, 1,024, and 2,048 respectively. The number of permutations <it>p </it>was set to 99, 99, 9, 2, and 2 respectively. A complete test on the dataset of size 4,096 was not conducted with Parafit due to the extremely long run-times on <it>n</it><sub>1 </sub>= <it>n</it><sub>2 </sub>= 2, 048 which already amounts to 19.9 days compared to 7.7 hours required by AxParafit.</p>
               <p>To test AxPcoords we used the same compiler switches as indicated above and a subset of the square association matrices with <it>nonZero</it>(<it>A</it>) amounting to 512, 1,024, 2,048, and 4,096 respectively.</p>
            </sec>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <p>In Figure <figr fid="F2">2</figr> we provide the sequential run-time improvement of AxParafit over Parafit. The acceleration obtained by AxParafit increases with growing dataset size and attains a factor of 61.86 on the association matrix of size 2,048. The increase of the performance improvement with growing dataset size is mainly due the larger efficiency of both our own optimizations as well as the cache blocking strategies used in the BLAS implementations.</p>
         <fig id="F2">
            <title>
               <p>Figure 2</p>
            </title>
            <caption>
               <p>Run Time Improvement Sequential AxParafit versus Parafit</p>
            </caption>
            <text>
               <p><b>Run Time Improvement Sequential AxParafit versus Parafit</b>. Run-time improvement of AxParafit versus Parafit for quadratic association matrices of dimensions 128, 256, 512, 1,024, and 2,048.</p>
            </text>
            <graphic file="1471-2105-8-405-2"/>
         </fig>
         <p>Figure <figr fid="F3">3</figr> provides the memory use of AxParafit and Parafit in MB for quadratic <it>A</it>-matrices of sizes 128, 256, 512, 1,024, 2,048, and 4,096 (note that the dataset of size 4,096 was not run to completion). To test AxPcoords we used distance matrices of sizes 512, 1,024, 2,048, and 4,096. Run-time improvements range from 8.8 to 25.74. The run on 4,096 with DistPCoA apparently terminated but did not write a results file, most probably due to numerical instability (Pierre Legendre, personal communication). Figure <figr fid="F4">4</figr> shows the run-time improvement of AxPcoords over DistPCoA for quadratic distance matrices of sizes 512, 1,024, 2,048, and 4,096. As already mentioned, the run on 4,096 with DistPCoA did not write a results file. Tests on smaller distance matrices e.g., of size 128 and 256 were omitted due to the low execution times which were below 10 seconds. On the largest matrix AxPcoords terminated within only 399 seconds as opposed to 10,268 seconds required by DistPCoA.</p>
         <fig id="F3">
            <title>
               <p>Figure 3</p>
            </title>
            <caption>
               <p>Memory Consumption AxParafit versus Parafit</p>
            </caption>
            <text>
               <p><b>Memory Consumption AxParafit versus Parafit</b>. Memory consumption of Parafit and AxParafit for quadratic association matrices of size 128, 256, 512, 1,024, 2,048, and 4,096.</p>
            </text>
            <graphic file="1471-2105-8-405-3"/>
         </fig>
         <fig id="F4">
            <title>
               <p>Figure 4</p>
            </title>
            <caption>
               <p>Run Time Improvement Sequential AxPcoords versus DistPCoA</p>
            </caption>
            <text>
               <p><b>Run Time Improvement Sequential AxPcoords versus DistPCoA</b>. Run-time improvement of AxPcoords versus DistPCoA for quadratic distance matrices of dimensions 512, 1,024, 2,048, and 4,096.</p>
            </text>
            <graphic file="1471-2105-8-405-4"/>
         </fig>
         <p>We assessed scalability of parallel AxParafit using the association matrix <it>A </it>of size 512 on 4, 8, 16, 32, 64, and 128 processors with <it>p </it>= 99. Figure <figr fid="F5">5</figr> provides the speedup with respect to the number of worker processes. We indicate speedup values for the parallel part (SpeedupIndividual, computation of individual host-parasite links) as well as for the sequential plus the parallel part of the program (SpeedupWhole), i.e., we added the sequential computation time for the global test to the parallel execution time. On 128 processors the computation took only 50 seconds. An analysis of this dataset with the sequential version of Parafit would take approximately 20 hours.</p>
         <fig id="F5">
            <title>
               <p>Figure 5</p>
            </title>
            <caption>
               <p>Speedup of Parallel AxParafit</p>
            </caption>
            <text>
               <p><b>Speedup of Parallel AxParafit</b>. Speedup of parallelized part and speedup for sequential plus parallel part of AxParParafit for a quadratic association matrix of size 512 on 4, 8, 16, 32, 64 and 128 CPUs.</p>
            </text>
            <graphic file="1471-2105-8-405-5"/>
         </fig>
         <sec>
            <st>
               <p>A Real-World Example</p>
            </st>
            <p>In order to provide an example for the substantial benefits of performing a large-scale co-phylogenetic analysis with AxParafit we provide a real-world study on smut fungi and their host plants.</p>
            <sec>
               <st>
                  <p>Experimental Data</p>
               </st>
               <p>We collected a large sample of associations of smut fungi and their host plants. Smut fungi comprise more than 1,500 species of obligate phytoparasites and are arranged in the taxa <it>Entorrhizomycetes</it>, <it>Microbotryales</it>, and <it>Ustilaginomycotina</it>. These parasites cause syndromes such as dark, powdery appearance of the mature spore masses or may even lead to plant deformation in some cases <abbrgrp><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr></abbrgrp>. The <it>Ustilaginomycotina </it>also comprise obligate plant parasites with distinct morphology <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>.</p>
               <p>With a few exceptions, hosts of smut fungi belong to the Angiosperms <abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. For economically important hosts, such as barley and other cereals, smut fungi may cause considerable yield losses (see e.g., <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>). Phylogeny and taxonomy of genera and higher ranks has been derived from sound molecular and ultrastructural data in recent years (see <abbrgrp><abbr bid="B30">30</abbr></abbrgrp> and references therein). However, apart from the work presented in <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>, co-phylogenetic analysis of smut fungi have so far been restricted to single genera with comparatively few species <abbrgrp><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr></abbrgrp>.</p>
               <p>In addition to the host plant index for European smut fungi <abbrgrp><abbr bid="B31">31</abbr><abbr bid="B35">35</abbr></abbrgrp> that has been used in <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>, information on smut fungus-host plant associations was extracted from the following publications: Bauer et al. <abbrgrp><abbr bid="B36">36</abbr><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr></abbrgrp>, Begerow et al. <abbrgrp><abbr bid="B33">33</abbr><abbr bid="B39">39</abbr></abbrgrp>, De Beer et al. <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>, Hendrichs et al. <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>, Nannfeldt <abbrgrp><abbr bid="B42">42</abbr></abbrgrp>, Piepenbring <abbrgrp><abbr bid="B43">43</abbr></abbrgrp>, Scholz and Scholz <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>, an unpublished manuscript by K. Vanky (Smut fungi of the Indian subcontinent; Vanky, personal communication), and Vanky and McKenzie <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>. Moreover, we included information contained in the "specific host" entries of the complete collection of core nucleotide sequences for <it>Entorrhizomycetes</it>, <it>Microbotryales</it>, and <it>Ustilaginomycotina </it>downloaded from GenBank <abbrgrp><abbr bid="B46">46</abbr></abbrgrp> on September 01, 2007 (12,815 sequences). Parasite taxon names were corrected using Vanky's synonym-list <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. Synonyms for host taxon names were obtained from Palese and Moser <abbrgrp><abbr bid="B47">47</abbr></abbrgrp>.</p>
               <p>Including synonyms, our data set contained 3,912 different fungus-plant associations. In order to retrieve taxon IDs and to construct taxonomy trees for hosts and parasites <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>, we used the NCBI taxonomy release of September 01, 2007. For host and parasite species names that were not found in the NCBI taxonomy, the search was repeated after reducing the taxon name to the respective genus. In this way, a total of 2,362 different associations could be identified that covers 413 smut fungi and 1,400 host plants. Thus, the dataset assembled was more than <it>three times </it>larger than the one recently analyzed in <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>, which contained 645 associations, corresponding to 140 smut fungi and 437 host plants. The Parafit analysis of this comparatively small dataset took already more than a week. For both hosts and parasites, two trees were constructed, one tree with branch lengths corresponding to the "true" (denoted as W for Weighted) taxonomical distance <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> and one with all branch lengths set to 1 (denoted as U for Un-weighted/Uniform). As outlined on page 4 the computational complexity of AxParafit is <it>O</it>(<it>nonZero</it>(<it>A</it>)<it>n</it><sub>3</sub><it>n</it><sub>4</sub><it>n</it><sub>1</sub><it>p</it>) and thus the execution time requirements for this larger dataset increase significantly.</p>
            </sec>
            <sec>
               <st>
                  <p>Inference with AxParafit</p>
               </st>
               <p>Production runs with Parafit and AxParafit on an initial version of our dataset were started on August 29, 2007. While the Parafit inferences with 99 permutations on this initial dataset were still running at the time of writing this manuscript(September 9, 2007), the parallel AxParafit run with 99 permutations terminated within less than 480 seconds on 128 CPUs of the Infiniband cluster. This made the results available immediately and allowed us to identify a bug in the data collection script. The buggy version of this script did not take the presence of non-unique scientific taxon names, (e.g.,<it>Setaria </it>(<it>Magnoliophyta, Poales</it>) and <it>Setaria </it>(<it>Nematoda, Filarioidea</it>)) into account to identify NCBI taxon IDs. Such errors are unfortunately typical and frequent in Bioinformatics analysis pipelines. As a typical example of such errors consider the retraction of "Measures of Clade Confidence Do Not Correlate with Accuracy of Phylogenetic Trees" by Barry G. Hall due to an error in a perl script <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>.</p>
               <p>In addition to the rapid detection of input data errors, the significant performance gains obtained by sequential optimization and parallelism allow for the assessment of different program parameters and analysis options, such as trees with different patristic distances (U and W trees) as well as the impact of the number of permutations on the results (AxParafit was run with 99/999/9,999 permutations on the U and W data), i.e., a significantly more thorough and detailed analysis.</p>
               <p>The absolute execution times for AxParafit on 128 CPUs for 99/999/9,999 permutations are indicated in Table <tblr tid="T1">1</tblr>. Essentially, 99 permutations could be conducted within 7 minutes, 999 permutations in much less than 2 hours, and 9,999 permutations overnight in about 12 hours such that the whole study, including the detection of the script error and the analysis of the results could be completed in less than a week. As indicated in Table <tblr tid="T2">2</tblr> there are a number of links (max. 48 out of 2,362 &#8776; 2%) that are not uniformly significant or uniformly insignificant at low <it>p</it>-values between analyses with a distinct number of permutations. AxParafit therefore allows for rapid and much more thorough computation and analyses of large co-phylogenetic datasets. The results indicate that U-based analyses are in general more sensitive to the number of permutations than W-based runs. Note that the number of host/parasite eigenvectors for U (1,390/411) was higher than for W (1,200/372), which explains the longer execution times and potentially the larger differences in significance values.</p>
               <tbl id="T1">
                  <title>
                     <p>Table 1</p>
                  </title>
                  <caption>
                     <p>Empirical Data Study: Parallel AxParafit Execution Times</p>
                  </caption>
                  <tblbdy cols="4">
                     <r>
                        <c ca="center">
                           <p># Permutations</p>
                        </c>
                        <c ca="center">
                           <p>99</p>
                        </c>
                        <c ca="center">
                           <p>999</p>
                        </c>
                        <c ca="center">
                           <p>9,999</p>
                        </c>
                     </r>
                     <r>
                        <c cspan="4">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>W</p>
                        </c>
                        <c ca="center">
                           <p>355 secs</p>
                        </c>
                        <c ca="center">
                           <p>3,759 secs</p>
                        </c>
                        <c ca="center">
                           <p>39,170 secs</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>U</p>
                        </c>
                        <c ca="center">
                           <p>451 secs</p>
                        </c>
                        <c ca="center">
                           <p>4,441 secs</p>
                        </c>
                        <c ca="center">
                           <p>47,221 secs</p>
                        </c>
                     </r>
                  </tblbdy>
                  <tblfn>
                     <p>Parallel execution times in seconds for AxParafit on 128 CPUs for 99/999/9,999 permutations.</p>
                  </tblfn>
               </tbl>
               <tbl id="T2">
                  <title>
                     <p>Table 2</p>
                  </title>
                  <caption>
                     <p>Empirical Data Study: Impact of the Number of Permutations</p>
                  </caption>
                  <tblbdy cols="9">
                     <r>
                        <c ca="center">
                           <p># Permutations</p>
                        </c>
                        <c cspan="2" ca="center">
                           <p>99/999/9,999</p>
                        </c>
                        <c cspan="2" ca="center">
                           <p>99/999</p>
                        </c>
                        <c cspan="2" ca="center">
                           <p>99/9,999</p>
                        </c>
                        <c cspan="2" ca="center">
                           <p>999/9,999</p>
                        </c>
                     </r>
                     <r>
                        <c cspan="9">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>Tree</p>
                        </c>
                        <c ca="center">
                           <p>W</p>
                        </c>
                        <c ca="center">
                           <p>U</p>
                        </c>
                        <c ca="center">
                           <p>W</p>
                        </c>
                        <c ca="center">
                           <p>U</p>
                        </c>
                        <c ca="center">
                           <p>W</p>
                        </c>
                        <c ca="center">
                           <p>U</p>
                        </c>
                        <c ca="center">
                           <p>W</p>
                        </c>
                        <c ca="center">
                           <p>U</p>
                        </c>
                     </r>
                     <r>
                        <c cspan="9">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>p = 0.01</p>
                        </c>
                        <c ca="center">
                           <p>16</p>
                        </c>
                        <c ca="center">
                           <p>48</p>
                        </c>
                        <c ca="center">
                           <p>14</p>
                        </c>
                        <c ca="center">
                           <p>35</p>
                        </c>
                        <c ca="center">
                           <p>13</p>
                        </c>
                        <c ca="center">
                           <p>36</p>
                        </c>
                        <c ca="center">
                           <p>5</p>
                        </c>
                        <c ca="center">
                           <p>25</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>p = 0.02</p>
                        </c>
                        <c ca="center">
                           <p>7</p>
                        </c>
                        <c ca="center">
                           <p>27</p>
                        </c>
                        <c ca="center">
                           <p>5</p>
                        </c>
                        <c ca="center">
                           <p>27</p>
                        </c>
                        <c ca="center">
                           <p>6</p>
                        </c>
                        <c ca="center">
                           <p>27</p>
                        </c>
                        <c ca="center">
                           <p>3</p>
                        </c>
                        <c ca="center">
                           <p>0</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>p = 0.03</p>
                        </c>
                        <c ca="center">
                           <p>4</p>
                        </c>
                        <c ca="center">
                           <p>22</p>
                        </c>
                        <c ca="center">
                           <p>3</p>
                        </c>
                        <c ca="center">
                           <p>17</p>
                        </c>
                        <c ca="center">
                           <p>4</p>
                        </c>
                        <c ca="center">
                           <p>19</p>
                        </c>
                        <c ca="center">
                           <p>1</p>
                        </c>
                        <c ca="center">
                           <p>8</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>p = 0.04</p>
                        </c>
                        <c ca="center">
                           <p>2</p>
                        </c>
                        <c ca="center">
                           <p>18</p>
                        </c>
                        <c ca="center">
                           <p>1</p>
                        </c>
                        <c ca="center">
                           <p>17</p>
                        </c>
                        <c ca="center">
                           <p>2</p>
                        </c>
                        <c ca="center">
                           <p>18</p>
                        </c>
                        <c ca="center">
                           <p>1</p>
                        </c>
                        <c ca="center">
                           <p>1</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>p = 0.05</p>
                        </c>
                        <c ca="center">
                           <p>1</p>
                        </c>
                        <c ca="center">
                           <p>8</p>
                        </c>
                        <c ca="center">
                           <p>0</p>
                        </c>
                        <c ca="center">
                           <p>8</p>
                        </c>
                        <c ca="center">
                           <p>1</p>
                        </c>
                        <c ca="center">
                           <p>7</p>
                        </c>
                        <c ca="center">
                           <p>1</p>
                        </c>
                        <c ca="center">
                           <p>1</p>
                        </c>
                     </r>
                  </tblbdy>
                  <tblfn>
                     <p>The table outlines the impact of the number of permutations on the distribution of significant and insignificant links for distinct <it>p</it>-values. Column (99/999/9,999) indicates the number of links that have a different significance than at least one of the other runs.</p>
                  </tblfn>
               </tbl>
               <p>Table <tblr tid="T3">3</tblr> indicates the number of different significant links between the U- and W-based analyses for various <it>p</it>-values. The table indicates that there is no clear tendency for differences to decrease with increasing number of permutations.</p>
               <tbl id="T3">
                  <title>
                     <p>Table 3</p>
                  </title>
                  <caption>
                     <p>Empirical Data Study: Differences between U and W-based Analyses</p>
                  </caption>
                  <tblbdy cols="6">
                     <r>
                        <c ca="center">
                           <p>p-value</p>
                        </c>
                        <c ca="center">
                           <p>0.01</p>
                        </c>
                        <c ca="center">
                           <p>0.02</p>
                        </c>
                        <c ca="center">
                           <p>0.03</p>
                        </c>
                        <c ca="center">
                           <p>0.04</p>
                        </c>
                        <c ca="center">
                           <p>0.05</p>
                        </c>
                     </r>
                     <r>
                        <c cspan="6">
                           <hr/>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>99</p>
                        </c>
                        <c ca="center">
                           <p>91</p>
                        </c>
                        <c ca="center">
                           <p>60</p>
                        </c>
                        <c ca="center">
                           <p>42</p>
                        </c>
                        <c ca="center">
                           <p>29</p>
                        </c>
                        <c ca="center">
                           <p>16</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>999</p>
                        </c>
                        <c ca="center">
                           <p>76</p>
                        </c>
                        <c ca="center">
                           <p>54</p>
                        </c>
                        <c ca="center">
                           <p>44</p>
                        </c>
                        <c ca="center">
                           <p>15</p>
                        </c>
                        <c ca="center">
                           <p>10</p>
                        </c>
                     </r>
                     <r>
                        <c ca="center">
                           <p>9,999</p>
                        </c>
                        <c ca="center">
                           <p>84</p>
                        </c>
                        <c ca="center">
                           <p>51</p>
                        </c>
                        <c ca="center">
                           <p>51</p>
                        </c>
                        <c ca="center">
                           <p>13</p>
                        </c>
                        <c ca="center">
                           <p>8</p>
                        </c>
                     </r>
                  </tblbdy>
                  <tblfn>
                     <p>The table shows the differences between U and W-based analyses of individual associations for different <it>p</it>-values and numbers of permutations.</p>
                  </tblfn>
               </tbl>
            </sec>
            <sec>
               <st>
                  <p>Biological Interpretation of Results</p>
               </st>
               <p>In the following, we focus on the results obtained with 9,999 permutations and branch lengths scaled in terms of taxonomical distances (W-labeled results). The global test indicates a highly significant co-phylogenetic relationship (<it>p </it>= 0.0001). An overview of the results for individual host-parasite links based on the smut fungi genera is provided in Figure <figr fid="F6">6</figr>. Major taxonomic groups of host and parasites are indicated according to the NCBI taxonomy release used. Based on a significance threshold of <it>p </it>= 0.05 and the ParafitLink1 statistics <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>, a total of 578 insignificant and 1,784 significant associations is obtained. As in our earlier study <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>, genera of smut fungi are rather uniform with respect to their significance values, which facilitates the identification of a general distribution pattern with respect to significant and insignificant links, i.e., the "deep co-phylogeny" of smut fungi.</p>
               <fig id="F6">
                  <title>
                     <p>Figure 6</p>
                  </title>
                  <caption>
                     <p>Overview of the Results for Individual Host-Parasite Links based on the Smut Fungi Genera</p>
                  </caption>
                  <text>
                     <p><b>Overview of the Results for Individual Host-Parasite Links based on the Smut Fungi Genera</b>. Major taxonomic groups of host and parasites are indicated according to the NCBI taxonomy release used. Significant and insignificant associations are indicated as [S] or [I] respectively. Stars denote to doubtful associations.</p>
                  </text>
                  <graphic file="1471-2105-8-405-6"/>
               </fig>
               <p>The single most important factor appears to be whether the hosts belong to the monocots (i.e.,<it>Liliopsida</it>) or not. <it>Entorrhiza </it>species, which are taxonomically isolated, mostly are linked with monocots (<it>Poales</it>) and do thus not contribute significantly to the overall fit between host and parasite phylogenies. In the case of <it>Microbotryales</it>, the majority of taxa are pathogenic of core eudicots, resulting in significant links. Fewer associations with monocots (mostly <it>Poales</it>) are present, which are considered insignificant. The same pattern can be observed in the class <it>Exobasidiomycetes </it>within <it>Ustilaginomycotina</it>: A minority of host-parasite links is within monocots (<it>Poales</it>, but also other orders), which are considered insignificant, whereas the associations with other hosts (<it>Selaginellales</it>, basal <it>Magnoliophyta</it>, magnoliids, and stem and core eudicots) are significant. Inverse relationships are present in the class <it>Ustilaginomycetes </it>within <it>Ustilaginomycotina</it>. Here, most species infect monocots, mainly <it>Poales</it>, significantly increasing the congruence between host and parasite taxonomy trees, whereas the associations with core eudicots appear to be insignificant.</p>
               <p>Accordingly, the current analysis that is based on a considerably larger empirical sample (e.g., 66 instead of 25 included genera of smut fungi) confirms earlier results <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. Therefore, we can generalize the observation that the difference between <it>Poales </it>and non-<it>Poales </it>hosts is crucial for the distribution of significance values to the distinction between monocot and non-monocot hosts. We also observe a small number of exceptions from this general pattern. For instance, in <it>Urocystis </it>(<it>Ustilaginomycetes</it>), which occurs on a variety of host groups, the links with stem eudicots (species of <it>Ranunculaceae</it>) are significant, and a single link with monocots (PACCAD clade within <it>Poaceae</it>) is judged as insignificant. Thus, rather subtle details of the host-parasite relationships, such as the presence of <it>Urocystis </it>on several closely related Ranunculaceae hosts and its presence on distantly related hosts within <it>Poaceae</it>, are recognized by the AxParafit algorithm, and the uniform overall pattern does not merely reflect the relatively low topological resolution present in the taxonomy trees.</p>
               <p>Some of the results obtained may also be due to flaws in the taxonomy of the species included, particularly in the nomenclature of the parasites. For instance, <it>Entorrhiza isoetis </it>is most likely conspecific with <it>Ustilago isoetis </it><abbrgrp><abbr bid="B31">31</abbr></abbrgrp>. At present it is even doubtful whether this species belongs to smut fungi (R. Bauer, personal communication). Thus, the associations with Isoetes (<it>Lycopodiophyta</it>) mentioned in Scholz and Scholz <abbrgrp><abbr bid="B44">44</abbr></abbrgrp>, which show different significance values than the majority of hosts links in either <it>Ustilago </it>or <it>Entorrhiza</it>, are dubious. Likewise, the exceptional associations of <it>Entyloma </it>with monocots are probably due to species names that would need to be recombined into genera of the <it>Georgefischeriales </it><abbrgrp><abbr bid="B37">37</abbr></abbrgrp>. Whereas these flaws have to be corrected by considering more comprehensive lists of species and synonyms in monographs and in future releases of the NCBI database, it is apparent that neither the highly significant overall co-phylogenetic relationship nor the general pattern regarding individual host-smut fungus links would be affected by the removal of the doubtful associations. Rather, their influence is overcome by the large total sample size; for each parasite genus dubious links are few relative to the total number of links or not present at all. Likewise, there are few differences in the significance between analyses with a distinct number of permutations (see Table <tblr tid="T2">2</tblr>). Discrepancies between U and W are also comparatively small (see Table <tblr tid="T3">3</tblr>). With 9,999 permutations, they are restricted to four genera of smut fungi and only affect hosts, such as <it>Urocystis </it>on monocots in <it>Asparagales </it>and <it>Dioscoreales </it>(details not shown), with an intermediate taxonomic position.</p>
               <p>The analysis process presented here underlines the advantage of the large-scale approach to co-phylogenetic tests, that is enabled by AxPcoords/AxParafit. Furthermore, because many problems are more easily recognized after conducting preliminary runs, re-analysis after applying corrective measures may be necessary for many empirical datasets. Thus, efficient implementations and parallelism are of great practical importance for the analysis pipeline.</p>
            </sec>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>We have produced highly optimized and efficient implementations of the two most compute-intensive components for P. Legendre's statistical test of host-parasite co-speciation. The parallel implementation of AxParafit scales well up to 128 CPUs on a medium-size dataset. AxParafit and AxPcoords have been integrated into the CopyCat tool and are freely available for download as open source code.</p>
         <p>Future work will mainly cover large-scale production runs with AxParafit.</p>
      </sec>
      <sec>
         <st>
            <p>Availability and Requirements</p>
         </st>
         <p>The source code and some of the test datasets are available at ic <url>http://www.epfl.ch/~stamatak/AxParafit.html</url>.</p>
         <p>The datasets and results of the empirical study on smut fungi are also available at this site. It also provides several pre-compiled binaries for Windows, MAC, and Linux/Unix platforms.</p>
         <p>AxParafit can be compiled as stand-alone application without making use of either ATLAS, MKL or ACML. AxPcoords requires either MKL, ACML, or the GNU scientific library.</p>
         <p>The new CopyCat version that uses AxParafit and AxPcoords is available at <url>http://www-ab.informatik.uni-tuebingen.de/software/copycat/review</url>.</p>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>AS ported the programs from Fortran to C, optimized the C code, integrated the BLAS and LAPACK packages, parallelized the program and performed the computational experiments. AFA and JMK carried out the integration into CopyCat. AFA, JMK, and MG assembled the Binaries for various platforms and provided scripts to conduct the computational experiments. MG assembled the test datasets. AS and MG conducted the empirical study on smut fungi and their hosts. AS, AFA, JMK, and MG wrote the manuscript. All authors read and approved the final manuscript</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We would like to thank Pierre Legendre for his kind and very useful feedback on this manuscript. We would also like to thank Daniel Huson for useful comments on this manuscript. Cordial thanks are addressed to K. Vanky, M. Piepenbring, D. Begerow, R. Bauer, and M. Hendrichs for providing published hosts lists of smut fungi stored electronically and particularly to K. Vanky for giving access to the list of parasites from India. R. Bauer provided helpful additional advice on smut fungi.</p>
            <p>AS is funded by Swiss Confederation Funds. Financial support provided by the Deutsche Forschungsgemeinschaft for MG and AFA is gratefully acknowledged.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <aug>
               <au>
                  <snm>Page</snm>
                  <fnm>RDM</fnm>
               </au>
            </aug>
            <source>Tangled Trees. Phylogeny, Cospeciation and Coevolution</source>
            <publisher>The University of Chicago Press</publisher>
            <pubdate>2002</pubdate>
            <note>chap. Introduction</note>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Analyzing large data sets in reasonable times: solution for composite optima</p>
            </title>
            <aug>
               <au>
                  <snm>Goloboff</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Cladistics</source>
            <pubdate>1999</pubdate>
            <volume>15</volume>
            <fpage>415</fpage>
            <lpage>428</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1111/j.1096-0031.1999.tb00278.x</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>MrBayes 3: Bayesian phylogenetic inference under mixed models</p>
            </title>
            <aug>
               <au>
                  <snm>Ronquist</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Huelsenbeck</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>19</volume>
            <issue>12</issue>
            <fpage>1572</fpage>
            <lpage>1574</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btg180</pubid>
                  <pubid idtype="pmpid" link="fulltext">12912839</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Genetic Algorithm Approaches for the Phylogenetic Analysis of Large Biological Sequence Datasets under the Maximum Likelihood Criterion</p>
            </title>
            <aug>
               <au>
                  <snm>Zwickl</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>PhD thesis</source>
            <publisher>University of Texas at Austin</publisher>
            <pubdate>2006</pubdate>
         </bibl>
         <bibl id="B5">
            <title>
               <p>RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models</p>
            </title>
            <aug>
               <au>
                  <snm>Stamatakis</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2006</pubdate>
            <volume>22</volume>
            <issue>21</issue>
            <fpage>2688</fpage>
            <lpage>2690</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btl446</pubid>
                  <pubid idtype="pmpid" link="fulltext">16928733</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>A Nuclear Ribosomal DNA Phylogeny of <it>Acer </it>Inferred with Maximum Likelihood, Splits Graphs, and Motif Analyses of 606 Sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Grimm</snm>
                  <fnm>GW</fnm>
               </au>
               <au>
                  <snm>Renner</snm>
                  <fnm>SS</fnm>
               </au>
               <au>
                  <snm>Stamatakis</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hemleben</snm>
                  <fnm>V</fnm>
               </au>
            </aug>
            <source>Evolutionary Bioinformatics Online</source>
            <pubdate>2006</pubdate>
            <volume>2</volume>
            <fpage>279</fpage>
            <lpage>294</lpage>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Phylogenetic Supermatrix Analysis of GenBank Sequences from 2228 Papilionoid Legumes</p>
            </title>
            <aug>
               <au>
                  <snm>McMahon</snm>
                  <fnm>MM</fnm>
               </au>
               <au>
                  <snm>Sanderson</snm>
                  <fnm>MJ</fnm>
               </au>
            </aug>
            <source>Systematic Biology</source>
            <pubdate>2006</pubdate>
            <volume>55</volume>
            <issue>5</issue>
            <fpage>818</fpage>
            <lpage>836</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1080/10635150600999150</pubid>
                  <pubid idtype="pmpid" link="fulltext">17060202</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>The delayed rise of present-day mammals</p>
            </title>
            <aug>
               <au>
                  <snm>Bininda-Emonds</snm>
                  <fnm>ORP</fnm>
               </au>
               <au>
                  <snm>Cardillo</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>KE</fnm>
               </au>
               <au>
                  <snm>MacPhee</snm>
                  <fnm>RDE</fnm>
               </au>
               <au>
                  <snm>Beck</snm>
                  <fnm>RMD</fnm>
               </au>
               <au>
                  <snm>Grenyer</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Price</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Vos</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Gittleman</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Purvis</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2007</pubdate>
            <volume>446</volume>
            <fpage>507</fpage>
            <lpage>512</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature05634</pubid>
                  <pubid idtype="pmpid" link="fulltext">17392779</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Greengenes, a Chimera-Checked 16S rRNA Gene Database and Workbench Compatible with ARB</p>
            </title>
            <aug>
               <au>
                  <snm>DeSantis</snm>
                  <fnm>TZ</fnm>
               </au>
               <au>
                  <snm>Hugenholtz</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Larsen</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Rojas</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Brodie</snm>
                  <fnm>EL</fnm>
               </au>
               <au>
                  <snm>Keller</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Huber</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Dalevi</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Hu</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Andersen</snm>
                  <fnm>GL</fnm>
               </au>
            </aug>
            <source>Applied Environmental Microbiology</source>
            <pubdate>2006</pubdate>
            <volume>72</volume>
            <issue>7</issue>
            <fpage>5069</fpage>
            <lpage>5072</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1128/AEM.03006-05</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>A 567-taxon data set for angiosperms: The challenges posed by bayesian analyses of large data sets</p>
            </title>
            <aug>
               <au>
                  <snm>Soltis</snm>
                  <fnm>DE</fnm>
               </au>
               <au>
                  <snm>Gitzendanner</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Soltis</snm>
                  <fnm>PS</fnm>
               </au>
            </aug>
            <source>International Journal of Plant Sciences</source>
            <pubdate>2007</pubdate>
            <volume>168</volume>
            <issue>2</issue>
            <fpage>137</fpage>
            <lpage>157</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1086/509788</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Phylogenetic diversity and ecology of environmental <it>Archaea</it></p>
            </title>
            <aug>
               <au>
                  <snm>Robertson</snm>
                  <fnm>CE</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>JK</fnm>
               </au>
               <au>
                  <snm>Spear</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Pace</snm>
                  <fnm>NR</fnm>
               </au>
            </aug>
            <source>Current Opinion in Microbiology</source>
            <pubdate>2005</pubdate>
            <volume>8</volume>
            <fpage>638</fpage>
            <lpage>642</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16236543</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Unexpected Diversity and Complexity of the Guerrero Negro Hypersaline Microbial Mat</p>
            </title>
            <aug>
               <au>
                  <snm>Ley</snm>
                  <fnm>RE</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>JK</fnm>
               </au>
               <au>
                  <snm>Wilcox</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Spear</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Miller</snm>
                  <fnm>SR</fnm>
               </au>
               <au>
                  <snm>Bebout</snm>
                  <fnm>BM</fnm>
               </au>
               <au>
                  <snm>Maresca</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Bryant</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Sogin</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Pace</snm>
                  <fnm>NR</fnm>
               </au>
            </aug>
            <source>Applied Environmental Microbiology</source>
            <pubdate>2006</pubdate>
            <volume>72</volume>
            <issue>5</issue>
            <fpage>3685</fpage>
            <lpage>3695</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1128/AEM.72.5.3685-3695.2006</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Computational aspects of host-parasite phylogenies</p>
            </title>
            <aug>
               <au>
                  <snm>Stevens</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Briefings in Bioinformatics</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <issue>4</issue>
            <fpage>339</fpage>
            <lpage>349</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bib/5.4.339</pubid>
                  <pubid idtype="pmpid" link="fulltext">15606970</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>COPYCAT: Co-phylogenetic analysis tool</p>
            </title>
            <aug>
               <au>
                  <snm>Meier-Kolthoff</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Auch</snm>
                  <fnm>AF</fnm>
               </au>
               <au>
                  <snm>Huson</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>G&#246;ker</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2007</pubdate>
            <volume>23</volume>
            <issue>7</issue>
            <fpage>898</fpage>
            <lpage>900</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btm027</pubid>
                  <pubid idtype="pmpid" link="fulltext">17267434</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>A statistical test for host-parasite coevolution</p>
            </title>
            <aug>
               <au>
                  <snm>Legendre</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Desdevises</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Bazin</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Systematic Biology</source>
            <pubdate>2002</pubdate>
            <volume>51</volume>
            <issue>2</issue>
            <fpage>217</fpage>
            <lpage>234</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1080/10635150252899734</pubid>
                  <pubid idtype="pmpid" link="fulltext">12028729</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Evolutionary relationships, cospeciation, and host switching in avian malaria parasites</p>
            </title>
            <aug>
               <au>
                  <snm>Ricklefs</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Fallon</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Birmingham</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Systematic Biology</source>
            <pubdate>2004</pubdate>
            <volume>53</volume>
            <fpage>111</fpage>
            <lpage>119</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1080/10635150490264987</pubid>
                  <pubid idtype="pmpid">14965906</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Mitochondrial DNA variation of <it>Gyrodactylus </it>spp. (<it>Monogenea, Gyrodactylidae</it>) populations infecting Atlantic salmon, grayling, and rainbow trout in Norway and Sweden</p>
            </title>
            <aug>
               <au>
                  <snm>Hansen</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Bachmann</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Bakke</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>International Journal of Parasitology</source>
            <pubdate>2003</pubdate>
            <volume>33</volume>
            <issue>13</issue>
            <fpage>1471</fpage>
            <lpage>1478</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0020-7519(03)00200-5</pubid>
                  <pubid idtype="pmpid" link="fulltext">14572510</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Initial steps of speciation by geographic isolation and host switch in salmonid pathogen <it>Gyrodactylus salaris </it>(<it>Monogenea: Gyrodactylidae</it>)</p>
            </title>
            <aug>
               <au>
                  <snm>Meinil&#228;</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Kuusela</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Zietara</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Lumme</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>International Journal of Parasitology</source>
            <pubdate>2004</pubdate>
            <volume>34</volume>
            <issue>4</issue>
            <fpage>515</fpage>
            <lpage>526</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.ijpara.2003.12.002</pubid>
                  <pubid idtype="pmpid" link="fulltext">15013741</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Speciation and host-parasite relationships in the parasite genus <it>Gyrodactylus </it>(<it>Monogenea, Platyhelminthes</it>) infecting gobies of the genus <it>Pomatoschistus </it>(<it>Gobiidae, Teleostei</it>)</p>
            </title>
            <aug>
               <au>
                  <snm>Huyse</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Audenart</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Volckaert</snm>
                  <fnm>FA</fnm>
               </au>
            </aug>
            <source>International Journal Parasitology</source>
            <pubdate>2003</pubdate>
            <volume>33</volume>
            <issue>14</issue>
            <fpage>1679</fpage>
            <lpage>1689</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/S0020-7519(03)00253-4</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Program DistPCoA</p>
            </title>
            <aug>
               <au>
                  <snm>Legendre</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Anderson</snm>
                  <fnm>MJ</fnm>
               </au>
            </aug>
            <publisher>D&#233;partement de sciences biologiques, Universit&#233; de Montr&#233;al</publisher>
            <pubdate>1998</pubdate>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Basic Linear Algebra Package</p>
            </title>
            <url>http://www.netlib.org/blas</url>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Linear Algebra PACKage</p>
            </title>
            <url>http://www.netlib.org/lapack</url>
         </bibl>
         <bibl id="B23">
            <title>
               <p>AMD Core Math Library</p>
            </title>
            <url>http://www.amd.com/acml</url>
         </bibl>
         <bibl id="B24">
            <title>
               <p>GNU scientific library</p>
            </title>
            <url>http://www.gnu.org/software/gsl</url>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Some large-scale matrix computation problems</p>
            </title>
            <aug>
               <au>
                  <snm>Bai</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Fahey</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Golub</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Journal of Computational and Applied Mathematics</source>
            <pubdate>1996</pubdate>
            <volume>74</volume>
            <fpage>71</fpage>
            <lpage>89</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/0377-0427(96)00018-0</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Eigenfunction properties and approximations of selected incidence matrices employed in spatial analyses</p>
            </title>
            <aug>
               <au>
                  <snm>Griffith</snm>
                  <fnm>DA</fnm>
               </au>
            </aug>
            <source>Linear Algebra and its Applications</source>
            <pubdate>2000</pubdate>
            <volume>321</volume>
            <fpage>95</fpage>
            <lpage>112</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1016/S0024-3795(00)00031-8</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>JADAMILU: a code for computing selected eigenvalues of large sparse symmetric matrices</p>
            </title>
            <aug>
               <au>
                  <snm>Bollh&#246;fer</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Notay</snm>
                  <fnm>Y</fnm>
               </au>
            </aug>
            <source>Tech rep</source>
            <publisher>Universit&#233; Libre de Bruxelles, Brussels</publisher>
            <pubdate>2006</pubdate>
            <url>http://homepages.ulb.ac.be/~jadamilu/</url>
         </bibl>
         <bibl id="B28">
            <aug>
               <au>
                  <snm>Press</snm>
                  <fnm>WH</fnm>
               </au>
               <au>
                  <snm>Teukolsky</snm>
                  <fnm>SA</fnm>
               </au>
               <au>
                  <snm>Vetterling</snm>
                  <fnm>WT</fnm>
               </au>
               <au>
                  <snm>Flannery</snm>
                  <fnm>BP</fnm>
               </au>
            </aug>
            <source>Numerical Recipes in C</source>
            <publisher>Cambridge University Press</publisher>
            <pubdate>1992</pubdate>
         </bibl>
         <bibl id="B29">
            <title>
               <p>EMBL database</p>
            </title>
            <url>http://www.ebi.ac.uk/embl</url>
         </bibl>
         <bibl id="B30">
            <title>
               <p>
                  <it>Ustilaginomycetes</it>
               </p>
            </title>
            <aug>
               <au>
                  <snm>Bauer</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Begerow</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Oberwinkler</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Piepenbring</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Berbee</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>The Mycota</source>
            <pubdate>2001</pubdate>
            <volume>7</volume>
            <issue>Part B</issue>
            <fpage>57</fpage>
            <lpage>83</lpage>
         </bibl>
         <bibl id="B31">
            <aug>
               <au>
                  <snm>Vanky</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>European Smut Fungi</source>
            <publisher>G. Fischer</publisher>
            <pubdate>1994</pubdate>
         </bibl>
         <bibl id="B32">
            <title>
               <p>Cereal smuts in Manitoba and Saskatchewan, 1989&#8211;95</p>
            </title>
            <aug>
               <au>
                  <snm>Thomas</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Menzies</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Canadian Journal of Plant Pathology</source>
            <pubdate>1997</pubdate>
            <volume>19</volume>
            <issue>2</issue>
            <fpage>161</fpage>
            <lpage>165</lpage>
         </bibl>
         <bibl id="B33">
            <title>
               <p>The <it>Exobasidiales</it>: An evolutionary hypothesis</p>
            </title>
            <aug>
               <au>
                  <snm>Begerow</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Bauer</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Oberwinkler</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Mycological Progress</source>
            <pubdate>2002</pubdate>
            <volume>1</volume>
            <issue>2</issue>
            <fpage>187</fpage>
            <lpage>199</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1007/s11557-006-0018-7</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>A Reconciliation Analysis of Host Switching in Plant-Fungal Symbioses</p>
            </title>
            <aug>
               <au>
                  <snm>Jackson</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Evolution</source>
            <pubdate>2004</pubdate>
            <volume>58</volume>
            <issue>9</issue>
            <fpage>1909</fpage>
            <lpage>1923</lpage>
            <xrefbib>
               <pubid idtype="pmpid">15521451</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>European Smut Fungi (<it>Ustilaginomycetes </it>p.p. and <it>Microbotryales</it>) according to recent nomenclature</p>
            </title>
            <aug>
               <au>
                  <snm>Vanky</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Mycologia Balcanica</source>
            <pubdate>2005</pubdate>
            <volume>2</volume>
            <fpage>169</fpage>
            <lpage>178</lpage>
         </bibl>
         <bibl id="B36">
            <title>
               <p><it>Ustilaginomycetes </it>on <it>Selaginella</it></p>
            </title>
            <aug>
               <au>
                  <snm>Bauer</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Vanky</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Begerow</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Oberwinkler</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Mycologia</source>
            <pubdate>1999</pubdate>
            <volume>91</volume>
            <issue>3</issue>
            <fpage>475</fpage>
            <lpage>484</lpage>
            <xrefbib>
               <pubid idtype="doi">10.2307/3761348</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>The <it>Georgefischeriales</it>: A Phylogenetic Hypothesis</p>
            </title>
            <aug>
               <au>
                  <snm>Bauer</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Begerow</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Nagler</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Oberwinkler</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Mycological Research</source>
            <pubdate>2001</pubdate>
            <volume>105</volume>
            <issue>04</issue>
            <fpage>416</fpage>
            <lpage>424</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1017/S0953756201003690</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p><it>Gjaerumia</it>, a new genus in the <it>Georgefischeriales </it>(<it>Ustilaginomycetes</it>)</p>
            </title>
            <aug>
               <au>
                  <snm>Bauer</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Lutz</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Oberwinkler</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Mycological Research</source>
            <pubdate>2005</pubdate>
            <volume>109</volume>
            <issue>11</issue>
            <fpage>1250</fpage>
            <lpage>1258</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1017/S0953756205003783</pubid>
                  <pubid idtype="pmpid">16279418</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p><it>Muribasidiospora: Microstromatales or Exobasidiales</it>?</p>
            </title>
            <aug>
               <au>
                  <snm>Begerow</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Bauer</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Oberwinkler</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Mycological Research</source>
            <pubdate>2001</pubdate>
            <volume>105</volume>
            <issue>07</issue>
            <fpage>798</fpage>
            <lpage>810</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1017/S0953756201004208</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p>Phylogeny of the <it>Quambalariaceae </it>fam. nov., including important <it>Eucalyptus </it>pathogens in South Africa and Australia</p>
            </title>
            <aug>
               <au>
                  <snm>de Beer</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Begerow</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Bauer</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Pegg</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Crous</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Wingfield</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Studies in Mycology</source>
            <pubdate>2006</pubdate>
            <volume>55</volume>
            <fpage>289</fpage>
            <lpage>298</lpage>
         </bibl>
         <bibl id="B41">
            <title>
               <p>The <it>Cryptobasidiaceae </it>of tropical Central and South America</p>
            </title>
            <aug>
               <au>
                  <snm>Hendrichs</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bauer</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Oberwinkler</snm>
                  <fnm>F</fnm>
               </au>
            </aug>
            <source>Sydowia</source>
            <pubdate>2003</pubdate>
            <volume>55</volume>
            <fpage>33</fpage>
            <lpage>64</lpage>
         </bibl>
         <bibl id="B42">
            <title>
               <p><it>Exobasidium</it>, a taxonomic reassessment applied to the European species</p>
            </title>
            <aug>
               <au>
                  <snm>Nannfeldt</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Symbolae Botanicae Upsaliensis</source>
            <pubdate>1981</pubdate>
            <volume>23</volume>
            <fpage>1</fpage>
            <lpage>72</lpage>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Smut fungi: (<it>Ustilaginomycetes </it>p.p. and <it>Microbotryales, Basidiomycota</it>)</p>
            </title>
            <aug>
               <au>
                  <snm>Piepenbring</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Flora Neotropica Monograph</source>
            <pubdate>2003</pubdate>
            <volume>86</volume>
            <fpage>1</fpage>
            <lpage>291</lpage>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Die Brandpilze Deutschlands (<it>Ustilaginales</it>)</p>
            </title>
            <aug>
               <au>
                  <snm>Scholz</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Scholz</snm>
                  <fnm>I</fnm>
               </au>
            </aug>
            <source>Englera</source>
            <pubdate>1988</pubdate>
            <issue>8</issue>
            <fpage>1</fpage>
            <lpage>691</lpage>
            <xrefbib>
               <pubid idtype="doi">10.2307/3776736</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B45">
            <aug>
               <au>
                  <snm>Vanky</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>McKenzie</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Smut fungi of New Zealand</source>
            <publisher>Fungal Diversity Press, Centre for Research in Fungal Diversity, Dept. of Ecology &amp; Biodiversity, University of Hong Kong Hong Kong</publisher>
            <pubdate>2002</pubdate>
         </bibl>
         <bibl id="B46">
            <title>
               <p>GenBank</p>
            </title>
            <url>http://www.ncbi.nlm.nih.gov</url>
         </bibl>
         <bibl id="B47">
            <title>
               <p>Synonymie-Index der Schweizer Flora und der angrenzenden Gebiete</p>
            </title>
            <aug>
               <au>
                  <snm>Palese</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Moser</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Tech rep</source>
            <publisher>Zentrum des Datenverbundnetzes der Schweizer Flora</publisher>
            <pubdate>1997</pubdate>
         </bibl>
         <bibl id="B48">
            <title>
               <p>PLOS retraction</p>
            </title>
            <url>http://compbiol.plosjournals.org/perlserv/?request=get-document&amp;doi=10.1371/journal.pcbi.0030158</url>
         </bibl>
      </refgrp>
   </bm>
</art>
