<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2148-6-27</ui>
   <ji>1471-2148</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Evolution of four gene families with patchy phylogenetic distributions: influx of genes into protist genomes</p>
         </title>
         <aug>
            <au id="A1" ca="yes">
               <snm>Andersson</snm>
               <mi>O</mi>
               <fnm>Jan</fnm>
               <insr iid="I1"/>
               <email>jan.andersson@icm.uu.se</email>
            </au>
            <au id="A2">
               <snm>Hirt</snm>
               <mi>P</mi>
               <fnm>Robert</fnm>
               <insr iid="I2"/>
               <email>r.p.hirt@ncl.ac.uk</email>
            </au>
            <au id="A3">
               <snm>Foster</snm>
               <mi>G</mi>
               <fnm>Peter</fnm>
               <insr iid="I3"/>
               <email>p.foster@nhm.ac.uk</email>
            </au>
            <au id="A4">
               <snm>Roger</snm>
               <mi>J</mi>
               <fnm>Andrew</fnm>
               <insr iid="I4"/>
               <email>andrew.roger@dal.ca</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Institute of Cell and Molecular Biology, Uppsala University, Biomedical Center, Box 596, S-751 24 Uppsala, Sweden</p>
            </ins>
            <ins id="I2">
               <p>School of Biology, The Devonshire Building, The University of Newcastle upon Tyne, NE1 7RU, UK</p>
            </ins>
            <ins id="I3">
               <p>Department of Zoology, The Natural History Museum, Cromwell Road, London SW7 5BD, UK</p>
            </ins>
            <ins id="I4">
               <p>The Canadian Institute for Advanced Research, Program in Evolutionary Biology, Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, Nova Scotia B3H 1X5, Canada</p>
            </ins>
         </insg>
         <source>BMC Evolutionary Biology</source>
         <issn>1471-2148</issn>
         <pubdate>2006</pubdate>
         <volume>6</volume>
         <issue>1</issue>
         <fpage>27</fpage>
         <url>http://www.biomedcentral.com/1471-2148/6/27</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">16551352</pubid>
               <pubid idtype="doi">10.1186/1471-2148-6-27</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>25</day>
               <month>8</month>
               <year>2005</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>21</day>
               <month>3</month>
               <year>2006</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>21</day>
               <month>3</month>
               <year>2006</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2006</year>
         <collab>Andersson et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Lateral gene transfer (LGT) in eukaryotes from non-organellar sources is a controversial subject in need of further study. Here we present gene distribution and phylogenetic analyses of the genes encoding the hybrid-cluster protein, A-type flavoprotein, glucosamine-6-phosphate isomerase, and alcohol dehydrogenase E. These four genes have a limited distribution among sequenced prokaryotic and eukaryotic genomes and were previously implicated in gene transfer events affecting eukaryotes. If our previous contention that these genes were introduced by LGT independently into the diplomonad and <it>Entamoeba </it>lineages were true, we expect that the number of putative transfers and the phylogenetic signal supporting LGT should be stable or increase, rather than decrease, when novel eukaryotic and prokaryotic homologs are added to the analyses.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>The addition of homologs from phagotrophic protists, including several <it>Entamoeba </it>species, the pelobiont <it>Mastigamoeba balamuthi</it>, and the parabasalid <it>Trichomonas vaginalis</it>, and a large quantity of sequences from genome projects resulted in an apparent increase in the number of putative transfer events affecting all three domains of life. Some of the eukaryotic transfers affect a wide range of protists, such as three divergent lineages of Amoebozoa, represented by <it>Entamoeba</it>, <it>Mastigamoeba</it>, and <it>Dictyostelium</it>, while other transfers only affect a limited diversity, for example only the <it>Entamoeba </it>lineage. These observations are consistent with a model where these genes have been introduced into protist genomes independently from various sources over a long evolutionary time.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>Phylogenetic analyses of the updated datasets using more sophisticated phylogenetic methods, in combination with the gene distribution analyses, strengthened, rather than weakened, the support for LGT as an important mechanism affecting the evolution of these gene families. Thus, gene transfer seems to be an on-going evolutionary mechanism by which genes are spread between unrelated lineages of all three domains of life, further indicating the importance of LGT from non-organellar sources into eukaryotic genomes.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="bmc" subtype="user_supplied_xml" id="endnote"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>During the past five years a number of reports have appeared indicating that protists acquire genes via LGT <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>. Recently, phylogenomic analyses of the complete genome sequences of <it>Entamoeba histolytica </it>and <it>Cryptosporidium parvum </it>indicated that several genes of these human parasites, including some key metabolic enzymes, most likely had been acquired from prokaryotes. 96 cases of relatively recent LGT from prokaryotic sources were reported for the former and 24 for the latter <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp>. There are reasons to believe that LGT actually does influence protist genome evolution, since foreign genetic material is constantly entering the cell via food organisms. In addition, many protists harbour prokaryotes or eukaryotes (such as those that gave rise to secondary and tertiary plastids <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>) as endosymbionts. As a result, the occasional incorporation of genes from engulfed cells into the nucleus may facilitate a process of directional transfer of genes from the food organisms to phagotrophic eukaryotes over evolutionary time <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B8">8</abbr></abbrgrp>. There is a growing amount of data that are consistent with this hypothesis. For instance, LGT has mostly been detected in phagotrophic lineages <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B9">9</abbr></abbrgrp>. Moreover, the introduced genes in these lineages seem to have originated from organisms sharing the same environment with the recipient organisms &#8211; the anaerobic diplomonad lineage was found to have acquired genes from anaerobic prokaryotes in most cases <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>, 22% of candidate donors lineages in LGT cases for <it>Entamoeba histolytica </it>involve relatives of the Bacteroides group which are abundant in human digestive tract <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>, while the alga <it>Bigelowiella natans </it>has acquired genes mostly from other algae <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. These observations are consistent with the idea that physical proximity in the environment of the donor and recipient lineages may greatly enhance the probability of a successful gene transfer event <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>, a notion recently supported by phylogenetic analyses of 144 prokaryotic proteomes identifying gene pools shared between organisms (including distantly related one) occupying the same ecological niche <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>.</p>
         <p>Most of the claims of LGT in protists are based on unexpected phylogenetic relationships between protist and prokaryotic sequences <abbrgrp><abbr bid="B2">2</abbr><abbr bid="B4">4</abbr></abbrgrp>. However, phylogenetic methods are susceptible to systematic error that could lead to false interpretations of transfer events <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. For example, a recent phylogenetic analysis indicated that the hydrogenosomal NuoF protein from <it>Trichomonas vaginalis </it>(a subunit of respiratory chain complex I) branched outside of a clade of mitochondrial homologs <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>, leading the authors to propose a separate (non-mitochondrial) origin for this protein. However, these analyses failed to take into account the heterogeneity of amino acid (aa) composition displayed by sequences in this dataset <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. In contrast, when the dataset is analysed with methods designed to avoid this potential artefact, the <it>T. vaginalis </it>sequence branched within the mitochondrial cluster <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>, in agreement with the well-supported hypothesis that <it>Trichomonas </it>hydrogenosomes share an evolutionary origin with mitochondria <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>. Similarly, Cpn60 phylogenies with different taxonomic samplings led to important differences in the phylogenetic relationships amongst anaerobic protists including <it>E. histolytica </it>and two diplomonads (<it>Giardia </it>and <it>Spironucleus</it>) eliminating the possibility of an LGT event between <it>Entamoeba </it>and <it>Giardia </it>lineages<abbrgrp><abbr bid="B2">2</abbr><abbr bid="B17">17</abbr></abbrgrp>. In both these cases, extreme divergence coupled with compositional biases in these sequences suggested, correctly, their unexpected branching patterns were due to phylogenetic artefacts. In contrast, the phylogenetic analyses of the alanyl and prolyl tRNA synthetases show the expected phylogenetic relationships amongst prokaryotes and eukaryotes with the exception that several protist sequences were found nested within Archaea as sisters to the Nanoarchaeota sequences. In this case, the observations could not be attributed to any known phylogenetic artefacts and were most easily explained in <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> as gene transfer events from the archaeal lineage to the protists.</p>
         <p>Interpretations of phylogenetic analyses of proteins with a more patchy distribution in the tree of life are more challenging than the cases described above. For example, gene duplications followed by differential gene loss may also yield the unexpected phylogenetic relationships that are hallmarks of LGT. In addition, genes with a patchy distribution may only be present in one or a few lineages in each organismal group making it potentially more difficult to identify donor and recipient lineages of gene transfer events since such assignments require that recipient lineages are nested within the donor group. Fortunately, the number of complete genome sequences is steadily growing, and should clarify the patterns of gene distribution within the tree of life. In combination with thorough phylogenetic studies, analyses of the presence and absence of genes in completely sequenced genomes should be very able to differentiate putative cases of gene transfers in gene families with a patchy phylogenetic distribution from other scenarios <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>.</p>
         <p>To investigate whether phylogenetic artefacts, and/or unappreciated gene duplication and loss events, have influenced previous interpretations of LGT, we have broadened the taxon sampling of four gene families with patchy phylogenetic distributions, previously implicated in gene transfer events in diplomonads and <it>E. histolytica </it><abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. The updated datasets &#8211; <it>priS </it>(encoding a hybrid-cluster protein), <it>fprA </it>(A-type flavoprotein), <it>nagB </it>(glucosamine-6-phosphate isomerase), and <it>adhE </it>(alcohol dehydrogenase E) &#8211; were also analysed using more sophisticated phylogenetic methods. We have previously argued that these four genes were introduced into the genomes of diplomonads and <it>Entamoeba </it>from different sources based on phylogenetic analyses <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. If these previous observations were really indicative of LGT, increased sampling of eukaryotic and prokaryotic taxa should result in an equal or increased number of distinct eukaryotic groups in the phylogenetic analyses (i.e. eukaryotes would be polyphyletic) and stronger support for tree topologies consistent with LGT. Alternatively, a different pattern is expected if the interpretation of gene transfers were based on phylogenetic artefacts and/or differential losses. In the former case, increased taxonomic sampling should, if anything, provide evidence for a common ancestry for the diplomonad and <it>Entamoeba </it>sequences &#8211; as improved within-clade taxonomic sampling tends to improve phylogenetic accuracy <abbrgrp><abbr bid="B20">20</abbr></abbrgrp> &#8211; reducing the number of independent eukaryote groups observed. Alternatively, if the 'polyphyletic eukaryotes' pattern was due to ancient duplications and poor paralog sampling, we would expect newly sampled sequences to cluster in the different eukaryotic clades and recover mirror eukaryotic phylogenies.</p>
         <p>To test these alternative hypotheses, we focused our active sampling of taxa to relatives of <it>Entamoeba</it>; the amphizoic <it>E. moshkovskii</it>, the turtle parasite <it>E. terrapinae</it>, the snake parasite <it>E. invadens</it><abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp>, and the more distantly related free-living amoeboflagellate <it>Mastigamoeba balamuthi </it><abbrgrp><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp>, and a putative relative of diplomonads; the parabasalid <it>Trichomonas vaginalis </it><abbrgrp><abbr bid="B18">18</abbr><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp> (the cause of trichomoniasis, a sexually transmitted disease in humans <abbrgrp><abbr bid="B28">28</abbr></abbrgrp>). In addition, we updated our datasets with all currently available homologous sequences in the public databases as well as from a number of ongoing genome sequencing projects of eukaryotes. Our updated phylogenies using more sophisticated models of aa substitutions in combination with analyses of the distribution pattern of the genes indicate that gene transfer hypotheses currently best explain the data.</p>
      </sec>
      <sec>
         <st>
            <p>Results and discussion</p>
         </st>
         <sec>
            <st>
               <p>Patchy distribution in eukaryotes</p>
            </st>
            <p>In a previous study we identified diplomonad genes potentially derived from LGT <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. Here we have tested if alternative hypotheses and/or phylogenetic artefacts could account for these observations. We have broadened the eukaryotic taxon sampling of four genes with a limited distribution among eukaryotes, both by cloning and sequencing new genes and mining of the available sequence databases. By using this approach we are also able to refine the timing of putative LGT events with respect to organismal divergences and to gain insights into the evolution of gene families with a patchy distribution in general. All four genes were obtained from <it>Entamoeba invadens</it>, <it>Entamoeba moshkovskii </it>and <it>Entamoeba terrapinae</it>, and <it>priS</it>, <it>fprA </it>and <it>nagB </it>(partial) sequences were obtained from <it>Mastigamoeba balamuthi </it>with PCR using genomic DNA as template. Two <it>T. vaginalis </it>cDNA clones (<it>nagB </it>and <it>fprA</it>) were also completely sequenced. A <it>T. vaginalis priS </it>sequence and a <it>M. balamuthi adhE </it>sequence had appeared in the databases since the previous analysis. Furthermore, a <it>N. gruberi priS </it>cDNA clone was completely sequenced (see <supplr sid="S1">Additional File 1</supplr> for complete listing of the datasets). To further investigate the distribution of these genes in complete or nearly complete genome sequences, we performed similarity searches against available data from ongoing eukaryotic genome projects and retrieved the significant BlastP hits. We also combined these results with the information from published genomes and mapped the occurrences of the genes onto the current hypothesis of organismal relationships among eukaryotes (Figure <figr fid="F1">1</figr>) <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr><abbr bid="B31">31</abbr></abbrgrp>. The four genes show a very patchy distribution often with both presences and absences within the same eukaryotic "super group". Two extreme alternative explanations may be invoked to explain these distribution pattern within eukaryotes; (i) presence of all four genes in the last common eukaryotic ancestor followed by many differential losses within the "super groups", or (ii) absence of the genes in the ancestor followed by independent gene acquisitions in all divergent lineages that possess the genes. The duplication and gene loss scenario becomes less likely the more independent convergent gene loss events need to be postulated. Therefore, phylogenetic analyses of the individual genes should help to distinguish between these hypotheses.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Distribution of the four genes in the taxa sampled in this study</p>
               </caption>
               <text>
                  <p><b>Distribution of the four genes in the taxa sampled in this study. </b>A hypothetical tree of eukaryotes for which genomes have been fully sampled and published, *; is close to completion, **; or only partially sampled (genome sequence survey or expressed sequence tags), ***; indicating their classification into "super-groups" [29-31], showing the presence or absence of the four genes in the study. Please notice that the gene absences in the genomes that are close to completion are unconfirmed, they may turn into presences upon publication. A and B refer to strongly separated groups in the phylogenetic analyses, as indicated in Figures 2-4 &amp; 6. The <it>priS </it>genes encode the hybrid-cluster proteins, <it>fprA </it>genes encode the A-type flavoproteins, <it>nagB </it>genes encode glucosamine-6-phosphate isomerase proteins and the <it>adhE </it>genes encode the alcohol dehydrogenase E proteins.</p>
               </text>
               <graphic file="1471-2148-6-27-1"/>
            </fig>
            <suppl id="S1">
               <title>
                  <p>Additional File 1</p>
               </title>
               <text>
                  <p>Lists accession numbers, keys to short names used in alignment files, taxonomic descriptions, and the basis for exclusion from the phylogenetic analyses for all datasets used in the study.</p>
               </text>
               <file name="1471-2148-6-27-S1.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Phylogenetic analyses</p>
            </st>
            <p>In our previous study we excluded sequences that showed indications of a biased aa composition to reduce the impact of phylogenetic artefacts due to compositional heterogeneity where possible &#8211; the available methods at the time assumed aa compositional homogeneity <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. Here we approach this potential problem by including analyses with methods and models that are designed to mitigate the potential misleading effects of compositional heterogeneity. Each aa in the alignments was recoded to the six groups of chemically related aa that commonly replace one another <abbrgrp><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr></abbrgrp>, an approach identical to the recent analyses of the NuoF protein <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. Previously, we were also limited by the size of the datasets <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>, since the maximum likelihood (ML) methods were very computationally demanding at the time. The release of the PHYML software solves this problem since it is able to perform bootstrap analyses of a large number of sequences (>100) in a reasonable computational time <abbrgrp><abbr bid="B34">34</abbr></abbrgrp>. The recently released ModelGenerator software also ensures the usage of the optimal available model for aa substitutions in the ML analyses <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. These advances in the field of phylogenetics enabled us to perform more detailed analyses that include all available members of each gene family. Information about the datasets and parameters for the phylogenetic analyses are listed in <supplr sid="S2">Additional File 2</supplr>, and the phylogenetic trees with support values from the two methods are shown in Figures <figr fid="F2">2</figr>, <figr fid="F3">3</figr>, <figr fid="F4">4</figr>, <figr fid="F5">5</figr>, <figr fid="F6">6</figr>.</p>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Protein maximum likelihood tree of hybrid-cluster protein (<it>priS </it>gene)</p>
               </caption>
               <text>
                  <p><b>Protein maximum likelihood tree of hybrid-cluster protein (<it>priS </it>gene). </b>ML tree based on 417 unambiguously aligned aa positions of the hybrid-cluster protein. Bootstrap support values >50% from ML analyses are shown above the branches. Posterior probabilities for the Bayesian consensus tree of the grouped aa analysis are shown below the branches. When no space is available a line indicates the position of the support values. Absence of a posterior probability value at a node indicates that this node was lacking in the Bayesian consensus tree. Details about the phylogenetic analyses are found in the Methods section and Additional<supplr sid="S2">Additional File 2</supplr>. The grey boxes A and B indicate strongly separated groups which include eukaryotic sequences. The tree is arbitrarily rooted. Eubacteria are labelled black, Archaea are labelled blue, and the Eukaryotes are labelled according to their classification into "super-groups" [29, 30]: opisthokonts (orange), amoebozoa (purple), chromalveolates (red), plants (green) and excavates (brown) (see Figure 1).</p>
               </text>
               <graphic file="1471-2148-6-27-2"/>
            </fig>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Protein maximum likelihood trees of A-type flavoprotein (<it>fprA </it>gene)</p>
               </caption>
               <text>
                  <p><b>Protein maximum likelihood trees of A-type flavoprotein (<it>fprA </it>gene). </b>ML trees based 269 unambiguously aligned aa positions of the A-type flavoprotein. The boxes indicate sequences that have an approximately 450 aa long conserved C-terminal extension of the flavoprotein which is absent from all other sequences in the alignment (see <supplr sid="S4">Additional File 4</supplr> for further analyses and discussion). The grey boxes A and B indicate strongly separated groups which include eukaryotic sequences. The tree is arbitrarily rooted. Details about the phylogenetic analyses are found in the Methods section and <supplr sid="S2">Additional File 2</supplr>. Labelling as in Figure 2.</p>
               </text>
               <graphic file="1471-2148-6-27-3"/>
            </fig>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>Protein maximum likelihood trees of the short and long versions of glucosamine-6-phosphate isomerase (<it>nagB </it>gene)</p>
               </caption>
               <text>
                  <p><b>Protein maximum likelihood trees of the short and long versions of glucosamine-6-phosphate isomerase (<it>nagB </it>gene)</b>. ML tree based on 229 unambiguously aligned aa positions from the N-terminal part of the alignment of the glucosamine-6-phosphate isomerase protein. The grey boxes A and B indicate strongly separated groups which include eukaryotic sequences. The sequences in the B box (with the exception of the <it>R. baltica </it>3 sequence) have an approximately 500 aa long conserved C-terminal extension of the protein which is absent from all other sequences in the alignment. The sequences in box B, together with the sequences indicated with asterisks were excluded in a separate analysis shown in <supplr sid="S5">Additional File 5</supplr>, to test the influence of the removal of the long version of the protein and long branches on the relative positions of eukaryotic sequences. The tree is arbitrarily rooted. Details about the phylogenetic analyses are found in the Methods section and <supplr sid="S2">Additional File 2</supplr>. Labelling as in Figure 2.</p>
               </text>
               <graphic file="1471-2148-6-27-4"/>
            </fig>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>Protein maximum likelihood trees of the long version of glucosamine-6-phosphate isomerase (<it>nagB </it>gene)</p>
               </caption>
               <text>
                  <p><b>Protein maximum likelihood trees of the long version of glucosamine-6-phosphate isomerase (<it>nagB </it>gene)</b>. Phylogenetic tree based on 560 unambiguously aligned aa positions from the glucosamine-6-phosphate isomerase sequences that have the long C-terminal extension (box B in Figure 4). In a separate analysis the partial <it>Mastigamoeba balamuthi </it>sequence was included and its position is indicated with an arrow with the bootstrap support value in parenthesis. The tree is arbitrarily rooted. Details about the phylogenetic analyses are found in the Methods section and <supplr sid="S2">Additional File 2</supplr>. Labelling as in Figure 2.</p>
               </text>
               <graphic file="1471-2148-6-27-5"/>
            </fig>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>Protein maximum likelihood tree of alcohol dehydrogenase E (<it>adhE </it>gene)</p>
               </caption>
               <text>
                  <p><b>Protein maximum likelihood tree of alcohol dehydrogenase E (<it>adhE </it>gene). </b>Phylogenetic tree based on 796 unambiguously aligned aa positions of the alcohol dehydrogenase E protein sequences. The grey boxes A and B indicate strongly separated groups which include eukaryotic sequences. The tree is arbitrarily rooted. Details about the phylogenetic analyses are found in the Methods section and <supplr sid="S2">Additional File 2</supplr>. Labelling as in Figure 2.</p>
               </text>
               <graphic file="1471-2148-6-27-6"/>
            </fig>
            <suppl id="S2">
               <title>
                  <p>Additional File 2</p>
               </title>
               <text>
                  <p>Information about the datasets and parameters of the phylogenetic analyses.</p>
               </text>
               <file name="1471-2148-6-27-S2.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>All datasets in the phylogenetic analyses with grouped aa using the Metropolis-coupled Markov Chain Monte Carlo (MCMC) strategy showed convergence, indicated by good agreements between the split support values of the duplicate runs (<supplr sid="S3">Additional File 3</supplr>). Two of the alignments (the long version of glucosamine-6-phosphate isomerase and the prismane protein) also showed a good model composition fit indicated by both posterior predictive simulations and tests for homogeneity using <it>X</it><sup>2 </sup>statistics and simulations to get the null distribution (<it>p</it><sub><it>t </it></sub>> 0.05 and P<sub>sim </sub>> 0.05, respectively) <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>, while the original datasets did not (P<sub>sim </sub>&lt; 0.05) (<supplr sid="S2">Additional File 2</supplr>). This indicates that the recoding procedure has reduced the potential misleading effects of compositional heterogeneity in these two analyses. The other three datasets (A-type flavoprotein, the short version of glucosamine-6-phosphate isomerase, and alcohol dehydrogenase E) showed low <it>p</it><sub><it>t </it></sub>and P<sub>sim </sub>values (&lt;0.05), suggesting that compositional heterogeneity might still represent a source of artefactual results in these datasets (<supplr sid="S2">Additional File 2</supplr>). Nevertheless, none of these grouped aa datasets failed the tests of the model composition when the &#967;<sup>2 </sup>curve was used to get the null distribution, while two of the these three original datasets did, suggesting that the recoding procedure had improved the model fit (<supplr sid="S2">Additional File 2</supplr>), reducing the potential for estimation biases. At the very least, these analyses complement the more "standard" ML analyses by showing what aspects of the phylogenies are robust to aa recoding and reducing any potential effects of saturation.</p>
            <suppl id="S3">
               <title>
                  <p>Additional File 3</p>
               </title>
               <text>
                  <p>Figures showing the split support for the two runs in the grouped aa analyses plotted one against the other as indicators of convergence.</p>
               </text>
               <file name="1471-2148-6-27-S3.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>The updated phylogenetic analyses show sequences highly scrambled with respect to expected organismal relationships (Figures <figr fid="F2">2</figr>, <figr fid="F3">3</figr>, <figr fid="F4">4</figr>, <figr fid="F5">5</figr>, <figr fid="F6">6</figr>), as previously observed for these genes <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. Thus, the earlier finding that these proteins produce phylogenetic trees that are incompatible with organismal phylogenies is robust with respect to improved taxon sampling and more detailed phylogenetic analyses &#8211; the number of eukaryotic groups (polyphyly) in the trees have increased, rather than decreased. In all analyses the eukaryotic sequences are found in at least two distinct regions of the trees nested with prokaryotic sequences (A and B boxes in Figures <figr fid="F2">2</figr>, <figr fid="F3">3</figr>, <figr fid="F4">4</figr> &amp;<figr fid="F6">6</figr>), which are separated with strong support values. These strong separations could, in principle, be due to ancient duplication events followed by a large number of differential losses. Indeed, the presence of the same prokaryotic group in both regions in several of the phylogenetic analyses &#8211; low G+C Gram positives are for example found in both box A and B in Figures <figr fid="F2">2</figr> and <figr fid="F6">6</figr> &#8211; superficially supports ancient duplications. Such scenarios are expected to result in phylogenetic relationships for each paralog that mirror the organismal relationships. This is not observed in our analyses (Figures <figr fid="F2">2</figr>, <figr fid="F3">3</figr>, <figr fid="F4">4</figr> &amp;<figr fid="F6">6</figr>). Furthermore, duplication and loss scenarios require that the gene was present in multiple copies in the last common universal ancestor and retained for a long evolutionary time. Thus, to explain the patterns we observe in the phylogenies (Figures <figr fid="F2">2</figr>, <figr fid="F3">3</figr>, <figr fid="F4">4</figr>, <figr fid="F5">5</figr>, <figr fid="F6">6</figr>) a eukaryotic ancestral genome that encoded a larger number of distantly related paralogs of the four genes than present in any of the extant eukaryote genomes would have to be inferred (Figure <figr fid="F1">1</figr>). To our knowledge, no data exist supporting a universal trend for drastic genome shrinkage in a relative recent evolutionary time. Therefore, gene duplication and differential losses alone do not seem sufficient to explain the unexpected phylogenetic relationships observed in our analyses (Figures <figr fid="F2">2</figr>, <figr fid="F3">3</figr>, <figr fid="F4">4</figr>, <figr fid="F5">5</figr>, <figr fid="F6">6</figr>).</p>
            <p>However, the number of independent gene losses has to be weighed against the possibility of a later introduction of the genes into eukaryotes by LGT events. Yet, as none of the eukaryotic groups are found nested within a natural prokaryotic group with strong bootstrap support (Figures <figr fid="F2">2</figr>, <figr fid="F3">3</figr>, <figr fid="F4">4</figr>, <figr fid="F5">5</figr>, <figr fid="F6">6</figr>), it is difficult to identify donor and recipient lineages involved in the putative LGT events. Thus, the presence of these genes in a subset of the sampled eukaryotes are neither easily explained by vertical inheritance of the genes from the common ancestor of all eukaryotes, nor by a distinct number of easily identified gene transfer events. These phylogenies need to be carefully interpreted in combination with analysis of gene distribution patterns, as well as in the context of the biology of the available organisms.</p>
         </sec>
         <sec>
            <st>
               <p>Hybrid-cluster protein</p>
            </st>
            <p>Genes for the hybrid-cluster protein (<it>priS</it>) have been identified from a large number of prokaryotes, as well as several eukaryotes (Figures <figr fid="F1">1</figr> &amp;<figr fid="F2">2</figr>). However, the cellular function of the protein is not well established; potential roles in the biological nitrogen cycle <abbrgrp><abbr bid="B37">37</abbr><abbr bid="B38">38</abbr></abbrgrp> and the adaptive response to oxidative stress <abbrgrp><abbr bid="B39">39</abbr></abbrgrp> have been suggested. Although the gene is found in all three domains of life, its distribution within the domains are patchy; for example, it is relatively widespread among proteobacterial genomes, while it has only been found in a single high G+C Gram positive species and a single cyanobacterium (Figure <figr fid="F2">2</figr>). The occurrence of the gene in a large number of unrelated lineages in combination with the absence from more closely related species is most simply explained by cross-species transmission via gene transfer. Indeed, the phylogeny of the hybrid-cluster protein strongly suggests a number of intra- and inter-domain prokaryotic LGT events, with sequences from organismal groups such as proteobacteria, low G+C Gram positives, and euryarchaeota branching in several distinct regions of the tree, often branching with unrelated lineages with strong support values (Figure <figr fid="F2">2</figr>). The eukaryotes are found within two large groups of sequences including both archaeal and bacterial homologs, separated by a long and strongly supported branch (box A and B in Figure <figr fid="F2">2</figr>). One clade contains two <it>Trichomonas </it>sequences that are the sole eukaryotes in one of these groups (box B in Figure <figr fid="F2">2</figr>). A prokaryote-to-eukaryote LGT event affecting the parabasalid lineage after the divergence from other eukaryotes, including diplomonads, is a more parsimonious explanation for the position of the <it>T. vaginalis </it>sequences than loss of this version of the gene in all other eukaryotic species, provided <it>T. vaginalis </it>is not basal to all the other eukaryotes included in this analysis <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp>. However, the prokaryotic donor lineage for the <it>T. vaginalis </it>sequences is difficult to determine from the current data and analyses.</p>
            <p>The eukaryotic sequences found in the second group within the hybrid-cluster protein phylogeny are found in four polyphyletic groups (box A in Figure <figr fid="F2">2</figr>). However, only one of these groups is separated from the other with a significant statistical support; the <it>Entamoeba </it>sequences form a weakly supported group with a cyanobacterial sequence which are found as a sister group of three &#948;-proteobacterial sequences with a posterior probability of 1.00 in the grouped aa analysis. This is suggestive of a eubacteria-to-<it>Entamoeba </it>LGT event, perhaps with cyanobacteria or &#948;-proteobacteria as the donor lineage (Figure <figr fid="F2">2</figr>). Thus, at the very least the phylogeny of the hybrid-cluster protein suggests two transfer events from prokaryotic donors into protists. Taken at face value, the tree also supports additional transfers into various protist lineages. Indeed, the diplomonad lineages are nested within proteobacterial sequences in both the ML and grouped aa Bayesian analyses, although with weak statistical support in both cases. At any rate, this observation is suggestive of a LGT event from a proteobacterium to the diplomonad lineage. Two &#945;-proteobacterial sequences are nested within the other eukaryotic <it>priS </it>sequences in box A with weak bootstrap support (Figure <figr fid="F2">2</figr>), which could indicate an origin via endosymbiotic gene transfer. However, the absence of the gene in mitochondrial genomes in combination with its absence from the nuclear genome of most eukaryotes related to pelobionts, diatoms, heterolobosea, and green algae (Figure <figr fid="F1">1</figr>), makes such an origin doubtful. Still, the weakly supported separation of diplomonad and these eukaryotic sequences may be artefactual &#8211; in reality they could represent a monophyletic group that inherited this gene from their common ancestor. If so, at least eight independent losses of <it>priS </it>in the apicomplexan/ciliate, oomycete, land plant, parabasalid, kinetoplastid, opisthokont, mycetozoan, and <it>Entamoeba </it>lineages would have to be invoked (Figure <figr fid="F1">1</figr>). Since such widespread and relatively recent independent losses appear unlikely, we favour a scenario where also the <it>N. gruberi</it>, <it>M. balamuthi</it>, <it>T. pseudonana</it>, and <it>C. reinhardtii </it>sequences have been distributed by an unknown number of gene transfer events from unsampled prokaryotic lineages or between microbial eukaryotic lineages. So far the <it>priS </it>gene has only been found in microbial eukaryotes. This circumstantially supports the hypothesis that the absence of a germ/soma separation in unicellular organisms increases their chance of acquiring genes by LGT <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>. We predict that additional taxon sampling will confirm the current trend of preferential presence in unicellular eukaryotes and will further clarify the origins of the eukaryotic <it>priS </it>genes.</p>
         </sec>
         <sec>
            <st>
               <p>A-type flavoprotein</p>
            </st>
            <p>The <it>fprA </it>gene encodes A-type flavoprotein, a protein recently inferred to play a role in the detoxification of nitric oxide and/or oxygen in <it>E. histolytica </it>and was suggested to derive from a relatively recent LGT event from a prokaryotic donor <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. Furthermore, it has been demonstrated that <it>T. vaginalis </it>is able to degrade nitric oxide under microaerophilic conditions, an activity proposed to be associated with the presence of A-type flavoproteins in these parasites <abbrgrp><abbr bid="B40">40</abbr></abbrgrp>. Again, the gene is only found in a subset of the sequenced prokaryotes &#8211; mostly species able to grow in oxygen-poor environments (Figure <figr fid="F3">3</figr>). One exception is the widespread presence of the A-type flavoprotein within cyanobacteria, possibly indicating that the protein has evolved a different function within this group. Consistent with this hypothesis the cyanobacterial sequences are well separated from the other sequences in the tree and have a unique alignment feature; they all share a ~160 aa highly conserved C-terminal extension that is absent from all other sequences. The phylogenetic analyses of the A-type flavoprotein strongly indicate that the <it>fprA </it>gene has been distributed between the prokaryotic groups via LGT, rather than by vertical inheritance, since many groupings of unrelated prokaryotic taxa are observed and supported by strong support values from both analyses (Figure <figr fid="F3">3</figr>).</p>
            <p>The eukaryotes are found in two clearly separated clusters. The two diplomonad sequences are found together with a <it>Trichomonas </it>sequence among a mixture of eubacterial and archaeal species, indicating a prokaryote-to-eukaryote gene transfer event to a hypothetical uniquely shared ancestor of diplomonads and parabasalids (box B in Figure <figr fid="F3">3</figr>) &#8211; unless several independent gene losses are inferred among a broad range of eukaryotic lineages (Figure <figr fid="F1">1</figr>). In the ML analysis, three additional <it>Trichomonas </it>homologs are found weakly associated with the strongly supported grouping of the <it>Mastigamoeba </it>and <it>Entamoeba </it>sequences (box A in Figure <figr fid="F3">3</figr>), while the <it>Entamoeba</it>/<it>Mastigamoeba </it>clade is found a sister clade to the <it>Clostridium perfringens </it>sequences in the grouped aa analysis with a posterior probability of 0.97 (data not shown). Thus, the relationships between amoebozoan, parabasalid and clostridial sequences are uncertain. However, <it>Trichomonas </it>does not share a recent common ancestor with amoebozoa (Figure <figr fid="F1">1</figr>) suggesting that the <it>fprA </it>gene has been acquired by separate gene transfer events in these two eukaryotic lineages. Alternatively, following a prokaryotic LGT to one of the two eukaryotic lineages, a second LGT took place between an ancestor of <it>Entamoeba </it>and a parabasalid. Interestingly, the three <it>T. vaginalis </it>sequences share a ~450 aa C-terminal extension of about 39% identity with the <it>Clostridium perfringens </it>3 <it>fprA </it>homolog (Figure <figr fid="F3">3</figr> and <supplr sid="S4">Additional File 4</supplr>). This sequence, and a 433 aa long <it>Clostridium tetani </it>sequence (an FAD-dependent pyridine nucleotide-disulphide oxidoreductase:Rubredoxin-type 38% identical to the <it>T. vaginalis </it>sequences) are the most similar prokaryotic sequences in the public databases, while the most similar eukaryotic sequence, an NADH dehydrogenase from <it>E. histolytica </it>previously identified to be of prokaryotic origin <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>, is only 25% identical (<supplr sid="S4">Additional File 4</supplr>). Such a taxonomic distribution of this protein domain links the <it>Trichomonas </it>C-terminal extensions with the <it>Clostridium </it>sequences rather than to the eukaryotic sequences, suggesting that they originated via a gene transfer event from a prokaryote donor. Thus, both the N-terminal and C-terminal domains of the <it>T. vaginalis </it>A-type flavoproteins likely have prokaryotic origins (see <supplr sid="S4">Additional File 4</supplr> for discussion of plausible scenarios).</p>
            <suppl id="S4">
               <title>
                  <p>Additional File 4</p>
               </title>
               <text>
                  <p>Figure showing the structural organization of the <it>T. vaginalis </it>A-type flavoproteins together with additional discussion.</p>
               </text>
               <file name="1471-2148-6-27-S4.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>The eukaryotic lineages that encode <it>fprA </it>are micro-aerophilic organisms that most likely have evolved from aerobic eukaryotes <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>, and the prokaryotes found closest to the eukaryotic sequences in the tree are found in oxygen-poor environments. These observations indicate that the transfer of the gene occurred in such an environment. The putative functional role of <it>fprA </it>in nitric oxide detoxification <abbrgrp><abbr bid="B40">40</abbr></abbrgrp> indicate that these gene transfers might represent metabolic adaptations that allowed these different eukaryotes to better survive in anoxic environments. <it>fprA </it>could be part of the gene pool shared between distantly-related organisms (prokaryotic or eukaryotic) that occupy the same ecological niche.</p>
         </sec>
         <sec>
            <st>
               <p>Glucosamine-6-phosphate Isomerase</p>
            </st>
            <p>The <it>nagB </it>gene encodes glucosamine-6-phosphate isomerase, an enzyme which is usually about 260 aa residues in length and is required for the biosynthesis of the cyst wall in <it>Giardia </it><abbrgrp><abbr bid="B42">42</abbr></abbrgrp>. Apart from low G+C Gram positives and &#947;-proteobacteria, the <it>nagB </it>gene is only sparsely represented in eubacteria and not yet detected in archaea (Figure <figr fid="F4">4</figr>). It is also absent from several eukaryotic lineages (Figure <figr fid="F1">1</figr>). In the phylogenetic tree of <it>nagB</it>, a strongly supported group including the <it>Entamoeba</it>, ciliate, mycetozoa, pelobiont, parabasalid and several eubacterial sequences was detected (box B in Figure <figr fid="F4">4</figr>). All these sequences, with the exception of one of the <it>Rhodopirellula baltica </it>paralogs, have a roughly 500 aa residue long homologous C-terminal extension of the protein with pair-wise identities above 48%, which confirms the common ancestry of these sequences (Figure <figr fid="F4">4</figr>). To increase the resolution of this group, a separate analysis was performed which only included the sequences of the long version of the protein and therefore was based on a larger number of positions in the alignment &#8211; 560 unambiguously aligned aa residues compared to 229 (Figure <figr fid="F5">5</figr>). Interestingly, in both analyses, pelobiont and <it>Entamoeba </it>sequences form a group with the ciliate sequences. In the ML analyses the mycetozoan <it>Dictyostelium discoideum </it>is found as a sister to these sequences with a bootstrap support of 56% (Figure <figr fid="F5">5</figr>), while the <it>Rhodopirelulla baltica </it>4 sequence is found as the immediate outgroup to the ciliate/<it>Mastigamoeba</it>/<it>Entamoeba </it>sequences with a posterior probability of 0.45 in the grouped aa analysis (data not shown). These weakly supported and partly incongruent phylogenies could be rationalized in the following ways: (i) a phylogenetic artefact splitting the amoebozoa sequences, in combination with differential gene loss in all sampled eukaryotic genomes with only the currently sampled ciliates and amoebozoa retaining <it>nagB </it>(Figure <figr fid="F1">1</figr>), (ii) inter-domain gene transfers events from closely related, but yet unsampled, prokaryotes to the amoebozoa and ciliate lineages, or (iii) the presence of the long version of <it>nagB </it>in the common amoebozoan ancestor followed by a transfer event to the ciliate lineage. Although none of the alternatives can be excluded, we favour the third explanation, since the expected topology within the amoebozoa is recovered, albeit with only weak bootstrap support from the ML analysis (Figure <figr fid="F5">5</figr>), if a single intra-domain LGT event is inferred. Furthermore, ciliates are known to eat other protists <abbrgrp><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr></abbrgrp>, indicating that a gene transfer event from an amoebozoan to a ciliate is feasible at least in principle <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. Further taxonomic sampling of eukaryotic and prokaryotic genomes is obviously needed, especially within the two particular eukaryotic groups concerned, to distinguish between the different plausible scenarios. In any case, the <it>Mastigamoeba </it>and <it>Entamoeba </it>sequences form a strongly supported group that indicates the presence of the gene in their common ancestor.</p>
            <p>In contrast, the <it>Trichomonas </it>homolog, that encodes the long version of the enzyme, and diplomonads, that encode the short version, have distinct origins. While the parabasalid gene likely originated via a gene transfer event, possibly from a eubacteria within the Bacteroidetes/Chlorobi group (Figure <figr fid="F5">5</figr>), the source of the diplomonad genes remains uncertain. The separation from other eukaryotes in box A appear robust with strong bootstrap support from both of the analyses with all taxa (Figure <figr fid="F4">4</figr>), as well as an additional analysis where the box B sequences and prokaryotic long branches were excluded (<supplr sid="S5">Additional File 5</supplr>). Thus, the separation of the diplomonad sequences from the eukaryotic sequences in box A is unlikely a result of long-branch attraction; an LGT event from an unsampled eubacterial lineage seems like a more likely explanation (Figure <figr fid="F4">4</figr>).</p>
            <suppl id="S5">
               <title>
                  <p>Additional File 5</p>
               </title>
               <text>
                  <p>Figure showing a phylogenetic analysis of the short version of glucosamine-6-phosphate isomerase with the long version and some prokaryotic long branches excluded.</p>
               </text>
               <file name="1471-2148-6-27-S5.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>The topology of the tree relating the opisthokont sequences to other eukaryotic lineages and prokaryotes is not easy to explain simply by vertical inheritance (box A in Figure <figr fid="F4">4</figr>); the metazoan sequences are grouped together as expected, but the fungi are split into one main group and a smaller group with three budding yeast sequences. The separation between the two fungal groups is supported by both analyses (Figure <figr fid="F4">4</figr>). In fact, the budding yeasts are found with the other fungi only in 1% of the bootstrap replicates in the ML analyses (including the analysis without long branches) and never among the 2000 sampled trees in the grouped aa analysis. Furthermore, the eukaryotes within box A are never found as a monophyletic group among the 500 bootstrap replicates in any of the ML analyses, and with a posterior probability of only 0.02 in the grouped aa analysis (data not shown). Collectively, these results indicate that the fungal <it>nagB </it>genes likely have separate origins; a recent introduction of a <it>nagB </it>gene into a common ancestor of the three budding yeast lineages <it>Debaryomyces</it>, <it>Candida</it>, and <it>Yarrowia </it>seems like a reasonable scenario.</p>
            <p>The <it>Dictyostelium </it>sequence is found as a sister to a <it>Fusobacterium </it>sequence in the two ML analyses (Figure <figr fid="F4">4</figr> and <supplr sid="S5">Additional File 5</supplr>), while the sequence is nested between the three budding yeast sequences and the other sequences in box A in Figure <figr fid="F4">4</figr> in the grouped aa analysis with posterior probabilities for the separations of 0.95 and 0.90, respectively (data not shown). The separation to the metazoan/fungi/euglenozoan group is strong also in the ML analysis; the <it>Dictyostelium </it>sequence indeed never branches with this group in any of the bootstrap replicates in the full analysis or the analysis where long branches were excluded (data not shown). Accordingly, the phylogenetic analyses indicate that a gene acquisition from a prokaryotic lineage is a plausible explanation for the origin of the <it>Dictyostelium </it>sequence, rather than a shared ancestry with the other eukaryotic sequences within box A.</p>
         </sec>
         <sec>
            <st>
               <p>Alcohol dehydrogenase E</p>
            </st>
            <p>Alcohol dehydrogenase E is a key enzyme in the energy metabolism of type I "amitochondriate" protists (i.e. those that lack energy-producing mitochondria or hydrogenosomes) <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>, since it catalyzes the conversion of acetyl-CoA to ethanol in a two-step reaction which oxidizes two molecules of NADH to NAD<sup>+ </sup><abbrgrp><abbr bid="B46">46</abbr></abbrgrp>. We expanded the dataset with <it>adhE </it>genes from three additional <it>Entamoeba</it>species. The failure to detect an <it>adhE </it>homolog in <it>T. vaginalis </it>in the ongoing genome project <abbrgrp><abbr bid="B47">47</abbr></abbrgrp> was expected, since this organism contains hydrogenosomes (type II "amitochondrial" protist), and therefore utilizes a different set of enzymes in their energy metabolism <abbrgrp><abbr bid="B45">45</abbr></abbrgrp>. Interestingly, alcohol dehydrogenase E genes have been detected in the anaerobic chytrid fungus <it>Piromyces </it>sp. E2, which indeed does contain hydrogenosomes <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>. However, energy metabolism of chytrids is clearly different from that of type II amitochondriate protists such as <it>Trichomonas </it>&#8211; chytrids exhibit a bacterial-type mixed-acid fermentation <abbrgrp><abbr bid="B48">48</abbr></abbrgrp>. The finding of alcohol dehydrogenase E in two green algal species, where the protein functions in aerobic mitochondria, indicates that the diversity of its functional role in eukaryotes is not fully understood <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>.</p>
            <p>The phylogenetic tree supports our earlier interpretations that LGT has played an important role in the evolution of this gene <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B49">49</abbr></abbrgrp>, with a number of strongly supported prokaryotic relationships that most easily are explained by gene transfer events (Figure <figr fid="F6">6</figr>) &#8211; the gene is only rarely found outside low G+C Gram positives and &#947;-proteobacteria. The additional <it>Entamoeba </it>sequences form a group with the <it>E. histolytica </it>sequence, indicating the presence of the gene in the common ancestor of these <it>Entamoeba </it>species, while the <it>Mastigamoeba </it>sequence clearly has a distinct origin from that of its amoebozoan sisters, as observed previously <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>. The position of the <it>Entamoeba </it>sequences within a eubacterial group strongly suggests a prokaryote-to-eukaryote LGT event. Sequences from two diplomonads, two green algae, a single apicomplexan, and a chytrid fungus are found in the same region of the tree as the <it>Mastigamoeba </it>sequence (box B in Figure <figr fid="F6">6</figr>). However, relatives of these organisms are known to lack the gene (Figure <figr fid="F1">1</figr>), arguing in favour of recent independent introductions of the gene, rather than an ancestral presence followed by differential gene loss. The green algal sequences are found as sisters to the single cyanobacterial sequence (Figure <figr fid="F6">6</figr>) with moderate to strong statistical support from both analyses, indicating a transfer event between the lineages. This seems ecologically reasonable since ancestors of these lineages could have been found in the same environment. Although the observed topology could be explained by endosymbiotic gene transfer from the plastid, the fact that land plants are lacking the gene, the absence of the gene from extant plastid genomes, and the localization of the protein in green algal mitochondria <abbrgrp><abbr bid="B49">49</abbr></abbrgrp> makes a gene transfer independent of the plastid endosymbiosis somewhat more likely. Also the <it>Mastigamoeba </it>sequence is separated from the other eukaryotic sequences in this region with strong and moderate support from the grouped aa and ML analyses, respectively (box B in Figure <figr fid="F6">6</figr>), clearly suggesting an origin via LGT, maybe from an unsampled prokaryotic lineage.</p>
            <p>The diplomonad, fungal, and apicomplexan alcohol dehydrogenases are found in a weakly supported cluster in both analyses. This suggests eukaryote-to-eukaryote gene transfer events, although the donor and recipient lineages are difficult to infer. Indeed, it has earlier been suggested that the green algal <it>adhE </it>could have been acquired by the algae from parasitizing chytrid fungi or from foraminiferan hosts to endosymbiotic algae <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>, and similar interactions between these lineages could be invoked to explain the exchange of <it>adhE </it>genes. Interestingly, <it>Cryptosporidium</it>, <it>Piromyces </it>and many diplomonads are all anaerobes or microaerophiles, and many share similar, if not identical, environments; the digestive tract of various mammals. This could have facilitated gene sharing via LGT between these distantly related eukaryotic lineages, although independent acquisitions from unsampled prokaryotes cannot be excluded. Interestingly, these three distantly related eukaryotic lineages most likely have adapted to an anaerobic lifestyle independently <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>, and the putative acquisition of <it>adhE</it>likely represented independent metabolic adaptations to this environment. As for the <it>priS </it>gene (Figure <figr fid="F2">2</figr>), our prediction is that future genome sampling will only uncover <it>adhE </it>genes among microbial taxa since the distribution of <it>adhE </it>is restricted to microbial eukaryotes.</p>
         </sec>
         <sec>
            <st>
               <p>LGT events as phylogenetic markers</p>
            </st>
            <p>LGT is usually expected to confound efforts to reconstruct organismal relationships, since it decouples the historical signals in the gene sequences from organismal lineages <abbrgrp><abbr bid="B50">50</abbr></abbrgrp>. However, gene transfer events can also be informative in a specific case; the shared possession of a transferred gene may indicate a phylogenetic relationship between the lineages that possess the transferred gene to the exclusion of the lineages that lack it. There certainly are limitations for such interpretations; the gene could have been lost in some of the descendants of the recipient lineage and additional transfers can complicate the correct identification of donor and recipient lineages. In any case, gene transfer events are a potentially very important source of information about organismal relationships <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B51">51</abbr></abbrgrp>, especially for protists where the molecular data are scarce and phylogenetic reconstructions are difficult <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>.</p>
            <p>For example, the phylogenetic positions of pelobionts and <it>Entamoeba </it>have been difficult to resolve with molecular markers. Analyses of ribosomal RNA only weakly grouped these together <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>, while more recently, based on a number of protein markers, it was conclusively shown that these two groups share a common ancestor to the exclusion of other eukaryotes <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp>. Interestingly, the <it>Entamoeba </it>sequences strongly group together with the <it>Mastigamoeba </it>sequence in two of the four analyses discussed here, <it>fprA </it>and <it>nagB </it>(Figures <figr fid="F3">3</figr> &amp;<figr fid="F5">5</figr>). This suggests that these genes were present in the ancestor of <it>Mastigamoeba </it>and <it>Entamoeba</it>, providing further support for a specific relationship between these two eukaryotic lineages. Furthermore, the higher eukaryotic taxon Amoebozoa <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp> is reflected in the phylogeny of one of these genes, <it>nagB </it>(Figure <figr fid="F5">5</figr>), provided one accepts the recovered gene phylogeny that indicates a possible gene transfer from within this group to a ciliate lineage. If robust, this branching pattern could allow one to make inferences about the relative timing of divergences within Amoebozoa and the Ciliophora. However, improved taxonomic sampling of the <it>nagB </it>gene within both groups of protists will be needed to solidify such inferences. Finally, the absence of <it>fprA </it>in the <it>Dictyostelium discoideum </it>genome suggests that the presence of this eubacterial gene within various Amoebozoa lineages might be used as a synapomorphy for discerning phylogenetic relationships within the group.</p>
            <p>Similarly, diplomonads and parabasalids have been suggested to share a common ancestor, initially mainly based on weak evidence from molecular data <abbrgrp><abbr bid="B53">53</abbr><abbr bid="B54">54</abbr></abbrgrp>. The case for this relationship was recently strengthened by the identification of two aminoacyl-tRNA synthetase genes that appear to have been transferred to a common ancestor of the two lineages <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> and recent phylogenetic analyses of concatenated protein alignments <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp>. The observation of a transfer of a gene encoding A-type flavoprotein to a uniquely shared ancestor of the two lineages (Figure <figr fid="F3">3</figr>) further supports their specific relationship. The identification of these three genes of prokaryotic origin shared between diplomonads and parabasalids in other lineages within Excavata should be useful to pinpoint relationships within this poorly resolved and diverse group of eukaryotes.</p>
         </sec>
         <sec>
            <st>
               <p>The timing of transfers relative to eukaryote diversification</p>
            </st>
            <p>The relative timing of the transfers can be addressed in more detail with our increased taxon sampling (Figure <figr fid="F7">7</figr>). For reasons outlined above, probably none of the four genes was present in the last common eukaryotic ancestor indicating that all putative transfers almost certainly happened in a more recent evolutionary time (Figure <figr fid="F7">7</figr>). However, all four genes were transferred to the diplomonad lineage before the split between <it>Giardia </it>and <it>Spironucleus </it>&#8211; they branch together in the phylogenetic reconstructions (Figures <figr fid="F2">2</figr>, <figr fid="F3">3</figr>, <figr fid="F4">4</figr> &amp;<figr fid="F6">6</figr>). With the sampling of <it>Trichomonas </it>homologs for three of the genes and the absence of the fourth, we now can date the transfers of <it>priS</it>, <it>nagB</it>, and <it>adhE </it>to after the split between diplomonads and parabasalids <abbrgrp><abbr bid="B18">18</abbr><abbr bid="B25">25</abbr><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr></abbrgrp>, but before the divergence of the two major groups of diplomonads <abbrgrp><abbr bid="B55">55</abbr></abbrgrp> (Figure <figr fid="F7">7</figr>). The fourth gene (<it>fprA</it>) was most likely introduced into the diplomonads lineage before the split of parabasalids, but after their divergence to the other eukaryotic lineages.</p>
            <fig id="F7">
               <title>
                  <p>Figure 7</p>
               </title>
               <caption>
                  <p>Summary of putative lateral gene transfers affecting amoebozoa, ciliates, and diplomonads, and parabasalids</p>
               </caption>
               <text>
                  <p><b>Summary of putative lateral gene transfers affecting amoebozoa, ciliates, and diplomonads, and parabasalids</b>. Lateral gene transfers inferred from Figures 2-6, as well as previously published phylogenetic analyses [10, 18] discussed in the text, are indicated on the topology; gene transfers from prokaryotes are indicated by black arrows, intra-eukaryote transfers between the groups are indicated by orange arrows, and gene introduced from uncertain origins are indicated by grey arrow. Please notice that the figure does not delineate the order of individual transfer events on each branch, and that plausible alternative hypotheses do exist to explain some of the unexpected phylogenetic positions of eukaryotes, here indicated as gene transfer events, our currently preferred hypothesis (see text for details).</p>
               </text>
               <graphic file="1471-2148-6-27-7"/>
            </fig>
            <p>Similarly, the putative transfers previously found to be affecting the <it>Entamoeba </it>lineage <abbrgrp><abbr bid="B10">10</abbr></abbrgrp> can be dated in more detail with our updated datasets. The <it>Entamoeba </it>sequences branch together in the phylogenetic reconstructions for all genes, indicating that the genes were present in the common ancestor of the four species included in the analysis (Figures <figr fid="F2">2</figr>, <figr fid="F3">3</figr>, <figr fid="F4">4</figr>, <figr fid="F5">5</figr>, <figr fid="F6">6</figr>). For the other two genes, <it>priS </it>and <it>adhE</it>, the separation of the <it>Entamoeba </it>and <it>Mastigamoeba </it>sequences is strongly supported (Figures <figr fid="F2">2</figr> &amp;<figr fid="F6">6</figr>), indicating that the transfer event to the <it>Entamoeba </it>lineage probably happened after the split between <it>Entamoeba </it>and pelobionts, but before the divergence of the <it>Entamoeba </it>species <abbrgrp><abbr bid="B21">21</abbr></abbrgrp>. A similar pattern was observed for the gene encoding alanyl-tRNA synthetase, where the ancestral eukaryotic version was replaced by a homolog from the parabasalid lineage in <it>Entamoeba </it>after the split from the <it>Mastigamoeba </it><abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. The timing of the transfer of the <it>priS </it>gene is more difficult to pinpoint since the separation of the <it>Entamoeba </it>and <it>Mastigamoeba </it>sequences is only weakly supported by the bootstrap analyses (Figure <figr fid="F2">2</figr>). As mentioned above, <it>nagB </it>and <it>fprA </it>most likely were present in the common ancestor of <it>Mastigamoeba </it>and <it>Entamoeba</it>, and the recipient of the <it>nagB </it>gene most likely was a common ancestor also of <it>Dictyostelium</it>, indicating that the transfer of these genes probably were more ancient events than the transfers of <it>priS </it>and <it>adhE </it>(Figure <figr fid="F7">7</figr>). The multiple copies found in one or more of the <it>Entamoeba </it>species for <it>priS</it>, and <it>fprA </it>are most likely due to recent gene duplication events within the <it>Entamoeba </it>lineages (Figures <figr fid="F2">2</figr> &amp;<figr fid="F3">3</figr>), a pattern also observed from the analysis of the partial genome sequence of <it>E. invadens</it><abbrgrp><abbr bid="B56">56</abbr></abbrgrp>. Interestingly, one of the two <it>E. invadens priS </it>sequences has a frameshift due to an eight nucleotide long deletion in the middle of the gene (this is unlikely to be due to a methodological artefact as several different PCR products gave identical sequences). This frameshift probably reflects the dynamics of the evolution of gene families in the <it>Entamoeba </it>lineage with frequent gene duplication followed by inactivation of some of the paralogs by accumulation of deleterious mutations.</p>
            <p>Among the four sampled genes, the absence of gene transfer occurring within the <it>Entamoeba </it>and diplomonad groups is probably an indication that the evolutionary times since the split of these respective groups are short in comparison to the time since the last common eukaryotic ancestor, rather than an indication that the rate of inter-domain transfers have decreased in more recent evolutionary time. Indeed, one of the fifteen genes in the previous analysis, <it>pyrG </it>which encodes CTP synthetase, was probably introduced independently into the <it>Giardia </it>and <it>Spironucleus </it>lineages <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>. However, the data are still scarce, and additional sampling of genes from diverse protist lineages could change the inferences of the timing of the transfers of individual genes presented here (Figure <figr fid="F7">7</figr>). Nevertheless, the data from our four genes, in combination with previously published data <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B18">18</abbr></abbrgrp>, support a scenario where prokaryotic genes from various lineages have been transferred into eukaryotic lineages continuously over time (Figure <figr fid="F7">7</figr>).</p>
         </sec>
         <sec>
            <st>
               <p>A link between gene transfers and feeding habits in phagotrophic protists and their shared ecological niche?</p>
            </st>
            <p>An interesting pattern where the studied protists mostly acquire genes from prokaryotes was observed (Figure <figr fid="F7">7</figr>). This may be explained by a preference for growing in prokaryote rich environments and consuming prokaryotes by the four groups of phagotrophic protists investigated in this study &#8211; diplomonads, parabasalids, pelobionts and <it>Entamoeba</it>, since uptake of DNA from ingested cells is possibly an important mechanism enabling LGT in eukaryotes <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B8">8</abbr></abbrgrp>. Indeed, diplomonads generally feed on prokaryotes <abbrgrp><abbr bid="B57">57</abbr></abbrgrp> and several prokaryote-to-eukaryote gene transfer events have been described for this eukaryotic group (Figure <figr fid="F7">7</figr>) <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B41">41</abbr><abbr bid="B58">58</abbr><abbr bid="B59">59</abbr></abbrgrp>, while, to our knowledge, no strong case of gene transfer event from a eukaryote lineage to diplomonads has been described yet. <it>Entamoeba</it>, on the other hand, is able to ingest both prokaryotes and eukaryotes; it can be maintained in monoxenic cultures with bacteria as well as trypanosomatid flagellates <abbrgrp><abbr bid="B60">60</abbr><abbr bid="B61">61</abbr></abbrgrp>. The <it>Entamoeba </it>lineage was recently suggested as the recipient lineage in a eukaryote-to-eukaryote gene transfer event of the alanyl-tRNA synthetase gene from the parabasalid lineage <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> (Figure <figr fid="F7">7</figr>), although most gene transfer events affecting <it>Entamoeba </it>seem to involve prokaryotic donor lineages (Figures <figr fid="F2">2</figr>, <figr fid="F3">3</figr>, <figr fid="F4">4</figr>, <figr fid="F5">5</figr>, <figr fid="F6">6</figr>) <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B10">10</abbr></abbrgrp>.</p>
            <p>In this study, the donor and recipient lineages could be inferred in one putative eukaryote-to-eukaryote gene transfer event with reasonable support; ciliates were hypothesized to have acquired a gene from an Amoebozoa lineage (Figures <figr fid="F5">5</figr> and <figr fid="F7">7</figr>). Interestingly, ciliates were also previously shown to represent the recipient lineage in an intra-domain transfer of the alanyl-tRNA synthetase gene <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> (Figure <figr fid="F7">7</figr>). It is possible that the recipient lineage of these two transfers &#8211; an ancestor of <it>Paramecium </it>and <it>Tetrahymena </it>&#8211; tended to preferentially graze on eukaryotic protists rather than bacteria and was therefore exposed to eukaryotic DNA leading to LGT events; ciliates are indeed known to eat both prokaryotes and eukaryotes <abbrgrp><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr></abbrgrp>. Similarly, dinoflagellates are known to graze on eukaryotes <abbrgrp><abbr bid="B62">62</abbr></abbrgrp> and have been identified as the recipient lineage in eukaryote-to-eukaryote gene transfer events <abbrgrp><abbr bid="B63">63</abbr><abbr bid="B64">64</abbr></abbrgrp>. If this pattern holds up in light of more data, it suggests that there is a link between the genome evolution and the food content in phagotrophic protists &#8211; indicating that an understanding of eating habits is important to our understanding of gene transfer in the evolution of phagotrophic protists, as postulated by Doolittle <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. Global proteome phylogenies from 144 prokaryotes indicate that LGT has created pools of shared genes between distantly related prokaryotes occupying the same niche <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>, such as mammalian mucosa. Our present analyses extend these observations to microbial eukaryotes with shared genes between microorganisms thriving on mammalian mucosa such as the trichomonads, diplomonads, apicomplexans, <it>Piromyces </it>and <it>Entamoeba</it>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Sources of DNA</p>
            </st>
            <p><it>Entamoeba invadens </it>(strain IP-1, ATCC 30994), <it>E. moshkovskii </it>(strain FIC, ATCC 30041), <it>E. terrapinae </it>(strain M, ATCC 30043) were cultured in LYI-S-2 medium at room temperature <abbrgrp><abbr bid="B61">61</abbr></abbrgrp>, <it>Mastigamoeba balamuthi </it>(ATCC 30984) were cultured in PYGC medium <abbrgrp><abbr bid="B65">65</abbr></abbrgrp>, and <it>Naegleria gruberi </it>(strain NEG-M, ATCC 30224) were cultured in modified PYNFH medium (ATCC medium 1034, cat. no. 327-X) at room temperature. The cells were harvested, followed by lysis in 0.25% SDS/0.1 M EDTA, pH 8.0, and genomic DNAs were purified using a cetyl trimethylammonium bromide extraction (CTAB) method <abbrgrp><abbr bid="B66">66</abbr></abbrgrp>. <it>Trichomonas vaginalis </it>(strain G3) <it>nagB </it>and <it>fprA </it>cDNA clones were identified in the ongoing EST project (Hirt, R.P., Embley, T.M., and Harriman, N.), and two <it>priS Naegleria gruberi </it>(strain NEG-M, ATCC 30224) cDNA clones were identified in an ongoing EST project (Sj&#246;gren, &#197;.M., Andersson, J.O., Gill, E., Roger, A.J., unpublished).</p>
         </sec>
         <sec>
            <st>
               <p>PCR and sequencing</p>
            </st>
            <p>Exact match PCR primers to obtain <it>Entamoeba </it>genes for independent sequencing were designed based on sequences available from the website of the ongoing genome projects <abbrgrp><abbr bid="B67">67</abbr></abbrgrp> that showed similarity to the studied genes. If no such sequences were available, degenerate primers were designed against conserved regions of the alignments. Using different combinations of these primers the four genes were successfully amplified using genomic DNA in PCR reactions. The <it>Mastigamoeba priS </it>sequence was amplified using degenerate primers and genomic DNA, while the <it>Mastigamoeba nagB </it>and <it>fprA </it>were PCR amplified from genomic DNA using exact-match primers designed from cDNA sequences available from an in-house EST project and published <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> cDNA projects, respectively. The <it>N. gruberi priS </it>sequence was amplified from genomic DNA using exact match PCR primers based on cDNA sequences. The PCR products were purified using the Qiaquick PCR Purification Kit (Qiagen Inc., Valencia, CA) and directly sequenced using the ABI PRISM BigDye Termination Cycle Sequencing Kit (Applied Biosystems, Foster City, CA) using the primers used in the amplification as well as internal primers. Introns were identified and removed before the subsequent phylogenetic analyses from the obtained <it>M. balamuthi nagB</it>, <it>priS</it>, and <it>fprA </it>sequences.</p>
         </sec>
         <sec>
            <st>
               <p>Assembly of the datasets</p>
            </st>
            <p>All available homologs for the four genes were retrieved from the National Center for Biotechnology Information <abbrgrp><abbr bid="B68">68</abbr></abbrgrp>. In addition, similarity searches against sequences released from ongoing eukaryotic genome projects were performed to identify and retrieve additional eukaryotic homologs of the genes from ongoing genome projects at various genome sequences centres. Additional <it>priS</it>, and <it>fprA T. vaginalis </it>sequences, and <it>nagB </it>sequences from <it>Tetrahymena thermophila</it>, <it>Trypanosoma. bruci</it>, and <it>Trypanosoma cruzi </it>were retrieved from the Institute for Genomic Research <abbrgrp><abbr bid="B69">69</abbr></abbrgrp>. <it>Chlamydomonas reinhardtii priS </it>and <it>adhE </it>sequences, and a <it>Thalassiosira pseudonana priS </it>sequence were retrieved from the DOE Joint Genome Institute <abbrgrp><abbr bid="B70">70</abbr></abbrgrp>. Finally, a <it>nagB </it>sequences from <it>Leishmania major </it>and <it>Paramecium tetraurelia </it>were retrieved from the Wellcome Trust Sanger Institute <abbrgrp><abbr bid="B71">71</abbr></abbrgrp>, and Genoscope <abbrgrp><abbr bid="B72">72</abbr></abbrgrp>, respectively. None of the four genes could be detected among the available <it>Phytophthora sojae </it>sequences <abbrgrp><abbr bid="B70">70</abbr></abbrgrp>.</p>
            <p>The aa sequence datasets were aligned using CLUSTALW <abbrgrp><abbr bid="B73">73</abbr></abbrgrp>, manually adjusted, and visually inspected to identify unambiguously aligned regions suitable for phylogenetic reconstructions. Only one sequence among pairs with >95% aa sequence identity within the unambiguously aligned regions were retained for further analyses. Finally, all sequences that covered less than one third of the unambiguously aligned regions were excluded. The accessions numbers and other details of the sequences within the datasets are listed in <supplr sid="S1">Additional File 1</supplr>, and the alignments used in the study are available as Additional Files <supplr sid="S6">6</supplr>, <supplr sid="S7">7</supplr>, <supplr sid="S8">8</supplr>, <supplr sid="S9">9</supplr>. In the previously analyses data from unpublished prokaryotic genome projects were included <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>, while only published prokaryotic sequences were included in the present analyses. Therefore, the <it>Carboxydothermus </it>sequence and the single <it>Clostridium </it>sequence, and the <it>Carboxydothermus </it>and <it>Fibrobacter </it>sequences are present in the previous <it>priS </it>and <it>fprA </it>analyses, respectively, but missing from the current datasets.</p>
            <suppl id="S6">
               <title>
                  <p>Additional File 6</p>
               </title>
               <text>
                  <p>Alignment file in nexus format for the hybrid-cluster protein.</p>
               </text>
               <file name="1471-2148-6-27-S6.ande">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S7">
               <title>
                  <p>Additional File 7</p>
               </title>
               <text>
                  <p>Alignment file in nexus format for the A-type flavoprotein.</p>
               </text>
               <file name="1471-2148-6-27-S7.ande">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S8">
               <title>
                  <p>Additional File 8</p>
               </title>
               <text>
                  <p>Alignment file in nexus format for the glucosamine-6-phosphate isomerase.</p>
               </text>
               <file name="1471-2148-6-27-S8.ande">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S9">
               <title>
                  <p>Additional File 9</p>
               </title>
               <text>
                  <p>Alignment file in nexus format for the alcohol dehydrogenase E.</p>
               </text>
               <file name="1471-2148-6-27-S9.ande">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Phylogenetic analyses</p>
            </st>
            <p>The optimal aa substitution model for each dataset was selected using the program ModelGenerator, recently developed by T. M. Keane, T. J. Naughton, and J. O. McInerney, National University of Ireland, Maynooth, Ireland <abbrgrp><abbr bid="B35">35</abbr></abbrgrp>. Protein maximum likelihood (ML) phylogenies were inferred using PHYML, version 2.4.4 <abbrgrp><abbr bid="B34">34</abbr></abbrgrp> with the optimal substitution model, and bootstrap support values were calculated based on 500 resampled datasets. Most currently available phylogenetic methods cannot deal with strong aa compositional heterogeneity in the data <abbrgrp><abbr bid="B74">74</abbr></abbrgrp>, which may lead to false interpretations of evolutionary events<abbrgrp><abbr bid="B2">2</abbr><abbr bid="B14">14</abbr></abbrgrp>. Therefore, we also performed phylogenetic analyses using an approach that ameliorates (or mitigates) the compositional heterogeneity, as recently described in the analyses of the NuoF protein <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. The aa alignments were recoded into six categories corresponding to the PAM matrix (and most other matrices) as follows: (1) ASTGP, (2) DNEQ, (3) RKH, (4) MVIL, (5) FYW and (6) C <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. Bayesian phylogenetic analyses were performed on these grouped aa alignments using the program p4 <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. This allowed the use of a 6 &#215; 6 general time-reversible rate matrix with free parameters rather than a fixed empirical matrix. The among-site rate variation (ASRV) was chosen by the Akaike Information Criterion (AIC) based on ML on the neighbor-joining tree. All parameters, including the composition and substitution rate matrix, were free, and the analysis used the Metropolis-coupled MCMC strategy from MrBayes <abbrgrp><abbr bid="B75">75</abbr></abbrgrp>. Runs were done in duplicate for 10<sup>6 </sup>generations. The first halves were discarded as burn-in, while the second halves of both runs were combined for calculating the consensus trees. Convergence was assessed by plotting the split support values >0.10 in the two independent runs against each other (<supplr sid="S3">Additional File 3</supplr>). One dataset (the short version of glucosamine-6-phosophate) did not converge well (data not shown) and was rerun in duplicate for 2 &#215; 10<sup>6 </sup>generations, and the analyses converged under these conditions (<supplr sid="S3">Additional File 3</supplr>). The model fit of the composition was assessed using different approaches. Posterior predictive simulations were performed on the grouped aa datasets <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. If the tail area probabilities (<it>p</it><sub><it>t</it></sub>) is low (&lt;0.05), the model composition does not fit the composition of the dataset. Also, tests for compositional homogeneity using <it>X</it><sup>2 </sup>statistics were performed for both the original non-grouped and the grouped aa datasets using simulations to get the expected null distribution from the obtained trees and preferred substitution models used in the analyses <abbrgrp><abbr bid="B36">36</abbr></abbrgrp>. The more widely used <it>X</it><sup>2</sup>-tests for compositional homogeneity using the &#967;<sup>2 </sup>curve as the null distributions are only included for comparison since they fail to take the tree-based correlation of compositions among taxa into account <abbrgrp><abbr bid="B36">36</abbr></abbrgrp> (<supplr sid="S2">Additional File 2</supplr>).</p>
         </sec>
         <sec>
            <st>
               <p>Nucleotide sequence accession numbers</p>
            </st>
            <p>The sequences reported here were deposited in GenBank under the accession numbers <ext-link ext-link-type="gen" ext-link-id="AJ864541">AJ864541</ext-link>-<ext-link ext-link-type="gen" ext-link-id="AJ864560">AJ864560</ext-link>.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>JOA carried out the molecular biology studies, most bioinformatic analyses, and ML phylogenetic analyses and drafted the manuscript. PGF performed the grouped aa Bayesian phylogenetic analyses. RPH provided two cDNA clones and carried out the analyses for <supplr sid="S4">additional file 4</supplr>. RPH and AJR provided advice on analyses and edited the manuscript. All authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>We thank C. Graham Clark (London School of Hygiene and Tropical Medicine) for generous gifts of cell lysates of three <it>Entamoeba </it>species, T. Martin Embley (University of Newcastle upon Tyne) for sharing the <it>Trichomonas </it>cDNA clones, Erin Gill and Lesley Davis (Dalhousie University) for a generous gift of <it>M. balamuthi </it>genomic DNA and a cDNA sequence, and Mikl&#243;s M&#252;ller (Rockerfeller University) for the cDNA library, Stewart Sarchfield (Dalhousie University) for experimental assistance, &#197;sa Sj&#246;gren (Dalhousie University) for gift of <it>N. gruberi </it>cDNA clones and genomic DNA, experimental assistance and critical reading of the manuscript, and I&#241;aki Ruiz-Trillo for critical reading of the manuscript. The availability of preliminary sequences from the various sequencing centers is greatly acknowledged.</p>
            <p>A.J.R. is supported by a fellowship from the Alfred P. Sloan Foundation, the Peter Lougheed/Canadian Institutes for Health Research New Investigator fellowship, and the Canadian Institute for Advanced Research, Program in Evolutionary Biology. R.P.H. was supported by a Wellcome Trust University Award. This work was supported by a Canadian Institutes of Health Research (CIHR) Grant (MOP-62809) awarded to A.J.R and a Swedish Research Council (VR) Grant awarded to J.O.A.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>How big is the iceberg of which organellar genes in nuclear genomes are but the tip?</p>
            </title>
            <aug>
               <au>
                  <snm>Doolittle</snm>
                  <fnm>WF</fnm>
               </au>
               <au>
                  <snm>Boucher</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Nesb&#248;</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>Douady</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Andersson</snm>
                  <fnm>JO</fnm>
               </au>
               <au>
                  <snm>Roger</snm>
                  <fnm>AJ</fnm>
               </au>
            </aug>
            <source>Philos Trans R Soc Lond B Biol Sci</source>
            <pubdate>2003</pubdate>
            <volume>358</volume>
            <issue>1429</issue>
            <fpage>39</fpage>
            <lpage>58</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1098/rstb.2002.1185</pubid>
                  <pubid idtype="pmpid" link="fulltext">12594917</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Horizontal gene transfer and the evolution of parasitic protozoa</p>
            </title>
            <aug>
               <au>
                  <snm>Richards</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Hirt</snm>
                  <fnm>RP</fnm>
               </au>
               <au>
                  <snm>Williams</snm>
                  <fnm>BA</fnm>
               </au>
               <au>
                  <snm>Embley</snm>
                  <fnm>TM</fnm>
               </au>
            </aug>
            <source>Protist</source>
            <pubdate>2003</pubdate>
            <volume>154</volume>
            <issue>1</issue>
            <fpage>17</fpage>
            <lpage>32</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1078/143446103764928468</pubid>
                  <pubid idtype="pmpid">12812367</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Gene transfer: gene swapping craze reaches eukaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Gogarten</snm>
                  <fnm>JP</fnm>
               </au>
            </aug>
            <source>Curr Biol</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <issue>2</issue>
            <fpage>R53</fpage>
            <lpage>R54</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0960-9822(02)01426-4</pubid>
                  <pubid idtype="pmpid" link="fulltext">12546803</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Lateral gene transfer in eukaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Andersson</snm>
                  <fnm>JO</fnm>
               </au>
            </aug>
            <source>Cell Mol Life Sci</source>
            <pubdate>2005</pubdate>
            <volume>62</volume>
            <issue>11</issue>
            <fpage>1182</fpage>
            <lpage>1197</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s00018-005-4539-z</pubid>
                  <pubid idtype="pmpid" link="fulltext">15761667</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>The genome of the protist parasite <it>Entamoeba histolytica</it></p>
            </title>
            <aug>
               <au>
                  <snm>Loftus</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Anderson</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Davies</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Alsmark</snm>
                  <fnm>UCM</fnm>
               </au>
               <au>
                  <snm>Samuelson</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Amedeo</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Roncaglia</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Berriman</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hirt</snm>
                  <fnm>RP</fnm>
               </au>
               <au>
                  <snm>Mann</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Nozaki</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Suh</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Pop</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Duchene</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ackers</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Tannich</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Leippe</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hofer</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Bruchhaus</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Willhoeft</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Bhattacharya</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Chillingworth</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Churcher</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Hance</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Harris</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Jagels</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Moule</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Mungall</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ormond</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Squares</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Whitehead</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Quail</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Rabbinowitsch</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Norbertczak</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Price</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Guillen</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Gilchrist</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Stroup</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Bhattacharya</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lohia</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Foster</snm>
                  <fnm>PG</fnm>
               </au>
               <au>
                  <snm>Sicheritz-Ponten</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Weber</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Singh</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Mukherjee</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>El-Sayed</snm>
                  <fnm>NM</fnm>
               </au>
               <au>
                  <snm>Petri</snm>
                  <fnm>WAJ</fnm>
               </au>
               <au>
                  <snm>Clark</snm>
                  <fnm>CG</fnm>
               </au>
               <au>
                  <snm>Embley</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Barrell</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Fraser</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Hall</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2005</pubdate>
            <volume>433</volume>
            <issue>7028</issue>
            <fpage>865</fpage>
            <lpage>868</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature03291</pubid>
                  <pubid idtype="pmpid" link="fulltext">15729342</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>Phylogenomic evidence supports past endosymbiosis, intracellular and horizontal gene transfer in <it>Cryptosporidium parvum</it></p>
            </title>
            <aug>
               <au>
                  <snm>Huang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Mullapudi</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Lancto</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Scott</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Abrahamsen</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Kissinger</snm>
                  <fnm>JC</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2004</pubdate>
            <volume>5</volume>
            <issue>11</issue>
            <fpage>R88</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">545779</pubid>
                  <pubid idtype="pmpid" link="fulltext">15535864</pubid>
                  <pubid idtype="doi">10.1186/gb-2004-5-11-r88</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Photosynthetic eukaryotes unite: endosymbiosis connects the dots</p>
            </title>
            <aug>
               <au>
                  <snm>Bhattacharya</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Yoon</snm>
                  <fnm>HS</fnm>
               </au>
               <au>
                  <snm>Hackett</snm>
                  <fnm>JD</fnm>
               </au>
            </aug>
            <source>Bioessays</source>
            <pubdate>2004</pubdate>
            <volume>26</volume>
            <issue>1</issue>
            <fpage>50</fpage>
            <lpage>60</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1002/bies.10376</pubid>
                  <pubid idtype="pmpid" link="fulltext">14696040</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>You are what you eat: a gene transfer ratchet could account for bacterial genes in eukaryotic nuclear genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Doolittle</snm>
                  <fnm>WF</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>1998</pubdate>
            <volume>14</volume>
            <fpage>307</fpage>
            <lpage>311</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-9525(98)01494-2</pubid>
                  <pubid idtype="pmpid" link="fulltext">9724962</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Lateral gene transfer and the evolution of plastid-targeted proteins in the secondary plastid-containing alga <it>Bigelowiella natans</it></p>
            </title>
            <aug>
               <au>
                  <snm>Archibald</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Rogers</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Toop</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Ishida</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Keeling</snm>
                  <fnm>PJ</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2003</pubdate>
            <volume>100</volume>
            <issue>13</issue>
            <fpage>7678</fpage>
            <lpage>7683</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">164647</pubid>
                  <pubid idtype="pmpid" link="fulltext">12777624</pubid>
                  <pubid idtype="doi">10.1073/pnas.1230951100</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Phylogenetic analyses of diplomonad genes reveal frequent lateral gene transfers affecting eukaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Andersson</snm>
                  <fnm>JO</fnm>
               </au>
               <au>
                  <snm>Sj&#246;gren</snm>
                  <fnm>&#197;M</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>LAM</fnm>
               </au>
               <au>
                  <snm>Embley</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Roger</snm>
                  <fnm>AJ</fnm>
               </au>
            </aug>
            <source>Curr Biol</source>
            <pubdate>2003</pubdate>
            <volume>13</volume>
            <issue>2</issue>
            <fpage>94</fpage>
            <lpage>104</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0960-9822(03)00003-4</pubid>
                  <pubid idtype="pmpid" link="fulltext">12546782</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Evolution by acquisition: the case for horizontal gene transfers</p>
            </title>
            <aug>
               <au>
                  <snm>Smith</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Feng</snm>
                  <fnm>DF</fnm>
               </au>
               <au>
                  <snm>Doolittle</snm>
                  <fnm>RF</fnm>
               </au>
            </aug>
            <source>Trends Biochem Sci</source>
            <pubdate>1992</pubdate>
            <volume>17</volume>
            <issue>12</issue>
            <fpage>489</fpage>
            <lpage>493</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/0968-0004(92)90335-7</pubid>
                  <pubid idtype="pmpid">1471257</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Highways of gene sharing in prokaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Beiko</snm>
                  <fnm>RG</fnm>
               </au>
               <au>
                  <snm>Harlow</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>Ragan</snm>
                  <fnm>MA</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2005</pubdate>
            <volume>102</volume>
            <issue>40</issue>
            <fpage>14332</fpage>
            <lpage>14337</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.0504068102</pubid>
                  <pubid idtype="pmpid" link="fulltext">16176988</pubid>
                  <pubid idtype="pmcid">1242295</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Non-mitochondrial complex I proteins in a hydrogenosomal oxidoreductase complex</p>
            </title>
            <aug>
               <au>
                  <snm>Dyall</snm>
                  <fnm>SD</fnm>
               </au>
               <au>
                  <snm>Yan</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Delgadillo-Correa</snm>
                  <fnm>MG</fnm>
               </au>
               <au>
                  <snm>Lunceford</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Loo</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Clarke</snm>
                  <fnm>CF</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>PJ</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2004</pubdate>
            <volume>431</volume>
            <issue>7012</issue>
            <fpage>1103</fpage>
            <lpage>1107</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature02990</pubid>
                  <pubid idtype="pmpid" link="fulltext">15510149</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p><it>Trichomonas </it>hydrogenosomes contain the NADH dehydrogenase module of mitochondrial complex I</p>
            </title>
            <aug>
               <au>
                  <snm>Hrdy</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Hirt</snm>
                  <fnm>RP</fnm>
               </au>
               <au>
                  <snm>Dolezal</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Bardonova</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Foster</snm>
                  <fnm>PG</fnm>
               </au>
               <au>
                  <snm>Tachezy</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Embley</snm>
                  <fnm>TM</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2004</pubdate>
            <volume>432</volume>
            <issue>7017</issue>
            <fpage>618</fpage>
            <lpage>622</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature03149</pubid>
                  <pubid idtype="pmpid" link="fulltext">15577909</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Mitochondria and hydrogenosomes are two forms of the same fundamental organelle</p>
            </title>
            <aug>
               <au>
                  <snm>Embley</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>van der Giezen</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Horner</snm>
                  <fnm>DS</fnm>
               </au>
               <au>
                  <snm>Dyal</snm>
                  <fnm>PL</fnm>
               </au>
               <au>
                  <snm>Foster</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Philos Trans R Soc Lond B Biol Sci</source>
            <pubdate>2003</pubdate>
            <volume>358</volume>
            <issue>1429</issue>
            <fpage>191</fpage>
            <lpage>201</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1098/rstb.2002.1190</pubid>
                  <pubid idtype="pmpid" link="fulltext">12594927</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>The missing link between hydrogenosomes and mitochondria</p>
            </title>
            <aug>
               <au>
                  <snm>Martin</snm>
                  <fnm>W</fnm>
               </au>
            </aug>
            <source>Trends Microbiol</source>
            <pubdate>2005</pubdate>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16109488</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>A mitochondrial-like chaperonin 60 gene in <it>Giardia lamblia</it>: evidence that diplomonads once harbored an endosymbiont related to the progenitor of mitochondria</p>
            </title>
            <aug>
               <au>
                  <snm>Roger</snm>
                  <fnm>AJ</fnm>
               </au>
               <au>
                  <snm>Sv&#228;rd</snm>
                  <fnm>SG</fnm>
               </au>
               <au>
                  <snm>Tovar</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Clark</snm>
                  <fnm>CG</fnm>
               </au>
               <au>
                  <snm>Smith</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Gillin</snm>
                  <fnm>FD</fnm>
               </au>
               <au>
                  <snm>Sogin</snm>
                  <fnm>ML</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>1998</pubdate>
            <volume>95</volume>
            <fpage>229</fpage>
            <lpage>234</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">18184</pubid>
                  <pubid idtype="pmpid" link="fulltext">9419358</pubid>
                  <pubid idtype="doi">10.1073/pnas.95.1.229</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Gene transfers from Nanoarchaeota to an ancestor of diplomonads and parabasalids</p>
            </title>
            <aug>
               <au>
                  <snm>Andersson</snm>
                  <fnm>JO</fnm>
               </au>
               <au>
                  <snm>Sarchfield</snm>
                  <fnm>SW</fnm>
               </au>
               <au>
                  <snm>Roger</snm>
                  <fnm>AJ</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2005</pubdate>
            <volume>22</volume>
            <issue>1</issue>
            <fpage>85</fpage>
            <lpage>90</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msh254</pubid>
                  <pubid idtype="pmpid" link="fulltext">15356278</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>Evolution of glutamate dehydrogenase genes: evidence for lateral gene transfer within and between prokaryotes and eukaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Andersson</snm>
                  <fnm>JO</fnm>
               </au>
               <au>
                  <snm>Roger</snm>
                  <fnm>AJ</fnm>
               </au>
            </aug>
            <source>BMC Evol Biol</source>
            <pubdate>2003</pubdate>
            <volume>3</volume>
            <fpage>14</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">166173</pubid>
                  <pubid idtype="pmpid" link="fulltext">12820901</pubid>
                  <pubid idtype="doi">10.1186/1471-2148-3-14</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>Increased taxon sampling is advantageous for phylogenetic inference</p>
            </title>
            <aug>
               <au>
                  <snm>Pollock</snm>
                  <fnm>DD</fnm>
               </au>
               <au>
                  <snm>Zwickl</snm>
                  <fnm>DJ</fnm>
               </au>
               <au>
                  <snm>McGuire</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Hillis</snm>
                  <fnm>DM</fnm>
               </au>
            </aug>
            <source>Syst Biol</source>
            <pubdate>2002</pubdate>
            <volume>51</volume>
            <issue>4</issue>
            <fpage>664</fpage>
            <lpage>671</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1080/10635150290102357</pubid>
                  <pubid idtype="pmpid" link="fulltext">12228008</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Phylogeny of the genera <it>Entamoeba </it>and <it>Endolimax </it>as deduced from small-subunit ribosomal RNA sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Silberman</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Clark</snm>
                  <fnm>CG</fnm>
               </au>
               <au>
                  <snm>Diamond</snm>
                  <fnm>LS</fnm>
               </au>
               <au>
                  <snm>Sogin</snm>
                  <fnm>ML</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>1999</pubdate>
            <volume>16</volume>
            <issue>12</issue>
            <fpage>1740</fpage>
            <lpage>1751</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10605115</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>Intraspecific variation and phylogenetic relationships in the genus <it>Entamoeba </it>as revealed by riboprinting</p>
            </title>
            <aug>
               <au>
                  <snm>Clark</snm>
                  <fnm>CG</fnm>
               </au>
               <au>
                  <snm>Diamond</snm>
                  <fnm>LS</fnm>
               </au>
            </aug>
            <source>J Eukaryot Microbiol</source>
            <pubdate>1997</pubdate>
            <volume>44</volume>
            <issue>2</issue>
            <fpage>142</fpage>
            <lpage>154</lpage>
            <xrefbib>
               <pubid idtype="pmpid">9109261</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>The analysis of 100 genes supports the grouping of three highly divergent amoebae: <it>Dictyostelium, Entamoeba, </it>and <it>Mastigamoeba</it></p>
            </title>
            <aug>
               <au>
                  <snm>Bapteste</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Brinkmann</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Moore</snm>
                  <fnm>DV</fnm>
               </au>
               <au>
                  <snm>Sensen</snm>
                  <fnm>CW</fnm>
               </au>
               <au>
                  <snm>Gordon</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Durufle</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Gaasterland</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Lopez</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>M&#252;ller</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Philippe</snm>
                  <fnm>H</fnm>
               </au>
            </aug>
            <source>Proc Natl Acad Sci U S A</source>
            <pubdate>2002</pubdate>
            <volume>99</volume>
            <issue>3</issue>
            <fpage>1414</fpage>
            <lpage>1419</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">122205</pubid>
                  <pubid idtype="pmpid" link="fulltext">11830664</pubid>
                  <pubid idtype="doi">10.1073/pnas.032662799</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>The phylogenetic position of the pelobiont <it>Mastigamoeba balamuthi </it>based on sequences of rRNA and translation elongation factors EF-1a and EF-2.</p>
            </title>
            <aug>
               <au>
                  <snm>Arisue</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Hashimoto</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>JA</fnm>
               </au>
               <au>
                  <snm>Moore</snm>
                  <fnm>DV</fnm>
               </au>
               <au>
                  <snm>Gordon</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Sensen</snm>
                  <fnm>CW</fnm>
               </au>
               <au>
                  <snm>Gaasterland</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Hasegawa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>M&#252;ller</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>J Eukaryot Microbiol</source>
            <pubdate>2002</pubdate>
            <volume>49</volume>
            <fpage>1</fpage>
            <lpage>10</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1550-7408.2002.tb00332.x</pubid>
                  <pubid idtype="pmpid">11908892</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>Root of the eukaryota tree as inferred from combined maximum likelihood analyses of multiple molecular sequence data</p>
            </title>
            <aug>
               <au>
                  <snm>Arisue</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Hasegawa</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hashimoto</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2005</pubdate>
            <volume>22</volume>
            <issue>3</issue>
            <fpage>409</fpage>
            <lpage>420</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msi023</pubid>
                  <pubid idtype="pmpid" link="fulltext">15496553</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Inference of the phylogenetic position of oxymonads based on nine genes: support for Metamonada and Excavata</p>
            </title>
            <aug>
               <au>
                  <snm>Hampl</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Horner</snm>
                  <fnm>DS</fnm>
               </au>
               <au>
                  <snm>Dyal</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Kulda</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Flegr</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Foster</snm>
                  <fnm>PG</fnm>
               </au>
               <au>
                  <snm>Embley</snm>
                  <fnm>TM</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2005</pubdate>
            <volume>22</volume>
            <issue>12</issue>
            <fpage>2508</fpage>
            <lpage>2518</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msi245</pubid>
                  <pubid idtype="pmpid" link="fulltext">16120804</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Comprehensive multi-gene phylogenies of excavate protists reveal the evolutionary positions of 'primitive' eukaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Simpson</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Inagaki</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Roger</snm>
                  <fnm>AJ</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2006</pubdate>
            <volume>23</volume>
            <issue>3</issue>
            <fpage>615</fpage>
            <lpage>625</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msj068</pubid>
                  <pubid idtype="pmpid" link="fulltext">16308337</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Trichomoniasis</p>
            </title>
            <aug>
               <au>
                  <snm>Schwebke</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Burgess</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Clin Microbiol Rev</source>
            <pubdate>2004</pubdate>
            <volume>17</volume>
            <issue>4</issue>
            <fpage>794</fpage>
            <lpage>803</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">523559</pubid>
                  <pubid idtype="pmpid" link="fulltext">15489349</pubid>
                  <pubid idtype="doi">10.1128/CMR.17.4.794-803.2004</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>The deep roots of eukaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Baldauf</snm>
                  <fnm>SL</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2003</pubdate>
            <volume>300</volume>
            <issue>5626</issue>
            <fpage>1703</fpage>
            <lpage>1706</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1085544</pubid>
                  <pubid idtype="pmpid" link="fulltext">12805537</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B30">
            <title>
               <p>The real 'kingdoms' of eukaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Simpson</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Roger</snm>
                  <fnm>AJ</fnm>
               </au>
            </aug>
            <source>Curr Biol</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <issue>17</issue>
            <fpage>R693</fpage>
            <lpage>6</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.cub.2004.08.038</pubid>
                  <pubid idtype="pmpid" link="fulltext">15341755</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B31">
            <title>
               <p>The new higher level classification of eukaryotes with emphasis on the taxonomy of protists</p>
            </title>
            <aug>
               <au>
                  <snm>Adl</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Simpson</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Farmer</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Andersen</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Anderson</snm>
                  <fnm>OR</fnm>
               </au>
               <au>
                  <snm>Barta</snm>
                  <fnm>JR</fnm>
               </au>
               <au>
                  <snm>Bowser</snm>
                  <fnm>SS</fnm>
               </au>
               <au>
                  <snm>Brugerolle</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Fensome</snm>
                  <fnm>RA</fnm>
               </au>
               <au>
                  <snm>Fredericq</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>James</snm>
                  <fnm>TY</fnm>
               </au>
               <au>
                  <snm>Karpov</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Kugrens</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Krug</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lane</snm>
                  <fnm>CE</fnm>
               </au>
               <au>
                  <snm>Lewis</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Lodge</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Lynn</snm>
                  <fnm>DH</fnm>
               </au>
               <au>
                  <snm>Mann</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>McCourt</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Mendoza</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Moestrup</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Mozley-Standridge</snm>
                  <fnm>SE</fnm>
               </au>
               <au>
                  <snm>Nerad</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Shearer</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Smirnov</snm>
                  <fnm>AV</fnm>
               </au>
               <au>
                  <snm>Spiegel</snm>
                  <fnm>FW</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>MF</fnm>
               </au>
            </aug>
            <source>J Eukaryot Microbiol</source>
            <pubdate>2005</pubdate>
            <volume>52</volume>
            <issue>5</issue>
            <fpage>399</fpage>
            <lpage>451</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1550-7408.2005.00053.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">16248873</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B32">
            <title>
               <p>A model of evolutionary change in proteins.</p>
            </title>
            <aug>
               <au>
                  <snm>Dayhoff</snm>
                  <fnm>MO</fnm>
               </au>
               <au>
                  <snm>Schwartz</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Orcutt</snm>
                  <fnm>BC</fnm>
               </au>
            </aug>
            <source>Atlas of Protein Sequences and Structure</source>
            <publisher>Washington DC , National Biomedical Research Foundation</publisher>
            <editor>Dayhoff MO</editor>
            <pubdate>1978</pubdate>
            <fpage>345</fpage>
            <lpage>352</lpage>
         </bibl>
         <bibl id="B33">
            <title>
               <p>A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach</p>
            </title>
            <aug>
               <au>
                  <snm>Whelan</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Goldman</snm>
                  <fnm>N</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2001</pubdate>
            <volume>18</volume>
            <issue>5</issue>
            <fpage>691</fpage>
            <lpage>699</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11319253</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B34">
            <title>
               <p>A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood</p>
            </title>
            <aug>
               <au>
                  <snm>Guindon</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Gascuel</snm>
                  <fnm>O</fnm>
               </au>
            </aug>
            <source>Syst Biol</source>
            <pubdate>2003</pubdate>
            <volume>52</volume>
            <issue>5</issue>
            <fpage>696</fpage>
            <lpage>704</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1080/10635150390235520</pubid>
                  <pubid idtype="pmpid" link="fulltext">14530136</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B35">
            <title>
               <p>Assessment of methods for amino acid matrix selection and their use on empirical data shows that ad hoc assumptions for choice of matrix are not justified</p>
            </title>
            <aug>
               <au>
                  <snm>Keane</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Creevey</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Pentony</snm>
                  <fnm>MM</fnm>
               </au>
               <au>
                  <snm>Naughton</snm>
                  <fnm>TJ</fnm>
               </au>
               <au>
                  <snm>McInerney</snm>
                  <fnm>JO</fnm>
               </au>
            </aug>
            <source>BMC Evol Biol</source>
            <pubdate>2006</pubdate>
            <volume>6</volume>
            <fpage>29</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1435933</pubid>
                  <pubid idtype="pmpid" link="fulltext">16563161</pubid>
                  <pubid idtype="doi">10.1186/1471-2148-6-29</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B36">
            <title>
               <p>Modeling compositional heterogeneity</p>
            </title>
            <aug>
               <au>
                  <snm>Foster</snm>
                  <fnm>PG</fnm>
               </au>
            </aug>
            <source>Syst Biol</source>
            <pubdate>2004</pubdate>
            <volume>53</volume>
            <issue>3</issue>
            <fpage>485</fpage>
            <lpage>495</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1080/10635150490445779</pubid>
                  <pubid idtype="pmpid" link="fulltext">15503675</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B37">
            <title>
               <p>Hydroxylamine reductase activity of the hybrid cluster protein from <it>Escherichia coli</it></p>
            </title>
            <aug>
               <au>
                  <snm>Wolfe</snm>
                  <fnm>MT</fnm>
               </au>
               <au>
                  <snm>Heo</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Garavelli</snm>
                  <fnm>JS</fnm>
               </au>
               <au>
                  <snm>Ludden</snm>
                  <fnm>PW</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2002</pubdate>
            <volume>184</volume>
            <issue>21</issue>
            <fpage>5898</fpage>
            <lpage>5902</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">135376</pubid>
                  <pubid idtype="pmpid" link="fulltext">12374823</pubid>
                  <pubid idtype="doi">10.1128/JB.184.21.5898-5902.2002</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B38">
            <title>
               <p>The hybrid-cluster protein ('prismane protein') from <it>Escherichia coli. </it>Characterization of the hybrid-cluster protein, redox properties of the [2Fe-2S] and [4Fe-2S-2O] clusters and identification of an associated NADH oxidoreductase containing FAD and [2Fe-2S]</p>
            </title>
            <aug>
               <au>
                  <snm>van den Berg</snm>
                  <fnm>WA</fnm>
               </au>
               <au>
                  <snm>Hagen</snm>
                  <fnm>WR</fnm>
               </au>
               <au>
                  <snm>van Dongen</snm>
                  <fnm>WM</fnm>
               </au>
            </aug>
            <source>Eur J Biochem</source>
            <pubdate>2000</pubdate>
            <volume>267</volume>
            <issue>3</issue>
            <fpage>666</fpage>
            <lpage>676</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1432-1327.2000.01032.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">10651802</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B39">
            <title>
               <p>Identification of the <it>Clostridium perfringens </it>genes involved in the adaptive response to oxidative stress</p>
            </title>
            <aug>
               <au>
                  <snm>Briolat</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Reysset</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>J Bacteriol</source>
            <pubdate>2002</pubdate>
            <volume>184</volume>
            <issue>9</issue>
            <fpage>2333</fpage>
            <lpage>2343</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">134984</pubid>
                  <pubid idtype="pmpid" link="fulltext">11948145</pubid>
                  <pubid idtype="doi">10.1128/JB.184.9.2333-2343.2002</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B40">
            <title>
               <p><it>Trichomonas vaginalis </it>degrades nitric oxide and expresses a flavorubredoxin-like protein: a new pathogenic mechanism?</p>
            </title>
            <aug>
               <au>
                  <snm>Sarti</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Fiori</snm>
                  <fnm>PL</fnm>
               </au>
               <au>
                  <snm>Forte</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Rappelli</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Teixeira</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Mastronicola</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Sanciu</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Giuffre</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Brunori</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Cell Mol Life Sci</source>
            <pubdate>2004</pubdate>
            <volume>61</volume>
            <issue>5</issue>
            <fpage>618</fpage>
            <lpage>623</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s00018-003-3413-8</pubid>
                  <pubid idtype="pmpid" link="fulltext">15004700</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B41">
            <title>
               <p>Evidence for lateral transfer of genes encoding ferredoxins, nitroreductases, NADH oxidase, and alcohol dehydrogenase 3 from anaerobic prokaryotes to <it>Giardia lamblia </it>and <it>Entamoeba histolytica</it></p>
            </title>
            <aug>
               <au>
                  <snm>Nixon</snm>
                  <fnm>JEJ</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Field</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Morrison</snm>
                  <fnm>HG</fnm>
               </au>
               <au>
                  <snm>McArthur</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Sogin</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Loftus</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Samuelson</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Eukaryot Cell</source>
            <pubdate>2002</pubdate>
            <volume>1</volume>
            <issue>2</issue>
            <fpage>181</fpage>
            <lpage>190</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">118039</pubid>
                  <pubid idtype="pmpid" link="fulltext">12455953</pubid>
                  <pubid idtype="doi">10.1128/EC.1.2.181-190.2002</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B42">
            <title>
               <p>Developmental gene regulation in <it>Giardia lamblia</it>: first evidence for an encystation-specific promoter and differential 5' mRNA processing</p>
            </title>
            <aug>
               <au>
                  <snm>Knodler</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Sv&#228;rd</snm>
                  <fnm>SG</fnm>
               </au>
               <au>
                  <snm>Silberman</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Davids</snm>
                  <fnm>BJ</fnm>
               </au>
               <au>
                  <snm>Gillin</snm>
                  <fnm>FD</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>1999</pubdate>
            <volume>34</volume>
            <issue>2</issue>
            <fpage>327</fpage>
            <lpage>340</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-2958.1999.01602.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">10564476</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B43">
            <title>
               <p>Ecology of planktonic ciliates in marine food webs</p>
            </title>
            <aug>
               <au>
                  <snm>Pierce</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>Turner</snm>
                  <fnm>JP</fnm>
               </au>
            </aug>
            <source>Rev Aquat Sci</source>
            <pubdate>1992</pubdate>
            <volume>6</volume>
            <issue>2</issue>
            <fpage>139</fpage>
            <lpage>181</lpage>
         </bibl>
         <bibl id="B44">
            <title>
               <p>Phagotrophy of ciliates</p>
            </title>
            <aug>
               <au>
                  <snm>Radek</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Hausmann</snm>
                  <fnm>K</fnm>
               </au>
            </aug>
            <source>Ciliates as organisms</source>
            <publisher>Stuttgart , Gustav Fischer Verlag GmbH</publisher>
            <editor>Hausmann K, Bradbury PC</editor>
            <pubdate>1996</pubdate>
            <fpage>197</fpage>
            <lpage>219</lpage>
         </bibl>
         <bibl id="B45">
            <title>
               <p>Enzymes and compartmentation of core energy metabolism of anaerobic protists - a special case in eukaryotic evolution?</p>
            </title>
            <aug>
               <au>
                  <snm>M&#252;ller</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Evolutionary relationships among protozoa</source>
            <publisher>Dordrecht, the Netherlands , Kluwer</publisher>
            <editor>Coombs GH, Vickerman K, Sleigh MA, Warren A</editor>
            <pubdate>1998</pubdate>
            <fpage>109</fpage>
            <lpage>131</lpage>
         </bibl>
         <bibl id="B46">
            <title>
               <p>Aldehyde dehydrogenase (CoA-acetylating) and the mechanism of ethanol formation in the amitochondriate protist, <it>Giardia lamblia</it></p>
            </title>
            <aug>
               <au>
                  <snm>S&#225;nchez</snm>
                  <fnm>LB</fnm>
               </au>
            </aug>
            <source>Arch Biochem Biophys</source>
            <pubdate>1998</pubdate>
            <volume>354</volume>
            <issue>1</issue>
            <fpage>57</fpage>
            <lpage>64</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1006/abbi.1998.0664</pubid>
                  <pubid idtype="pmpid" link="fulltext">9633598</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B47">
            <title>
               <p>Mind the gap: bridging the divide between clinical and molecular studies of the trichomonads</p>
            </title>
            <aug>
               <au>
                  <snm>Lyons</snm>
                  <fnm>EJ</fnm>
               </au>
               <au>
                  <snm>Carlton</snm>
                  <fnm>JM</fnm>
               </au>
            </aug>
            <source>Trends Parasitol</source>
            <pubdate>2004</pubdate>
            <volume>20</volume>
            <issue>5</issue>
            <fpage>204</fpage>
            <lpage>207</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.pt.2004.03.005</pubid>
                  <pubid idtype="pmpid" link="fulltext">15105016</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B48">
            <title>
               <p>The anaerobic chytridiomycete fungus <it>Piromyces </it>sp. E2 produces ethanol via pyruvate:formate lyase and an alcohol dehydrogenase E</p>
            </title>
            <aug>
               <au>
                  <snm>Boxma</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Voncken</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Jannink</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>van Alen</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Akhmanova</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>van Weelden</snm>
                  <fnm>SW</fnm>
               </au>
               <au>
                  <snm>van Hellemond</snm>
                  <fnm>JJ</fnm>
               </au>
               <au>
                  <snm>Ricard</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Huynen</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Tielens</snm>
                  <fnm>AG</fnm>
               </au>
               <au>
                  <snm>Hackstein</snm>
                  <fnm>JH</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>2004</pubdate>
            <volume>51</volume>
            <issue>5</issue>
            <fpage>1389</fpage>
            <lpage>1399</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1046/j.1365-2958.2003.03912.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">14982632</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B49">
            <title>
               <p>Bifunctional aldehyde/alcohol dehydrogenase (ADHE) in chlorophyte algal mitochondria</p>
            </title>
            <aug>
               <au>
                  <snm>Atteia</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>van Lis</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Mendoza-Hernandez</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Henze</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Martin</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Riveros-Rosas</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Gonzalez-Halphen</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Plant Mol Biol</source>
            <pubdate>2003</pubdate>
            <volume>53</volume>
            <issue>1-2</issue>
            <fpage>175</fpage>
            <lpage>188</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1023/B:PLAN.0000009274.19340.36</pubid>
                  <pubid idtype="pmpid" link="fulltext">14756315</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B50">
            <title>
               <p>Phylogenetic classification and the universal tree</p>
            </title>
            <aug>
               <au>
                  <snm>Doolittle</snm>
                  <fnm>WF</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1999</pubdate>
            <volume>284</volume>
            <fpage>2124</fpage>
            <lpage>2129</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.284.5423.2124</pubid>
                  <pubid idtype="pmpid" link="fulltext">10381871</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B51">
            <title>
               <p>The presence of a haloarchaeal type tyrosyl-tRNA synthetase marks the opisthokonts as monophyletic</p>
            </title>
            <aug>
               <au>
                  <snm>Huang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Xu</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Gogarten</snm>
                  <fnm>JP</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2005</pubdate>
            <volume>22</volume>
            <issue>11</issue>
            <fpage>2142</fpage>
            <lpage>2146</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msi221</pubid>
                  <pubid idtype="pmpid" link="fulltext">16049196</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B52">
            <title>
               <p>Phylogeny of eukaryotes based on ribosomal RNA: long-branch attraction and models of sequence evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Philippe</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Germot</snm>
                  <fnm>A</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2000</pubdate>
            <volume>17</volume>
            <issue>5</issue>
            <fpage>830</fpage>
            <lpage>834</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">10917801</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B53">
            <title>
               <p>The excavate protozoan phyla Metamonada Grasse emend. (Anaeromonadea, Parabasalia, Carpediemonas, Eopharyngia) and Loukozoa emend. (Jakobea, Malawimonas): their evolutionary affinities and new higher taxa</p>
            </title>
            <aug>
               <au>
                  <snm>Cavalier-Smith</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Int J Syst Evol Microbiol</source>
            <pubdate>2003</pubdate>
            <volume>53</volume>
            <issue>Pt 6</issue>
            <fpage>1741</fpage>
            <lpage>1758</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1099/ijs.0.02548-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">14657102</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B54">
            <title>
               <p>Early branching eukaryotes?</p>
            </title>
            <aug>
               <au>
                  <snm>Embley</snm>
                  <fnm>TM</fnm>
               </au>
               <au>
                  <snm>Hirt</snm>
                  <fnm>RP</fnm>
               </au>
            </aug>
            <source>Curr Opin Genet Dev</source>
            <pubdate>1998</pubdate>
            <volume>8</volume>
            <fpage>624</fpage>
            <lpage>629</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0959-437X(98)80029-4</pubid>
                  <pubid idtype="pmpid" link="fulltext">9914207</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B55">
            <title>
               <p>Retortamonad flagellates are closely related to diplomonads - implications for the history of mitochondrial function in eukaryote evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Silberman</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Simpson</snm>
                  <fnm>AGB</fnm>
               </au>
               <au>
                  <snm>Kulda</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Cepicka</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Hampl</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Johnson</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Roger</snm>
                  <fnm>AJ</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2002</pubdate>
            <volume>19</volume>
            <issue>5</issue>
            <fpage>777</fpage>
            <lpage>786</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">11961110</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B56">
            <title>
               <p>Gene discovery in the <it>Entamoeba invadens </it>genome</p>
            </title>
            <aug>
               <au>
                  <snm>Wang</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Samuelson</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Clark</snm>
                  <fnm>CG</fnm>
               </au>
               <au>
                  <snm>Eichinger</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Paul</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Van Dellen</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Hall</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Anderson</snm>
                  <fnm>I</fnm>
               </au>
               <au>
                  <snm>Loftus</snm>
                  <fnm>B</fnm>
               </au>
            </aug>
            <source>Mol Biochem Parasitol</source>
            <pubdate>2003</pubdate>
            <volume>129</volume>
            <issue>1</issue>
            <fpage>23</fpage>
            <lpage>31</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0166-6851(03)00073-2</pubid>
                  <pubid idtype="pmpid" link="fulltext">12798503</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B57">
            <title>
               <p>Order Diplomonadida</p>
            </title>
            <aug>
               <au>
                  <snm>Brugerolle</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>JJ</fnm>
               </au>
            </aug>
            <source>An Illustrated Guide to the Protozoa, 2nd edn</source>
            <publisher>Lawrence, Kansas , Society of Protozoologists</publisher>
            <editor>Lee JJ, Leedale GF, Bradbury P</editor>
            <pubdate>2002</pubdate>
            <fpage>1125</fpage>
            <lpage>1135</lpage>
         </bibl>
         <bibl id="B58">
            <title>
               <p>Evolutionary analyses of the small subunit of glutamate synthase: gene order conservation, gene fusions and prokaryote-to-eukaryote lateral gene transfers</p>
            </title>
            <aug>
               <au>
                  <snm>Andersson</snm>
                  <fnm>JO</fnm>
               </au>
               <au>
                  <snm>Roger</snm>
                  <fnm>AJ</fnm>
               </au>
            </aug>
            <source>Eukaryot Cell</source>
            <pubdate>2002</pubdate>
            <volume>1</volume>
            <issue>2</issue>
            <fpage>304</fpage>
            <lpage>310</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">118040</pubid>
                  <pubid idtype="pmpid" link="fulltext">12455964</pubid>
                  <pubid idtype="doi">10.1128/EC.1.2.304-310.2002</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B59">
            <title>
               <p>Archaebacterial relationships of the phosphoenolpyruvate carboxykinase gene reveal mosaicism of <it>Giardia intestinalis </it>core metabolism</p>
            </title>
            <aug>
               <au>
                  <snm>Suguri</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Henze</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>S&#225;nchez</snm>
                  <fnm>LB</fnm>
               </au>
               <au>
                  <snm>Moore</snm>
                  <fnm>DV</fnm>
               </au>
               <au>
                  <snm>M&#252;ller</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>J Eukaryot Microbiol</source>
            <pubdate>2001</pubdate>
            <volume>48</volume>
            <issue>4</issue>
            <fpage>493</fpage>
            <lpage>497</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1550-7408.2001.tb00184.x</pubid>
                  <pubid idtype="pmpid">11456327</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B60">
            <title>
               <p>Improved method for the monoxenic cultivation of <it>Entamoeba histolytica </it>Schaudinn, 1903 and <it>E. histolytica</it>-like amebae with trypanosomatids</p>
            </title>
            <aug>
               <au>
                  <snm>Diamond</snm>
                  <fnm>LS</fnm>
               </au>
            </aug>
            <source>J Parasitol</source>
            <pubdate>1968</pubdate>
            <volume>54</volume>
            <issue>4</issue>
            <fpage>715</fpage>
            <lpage>719</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.2307/3277027</pubid>
                  <pubid idtype="pmpid">4319344</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B61">
            <title>
               <p>Methods for cultivation of luminal parasitic protists of clinical importance</p>
            </title>
            <aug>
               <au>
                  <snm>Clark</snm>
                  <fnm>CG</fnm>
               </au>
               <au>
                  <snm>Diamond</snm>
                  <fnm>LS</fnm>
               </au>
            </aug>
            <source>Clin Microbiol Rev</source>
            <pubdate>2002</pubdate>
            <volume>15</volume>
            <issue>3</issue>
            <fpage>329</fpage>
            <lpage>341</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">118080</pubid>
                  <pubid idtype="pmpid" link="fulltext">12097242</pubid>
                  <pubid idtype="doi">10.1128/CMR.15.3.329-341.2002</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B62">
            <title>
               <p>Interactions between planktonic microalgae and protozoan grazers</p>
            </title>
            <aug>
               <au>
                  <snm>Tillmann</snm>
                  <fnm>U</fnm>
               </au>
            </aug>
            <source>J Eukaryot Microbiol</source>
            <pubdate>2004</pubdate>
            <volume>51</volume>
            <issue>2</issue>
            <fpage>156</fpage>
            <lpage>168</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1550-7408.2004.tb00540.x</pubid>
                  <pubid idtype="pmpid">15134250</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B63">
            <title>
               <p>Lateral gene transfer and the complex distribution of insertions in eukaryotic enolase</p>
            </title>
            <aug>
               <au>
                  <snm>Harper</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Keeling</snm>
                  <fnm>PJ</fnm>
               </au>
            </aug>
            <source>Gene</source>
            <pubdate>2004</pubdate>
            <volume>340</volume>
            <issue>2</issue>
            <fpage>227</fpage>
            <lpage>235</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.gene.2004.06.048</pubid>
                  <pubid idtype="pmpid" link="fulltext">15475163</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B64">
            <title>
               <p>An enigmatic GAPDH gene in the symbiotic dinoflagellate genus <it>Symbiodinium </it>and its related species (the order Suessiales): possible lateral gene transfer between two eukaryotic algae, dinoflagellate and euglenophyte</p>
            </title>
            <aug>
               <au>
                  <snm>Takishita</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Ishida</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Maruyama</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>Protist</source>
            <pubdate>2003</pubdate>
            <volume>154</volume>
            <issue>3-4</issue>
            <fpage>443</fpage>
            <lpage>454</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1078/143446103322454176</pubid>
                  <pubid idtype="pmpid">14658500</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B65">
            <title>
               <p>A light and electron microscopical study of a new, polymorphic free-living amoeba, <it>Phreatamoeba balamuthi </it>n. g., n. sp.</p>
            </title>
            <aug>
               <au>
                  <snm>Chavez</snm>
                  <fnm>LA</fnm>
               </au>
               <au>
                  <snm>Balamuth</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Gong</snm>
                  <fnm>T</fnm>
               </au>
            </aug>
            <source>J Protozool</source>
            <pubdate>1986</pubdate>
            <volume>33</volume>
            <issue>3</issue>
            <fpage>397</fpage>
            <lpage>404</lpage>
            <xrefbib>
               <pubid idtype="pmpid">3746722</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B66">
            <title>
               <p>DNA purification from polysaccharide-rich cells</p>
            </title>
            <aug>
               <au>
                  <snm>Clark</snm>
                  <fnm>CG</fnm>
               </au>
            </aug>
            <source>Protocols in Protozoology, Vol 1</source>
            <publisher>Lawrence, Kansas , Allen Press</publisher>
            <editor>Lee JJ, Soldo AT</editor>
            <pubdate>1992</pubdate>
            <fpage>D</fpage>
            <lpage>3.1-D-3.2</lpage>
         </bibl>
         <bibl id="B67">
            <title>
               <p>Protozoan Genomes</p>
            </title>
            <url>http://www.sanger.ac.uk/Projects/Protozoa/</url>
         </bibl>
         <bibl id="B68">
            <title>
               <p>National Center for Biotechnology Information</p>
            </title>
            <url>http://www.ncbi.nlm.nih.gov/</url>
         </bibl>
         <bibl id="B69">
            <title>
               <p>The Institute for Genomic Research</p>
            </title>
            <url>http://www.tigr.org/</url>
         </bibl>
         <bibl id="B70">
            <title>
               <p>DOE Joint Genome Institute</p>
            </title>
            <url>http://www.jgi.doe.gov/</url>
         </bibl>
         <bibl id="B71">
            <title>
               <p>The Wellcome Trust Sanger Institute</p>
            </title>
            <url>http://www.sanger.ac.uk/</url>
         </bibl>
         <bibl id="B72">
            <title>
               <p>Genoscope</p>
            </title>
            <url>http://www.genoscope.cns.fr/</url>
         </bibl>
         <bibl id="B73">
            <title>
               <p>CLUSTAL W: Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice</p>
            </title>
            <aug>
               <au>
                  <snm>Thompson</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Higgins</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Gibson</snm>
                  <fnm>TJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>1994</pubdate>
            <volume>22</volume>
            <fpage>4673</fpage>
            <lpage>4680</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">308517</pubid>
                  <pubid idtype="pmpid">7984417</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B74">
            <title>
               <p>Compositional bias may affect both DNA-based and protein-based phylogenetic reconstructions</p>
            </title>
            <aug>
               <au>
                  <snm>Foster</snm>
                  <fnm>PG</fnm>
               </au>
               <au>
                  <snm>Hickey</snm>
                  <fnm>DA</fnm>
               </au>
            </aug>
            <source>J Mol Evol</source>
            <pubdate>1999</pubdate>
            <volume>48</volume>
            <fpage>284</fpage>
            <lpage>290</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/PL00006471</pubid>
                  <pubid idtype="pmpid" link="fulltext">10093217</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B75">
            <title>
               <p>MrBayes 3: Bayesian phylogenetic inference under mixed models</p>
            </title>
            <aug>
               <au>
                  <snm>Ronquist</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Huelsenbeck</snm>
                  <fnm>JP</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2003</pubdate>
            <volume>19</volume>
            <issue>12</issue>
            <fpage>1572</fpage>
            <lpage>1574</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btg180</pubid>
                  <pubid idtype="pmpid" link="fulltext">12912839</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>
