<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2148-8-203</ui>
   <ji>1471-2148</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Chlamydial genes shed light on the evolution of photoautotrophic eukaryotes</p>
         </title>
         <aug>
            <au id="A1" ca="yes" ce="yes">
               <snm>Becker</snm>
               <fnm>Burkhard</fnm>
               <insr iid="I1"/>
               <email>b.becker@uni-koeln.de</email>
            </au>
            <au id="A2" ce="yes">
               <snm>Hoef-Emden</snm>
               <fnm>Kerstin</fnm>
               <insr iid="I1"/>
               <email>kerstin.hoef-emden@uni-koeln.de</email>
            </au>
            <au id="A3" ca="yes">
               <snm>Melkonian</snm>
               <fnm>Michael</fnm>
               <insr iid="I1"/>
               <email>michael.melkonian@uni-koeln.de</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Botanisches Institut, Universit&#228;t zu K&#246;ln, Gyrhofstr. 15, 50931 K&#246;ln, Germany</p>
            </ins>
         </insg>
         <source>BMC Evolutionary Biology</source>
         <issn>1471-2148</issn>
         <pubdate>2008</pubdate>
         <volume>8</volume>
         <issue>1</issue>
         <fpage>203</fpage>
         <url>http://www.biomedcentral.com/1471-2148/8/203</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="pmpid">18627593</pubid>
               <pubid idtype="doi">10.1186/1471-2148-8-203</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>22</day>
               <month>4</month>
               <year>2008</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>15</day>
               <month>7</month>
               <year>2008</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>15</day>
               <month>7</month>
               <year>2008</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2008</year>
         <collab>Becker et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>Chlamydiae are obligate intracellular bacteria of protists, invertebrates and vertebrates, but have not been found to date in photosynthetic eukaryotes (algae and embryophytes). Genes of putative chlamydial origin, however, are present in significant numbers in sequenced genomes of photosynthetic eukaryotes. It has been suggested that such genes were acquired by an ancient horizontal gene transfer from Chlamydiae to the ancestor of photosynthetic eukaryotes. To further test this hypothesis, an extensive search for proteins of chlamydial origin was performed using several recently sequenced algal genomes and EST databases, and the proteins subjected to phylogenetic analyses.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>A total of 39 proteins of chlamydial origin were retrieved from the photosynthetic eukaryotes analyzed and their identity verified through phylogenetic analyses. The distribution of the chlamydial proteins among four groups of photosynthetic eukaryotes (Viridiplantae, Rhodoplantae, Glaucoplantae, Bacillariophyta) was complex suggesting multiple acquisitions and losses. Evidence is presented that all except one of the chlamydial genes originated from an ancient endosymbiosis of a chlamydial bacterium into the ancestor of the Plantae before their divergence into Viridiplantae, Rhodoplantae and Glaucoplantae, i.e. more than 1.1 BYA. The chlamydial proteins subsequently spread through secondary plastid endosymbioses to other eukaryotes. Of 20 chlamydial proteins recovered from the genomes of two Bacillariophyta, 10 were of rhodoplant, and 10 of viridiplant origin suggesting that they were acquired by two different secondary endosymbioses. Phylogenetic analyses of concatenated sequences demonstrated that the viridiplant secondary endosymbiosis likely occurred before the divergence of Chlorophyta and Streptophyta.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusion</p>
               </st>
               <p>We identified 39 proteins of chlamydial origin in photosynthetic eukaryotes signaling an ancient invasion of the ancestor of the Plantae by a chlamydial bacterium accompanied by horizontal gene transfer. Subsequently, chlamydial proteins spread through secondary endosymbioses to other eukaryotes. We conclude that intracellular chlamydiae likely persisted throughout the early history of the Plantae donating genes to their hosts that replaced their cyanobacterial/plastid homologs thus shaping early algal/plant evolution before they eventually vanished.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification type="bmc" subtype="user_supplied_xml" id="endnote"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Transfer of genes between different genomes is now recognized as widespread and has been postulated to play an important role during evolution <abbrgrp><abbr bid="B1">1</abbr><abbr bid="B2">2</abbr></abbrgrp>. Gene transfer between different species is generally termed horizontal gene transfer (HGT) or lateral gene transfer (LGT). HGT is especially common in prokaryotes <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>, however, it has become recently clear it also occurs frequently in phagotrophic protists <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>. In addition, in eukaryotes a large number of genes were transferred from the evolving plastid and mitochondrion to the nuclear genome. This type of gene transfer has been named intracellular gene transfer (IGT, <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>) or endocytotic gene transfer (EGT, <abbrgrp><abbr bid="B4">4</abbr></abbrgrp>). HGT or EGT leads to an exchange of genes between distantly related organisms creating genetic chimeras. It allows the recipient to acquire new characters de novo and therefore challenges the traditional evolutionary concept that evolution proceeds by modification of existing genetic information. As is evident most clearly in EGT no gene appears to be immune to transfer, however, most of the recently transferred genes (HGT) appear not to include housekeeping genes <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B6">6</abbr></abbrgrp>. HGT is generally detected by phylogenetic analyses (reviewed in <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>) and is generally considered as a form of noise, obscuring phylogenetic signal and therefore interfering with the reconstruction of the evolution or a group of organisms <abbrgrp><abbr bid="B5">5</abbr><abbr bid="B8">8</abbr></abbrgrp>. However, Huang and Gogarten <abbrgrp><abbr bid="B3">3</abbr></abbrgrp> suggested that ancient horizontal gene transfers are helpful for elucidating the evolutionary history of a group of organisms.</p>
         <p>Chlamydiae are well known donors for horizontal gene transfer events <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. Chlamydiae are an ancient group of obligate intracellular bacteria <abbrgrp><abbr bid="B10">10</abbr></abbrgrp>, probably most closely related to the Verrucomicrobia <abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. The presence of <it>Chlamydia</it>-type genes in plants (or plant-like genes in Chlamydiae) has been reported repeatedly <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr></abbrgrp>. Three different explanations have been forwarded: 1) HGT of plant genes to a chlamydial ancestor <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>, 2) HGT of chlamydial genes to plants <abbrgrp><abbr bid="B15">15</abbr></abbrgrp> and 3) an ancient phylogenetic relationship between Chlamydiae, Cyanobacteria, and chloroplasts <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. A list of chlamydial proteins with high similarity to plant proteins by Brinkman et al. <abbrgrp><abbr bid="B13">13</abbr></abbrgrp> included 37 proteins involved in diverse functions such as protein and fatty acid synthesis, glycolysis, and nucleotide metabolism. Whereas a list by Horn et al. <abbrgrp><abbr bid="B14">14</abbr></abbrgrp> using proteins of the <it>Parachlamydia</it>-related symbiont of acanthamoebae (recently proposed as Candidatus <it>Protochlamydia amoebophila </it><abbrgrp><abbr bid="B17">17</abbr></abbrgrp>) as query in BLAST searches found 137 proteins from <it>Protochlamydia </it>with high similarity to plant proteins. Recently, a list of 14 chlamydial genes present in Viridiplantae and <it>Cyanidioschyzon merolae </it>(Rhodoplantae) has been published by Huang and Gogarten <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. These authors postulated an ancient EGT event to explain the large number of genes transferred <abbrgrp><abbr bid="B16">16</abbr></abbrgrp>. Several additional genomes (including those of two diatoms, three chlorophytes and several additional cyanobacteria), and a large number of additional ESTs (including those of two glaucophytes and several red and green algae) have been recently sequenced. Therefore, we have reinvestigated the phylogenetic relationships between plastids, Cyanobacteria and Chlamydiae using an extended taxon sampling. We show that 39 chlamydial type genes are present in Glaucoplantae, Rhodoplantae, Viridiplantae and Bacillariophyta indicating an ancient origin for these chlamydial genes. However, the distribution of chlamydial proteins among different photoautotrophic eukaryotes is complex requiring an in depth phylogenetic analysis.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>A significant number of chlamydial genes occur in eukaryotic algae</p>
            </st>
            <p>To search for genes of putative chlamydial origin in eukaryotic algae, the genomes of two diatoms (<it>Thalassiosira pseudonana</it>, <it>Phaeodactylum tricornutum</it>), three green algae (<it>Ostreococcus tauri</it>, <it>Ostreococcus lucimarinus, Chlamydomonas reinhardtii</it>), and the red alga <it>Cyanidioschyzon merolae </it>were screened for the presence of protein-coding genes most similar to genes in the genome of Candidatus <it>Protochlamydia amoebophila </it>(see Methods). The initial screen identified 89 putative chlamydial proteins in these algae (Additional File <supplr sid="S1">1</supplr>). Maximum likelihood phylogenetic analyses were performed with each of the 89 proteins using a taxon sampling that included other eukaryotic algae and chlamydiae, embryophytes, cyanobacteria, and other bacteria (an average of 30 taxa for each protein); 37 proteins revealed a monophyletic lineage (with 70&#8211;100% bootstrap support (BS) for 33 proteins, and 57&#8211;68% BS for 4 proteins) consisting of chlamydiae and at least one of the three plastid-containing eukaryotic lineages represented by genomes (diatoms, green algae/embryophytes, and red algae; Table <tblr tid="T1">1</tblr>) to the exclusion of all other proteins. For two additional proteins (putative glycerol-3-phosphate acyltransferase (Fig. <figr fid="F1">1A</figr>), hypothetical protein pc0324 (no 32, Table <tblr tid="T1">1</tblr>)), the phylogenetic trees contained only chlamydiae and plastid-containing eukaryotes because proteins with significant similarities to the chlamydial proteins could not be retrieved from any other taxa. Among the remaining 50 phylogenetic trees, several trees also displayed a topological sister group relation between chlamydiae and plastid-containing eukaryotes, however, without bootstrap support (not shown).</p>
            <suppl id="S1">
               <title>
                  <p>Additional File 1</p>
               </title>
               <text>
                  <p>Additional Table 1. Proteins from Candidatus <it>Protochlamydia amoebophila </it>showing significant similarity to algal proteins.</p>
               </text>
               <file name="1471-2148-8-203-S1.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>Phylogenetic analysis of chlamydial genes in photoautotrophic eukaryotes</p>
               </caption>
               <text>
                  <p><b>Phylogenetic analysis of chlamydial genes in photoautotrophic eukaryotes</b>. Unrooted maximum likelihood trees of single-gene data sets. Evolutionary models of all data sets: WAG+I+&#915;, except for Fig. 1E: RtREV+I+&#915; (<abbrgrp><abbr bid="B85">85</abbr></abbrgrp>. Support values: maximum likelihood bootstrap/posterior probability; branches in bold: ML bootstrap > 95% and posterior probability of 1.0. Scale bars = substitutions per site. For enlarged trees with taxon names, see Additional File <supplr sid="S4">4</supplr>. (A) Glycerol-3-phosphate acyltransferase (EC 2.3.1.15; 21 taxa, 281 amino acid positions). (B) tRNA delta(2)-isopentenylpyrophosphate transferase (EC 2.5.2.8; 38 taxa, 240 positions). (C) 2-C-methyl-D-erythritol 4-phosphate cytidyltransferase (<it>isp</it>D; EC 2.7.7.60; 38 taxa, 198 positions). Rhodoplantae, Bacillariophyta and Viridiplantae group with the Chlamydiae. (D) Putative ribosome release/recycling factor (COG0233; 30 taxa, 171 positions). (E) Ribosomal large subunit pseudouridine synthase (EC 4.2.1.70; 38 taxa, 262 positions). (F) Folylpolyglutamate synthase (EC 6.3.2.17; 26 taxa, 208 positions).</p>
               </text>
               <graphic file="1471-2148-8-203-1"/>
            </fig>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Proteins of putative chlamydial origin in plastid-containing eukaryotes.</p>
               </caption>
               <tblbdy cols="4">
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>Gene<sup>1)</sup></p>
                     </c>
                     <c ca="center">
                        <p>Bootstrap support</p>
                     </c>
                     <c ca="center">
                        <p>Comments</p>
                     </c>
                  </r>
                  <r>
                     <c cspan="4">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="3" ca="left">
                        <p>
                           <b>(Chlamydiae, Plantae)</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>1</p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Asparaginyl-tRNA synthetase</b>
                        </p>
                        <p>gi|46445980</p>
                     </c>
                     <c ca="center">
                        <p>74</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>2</p>
                     </c>
                     <c ca="left">
                        <p>Aspartate aminotransferase</p>
                        <p>gi|46446319</p>
                     </c>
                     <c ca="center">
                        <p>98</p>
                     </c>
                     <c ca="center">
                        <p><b>(CB F) </b>100% ML</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>3</p>
                     </c>
                     <c ca="left">
                        <p>ATP/ADP translocase</p>
                     </c>
                     <c ca="center">
                        <p>n. a.</p>
                     </c>
                     <c ca="center">
                        <p>Only in intracellular parasites and plastid-containing eukaryotes</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>4</p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>tRNA delta(2)-isopentenylpyro-phosphate transferase</b>
                        </p>
                        <p>gi|46446877</p>
                     </c>
                     <c ca="center">
                        <p>99</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>5</p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Diphosphate-fructose-6-phosphate 1-phosphotransferase</b>
                        </p>
                        <p>gi|46446514</p>
                     </c>
                     <c ca="center">
                        <p>58</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c cspan="3" ca="left">
                        <p>
                           <b>(Chlamydiae, Viridiplantae, Rhodoplantae)</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>6</p>
                     </c>
                     <c ca="left">
                        <p>Isopentenyl monophosphate kinase (ISPE)</p>
                        <p>gi|46447223</p>
                     </c>
                     <c ca="center">
                        <p>74</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>7</p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Queuine tRNA-ribosyltransferase</b>
                        </p>
                        <p>gi|46446428</p>
                     </c>
                     <c ca="center">
                        <p>100</p>
                     </c>
                     <c ca="center">
                        <p><it>Dictyostelium </it>sister to <b>[P]</b></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>8</p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Putative 7-dehydrocholesterol reductase</b>
                        </p>
                        <p>gi|46446854</p>
                     </c>
                     <c ca="center">
                        <p>94</p>
                     </c>
                     <c ca="center">
                        <p>Only eukaryotes except <it>Protochlamydia </it>and <it>Coxiella, Coxiella </it>sister to <it>Protochlamydia </it>(87% ML)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>9</p>
                     </c>
                     <c ca="left">
                        <p>Putative 23S rRNA (Uracil-5-)-methyltransferase</p>
                        <p>gi|46447632</p>
                     </c>
                     <c ca="center">
                        <p>100</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>10</p>
                     </c>
                     <c ca="left">
                        <p>Putative 4-diphosphocytidyl-2C-methyl-D-erythritol synthase (ISPD)</p>
                        <p>gi|46445961</p>
                     </c>
                     <c ca="center">
                        <p>99</p>
                     </c>
                     <c ca="center">
                        <p><b>(CB F) </b>73% ML</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>11</p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Hypothetical protein pc1328</b>
                        </p>
                        <p>gi|46446962</p>
                     </c>
                     <c ca="center">
                        <p>82</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>12</p>
                     </c>
                     <c ca="left">
                        <p>Putative glycerol-3-phosphate acyltransferase</p>
                        <p>gi|46446952</p>
                     </c>
                     <c ca="center">
                        <p>n. a.</p>
                     </c>
                     <c ca="center">
                        <p>Only present in Chlamydiae and plastid containing eukaryotes</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>13</p>
                     </c>
                     <c ca="left">
                        <p>Probable polyribonucleotide nucleotidyltransferase</p>
                        <p>gi|46446277</p>
                     </c>
                     <c ca="center">
                        <p>100</p>
                     </c>
                     <c ca="center">
                        <p><b>(CB F) </b>100% ML</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>14</p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Probable S-adenosyl-methyltransferase</b>
                        </p>
                        <p>gi|46445945</p>
                     </c>
                     <c ca="center">
                        <p>62</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>15</p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Putative tRNA pseudouridylate synthase I</b>
                        </p>
                        <p>gi|46445962</p>
                     </c>
                     <c ca="center">
                        <p>79</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>16</p>
                     </c>
                     <c ca="left">
                        <p>3-oxoacyl-(acyl carrier protein) synthase (FABB)</p>
                        <p>gi|46446872</p>
                     </c>
                     <c ca="center">
                        <p>100</p>
                     </c>
                     <c ca="center">
                        <p>Bacillariophyceae in eubacterial clade</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>17</p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Putative endopeptidase (ATP-dependent serine protease) La</b>
                        </p>
                        <p>gi|46446096</p>
                     </c>
                     <c ca="center">
                        <p>83</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>18</p>
                     </c>
                     <c ca="left">
                        <p>Probable tyrosine-tRNA ligase</p>
                        <p>gi|46446803</p>
                     </c>
                     <c ca="center">
                        <p>97</p>
                     </c>
                     <c ca="center">
                        <p>Bacillariophyceae in bacterial clade</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>19</p>
                     </c>
                     <c ca="left">
                        <p>Probable isoamylase</p>
                        <p>gi|46446740</p>
                     </c>
                     <c ca="center">
                        <p>99</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>20</p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Probable S-adenosyl-methyltransferase</b>
                        </p>
                        <p>gi|46445945</p>
                     </c>
                     <c ca="center">
                        <p>81</p>
                     </c>
                     <c ca="center">
                        <p>Bacillariophyceae in eukaryotic clade</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>21</p>
                     </c>
                     <c ca="left">
                        <p>Putative oligoendopeptidase F</p>
                        <p>gi|46446812</p>
                     </c>
                     <c ca="center">
                        <p>73</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <b>(Chlamydiae, Viridiplantae)</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>22</p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Phosphate transporter</b>
                        </p>
                        <p>
                           <b>gi|46445733</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>88</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>23</p>
                     </c>
                     <c ca="left">
                        <p>Phosphoglycerate mutase</p>
                        <p>gi|46399436</p>
                     </c>
                     <c ca="center">
                        <p>94</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>24</p>
                     </c>
                     <c ca="left">
                        <p>Probable gcpE protein (ISPG)</p>
                        <p>gi|46446374</p>
                     </c>
                     <c ca="center">
                        <p>91</p>
                     </c>
                     <c ca="center">
                        <p><b>(CB G R) </b>100% ML</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>25</p>
                     </c>
                     <c ca="left">
                        <p>Enoyl-(acyl carrier protein) reductase (FABI)</p>
                        <p>gi|46446786</p>
                     </c>
                     <c ca="center">
                        <p>100</p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>(CB R)</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>26</p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>DNA mismatch repair protein (MUTS)</b>
                        </p>
                        <p>gi|46446855</p>
                     </c>
                     <c ca="center">
                        <p>54</p>
                     </c>
                     <c ca="center">
                        <p><b>(CB R) </b>95% ML</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>27</p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Putative lipoate-protein ligase</b>
                        </p>
                        <p>gi|46447472</p>
                     </c>
                     <c ca="center">
                        <p>98</p>
                     </c>
                     <c ca="center">
                        <p><it>Dictyostelium </it>sister to <b>[B V]</b></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>28</p>
                     </c>
                     <c ca="left">
                        <p>Gut Q protein</p>
                        <p>gi|46447416</p>
                     </c>
                     <c ca="center">
                        <p>100</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>29</p>
                     </c>
                     <c ca="left">
                        <p>Malate dehydrogenase</p>
                        <p>gi|46447406</p>
                     </c>
                     <c ca="center">
                        <p>92</p>
                     </c>
                     <c ca="center">
                        <p>NADP<sup>+</sup>-dependent plastidial homologue of the Viridiplantae</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>30</p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Putative ribosome recycling factor</b>
                        </p>
                        <p>gi|46447510</p>
                     </c>
                     <c ca="center">
                        <p>68</p>
                     </c>
                     <c ca="center">
                        <p>(<b>CB R G</b>) 88% ML</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>31</p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Putative tyrosine/tryptophan transport protein</b>
                        </p>
                        <p>gi|46445802</p>
                     </c>
                     <c ca="center">
                        <p>99</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>32</p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Hypothetical protein pc0324</b>
                        </p>
                        <p>gi|46445958</p>
                     </c>
                     <c ca="center">
                        <p>n. a.</p>
                     </c>
                     <c ca="center">
                        <p>Only present in Viridiplantae and Protochlamydia</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>33</p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Hypothetical protein pc0378</b>
                        </p>
                        <p>gi|46446012|</p>
                     </c>
                     <c ca="center">
                        <p>99</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>34</p>
                     </c>
                     <c ca="left">
                        <p>Probable 3-deoxy-manno-octulosonate cytidylyltransferase (CMP-KDO synthetase)</p>
                        <p>gi|46400100</p>
                     </c>
                     <c ca="center">
                        <p>86</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <b>(Chlamydiae, Rhodoplantae)</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>35</p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Probable 23S RNA-specific pseudouridine synthase D</b>
                        </p>
                        <p>gi|46445989</p>
                     </c>
                     <c ca="center">
                        <p>96</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>36</p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Hypothetical protein pc0339</b>
                        </p>
                        <p>gi|46445973</p>
                     </c>
                     <c ca="center">
                        <p>100</p>
                     </c>
                     <c ca="center">
                        <p><it>Geobacter </it>sister to <b>[V]</b></p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>37</p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Cysteinyl-tRNA synthetase</b>
                        </p>
                        <p>gi|46446869</p>
                     </c>
                     <c ca="center">
                        <p>100</p>
                     </c>
                     <c ca="center">
                        <p><it>Leptosira </it>sister to <b>[C P]</b></p>
                     </c>
                  </r>
                  <r>
                     <c>
                        <p/>
                     </c>
                     <c ca="left">
                        <p>
                           <b>(Chlamydiae, Bacillariophyceae)</b>
                        </p>
                     </c>
                     <c>
                        <p/>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>38</p>
                     </c>
                     <c ca="left">
                        <p>
                           <b>Putative folylpolyglutamate synthase</b>
                        </p>
                        <p>gi|46447260</p>
                     </c>
                     <c ca="center">
                        <p>99</p>
                     </c>
                     <c>
                        <p/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>39</p>
                     </c>
                     <c ca="left">
                        <p>Transketolase</p>
                        <p>gi|46447148</p>
                     </c>
                     <c ca="center">
                        <p>70</p>
                     </c>
                     <c ca="center">
                        <p><it>Dictyostelium </it>sister to <b>[B], [V G CB]</b></p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>39 proteins were identified by ML analyses. The proteins are grouped according to the observed tree topologies in the maximum likelihood analyses. The bootstrap support (ML) for the indicated clades is given. For 20 genes (in bold letters) this is to our knowledge the first detailed phylogenetic analysis. See Additional File <supplr sid="S2">2</supplr> for an extended version of Table 1 including tree topologies.</p>
                  <p><sup>1) </sup>The accession number and annotation for the gene from <it>Protochlamydia </it>is given. n. a. not applicable, A Apicomplexa, B Bacillariophyta, C Chlamydiae except Protochlamydia, CB Cyanobacteria, CP Chlorophyta, F Firmicutes, G Glaucoplantae, P Protochlamydia, R Rhodoplantae, SP Streptophyta, V Viridiplantae.</p>
               </tblfn>
            </tbl>
            <p>The proteins of putative chlamydial origin in plastid-containing eukaryotes perform a broad spectrum of functions with isoprenoid, fatty acid and carbohydrate metabolism, and phosphate homeostasis featuring prominently (Additional File <supplr sid="S1">1</supplr>). All are encoded on the host's nuclear genome, and most (but not all, e.g. CMP-KDO synthetase) are predicted to be targeted to the plastid. Metabolic pathways involving putative chlamydial proteins reveal a chimerical origin (some genes of the plastidic isoprenoid and fatty acid biosynthesis pathways are of cyanobacterial origin).</p>
         </sec>
         <sec>
            <st>
               <p>Complex distribution of chlamydial genes among plastid-containing eukaryotes</p>
            </st>
            <p>The distribution of the 39 chlamydial proteins among four groups of photosynthetic eukaryotes (Viridiplantae (green algae and embryophytes), Rhodoplantae (red algae), Glaucoplantae (glaucophytes), and Bacillariophyta (diatoms)) is shown in a Venn diagram (Fig. <figr fid="F2">2</figr>). The largest number of chlamydial proteins (34) was found in the Viridiplantae, followed by the Rhodoplantae (24), Bacillariophyta (22) and Glaucoplantae (5). It should be noted that the low number of chlamydial proteins recovered from the Glaucoplantae presumably relates to that fact that no genome of this small, yet phylogenetically important, algal lineage has been sequenced to date and the chlamydial proteins had to be retrieved from the limited EST (cDNA) data available for two Glaucoplantae, <it>Cyanophora paradoxa </it>and <it>Glaucocystis nostochinearum</it>. The Viridiplantae contain the largest number of unique chlamydial proteins (9), followed by the Bacillariophyta (2), whereas no unique chlamydial proteins were recovered from the Rhodoplantae and Glaucoplantae. Viridiplantae and Rhodoplantae share 21, Viridiplantae and Bacillariophyta 17, and Rhodoplantae and Bacillariophyta 16 chlamydial proteins. Both Viridiplantae and Rhodoplantae share the same 5 chlamydial proteins with the Glaucoplantae (four of which are also present in the Bacillariophyta). Seven chlamydial proteins are unique to Viridiplantae and Rhodoplantae, 4 to Viridiplantae and Bacillariophyta, and 3 to Rhodoplantae and Bacillariophyta. Chlamydial proteins involved in the same biosynthetic pathway may be differentially distributed in the different algal lineages. In the plastidic isoprenoid pathway, the chlamydial genes <it>isp</it>D and <it>isp</it>E (nos 10 and 6 respectively, Table <tblr tid="T1">1</tblr>) are present in the Rhodoplantae and Viridiplantae, whereas the chlamydial <it>isp</it>G (gcpE; no 24, Table <tblr tid="T1">1</tblr>) is only present in the Viridiplantae and Bacillariophyta; the Rhodoplantae and Glaucoplantae contain the cyanobacterial homologue of <it>isp</it>G. Similarly, in the fatty acid synthesis pathway, the chlamydial gene <it>fab</it>B (no 16, Table <tblr tid="T1">1</tblr>) is present in both Rhodoplantae and Viridiplantae, whereas the chlamydial <it>fab</it>I (no 25, Table <tblr tid="T1">1</tblr>) is restricted to the Viridiplantae (and Bacillariophyta and Apicomplexa); the Rhodoplantae contain the cyanobacterial <it>fab</it>I. The complex distribution pattern of chlamydial proteins in the different lineages of photosynthetic eukaryotes called for an in depth analysis of the phylogeny of each chlamydial protein using a broader taxonomic sampling that included diverse chlamydiae, cyanobacteria and other groups of bacteria such as proteobacteria and Firmicutes (representative trees in Fig. <figr fid="F1">1A&#8211;F</figr>, summary in Table <tblr tid="T1">1</tblr> and Additional File <supplr sid="S2">2</supplr>).</p>
            <suppl id="S2">
               <title>
                  <p>Additional File 2</p>
               </title>
               <text>
                  <p>Additional Table 2. Proteins of putative chlamydial origin in plastid-containing eukaryotes (Extended version of Table <tblr tid="T1">1</tblr>).</p>
               </text>
               <file name="1471-2148-8-203-S2.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Venn diagram showing the number of chlamydial proteins shared by different photoautotrophic eukaryotes</p>
               </caption>
               <text>
                  <p>
                     <b>Venn diagram showing the number of chlamydial proteins shared by different photoautotrophic eukaryotes.</b>
                  </p>
               </text>
               <graphic file="1471-2148-8-203-2"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Chlamydial genes in photosynthetic eukaryotes signal an ancient, horizontal gene transfer event</p>
            </st>
            <p>The presence of a relatively small number of genes from another domain of life in a genome is usually regarded as signalling horizontal gene transfer (HGT). If a large number of foreign genes that have a single origin are involved, an endosymbiotic gene transfer (EGT) is often the favoured hypothesis. EGT is a special case of HGT, in which the donor (endosymbiont) transfers its genome or part of it to the acceptor (host) and physically persists for an extended period of time within the host. If the donor disappears from its host, it may be difficult, if not impossible, to distinguish between HGT and EGT. As HGT involves two genomes, the direction of gene flow is not <it>a priori </it>known, e.g. the presence of chlamydial proteins in photosynthetic eukaryotes could either signal HGT from chlamydiae to eukaryotes or HGT from eukaryotes to chlamydiae. Phylogenetic analyses can often resolve this conundrum. However, in photosynthetic eukaryotes an additional level of complexity is added by the presence of nuclear-encoded cyanobacterial genes. Since the 39 chlamydial proteins are, with a few notable exceptions (three chlamydial genes in <it>Dictyostelium </it>that may have originated by another HGT; nos 7, 27,39; Table <tblr tid="T1">1</tblr>), confined to plastid-containing eukaryotes, it is also conceivable that the presence of proteins of putative chlamydial origin in these eukaryotes simply reflects their phylogenetic relationship to the respective cyanobacterial homologues rather than an HGT between chlamydiae and eukaryotes, the former perhaps obscured by loss or modification of the respective genes in extant free-living cyanobacteria. Again phylogenetic analyses can help to reach an informed decision. The presence of cyanobacterial genes in the host genome has another connotation; it may indicate that HGT perhaps did not take place between the chlamydial and eukaryotic genomes directly but between chlamydial and cyanobacterial genomes either before or after the cyanobacterial endosymbiosis (note that the latter scenario requires the simultaneous presence of cyanobacteria and chlamydiae within the host). In this case the chlamydial genes were transferred to the nuclear genome by EGT from the genome of the cyanobacterial endosymbiont.</p>
            <p>For 9 of the 39 chlamydial proteins detected in plastid-containing eukaryotes, a cyanobacterial homologue could not be recovered (Fig. <figr fid="F1">1A</figr> and Table <tblr tid="T1">1</tblr>). Analyses of the remaining 30 chlamydial proteins revealed no specific phylogenetic relationships of the cyanobacterial and chlamydial proteins to the exclusion of other bacterial homologues (e.g. Fig. <figr fid="F1">1B&#8211;F</figr>). In fact, the cyanobacterial proteins sometimes formed strongly supported clades with proteins of other bacteria to the exclusion of proteins from chlamydiae (and their associated homologues in plastid-containing eukaryotes). Often, but not always, the cyanobacterial proteins were recovered as sisters to the respective proteins of the Firmicutes (e.g. polyribonucleotide nucleotidyltransferase (bootstrap support: 100/1.0 ML/PP; no 13, Table <tblr tid="T1">1</tblr>) and aspartate aminotransferase (100%/1.0 ML/PP; no 2, Table <tblr tid="T1">1</tblr>)). The cyanobacteria/Firmicutes sister group relationship had previously been inferred from multigene phylogenies (e.g. Battistuzzi et al. <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> using 32 proteins (7,597 positions) from 54 different bacterial strains)). Although this result does not rule out an ancient relationship between cyanobacteria and chlamydiae, it strongly suggests that the chlamydial proteins present in photosynthetic eukaryotes are not simply modified cyanobacterial proteins and thus are not of cyanobacterial but of chlamydial origin. That cyanobacterial homologues of chlamydial proteins can be readily identified as such when present in photosynthetic eukaryotes, is exemplified by three proteins with differential distribution in photosynthetic eukaryotes, the cyanobacterial homologue being associated with Glaucoplantae and/or Rhodoplantae (nos 24, 26, 30; Table <tblr tid="T1">1</tblr>, Fig. <figr fid="F1">1D</figr>), while the chlamydial homologue is present in Bacillariophyta and/or Viridiplantae.</p>
            <p>Phylogenetic analyses of the 39 chlamydial proteins also provide insight about the direction of HGT between chlamydiae and plastid-containing eukaryotes. Although two chlamydial proteins occur only in chlamydiae and plastid-containing eukaryotes (see above), and another chlamydial protein (ATP/ADP translocase, no. 3, Table <tblr tid="T1">1</tblr>) is present only in intracellular bacterial parasites and plastid-containing eukaryotes, all other chlamydial proteins have homologues among several major groups of bacteria and are thus embedded in the bacterial radiation. Conversely, the chlamydial proteins identified here are confined to plastid-containing eukaryotes among the eukaryote radiation (with the exception of the three chlamydial proteins found in <it>Dictyostelium</it>, see above). The presence of chlamydial genes in eukaryotes with secondary plastids (Bacillariophyta and Apicomplexa (Table <tblr tid="T1">1</tblr>)), and chlorarachniophytes (results not shown)) suggests that these genes have spread from Plantae (comprising Glaucoplantae, Rhodoplantae, and Viridiplantae) to other eukaryote supergroups such as heterokonts, alveolates, and Rhizaria together with their secondary, eukaryotic endosymbionts. In conclusion, chlamydial proteins in plastid-containing eukaryotes are most likely derived from chlamydiae by HGT/EGT.</p>
            <p>When and how often did HGT/EGT between chlamydiae and plastid-containing eukaryotes occur? For chlamydial proteins with homologues in all three lineages of the Plantae (5 chlamydial proteins; Table <tblr tid="T1">1</tblr>), a sister group relationship was recovered between chlamydiae and Plantae for each protein (e.g. Fig. <figr fid="F1">1B</figr>). This suggests that HGT from chlamydiae to plastid-containing eukaryotes predated the divergence of the Glaucoplantae, Rhodoplantae and Viridiplantae.</p>
            <p>For 8 additional proteins, a sister group relationship between chlamydiae and Rhodoplantae+Viridiplantae was observed (with BS support ranging from 79&#8211;100%; e.g. Fig. <figr fid="F1">1C</figr>; nos 6, 7, 10, 11, 15&#8211;18, Table <tblr tid="T1">1</tblr>) strongly suggesting that HGT for these proteins predated the split between Rhodoplantae and Viridiplantae. It is anticipated that most, if not all of these 8 proteins will also contain homologues in the Glaucoplantae, which are, however, currently unknown reflecting the paucity of genome information in the Glaucoplantae (see above). For another 7 chlamydial proteins in which a monophyletic origin of chlamydiae, Rhodoplantae, and Viridiplantae was recovered (Table <tblr tid="T1">1</tblr>), either the branching order among chlamydiae, Rhodoplantae and Viridiplantae was unresolved (BS support &#8804; 63%; nos 9, 13, 14, 19&#8211;21; Table <tblr tid="T1">1</tblr>) or the (single) protein occurred only in chlamydiae and plastid-containing eukaryotes (putative glycerol-3-phosphate acyltransferase; Fig. <figr fid="F1">1A</figr>). With the addition of more taxa, in particular from the Glaucoplantae, the branching order between chlamydiae and plastid-containing eukaryotes for these 7 chlamydial proteins may be resolved and a monophyletic origin of the chlamydial proteins in plastid-containing eukaryotes likely be revealed. Only one chlamydial protein (putative 7-dehydrocholesterol reductase: no 8, Table <tblr tid="T1">1</tblr>) deviates from this scheme (for this protein, long branch attraction between Rhodoplantae and chlorophytes, and the absence of any bacterial proteins other than Candidatus <it>Protochlamydia amoebophila </it>and <it>Coxiella </it>may have obscured phylogenetic relationships among taxa; Table <tblr tid="T1">1</tblr>).</p>
            <p>For chlamydial proteins that display a sister group relationship between chlamydiae and Viridiplantae (13 proteins; Table <tblr tid="T1">1</tblr>), either the homologue from Rhodoplantae is missing (7 proteins; nos 23, 27&#8211;29, 31&#8211;33) or the Rhodoplantae homologue is specifically associated (often as a sister group) with cyanobacteria (e.g. Fig. <figr fid="F1">1D</figr>; 6 proteins; nos 22, 24&#8211;26, 30, 34; Table <tblr tid="T1">1</tblr>). The first situation may reflect selective loss of chlamydial genes in the rhodoplant lineage, the second offers several contrasting explanations (see below).</p>
            <p>Chlamydial proteins that reveal a sister group relationship between chlamydiae and Rhodoplantae (3 proteins; Table <tblr tid="T1">1</tblr>) also display homologues in the Viridiplantae, whose positions, however, remained unresolved in the phylogenetic trees (Fig. <figr fid="F1">1E</figr>; nos 35&#8211;37; Table <tblr tid="T1">1</tblr>).</p>
            <p>Finally, chlamydial proteins showing a sister group relationship between chlamydiae and Bacillariophyta to the exclusion of Plantae (putative folylpolyglutamate synthase (Fig. <figr fid="F1">1F</figr>) and transketolase; nos 38 and 39 respectively; Table <tblr tid="T1">1</tblr>) either contain the three lineages of Plantae in a monophylum together with cyanobacteria (transketolase) or two lineages of Plantae (Rhodoplantae and Viridiplantae) together with cyanobacteria, Firmicutes and other bacteria in an unresolved radiation (Fig. <figr fid="F1">1F</figr>). It is likely that at least the transketolase of the Bacillariophyta originated by an HGT involving a chlamydial donor different from the donor(s) that contributed the chlamydial genes in the Plantae (see also <abbrgrp><abbr bid="B19">19</abbr></abbrgrp>).</p>
            <p>The overall conclusion from the phylogenetic analyses is that the vast majority of the 39 chlamydial proteins detected in photosynthetic eukaryotes during this study (a minimum of 31, perhaps all, except one) presumably entered the Plantae by HGT from chlamydiae before the divergence of the Glaucoplantae, Rhodoplantae and Viridiplantae.</p>
            <p>Another important question relates to the possible nature of the donor in the HGT/EGT of the chlamydial genes into plastid-containing eukaryotes. For 7 chlamydial proteins, Candidatus <it>Protochlamydia amoebophila </it>was the only member of the chlamydiae in the trees; proteins from other chlamydiae could not be retrieved from the data bases (Fig. <figr fid="F1">1F</figr>; Table <tblr tid="T1">1</tblr>). The phylogenetic tree for one of these proteins, asparaginyl-tRNA synthetase (no 1; Table <tblr tid="T1">1</tblr>), also included all three lineages of the Plantae, and in this case, <it>P. amoebophila </it>formed a weakly supported sister group to the Plantae + Bacillariophyta (58% BS support for Plantae + Bacillariophyta; Table <tblr tid="T1">1</tblr>). For the other 6 proteins, <it>P. amoebophila </it>was either in an unresolved position among the photosynthetic eukaryotes (3 proteins; nos 8, 9, 20; Table <tblr tid="T1">1</tblr>), was the only bacterial protein in a tree, that, in addition to <it>P. amoebophila</it>, comprised only one lineage of Plantae (Viridiplantae; hypothetical protein pc0324; no 32; Table <tblr tid="T1">1</tblr>) or was a sister group to the Bacillariophyta (e.g. Fig. <figr fid="F1">1F</figr>; <figr fid="F2">2</figr> proteins; nos 36, 38; Table <tblr tid="T1">1</tblr>).</p>
            <p>Four of the 5 trees that contained all three lineages of Plantae, displayed, in addition to <it>P. amoebophila</it>, other chlamydiae (Fig. <figr fid="F1">1B</figr>; nos 2&#8211;5; Table <tblr tid="T1">1</tblr>). In one of the 4 trees (aspartate aminotransferase; no 2; Table <tblr tid="T1">1</tblr>), the chlamydiae were paraphyletic (with <it>P. amoebophila </it>as a sister group to the Plantae (BS 77% for Plantae), in two others (nos 4 and 5; Table <tblr tid="T1">1</tblr>) they were monophyletic, with <it>P. amoebophila </it>as their first divergence, and the chlamydiae in sister group position to the Plantae. In one tree (ATP/ADP translocase; no 3; Table <tblr tid="T1">1</tblr>) relationships among the chlamydiae were unresolved. In the 4 trees, the chlamydiae (except <it>P. amoebophila</it>) exhibited relatively long branches, and in the case of the aspartate aminotransferase, this may have caused artificial attraction of these chlamydiae to a very long branch comprising all other bacteria (except cyanobacteria) rendering the chlamydiae paraphyletic (not shown).</p>
            <p>Extending the comparison to the other 28 phylogenetic trees that contained chlamydial proteins from both <it>P. amoebophila </it>and other chlamydial taxa, again two topologies emerged: a monophyletic chlamydiae (12 proteins; Table <tblr tid="T1">1</tblr>) in which <it>P. amoebophila </it>always represented the first divergence within the chlamydiae (e.g. Additional File <supplr sid="S3">3D</supplr>), or a paraphyletic (sometimes polyphyletic) divergence of the chlamydiae (16 proteins; Table <tblr tid="T1">1</tblr>) with <it>P. amoebophila </it>either in a sister group position (with or without BS support) to the chlamydial proteins of the plastid-containing eukaryotes (e.g. Additional File <supplr sid="S3">3E</supplr>; 9 proteins; nos 6, 7, 18, 22&#8211;24, 27, 31, 35), or positioned (albeit without resolution) within the Plantae (3 proteins; nos 12, 15, 16, Table <tblr tid="T1">1</tblr>). For 4 chlamydial proteins a sister group topology between the chlamydiae (excluding <it>P. amoebophila</it>) and either a single clade of plastid-containing eukaryotes (twice with Viridiplantae (nos 21, 28; Table <tblr tid="T1">1</tblr>), and once with Rhodoplantae (no. 19; Table <tblr tid="T1">1</tblr>); in nos 19 and 21 without bootstrap support; Table <tblr tid="T1">1</tblr>) or with two clades of plastid-containing eukaryotes (Viridiplantae and Bacillariophyta; no 26; Table <tblr tid="T1">1</tblr>) was observed. In all 10 phylogenetic trees in which the chlamydiae were paraphyletic and <it>P. amoebophila </it>was sister to the plastid-containing eukaryotes (see above), the chlamydiae (excluding <it>P. amoebophila</it>) exhibited much longer branches than the branches of both <it>P. amoebophila </it>and the plastid-containing eukaryotes (which may have drawn them closer to the long branches of other bacteria or conversely resulted in short branch attraction of <it>P. amoebophila </it>and the plastid-containing eukaryotes), again likely rendering the chlamydiae paraphyletic (see above). Support for the monophyly of the chlamydiae is strongest (&#8805; 95% BS; 6 proteins; nos 13, 17, 30, 33, 37, 39) in cases in which the branch lengths of <it>P. amoebophila </it>and the other 4&#8211;8 chlamydiae are of comparable lengths (not shown) or in which branch lengths of the plastid-containing eukaryotes and the bacteria (other than chlamydiae) are of similar lengths (RNA delta(2)-isopentenylpyrophosphate transferase; Additional File <supplr sid="S3">3B</supplr>; no 4; Table <tblr tid="T1">1</tblr>). When support for the monophyly of the chlamydiae was strong (&#8805; 95% BS), so was support for the sister group relationship between chlamydiae and plastid-containing eukaryotes (BS &#8805; 96%; 5 proteins; nos 4, 13, 17, 33, 37; Table <tblr tid="T1">1</tblr>).</p>
            <suppl id="S3">
               <title>
                  <p>Additional File 3</p>
               </title>
               <text>
                  <p>Additional Figure 1. Panels A-F.</p>
               </text>
               <file name="1471-2148-8-203-S3.pdf">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>From these data it is concluded (1) that the chlamydiae are monophyletic, (2) that the apparent paraphyly (polyphyly) of chlamydiae seen in some trees is the result of a long branch (or short branch) attraction artifact because of largely differing evolutionary rates between the genes of <it>P. amoebophila </it>and the later diverging intracellular chlamydial parasites of animal hosts, (3) that the later diverging chlamydiae lost several genes that <it>P. amoebophila </it>(and the plastid-containing eukaryotes) retained, (4) that the donor of the chlamydial genes found today in the plastid-containing eukaryotes was related to the common ancestor of <it>P. amoebophila </it>and other chlamydiae, and thus (5) that the HGT/EGT of chlamydial genes into plastid-containing eukaryotes occurred more than 700 MYA, the estimated time of divergence of <it>P</it>.<it>amoebophila </it>from other chlamydiae <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. The apparent ancient origin of the chlamydial genes in plastid-containing eukaryotes is independently supported by the recovered monophyly of five chlamydial proteins found to date in all three lineages of Plantae (with 58 &#8211; 100% BS support for the monophyly of the Plantae; Table <tblr tid="T1">1</tblr>). The divergence time between Rhodoplantae and Viridiplantae has been variously estimated, based on molecular clock analyses, to be at least 1,100 MYA <abbrgrp><abbr bid="B20">20</abbr><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp>.</p>
         </sec>
         <sec>
            <st>
               <p>Phylogeny of chlamydial proteins supports the monophyly of the Plantae, but can chlamydial proteins also shed light on subsequent algal evolution?</p>
            </st>
            <p>Because chlamydial genes apparently entered the nuclear genome of plastid-containing eukaryotes before the divergence of the three major lineages of Plantae (see above), their fate can be traced through algal evolution and subsequent secondary endosymbioses in much the same way as that of plastidial proteins of cyanobacterial origin. Whereas the monophyly of the Plantae is rarely questioned today (but see <abbrgrp><abbr bid="B23">23</abbr></abbrgrp> and reviews by <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr></abbrgrp>), the sequence of evolutionary divergence of the three principal types of plastids that correspond to the three lineages of Plantae, namely cyanelles (Glaucoplantae), rhodoplasts (Rhodoplantae) and chloroplasts (Viridiplantae) still remains unresolved. All possible topologies of the three lineages of plastids (or the corresponding Plantae) have been recovered in phylogenetic analyses, i.e. a sister group of Rhodoplantae and Viridiplantae (R+V; <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr></abbrgrp>), a sister group of Glaucoplantae and Rhodoplantae (G+R; <abbrgrp><abbr bid="B29">29</abbr><abbr bid="B30">30</abbr></abbrgrp>), or a sister group of Glaucoplantae and Viridiplantae (G+V; <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B31">31</abbr><abbr bid="B32">32</abbr><abbr bid="B33">33</abbr></abbrgrp>. Often plastid and host trees yielded conflicting results and statistical support was low, taxon sampling was insufficient, especially in analyses of host genes (10&#8211;16 taxa of Plantae; <abbrgrp><abbr bid="B33">33</abbr></abbrgrp>) or apparently long-branch attraction artefacts prevailed.</p>
            <p>Chlamydial proteins do not (yet) shed light on the sequence of divergence of the three lineages of Plantae, likely because too few such proteins are currently known from the Glaucoplantae. For the 5 chlamydial proteins present in all three lineages of the Plantae, none of the three topologies obtained for the Plantae was significantly supported by bootstrap values (Table <tblr tid="T1">1</tblr>). Two proteins supported a G+R sister group (nos 3 and 5, Table <tblr tid="T1">1</tblr>), whereas two other proteins supported a R+V sister group (nos 2 and 4, Table <tblr tid="T1">1</tblr>). The fifth chlamydial protein (Asparaginyl-tRNA synthetase, no. 1, Table <tblr tid="T1">1</tblr>) supported a G+V sister group, however the Viridiplantae were paraphyletic in this analysis (Table <tblr tid="T1">1</tblr>). For two additional proteins, chlamydial homologues were present only in the Viridiplantae, whereas the Glaucoplantae and Rhodoplantae contained the cyanobacterial homologues (nos 24 and 30, Table <tblr tid="T1">1</tblr>). The distribution of the latter two proteins among the three lineages of Plantae is most parsimoniously explained by either a G+R or R+V topology. To further evaluate the different topologies, phylogenetic analyses with concatenated data sets using <it>P. amoebophila </it>and two other chlamydiae as outgroup were performed. In a 4-protein analysis (aspartate aminotransferase, ATP/ADP translocase, tRNA delta(2)-isopentenylpyrophosphate transferase and diphosphate-fructose-6-phosphate 1-phosphotransferase; nos 2&#8211;5, Table <tblr tid="T1">1</tblr>) using 576 aa positions, a moderate (71%) BS support for a sister group relationship between Rhodoplantae and Viridiplantae (R+V) was obtained (data not shown). When more protein sequences of chlamydial origin will become available from the Glaucoplantae, chlamydial proteins may provide a unique opportunity to address the pattern of evolutionary divergence in the three lineages of Plantae.</p>
            <p>The complex distribution pattern of chlamydial proteins in the plastid-containing eukaryotes in combination with the detailed phylogenetic analyses (see above), suggest that chlamydial proteins were differentially lost from individual lineages of Plantae. Of the 13 chlamydial proteins recovered from Viridiplantae only, in seven cases chlamydial homologues were absent from Rhodoplantae suggesting that these were lost in the Rhodoplantae after their divergence from the Viridiplantae. For two of these proteins (nos 23 and 28, Table <tblr tid="T1">1</tblr>), the chlamydial homologues were also absent and thus presumably lost in the chlorophyte sublineage of the Viridiplantae, perhaps indicating that these proteins perform specific functions confined to the streptophyte sublineage of the Viridiplantae. It should be noted that taxon sampling in the Rhodoplantae was mostly limited to <it>Cyanidioschyzon merolae </it>and <it>Galdieria sulphuraria</it>, two members of a specialized group of thermophilic red algae with extremely small genomes (i.e. the Cyanidiophyta). It is thus possible, although not perhaps likely, that the absence of these chlamydial proteins may be confined to this sublineage of the Rhodoplantae and that more "typical" Rhodoplantae will be shown to contain them. For the other 6 chlamydial proteins confined to the Viridiplantae, the Rhodoplantae contain a cyanobacterial homologue (nos 22, 24&#8211;26, 30, 34, Table <tblr tid="T1">1</tblr>). This can be interpreted in different ways. It may be argued that this is indicative of a separate origin of chlamydial proteins in the Viridiplantae (but see results and discussion above). Alternatively, it indicates that both the cyanobacterial and the chlamydial homologues were present in the ancestor of the Plantae and persisted until after the divergence of Rhodoplantae and Viridiplantae, when they were differentially lost (the cyanobacterial homologue in the Viridiplantae and the chlamydial homologue in the Rhodoplantae (and likely also in the Glaucoplantae; nos 24 and 30, Table <tblr tid="T1">1</tblr>)). In case of the phosphate transporter (no. 22, Table <tblr tid="T1">1</tblr>), it is likely that the cyanobacterial homologue was lost only from the streptophyte sublineage of the Viridiplantae as the chlorophyte sublineage contains a cyanobacterial homologue (not shown). It is important to stress that in no case cyanobacterial and chlamydial homologues of the same protein have been found in a single taxon of Plantae (Table <tblr tid="T1">1</tblr>) suggesting that their functions (within the plastid) are redundant and their occurrence thus mutually exclusive. Although several different scenarios may explain the simultaneous presence of cyanobacterial and chlamydial homologues of a larger number of proteins throughout the early history of the Plantae, the most intriquing is certainly the simultaneous presence of two different intracellular bacteria throughout the early history of the Plantae, i.e. cyanobacteria and chlamydiae (see Discussion).</p>
         </sec>
         <sec>
            <st>
               <p>Origin of chlamydial proteins in the Bacillariophyta: a hidden secondary endosymbiosis</p>
            </st>
            <p>For 22 chlamydial proteins putative homologues were identified in the two sequenced genomes of the Bacillariophyta (Fig. <figr fid="F2">2</figr>). Two of these chlamydial proteins apparently occur exclusively in the Bacillariophyta and not in the Plantae (nos 38 and 39, Table <tblr tid="T1">1</tblr>) which instead contain cyanobacterial homologues of these proteins (e.g. Additional File <supplr sid="S3">3F</supplr>). Since it is well established that the Bacillariophyta (and the heterokonts in general) obtained their plastids through a secondary endosymbiosis involving a rhodoplant symbiont (reviews by <abbrgrp><abbr bid="B34">34</abbr><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr></abbrgrp>), either the rhodoplant symbiont of the heterokonts still contained both homologues (which were later differentially lost in the Rhodoplantae and the heterokonts) or, at least in the case of transketolase (no. 39, Table <tblr tid="T1">1</tblr>) more likely, the chlamydial protein was independently acquired by HGT in the Bacillariophyta. The latter scenario is favored for transketolase because <it>Dictyostelium </it>is a sister to the two Bacillariophyta, and all three lineages of Plantae lack the chlamydial homologue.</p>
            <p>For the remaining 20 chlamydial proteins, the Bacillariophyta formed a sister group with either the Rhodoplantae (10 proteins; RB clade, Table <tblr tid="T1">1</tblr>) or the Viridiplantae (10 proteins, VB clade; Table <tblr tid="T1">1</tblr>). Both topologies received similar support values, however, BS values were sometimes low. The four chlamydial proteins that were detected in all lineages of the Plantae, revealed exclusively an RB clade with variable BS support (94, 66, 80 and 55% BS values; Table <tblr tid="T1">1</tblr>). Of the 16 chlamydial proteins that occurred in both Rhodoplantae and Viridiplantae, 9 were also present in the Bacillariophyta (Fig. <figr fid="F2">2</figr>). Of those 9 proteins, one protein (no. 10; Table <tblr tid="T1">1</tblr>) revealed a strongly supported RB clade (94% BS), four proteins a VB clade (nos 6&#8211;8, 13, supported by BS values of 68, 54, 100, and 86%, respectively; Table <tblr tid="T1">1</tblr>), and the remaining 4 proteins exhibited either one or the other topology, but without BS support (nos 9, 11, 12, 17; Table <tblr tid="T1">1</tblr>). Of the 13 chlamydial proteins that were found only in the Viridiplantae, four were also detected in the Bacillariophyta (Fig. <figr fid="F2">2</figr>), for two of which the Bacillariophyta formed a clade with the Viridiplantae (no. 24; 92% BS; Table <tblr tid="T1">1</tblr>) or a sister group with the streptophyte sublineage of the Viridiplantae (no. 26; 100% BS; Table <tblr tid="T1">1</tblr>), for one protein, the VB topology was unsupported (no. 27; Table <tblr tid="T1">1</tblr>) and for the remaining protein, the Bacillariophyta were sister to the Apicomplexa (no. 25; 64% BS; Table <tblr tid="T1">1</tblr>). All three chlamydial proteins that were detected only in the Rhodoplantae (Fig. <figr fid="F2">2</figr>), also displayed homologues in the Bacillariophyta: for two proteins, a well-supported RB clade was recovered (nos 35 and 37; 97 and 96% BS support, respectively; Table <tblr tid="T1">1</tblr>), for the third protein, the Rhodoplantae and Bacillariophyta formed a clade that also included <it>P. amoebophila </it>with 100% BS support (no. 36; Table <tblr tid="T1">1</tblr>). It should be noted that in two of the chlamydial proteins revealing a well-supported RB clade (nos 3 and 35), the RB clade contained, in addition to the two Cyanidiophyta, also members of the Rhodophyta (the second major lineage of the Rhodoplantae according to <abbrgrp><abbr bid="B37">37</abbr></abbrgrp>).</p>
            <p>To study the phylogeny of the chlamydial proteins from the Bacillariophyta in more detail, unrooted maximum likelihood phylogenetic analyses of concatenated data sets were performed (Fig. <figr fid="F3">3</figr>). In the first analysis (Fig. <figr fid="F3">3A</figr>), 7 chlamydial proteins that individually had revealed an RB topology (with or without BS support; nos 1&#8211;4, 10, 12, 17; Table <tblr tid="T1">1</tblr>) were concatenated and subjected to phylogenetic analysis using 12 taxa (2675 amino acid positions; Fig. <figr fid="F3">3A</figr>). The taxa were selected to maximize taxon congruence with the second data set of proteins displaying a VB topology (Fig. <figr fid="F3">3B</figr>), i.e. two Bacillariophyta, the chlorophyte and streptophyte sublineages of the Viridiplantae (5 sequences), the Rhodoplantae (2 sequences, except for no. 10 for which only one rhodoplant sequence was recovered) and, if possible, 3 sequences of chlamydiae were included. The RB clade was recovered with 95% BS values (ML) and 1.0 posterior probability in the Bayesian analysis (Fig <figr fid="F3">3A</figr>). In a second analysis (Fig. <figr fid="F3">3B</figr>), 5 chlamydial proteins that had individually revealed a VB topology (nos 6, 7, 9, 11, 13; Table <tblr tid="T1">1</tblr>; it should be noted that only protein no. 13 received BS support for a VB clade) were concatenated and subjected to phylogenetic analyses using the same 12 taxa as in the concatenated analysis of the proteins with an RB topology (1736 amino acid positions; Fig <figr fid="F3">3B</figr>). In this analysis, the VB topology was strongly supported (>95% BS values in ML and 1.0 posterior probability in the Bayesian analysis; Fig. <figr fid="F3">3B</figr>). Although in the individual analyses of the 5 chlamydial proteins, the position of the Bacillariophyta with respect to the sublineages of the Viridiplantae was not resolved and BS values for 4 of the 5 proteins were below 50% (Table <tblr tid="T1">1</tblr>), the concatenated data set clearly indicated that the 5 chlamydial proteins of the Bacillariophyta were sister to the respective proteins of the Viridiplantae, i.e. they diverged before the split of the Viridiplantae into its chlorophyte and streptophyte sublineages (Fig. <figr fid="F3">3B</figr>).</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>Phylogenetic analyses of chlamydial proteins in the Bacillariophyta support the occurrence of two independent HGT/EGT events</p>
               </caption>
               <text>
                  <p><b>Phylogenetic analyses of chlamydial proteins in the Bacillariophyta support the occurrence of two independent HGT/EGT events</b>. Unrooted maximum likelihood trees of concatenated data sets. Support values: maximum likelihood bootstrap/posterior probability. Scale bars = substitutions per site. (<b>A</b>) Concatenated data set of seven proteins showing relationship of diatoms to rhodoplants (12 taxa; 2675 amino acid positions). (<b>B</b>) Concatenated data set of five proteins showing a relationship of viridiplant and diatom genes (same 12 taxa as in Fig. 3A; 1736 amino acid positions). For a complete list of genes used in the two concatenated analyses see Additional File <supplr sid="S5">5</supplr>.</p>
               </text>
               <graphic file="1471-2148-8-203-3"/>
            </fig>
            <p>From these data the following tentative conclusions are drawn: (1) The Bacillariophyta obtained chlamydial genes from two sources of photosynthetic eukaryotes, the Rhodoplantae and the Viridiplantae, (2) it is very likely that the chlamydial proteins forming RB clades were obtained during the secondary endosymbiosis of a rhodoplant symbiont into a heterokont (or chromalveolate) host, (3) there is preliminary evidence that the endosymbiosis of the rhodoplant occurred before the Rhodoplantae diverged into Cyanidiophyta and Rhodophyta (although this needs to be further studied using a larger taxon sampling in the Rhodoplantae), (4) chlamydial proteins of the Bacillariophyta forming VB clades were obtained by HGT or EGT from a viridiplant alga before the Viridiplantae diverged into Streptophyta and Chlorophyta, (5) since the number of chlamydial proteins forming VB clades in the Bacillariophyta found so far is roughly equivalent to the number of chlamydial proteins forming RB clades, it is suspected that EGT rather than HGT was the source of the viridiplant genes, the endosymbiont (or its plastid) being no longer present in extant heterokonts (or chromalveolates), (6) it cannot currently be decided which of the two secondary endosymbioses preceded the other. If the above conclusions are valid, the two secondary endosymbioses may have taken place during the same time period. Two chlamydial proteins involved in essential plastidic pathways (fatty acid biosynthesis, FABI; isoprenoid biosynthesis, ISPG; nos 24 and 25, Table <tblr tid="T1">1</tblr>) occur in the Bacillariophyta and Viridiplantae, but not in the Rhodoplantae (both Cyanidiophyta and Rhodophyta; tree not shown), which harbor the cyanobacterial homologues (Table <tblr tid="T1">1</tblr>). For ISPG, two Glaucoplantae also contain the cyanobacterial homologue. If the rhodoplant symbiont of the Bacillariophyta also contained the cyanobacterial homologues (the most parsimonious scenario), the latter must have been replaced in the Bacillariophyta by the chlamydial homologue from the viridiplant symbiont, requiring the simultaneous presence of both eukaryotic symbionts in the host.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>Chlamydiae are a group of obligate intracellular bacteria that are well-known pathogens of animals and humans and have been studied for decades <abbrgrp><abbr bid="B38">38</abbr><abbr bid="B39">39</abbr></abbrgrp>. More recently, a large diversity of previously unrecognized chlamydiae was discovered in the environment (e.g. <abbrgrp><abbr bid="B40">40</abbr><abbr bid="B41">41</abbr><abbr bid="B42">42</abbr></abbrgrp>), where they have been found in intracellular associations with diverse eukaryotic hosts ranging from amoebae to invertebrates. Phylogenetic and phylogenomic analyses of chlamydiae indicated that "environmental chlamydiae" represent a sister group of present-day chlamydiae pathogenic in animals, that separated from their common ancestor more than 700 million years ago <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>, suggesting that the ancestor already lived intracellularly in eukaryotes. Recently, the Verrucomicrobia, which have been estimated to comprise up to 10% of the soil bacterial flora and have also been found in aquatic systems including lakes, marine sediments and hot springs but also live associated with eukaryotes, were identified as the closest known free-living relatives of chlamydiae <abbrgrp><abbr bid="B11">11</abbr><abbr bid="B43">43</abbr></abbrgrp>.</p>
         <p>While chlamydiae are obligate intracellular pathogens/symbionts in many eukaryotes, they have not been discovered to date in Plantae or secondary plastid-containing eukaryotes. However, when the first chlamydial genomes were sequenced, a surprisingly high proportion of genes with highest similarity to plant genes were discovered <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. This finding, in conjunction with the obligate intracellular lifestyle of chlamydiae, sparked a number of studies that aimed to elucidate the phylogenetic history of the plant-like chlamydial genes. The conclusions ranged from proposals that the chlamydial proteins in plants simply reflected an ancient phylogenetic relationship between chlamydiae and cyanobacteria (and thus plastids) <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B44">44</abbr></abbrgrp> to an HGT between chlamydiae and plants with either the chlamydiae <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr></abbrgrp> or plants <abbrgrp><abbr bid="B47">47</abbr><abbr bid="B48">48</abbr></abbrgrp> proposed as donors.</p>
         <p>While the present work was in progress, Huang and Gogarten <abbrgrp><abbr bid="B16">16</abbr></abbrgrp> published data, based on phylogenomic analyses of the rhodoplant <it>Cyanidioschyzon merolae </it>to identify chlamydial homologues, which suggested that at least 21 genes were transferred between chlamydiae and the ancestor of Rhodoplantae and Viridiplantae. They concluded that the donor was most similar to present-day <it>Protochlamydia </it>and that the relatively high number of genes transferred suggested an ancient chlamydial endosymbiosis with the ancestral primary photosynthetic eukaryote. These authors also hypothesized that the chlamydial endosymbiont was perhaps necessary to facilitate the establishment of the cyanobacterial endosymbiont, explaining the apparent uniqueness of primary plastid evolution and providing independent evidence for the monophyly of eukaryotes harboring primary plastids. The results and conclusions by <abbrgrp><abbr bid="B16">16</abbr></abbrgrp> are corroborated by the present analyses (for differences in interpretation of phylogenetic analyses of some putative chlamydial proteins between the two studies, see Additional File <supplr sid="S4">4</supplr>). Using an extended phylogenomic analyses that included three chlorophyte (<it>Chlamydomonas reinhardtii</it>, <it>Ostreococcus tauri </it>and <it>O. lucimarinus</it>), a second rhodoplant (<it>Galdieria sulphuraria</it>), and two diatom genomes (<it>Thalassiosira pseudonana </it>and <it>Phaeodactylum tricornutum</it>), and sequences retrieved from EST databases (in particular from the two glaucoplants, <it>Cyanophora paradoxa </it>and <it>Glaucocystis nostochinearum</it>), as well as model-specified phylogenetic analyses of all proteins recovered, the number of chlamydial proteins now identified in photosynthetic eukaryotes has doubled [39 vs 19 (21) in Huang and Gogarten's analyses (two of their proteins are believed to be false positives; see Additional File <supplr sid="S4">4</supplr>)], which is still likely an underestimate. This is corroborated by a recent study that provided a list of 55 genes in Plantae of putative chlamydial origin <abbrgrp><abbr bid="B49">49</abbr></abbrgrp>, including 24 of the 39 proteins of likely chlamydial origin described in this study. Different data mining strategies including different e-value cut offs in the initial BLAST analyses, different phylogenetic methods used and the overall more conservative approach taken in this study may account for the observed different numbers of genes reported.</p>
         <suppl id="S4">
            <title>
               <p>Additional File 4</p>
            </title>
            <text>
               <p>Additional Table 3. Comparison of the results of this study with Huang and Gogarten 2007.</p>
            </text>
            <file name="1471-2148-8-203-S4.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
         <p>Most significantly, five chlamydial proteins (instead of previously only one, i.e. ATP/ADP translocase) are now represented in the three lineages of Plantae and their phylogenetic analyses all revealed monophyly of the Plantae as well as their sister group relationship to chlamydiae. Although <it>P. amoebophila </it>was often recovered as a sister to the Plantae, in such topologies other chlamydiae were either absent (suggesting loss of the respective genes in such chlamydiae) or the chlamydiae were paraphyletic, the latter likely an artifact of long-branch attraction (see Results). It is concluded that the donor of presumably all (except one) of the chlamydial genes found today in plastid-containing eukaryotes was related to the common ancestor of <it>P. amoebophila </it>and other chlamydiae, and thus that the HGT/EGT of chlamydial genes into plastid-containing eukaryotes was an ancient event that occurred more than 700 MYA, the estimated time since divergence of <it>P</it>.<it>amoebophila </it>from other chlamydiae <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. Evidence has also been presented (see Results) that the acceptor of this HGT/EGT event was related to the ancestor of the Plantae, i.e. that HGT/EGT of chlamydial genes occurred before the divergence of the Glaucoplantae, Rhodoplantae and Viridiplantae.</p>
         <p>The simultaneous presence of cyanobacterial and chlamydial homologues of a larger number of proteins throughout the early history of the Plantae can, in principle, be explained by four alternative scenarios (Fig. <figr fid="F4">4A&#8211;C</figr>): (1) the first scenario (multiple HGTs from a <it>Protochlamydia</it>-type donor into different photosynthetic eukaryotes; Fig. <figr fid="F4">4A</figr>) can be rejected (see above), because it is in conflict with the presence of chlamydial genes of the same origin in the common ancestor of the Plantae (see above). (2) The second scenario (Fig. <figr fid="F4">4B</figr>) assumes that chlamydial genes were transferred from a chlamydial donor(s) by a massive single or by multiple HGTs into the cyanobacterial ancestor of plastids before primary endosymbiosis, followed by multiple losses of chlamydial genes in the different lineages of photosynthetic eukaryotes. This scenario fails to explain the complete absence of chlamydial genes in any extant cyanobacterium (unless this gene transfer was a unique event, see below) and cannot provide a rationale for retention of two functional homologues (of cyanobacterial and chlamydial origin) of a protein over extended periods of time in the cyanobacterial symbiont (or plastid). Although HGT among prokaryotes is rampant [eg. <abbrgrp><abbr bid="B50">50</abbr><abbr bid="B51">51</abbr><abbr bid="B52">52</abbr><abbr bid="B53">53</abbr></abbrgrp>] it is difficult to envision what kind of adaptive advantage would be gained in a free-living cyanobacterium upon acquisition of, e.g., the ATP/ADP translocase <abbrgrp><abbr bid="B54">54</abbr></abbrgrp>. (3 &#8211; 4) In the third and fourth scenario (Fig <figr fid="F4">4C</figr>), HGT or EGT of chlamydial genes occurred during the same time period as EGT of the cyanobacterium into a eukaryotic host. Either there was massive bulk transfer of genes from an intracellular chlamydial bacterium to the host or cyanobacterial endosymbiont with subsequent differential loss of these genes in the different lineages of Plantae (HGT) or the chlamydial bacterium persisted as an intracellular symbiont/parasite over extended but varying periods of time in the three lineages of Plantae (EGT) and different chlamydial proteins were successively recruited by the eukaryotic hosts. The latter scenario is favored because the simultaneous presence of chlamydial and cyanobacterial homologues of enzymes with the same function over an extended period of time presumably requires compartmentalization, i. e. residence in their original environment, the respective symbionts. Whether gene transfer occurred first from the intracellular chlamydial bacterium to the host nucleus with later retargeting of the protein to the cyanobacterium (plastid), as hypothesized by <abbrgrp><abbr bid="B16">16</abbr></abbrgrp> or directly between the two types of intracellular bacteria (chlamydial bacterium and cyanobacterium), with gene transfer from the plastid to the host nucleus occurring later (favored here) or perhaps using both routes, remains unknown and possibly depended on the exact timing of the two endosymbioses and the intracellular location of the symbionts relative to each other.</p>
         <fig id="F4">
            <title>
               <p>Figure 4</p>
            </title>
            <caption>
               <p>Scenarios to explain the simultaneous presence of cyanobacterial and chlamydial protein homologues in photosynthetic eukaryotes</p>
            </caption>
            <text>
               <p><b>Scenarios to explain the simultaneous presence of cyanobacterial and chlamydial protein homologues in photosynthetic eukaryotes</b>. (A) Multiple HGTs of chlamydial genes from a single donor into different photosynthetic eukaryotes. (B) Single or multiple HGTs of chlamydial genes into the cyanobacterial ancestor of plastids and group-specific gene losses from different photosynthetic eukaryotes. (C) HGT or EGT from intracellular chlamydiae to the cyanobacterial endosymbiont of a photosynthetic eukaryote and group-specific chlamydial gene losses from different photosynthetic eukaryotes. In a variation of this scenario the intracellular chlamydiae donate genes by EGT or HGT to the eukaryotic host little before or at the time of cyanobacterial endosymbiosis and group-specific multiple gene losses of chlamydial genes. (D) Origin of chlamydial proteins in heterokont algae. Two secondary endosymbioses are shown involving sequentially a viridiplant and a rhodoplant symbiont. The endosymbiosis of a cyanobacterium has been omitted from Figure 4D for clarity.</p>
            </text>
            <graphic file="1471-2148-8-203-4"/>
         </fig>
         <p>A co-existence of a chlamydial bacterium and the evolving plastid in the eukaryotic host over an extended time period may seem unlikely; however, multiple intracellular bacteria occur in extant protists, and recently, Heinz et al. <abbrgrp><abbr bid="B55">55</abbr></abbrgrp> provided evidence for the beneficial association of two intracellular bacteria (a chlamydial bacterium and a proteobacterium) in a free living <it>Acanthamoeba</it>. There is also evidence for mobile DNA in intracellular bacteria of eukaryotic hosts <abbrgrp><abbr bid="B56">56</abbr><abbr bid="B57">57</abbr></abbrgrp> and the recent discoveries of conjugation machineries in intracellular rickettsiae and chlamydiae have been particularly enlightening <abbrgrp><abbr bid="B58">58</abbr><abbr bid="B59">59</abbr></abbrgrp>. Based on phylogenetic and bioinformatic analyses, Ogata et al. <abbrgrp><abbr bid="B59">59</abbr></abbrgrp> concluded that genes involved in conjugative DNA transfer have been exchanged by HGT between the ancestors of rickettsiae and environmental chlamydiae in a eukaryotic host, likely an amoeba. This may have been the same HGT event that resulted in the transfer of ATP/ADP translocase (in total five paralogous <it>tlc </it>genes), the hallmark enzyme of the intracellular ATP-parasitism of rickettsiae and chlamydiae, from the ancestor of chlamydiae to the ancestor of rickettsiae <abbrgrp><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr><abbr bid="B60">60</abbr><abbr bid="B61">61</abbr></abbrgrp>. Although it has recently been suggested that the plastid paralogue (NTT1) of the ATP/ADP translocase might have been derived from the mitochondrial ancestor (and thus from a rickettsia-type &#945;-proteobacterium; <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>), this is unlikely because almost all of the 39 chlamydial proteins identified to date in plastid-containing eukaryotes presumably originated in an intracellular chlamydial symbiont/parasite residing in the Plantae (this study), while the origin of mitochondria occurred much earlier (none of the 39 chlamydial proteins, with three exceptions [<it>Dictyostelium</it>], has been found in other eukaryotes).</p>
         <p>This study also offered the first opportunity to trace the spread of chlamydial proteins through secondary plastid endosymbioses using phylogenomic information from the two diatoms <it>Thalassiosira pseudonana </it>and <it>Phaeodactylum tricornutum</it>. Phylogenetic analyses of the 20 chlamydial proteins recovered from the diatom proteomes led to the conclusion that chlamydial proteins originated by two separate secondary endosymbiotic events, one involving a viridiplant and a second involving a rhodoplant symbiont, only the latter surviving in extant photosynthetic heterokonts. It is also concluded that both secondary endosymbioses were ancient, presumably occurring before the major radiations in both lineages of Plantae took place (see Results and Fig. <figr fid="F4">4D</figr>).</p>
         <p>Analysis of the genome of <it>Thalassiosira pseudonana </it>gave the first indication that the diatom proteome contained a surprisingly large number of proteins that matched only to Viridiplantae (i.e. <it>Arabidopsis thaliana</it>; 865 proteins), more than four times as many as to the rhodoplant <it>Cyanidioschyzon merolae </it><abbrgrp><abbr bid="B63">63</abbr></abbrgrp>. Even when one considers that the <it>A. thaliana </it>proteome is more than four times larger than the <it>C. merolae </it>proteome, there would still be about equal numbers of diatom proteins matching either to Viridiplantae or Rhodoplantae. The numerical ratio of diatom proteins with similarity to either Viridiplantae or Rhodoplantae was even more biased towards Viridiplantae, when animals (<it>Mus musculus</it>) were replaced by the cyanobacterium <it>Nostoc </it>sp. PCC 7120 (2023 "green" proteins vs. 254 "red" proteins) in proteome comparisons, perhaps suggesting that the contribution of viridiplant proteins to the diatom proteome extends well beyond the plastid proteome. These results were corroborated in a comparative proteome approach using more than 5,000 non-redundant EST sequences from the pennate diatom <it>P. tricornutum </it><abbrgrp><abbr bid="B64">64</abbr></abbrgrp>. The genomes of the early diverging heterokonts <it>Phytophthora sojae </it>and <it>P. ramosum </it>displayed a large number of genes (855) supporting a photosynthetic ancestry of these presumably aplastidial protists, and again a significant portion of these genes revealed best matches to Viridiplantae <abbrgrp><abbr bid="B65">65</abbr></abbrgrp>.</p>
         <p>Although the origin of the heterokont plastid from a red alga has been established beyond doubt using phylogenetic and phylogenomic approaches <abbrgrp><abbr bid="B66">66</abbr><abbr bid="B67">67</abbr><abbr bid="B68">68</abbr><abbr bid="B69">69</abbr><abbr bid="B70">70</abbr><abbr bid="B71">71</abbr></abbrgrp>, phylogenomic analyses of the "green" proteins in heterokonts/chromalveolates have only recently been initiated <abbrgrp><abbr bid="B65">65</abbr><abbr bid="B71">71</abbr><abbr bid="B72">72</abbr><abbr bid="B73">73</abbr><abbr bid="B74">74</abbr></abbrgrp>. Li et al. <abbrgrp><abbr bid="B71">71</abbr></abbrgrp> used 5,081 expressed sequence tags of the haptophyte <it>Emiliania huxleyi </it>and a phylogenomic approach including genome and EST data from other algae, animals, plants and bacteria to identify the source of endosymbiotic gene transfer for 19 non-paralogous proteins using maximum-likelihood phylogenetic analyses: 17 genes were of the expected red algal origin, whereas for two genes (chlorophyll a synthase, phosphoribulokinase [PRK]) Viridiplantae were the sister group of the heterokonts/chromalveolates to the exclusion of the Rhodoplantae. Similar results regarding the phylogeny of PRK were obtained by Petersen et al. <abbrgrp><abbr bid="B72">72</abbr></abbrgrp>. Whereas Petersen et al. <abbrgrp><abbr bid="B72">72</abbr></abbrgrp> suggested that PRK in heterokonts/chromalveolates originated by a single non-endosymbiotic HGT from a green alga to a rhodoplast-containing protist, Li et al. <abbrgrp><abbr bid="B71">71</abbr></abbrgrp>, referring to the earlier controversial discussion about a putative green algal ancestry of the apicoplast-encoded elongation factor <it>tufA </it><abbrgrp><abbr bid="B75">75</abbr></abbrgrp> and the apicomplexan mitochondrial-targeted <it>cox2a </it>and <it>cox2b </it>subunits <abbrgrp><abbr bid="B76">76</abbr></abbrgrp>, raised the possibility that "green" proteins in heterokonts/chromalveolates may have originated by EGT from a green alga that was endosymbiotic in a heterokont/chromalveolate. From their data, Li et al. <abbrgrp><abbr bid="B71">71</abbr></abbrgrp> concluded that the red algal contribution, however, was at least an order of magnitude larger than that of green algae. In a related study, Nosenko et al. <abbrgrp><abbr bid="B73">73</abbr></abbrgrp> identified 30 different plastid-targeted proteins from two EST-libraries of the tertiary plastid-containing dinoflagellate <it>Karenia brevis</it>. Of 22 proteins whose evolutionary origins could be resolved, 13 were of rhodoplant, while 6 were of viridiplant origin. These authors suggested that a major influx of viridiplant genes occurred early in the evolution of heterokonts/chromalveolates, and since all "foreign" genes acquired by chromalveolates before their divergence into heterokonts/alveolates were derived from a single donor (a viridiplant), one possible explanation would be the presence of a green algal endosymbiont in the chromalveolate ancestor prior to the rhodoplant endosymbiosis <abbrgrp><abbr bid="B71">71</abbr></abbrgrp>. The phylogenetic trees derived to date from most of the "green" proteins in chromalveolates (e.g. PRK, periplasmic serin protease IV, soluble inorganic pyrophosphatase, &#947;-Tocopherol O-methyltransferase) suggest that EGT occurred before the divergence of Chlorophyta and Streptophyta, in accordance with the results presented in this study. In conclusion, heterokonts/chromalveolates seem to have obtained chlamydial proteins from two sources, both signaling ancient secondary endosymbiotic events involving symbionts from two of the three lineages of Plantae. An ancient "shopping for plastidial eukaryotes" <abbrgrp><abbr bid="B24">24</abbr></abbrgrp> in the heterotrophic ancestor of the heterokonts/chromalveolates could explain the origin of the metabolic versatility that may have subsequently contributed to the ecological success of this group of organisms irrespective of whether photosynthesis was retained or not <abbrgrp><abbr bid="B77">77</abbr><abbr bid="B78">78</abbr></abbrgrp>. As in the case of the primary acquisition of chlamydial proteins by the ancestor of the Plantae (see above), it is suggested that in the heterokont/chromalveolate ancestor, genes were also transferred from the old to the new "shopping bag", or to phrase it differently, "you are what you shop".</p>
      </sec>
      <sec>
         <st>
            <p>Conclusion</p>
         </st>
         <p>We identified 39 proteins of chlamydial origin in photosynthetic eukaryotes. Most likely Chlamydiae invaded the ancestor of the Plantae and intracellular chlamydiae persisted throughout the early history of the Plantae, donating genes either directly or via the cyanobacterial endosymbiont/plastid to their hosts before they eventually vanished. The transferred genes replaced their cyanobacterial/plastid homologs thus shaping early algal/plant evolution. Chlamydial proteins spread through secondary endosymbioses to other photoautotrophic eukaryotes. Heterokonts/chromalveolates seem to have obtained chlamydial proteins from two secondary endosymbiotic events involving symbionts from the rhodoplant and viridiplant  lineages.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Data set</p>
            </st>
            <p>We screened for algal proteins of possible chlamydial origin in the following ways: 1) The JGI databases for <it>Chlamydomonas reinhardtii, Ostreococcus lucimarinus</it>, <it>Ostreococcus tauri</it>, <it>Phaeodactylum tricornutum </it>and <it>Thalassiosira pseudonana </it>were searched for Blast hits with <it>Protochlamydia amoebophilia </it>using the advanced search function and an e-value cut off of exp -20. 2) The proteome of <it>Cyanidioschyzon merolae </it>was downloaded from <url>http://merolae.biol.s.u-tokyo.ac.jp/</url> and blasted (NCBI BLASTP) locally against the proteom of <it>Protochlamydia amoebophila </it>using an e-value cut-off of exp-20. Proteins and contigs showing similarity over the entire length were selected for further analysis. To exclude false positive we blasted (BLASTP) each algal protein against the NCB non redundant protein database and the databases for <it>Chlamydomonas reinhardtii</it>, <it>Ostreococcus lucimarinus</it>, <it>Ostreococcus tauri</it>, <it>Phaeodactylum tricornutum </it>and <it>Thalassiosira pseudonana </it>at JGI and the <it>Cyanidionschyzon merolae </it>database at <url>http://merolae.biol.s.u-tokyo.ac.jp/</url>. For proteins showing an association with proteins of Chlamydiae in the distance tree generated by the NCBI BLAST server, we assembled a dataset containing all algal proteins and proteins from <it>Arabidopsis</it>, <it>Oryza</it>, at least 5 Chlamydiae (incl. <it>Protochlamydia amoebophilia</it>), at least 5 cyanobacteria and members of the Proteobacteria and Firmicutes and 5 &#8211; 10 bacterial strains obtained as top bacterial BLASTP hits using the protein from <it>Protochlamydi</it>a as a query.</p>
         </sec>
         <sec>
            <st>
               <p>Phylogenetic analysis</p>
            </st>
            <p>Alignment of single-gene data sets were generated using CLUSTAL W and manually refined with SeaView <abbrgrp><abbr bid="B79">79</abbr></abbrgrp>. Non-alignable regions were excluded prior to phylogenetic analyses. The evolutionary model fitting best the protein data was determined with ProtTest 1.2.6 with deactivated "+F" option <abbrgrp><abbr bid="B80">80</abbr><abbr bid="B81">81</abbr></abbrgrp>. Maximum likelihood trees were done with phyml 2.4.4 set to the optimal evolutionary model and including 200 bootstrap replicates (in most cases WAG+I+&#915;; <abbrgrp><abbr bid="B82">82</abbr><abbr bid="B83">83</abbr></abbrgrp>). All trees that were chosen to be depicted, have been subjected in addition to Bayesian inference with MrBayes 3.1.2 <abbrgrp><abbr bid="B84">84</abbr></abbrgrp>. For each data set, two runs with four chains and 3 million generations have been computed. Likelihood parameters were set to 4 gamma categories and proportion of invarable sites. Computation was done across all available amino acid substitution matrices (command "prset aamodel = mixed"). Every 100<sup>th </sup>generation was sampled. Convergence of the runs was checked according to the output of the "sump" command. The output was also used to determine the burn-in phase.</p>
            <p>Two concatenated data sets with a reduced taxon sampling have been assembled from single-gene data. In the first of these data sets, single genes that showed a closer relationship of Bacillariophyta to red algae have been combined, whereas the second data set comprised single genes showing a relationship of Bacillariophyta to Viridiplantae. All concatenated data sets have been subjected to the same analysis procedure as the single genes concerning maximum likelihood analysis (see above). Since in the concatenated data sets some sequences for single taxa were missing, no partitions were defined for Bayesian analyses (see legend of Fig. <figr fid="F3">3</figr>, additional file <supplr sid="S5">5</supplr> for details).</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>BB conceived the study, contributed to its design, performed data analysis, and helped to draft the manuscript. KH&#8211;E was responsible for phylogenetic analyses and helped to draft the manuscript. MM conceived the study, contributed to its design, and wrote the manuscript. All authors read and approved the final manuscript.</p>
         <suppl id="S5">
            <title>
               <p>Additional file 5</p>
            </title>
            <text>
               <p>Figure legend. Extended legend of Fig. <figr fid="F3">3</figr>.</p>
            </text>
            <file name="1471-2148-8-203-S5.pdf">
               <p>Click here for file</p>
            </file>
         </suppl>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>This work was supported by the University of Cologne. The authors thank the RRZK (Regional Computing Centre) for technical support.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>Lateral gene transfer and the nature of bacterial innovation</p>
            </title>
            <aug>
               <au>
                  <snm>Ochman</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Lawrence</snm>
                  <fnm>JG</fnm>
               </au>
               <au>
                  <snm>Groisman</snm>
                  <fnm>EA</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2000</pubdate>
            <volume>405</volume>
            <issue>6784</issue>
            <fpage>299</fpage>
            <lpage>304</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/35012500</pubid>
                  <pubid idtype="pmpid" link="fulltext">10830951</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Lateral gene transfer in eukaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Andersson</snm>
                  <fnm>JO</fnm>
               </au>
            </aug>
            <source>Cell Mol Life Sci</source>
            <pubdate>2005</pubdate>
            <volume>62</volume>
            <issue>11</issue>
            <fpage>1182</fpage>
            <lpage>1197</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1007/s00018-005-4539-z</pubid>
                  <pubid idtype="pmpid" link="fulltext">15761667</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Ancient horizontal gene transfer can benefit phylogenetic reconstruction</p>
            </title>
            <aug>
               <au>
                  <snm>Huang</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Gogarten</snm>
                  <fnm>JP</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2006</pubdate>
            <volume>22</volume>
            <issue>7</issue>
            <fpage>361</fpage>
            <lpage>366</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tig.2006.05.004</pubid>
                  <pubid idtype="pmpid" link="fulltext">16730850</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>The origin and stablishment of the lastid in algae and plants</p>
            </title>
            <aug>
               <au>
                  <snm>Reyes-Prieto</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Weber</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Bhattacharya</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Annu Rev Genet</source>
            <pubdate>2007</pubdate>
            <volume>41</volume>
            <fpage>147</fpage>
            <lpage>168</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.genet.41.110306.130134</pubid>
                  <pubid idtype="pmpid" link="fulltext">17600460</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>Horizontal gene transfer, genome innovation and evolution</p>
            </title>
            <aug>
               <au>
                  <snm>Gogarten</snm>
                  <fnm>JP</fnm>
               </au>
               <au>
                  <snm>Townsend</snm>
                  <fnm>JP</fnm>
               </au>
            </aug>
            <source>Nat Rev Microbiol</source>
            <pubdate>2005</pubdate>
            <volume>3</volume>
            <issue>9</issue>
            <fpage>679</fpage>
            <lpage>687</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nrmicro1204</pubid>
                  <pubid idtype="pmpid" link="fulltext">16138096</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>The source of laterally transferred genes in bacterial genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Daubin</snm>
                  <fnm>V</fnm>
               </au>
               <au>
                  <snm>Lerat</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Perriere</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Genome Biology</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <issue>9</issue>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">193657</pubid>
                  <pubid idtype="pmpid" link="fulltext">12952536</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>The power of phylogenetic approaches to detect horizontally transferred genes</p>
            </title>
            <aug>
               <au>
                  <snm>Poptsova</snm>
                  <fnm>MS</fnm>
               </au>
               <au>
                  <snm>Gogarten</snm>
                  <fnm>JP</fnm>
               </au>
            </aug>
            <source>BMC Evolutionary Biology</source>
            <pubdate>2007</pubdate>
            <volume>7</volume>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1847511</pubid>
                  <pubid idtype="pmpid" link="fulltext">17376230</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>Phylogenetic reconstruction and lateral gene transfer</p>
            </title>
            <aug>
               <au>
                  <snm>Bapteste</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Boucher</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Leigh</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Doolittle</snm>
                  <fnm>WF</fnm>
               </au>
            </aug>
            <source>Trends in Microbiology</source>
            <pubdate>2004</pubdate>
            <volume>12</volume>
            <issue>9</issue>
            <fpage>406</fpage>
            <lpage>411</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tim.2004.07.002</pubid>
                  <pubid idtype="pmpid" link="fulltext">15337161</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Chlamydiae-specific proteins and indels: novel tools for studies</p>
            </title>
            <aug>
               <au>
                  <snm>Gupta</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Griffiths</snm>
                  <fnm>E</fnm>
               </au>
            </aug>
            <source>Trends in Microbiology</source>
            <pubdate>2006</pubdate>
            <volume>14</volume>
            <issue>12</issue>
            <fpage>527</fpage>
            <lpage>535</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.tim.2006.10.002</pubid>
                  <pubid idtype="pmpid" link="fulltext">17049238</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>Increasing diversity within chlamydiae</p>
            </title>
            <aug>
               <au>
                  <snm>Corsaro</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Valassina</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Venditti</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>Crit Rev Microbiol</source>
            <pubdate>2003</pubdate>
            <volume>29</volume>
            <issue>1</issue>
            <fpage>37</fpage>
            <lpage>78</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1080/713610404</pubid>
                  <pubid idtype="pmpid">12638718</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>The <it>Planctomycetes</it>, <it>Verrucomicrobia</it>, <it>Chlamydiae</it> and sister phyla comprise a superphylum with biotechnological and medical relevance</p>
            </title>
            <aug>
               <au>
                  <snm>Wagner</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Horn</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Curr Opin Biotech</source>
            <pubdate>2006</pubdate>
            <volume>17</volume>
            <issue>3</issue>
            <fpage>241</fpage>
            <lpage>249</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.copbio.2006.05.005</pubid>
                  <pubid idtype="pmpid" link="fulltext">16704931</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Genome sequence of an obligate intracellular pathogen of humans: <it>Chlamydia trachomatis</it></p>
            </title>
            <aug>
               <au>
                  <snm>Stephens</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Kalman</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Lammel</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Fan</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Marathe</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Aravind</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Mitchell</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Olinger</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Tatusov</snm>
                  <fnm>RL</fnm>
               </au>
               <au>
                  <snm>Zhao</snm>
                  <fnm>QX</fnm>
               </au>
               <au>
                  <snm>Koonin</snm>
                  <fnm>EV</fnm>
               </au>
               <au>
                  <snm>Davis</snm>
                  <fnm>RW</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>1998</pubdate>
            <volume>282</volume>
            <issue>5389</issue>
            <fpage>754</fpage>
            <lpage>759</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.282.5389.754</pubid>
                  <pubid idtype="pmpid" link="fulltext">9784136</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Evidence that plant-like genes in Chlamydia species reflect an ancestral relationship between chlamydiaceae, cyanobacteria, and the chloroplast</p>
            </title>
            <aug>
               <au>
                  <snm>Brinkman</snm>
                  <fnm>FSL</fnm>
               </au>
               <au>
                  <snm>Blanchard</snm>
                  <fnm>JL</fnm>
               </au>
               <au>
                  <snm>Cherkasov</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Av-Gay</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Brunham</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Fernandez</snm>
                  <fnm>RC</fnm>
               </au>
               <au>
                  <snm>Finlay</snm>
                  <fnm>BB</fnm>
               </au>
               <au>
                  <snm>Otto</snm>
                  <fnm>SP</fnm>
               </au>
               <au>
                  <snm>Ouellette</snm>
                  <fnm>BFF</fnm>
               </au>
               <au>
                  <snm>Keeling</snm>
                  <fnm>PJ</fnm>
               </au>
               <au>
                  <snm>Rose</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Hancock</snm>
                  <fnm>REW</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>SJM</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2002</pubdate>
            <volume>12</volume>
            <fpage>1159</fpage>
            <lpage>67</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmpid" link="fulltext">12176923</pubid>
                  <pubid idtype="pmcid">186644</pubid>
                  <pubid idtype="doi">10.1101/gr.341802</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Illuminating the evolutionary history of chlamydiae</p>
            </title>
            <aug>
               <au>
                  <snm>Horn</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Collingro</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Schmitz-Esser</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Beier</snm>
                  <fnm>CL</fnm>
               </au>
               <au>
                  <snm>Purkhold</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Fartmann</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Brandt</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Nyakatura</snm>
                  <fnm>GJ</fnm>
               </au>
               <au>
                  <snm>Droege</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Frishman</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Rattei</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Mewes</snm>
                  <fnm>HW</fnm>
               </au>
               <au>
                  <snm>Wagner</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Science</source>
            <pubdate>2004</pubdate>
            <volume>304</volume>
            <issue>5671</issue>
            <fpage>728</fpage>
            <lpage>730</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1096330</pubid>
                  <pubid idtype="pmpid" link="fulltext">15073324</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>CMP-KDO synthetase - a plant gene borrowed from Gram-negative eubacteria</p>
            </title>
            <aug>
               <au>
                  <snm>Royo</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Gomez</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Hueros</snm>
                  <fnm>G</fnm>
               </au>
            </aug>
            <source>Trends Genet</source>
            <pubdate>2000</pubdate>
            <volume>16</volume>
            <issue>10</issue>
            <fpage>432</fpage>
            <lpage>433</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S0168-9525(00)02102-8</pubid>
                  <pubid idtype="pmpid" link="fulltext">11050325</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Did an ancient chlamydial endosymbiosis facilitate the establishment of primary plastids?</p>
            </title>
            <aug>
               <au>
                  <snm>Huang</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Gogarten</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Genome Biology</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>R99</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">2394758</pubid>
                  <pubid idtype="pmpid" link="fulltext">17547748</pubid>
                  <pubid idtype="doi">10.1186/gb-2007-8-6-r99</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>'<it>Candidatus</it> Protochlamydia amoebophila', an endosymbiont of <it>Acanthamoeba</it> spp.</p>
            </title>
            <aug>
               <au>
                  <snm>Collingro</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Toenshoff</snm>
                  <fnm>ER</fnm>
               </au>
               <au>
                  <snm>Taylor</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Fritsche</snm>
                  <fnm>TR</fnm>
               </au>
               <au>
                  <snm>Wagner</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Horn</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Int J Syst Evol Microbiol</source>
            <pubdate>2005</pubdate>
            <volume>55</volume>
            <fpage>1863</fpage>
            <lpage>1866</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1099/ijs.0.63572-0</pubid>
                  <pubid idtype="pmpid" link="fulltext">16166679</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>A genomic timescale of prokaryote evolution: insights into the origin of methanogenesis, phototrophy, and the colonization of land</p>
            </title>
            <aug>
               <au>
                  <snm>Battistuzzi</snm>
                  <fnm>FU</fnm>
               </au>
               <au>
                  <snm>Feijao</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Hedges</snm>
                  <fnm>SB</fnm>
               </au>
            </aug>
            <source>BMC Evolutionary Biology</source>
            <pubdate>2004</pubdate>
            <volume>4</volume>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">533871</pubid>
                  <pubid idtype="pmpid" link="fulltext">15535883</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>A complex and punctate distribution of three eukaryotic genes derived by lateral gene transfer</p>
            </title>
            <aug>
               <au>
                  <snm>Rogers</snm>
                  <fnm>MB</fnm>
               </au>
               <au>
                  <snm>Watkins</snm>
                  <fnm>RF</fnm>
               </au>
               <au>
                  <snm>Harper</snm>
                  <fnm>JT</fnm>
               </au>
               <au>
                  <snm>Durnford</snm>
                  <fnm>DG</fnm>
               </au>
               <au>
                  <snm>Gray</snm>
                  <fnm>MW</fnm>
               </au>
               <au>
                  <snm>Keeling</snm>
                  <fnm>PJ</fnm>
               </au>
            </aug>
            <source>BMC Evolutionary Biology</source>
            <pubdate>2007</pubdate>
            <volume>7</volume>
            <fpage>89</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">1920508</pubid>
                  <pubid idtype="pmpid" link="fulltext">17562012</pubid>
                  <pubid idtype="doi">10.1186/1471-2148-7-89</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>A molecular timescale of eukaryote evolution and the rise of complex multicellular life</p>
            </title>
            <aug>
               <au>
                  <snm>Hedges</snm>
                  <fnm>SB</fnm>
               </au>
               <au>
                  <snm>Blair</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Venturi</snm>
                  <fnm>ML</fnm>
               </au>
               <au>
                  <snm>Shoe</snm>
                  <fnm>JL</fnm>
               </au>
            </aug>
            <source>BMC Evolutionary Biology</source>
            <pubdate>2004</pubdate>
            <volume>4</volume>
            <fpage>2</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="pmcid">341452</pubid>
                  <pubid idtype="pmpid" link="fulltext">15005799</pubid>
                  <pubid idtype="doi">10.1186/1471-2148-4-2</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>A molecular timeline for the origin of photosynthetic eukaryotes</p>
            </title>
            <aug>
               <au>
                  <snm>Yoon</snm>
                  <fnm>HS</fnm>
               </au>
               <au>
                  <snm>Hackett</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Ciniglia</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Pinto</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Bhattacharya</snm>
                  <fnm>D</fnm>
               </au>
            </aug>
            <source>M