<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
	<ui>gb-2004-5-11-r88</ui>
	<ji>GBJ</ji>
	<fm>
		<dochead>Research</dochead>
		<bibl>
			<title>
				<p>Phylogenomic evidence supports past endosymbiosis, intracellular and horizontal gene transfer in <it>Cryptosporidium parvum</it></p>
			</title>
			<aug>
				<au id="A1">
					<snm>Huang</snm>
					<fnm>Jinling</fnm>
					<insr iid="I1"/>
					<email>jihuang@uga.edu</email>
				</au>
				<au id="A2">
					<snm>Mullapudi</snm>
					<fnm>Nandita</fnm>
					<insr iid="I2"/>
					<email>nandita@uga.edu</email>
				</au>
				<au id="A3">
					<snm>Lancto</snm>
					<mi>A</mi>
					<fnm>Cheryl</fnm>
					<insr iid="I3"/>
					<email>lanct002@umn.edu</email>
				</au>
				<au id="A4">
					<snm>Scott</snm>
					<fnm>Marla</fnm>
					<insr iid="I1"/>
					<email>marlascott@charter.net</email>
				</au>
				<au id="A5">
					<snm>Abrahamsen</snm>
					<mi>S</mi>
					<fnm>Mitchell</fnm>
					<insr iid="I3"/>
					<email>abe@umn.edu</email>
				</au>
				<au id="A6" ca="yes">
					<snm>Kissinger</snm>
					<mi>C</mi>
					<fnm>Jessica</fnm>
					<insr iid="I1"/>
					<insr iid="I2"/>
					<email>jkissing@uga.edu</email>
				</au>
			</aug>
			<insg>
				<ins id="I1">
					<p>Center for Tropical and Emerging Global Diseases, University of Georgia, Athens, GA 30602, USA</p>
				</ins>
				<ins id="I2">
					<p>Department of Genetics, University of Georgia, Athens, GA 30602, USA</p>
				</ins>
				<ins id="I3">
					<p>Veterinary and Biomedical Sciences, University of Minnesota, St Paul, MN 55108, USA</p>
				</ins>
			</insg>
			<source>Genome Biology</source>
			<issn>1465-6906</issn>
			<pubdate>2004</pubdate>
			<volume>5</volume>
			<issue>11</issue>
			<fpage>R88</fpage>
			<url>http://genomebiology.com/2004/5/11/R88</url>
			<xrefbib>
				<pubidlist><pubid idtype="pmpid">15535864</pubid><pubid idtype="doi">10.1186/gb-2004-5-11-r88</pubid>
				</pubidlist></xrefbib>
		</bibl>
		<history>
			<rec>
				<date>
					<day>19</day>
					<month>4</month>
					<year>2004</year>
				</date>
			</rec>
			<revrec>
				<date>
					<day>16</day>
					<month>8</month>
					<year>2004</year>
				</date>
			</revrec>
			<acc>
				<date>
					<day>10</day>
					<month>9</month>
					<year>2004</year>
				</date>
			</acc>
			<pub>
				<date>
					<day>19</day>
					<month>10</month>
					<year>2004</year>
				</date>
			</pub>
		</history>
		<cpyrt>
			<year>2004</year>
			<collab>Huang et al.; licensee BioMed Central Ltd.</collab>
			<note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
		</cpyrt>
		<shorttitle>
			<p>Phylogenomic evidence supports past endosymbiosis, intracellular and horizontal gene transfer in <it>Cryptosporidium parvum</it></p>
		</shorttitle>
		<shortabs>
			<p>An analysis of <it>Cryptosporidium parvum</it> genes of likely endosymbiont or prokaryotic origin supports the hypothesis that <it>C. arvum</it> evolved from a plastid-containing lineage.</p>
		</shortabs>
		<abs>
			<sec>
				<st>
					<p>Abstract</p>
				</st>
				<sec>
					<st>
						<p>Background</p>
					</st>
					<p>The apicomplexan parasite <it>Cryptosporidium parvum </it>is an emerging pathogen capable of causing illness in humans and other animals and death in immunocompromised individuals. No effective treatment is available and the genome sequence has recently been completed. This parasite differs from other apicomplexans in its lack of a plastid organelle, the apicoplast. Gene transfer, either intracellular from an endosymbiont/donor organelle or horizontal from another organism, can provide evidence of a previous endosymbiotic relationship and/or alter the genetic repertoire of the host organism. Given the importance of gene transfers in eukaryotic evolution and the potential implications for chemotherapy, it is important to identify the complement of transferred genes in <it>Cryptosporidium</it>.</p>
				</sec>
				<sec>
					<st>
						<p>Results</p>
					</st>
					<p>We have identified 31 genes of likely plastid/endosymbiont (n = 7) or prokaryotic (n = 24) origin using a phylogenomic approach. The findings support the hypothesis that <it>Cryptosporidium </it>evolved from a plastid-containing lineage and subsequently lost its apicoplast during evolution. Expression analyses of candidate genes of algal and eubacterial origin show that these genes are expressed and developmentally regulated during the life cycle of <it>C. parvum</it>.</p>
				</sec>
				<sec>
					<st>
						<p>Conclusions</p>
					</st>
					<p><it>Cryptosporidium </it>is the recipient of a large number of transferred genes, many of which are not shared by other apicomplexan parasites. Genes transferred from distant phylogenetic sources, such as eubacteria, may be potential targets for therapeutic drugs owing to their phylogenetic distance or the lack of homologs in the host. The successful integration and expression of the transferred genes in this genome has changed the genetic and metabolic repertoire of the parasite.</p>
				</sec>
			</sec>
		</abs>
	</fm>
	<meta>
		<classifications>
			<classification type="BMC" subtype="man_spc_id" id="30010014">Microbiology and parasitology</classification>
			<classification type="BMC" subtype="man_spc_id" id="30010008">Evolution</classification>
			<classification type="BMC" subtype="man_spc_id" id="30010010">Genome studies</classification>
			<classification type="BMC" subtype="man_spc_id" id="30010009">Genetics</classification>
		</classifications>
	</meta>
	<bdy>
		<sec>
			<st>
				<p>Background</p>
			</st>
			<p><it>Cryptosporidium </it>is a member of the Apicomplexa, a eukaryotic phylum that includes several important parasitic pathogens such as <it>Plasmodium</it>, <it>Toxoplasma</it>, <it>Eimeria </it>and <it>Theileria</it>. As an emerging pathogen in humans and other animals, <it>Cryptosporidium </it>often causes fever, diarrhea, anorexia and other complications. Although cryptosporidial infection is often self-limiting, it can be persistent and fatal for immunocompromised individuals. So far, no effective treatment is available <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Furthermore, because of its resistance to standard chlorine disinfection of water, <it>Cryptosporidium </it>continues to be a security concern as a potential water-borne bioterrorism agent <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>.</p>
			<p><it>Cryptosporidium </it>is phylogenetically quite distant from the hemosporidian and coccidian apicomplexans <abbrgrp><abbr bid="B3">3</abbr></abbrgrp> and, depending on the molecule and method used, is either basal to all Apicomplexa examined thus far, or is the sister group to the gregarines <abbrgrp><abbr bid="B4">4</abbr><abbr bid="B5">5</abbr></abbrgrp>. It is unusual in several respects, notably for the lack of the apicoplast organelle which is characteristic of all other apicomplexans that have been examined <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>. The apicoplast is a relict plastid hypothesized to have been acquired by an ancient secondary endosymbiosis of a pre-alveolate eukaryotic cell with an algal cell <abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. All that remains of the endosymbiont in Coccidia and Haemosporidia is a plastid organelle surrounded by four membranes <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. The apicoplast retains its own genome, but this is much reduced (27-35 kilobases (kb)), and contains genes primarily involved in the replication of the plastid genome <abbrgrp><abbr bid="B10">10</abbr><abbr bid="B11">11</abbr></abbrgrp>. In apicomplexans that have a plastid, many of the original plastid genes appear to have been lost (for example, photosynthesis genes) and some genes have been transferred to the host nuclear genome; their proteins are reimported into the apicoplast where they function <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. Plastids acquired by secondary endosymbiosis are scattered among eukaryotic lineages, including cryptomonads, haptophytes, alveolates, euglenids and chlorarachnions <abbrgrp><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp>. Among the alveolates, plastids are found in dinoflagellates and most examined apicomplexans but not in ciliates. Recent studies on the nuclear-encoded, plastid-targeted glyceraldehyde-3-phosphate dehydrogenase (GAPDH) gene suggest a common origin of the secondary plastids in apicomplexans, some dinoflagellates, heterokonts, haptophytes and cryptomonads <abbrgrp><abbr bid="B8">8</abbr><abbr bid="B18">18</abbr></abbrgrp>. If true, this would indicate that the lineage that gave rise to <it>Cryptosporidium </it>contained a plastid, even though many of its descendants (for example, the ciliates) appear to lack a plastid. Although indirect evidence has been noted for the past existence of an apicoplast in <it>C. parvum </it><abbrgrp><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr></abbrgrp>, no rigorous phylogenomic survey for nuclear-encoded genes of plastid or algal origin has been reported.</p>
			<p>Gene transfers, either intracellular (IGT) from an endosymbiont or organelle to the host nucleus or horizontal (HGT) between species, can dramatically alter the biochemical repertoire of host organisms and potentially create structural or functional novelties <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr><abbr bid="B23">23</abbr></abbrgrp>. In parasites, genes transferred from prokaryotes or other sources are potential targets for chemotherapy due to their phylogenetic distance or lack of a homolog in the host <abbrgrp><abbr bid="B24">24</abbr><abbr bid="B25">25</abbr></abbrgrp>. The detection of transferred genes in <it>Cryptosporidium </it>is thus of evolutionary and practical importance.</p>
			<p>In this study, we use a phylogenomic approach to mine the recently sequenced genome of <it>C. parvum </it>(IOWA isolate; 9.1 megabases (Mb)) <abbrgrp><abbr bid="B7">7</abbr></abbrgrp> for evidence of the past existence of an endosymbiont or apicoplast organelle and of other independent HGTs into this genome. We have detected genes of cyanobacterial/algal origin and genes acquired from other prokaryotic lineages in <it>C. parvum</it>. The fate of several of these transferred genes in <it>C. parvum </it>is explored by expression analyses. The significance of our findings and their impact on the genetic makeup of the parasite are discussed.</p>
		</sec>
		<sec>
			<st>
				<p>Results</p>
			</st>
			<sec>
				<st>
					<p>BLAST analyses</p>
				</st>
				<p>From BLAST analyses, the genome of <it>Cryptosporidium</it>, like that of <it>Plasmodium falciparum </it><abbrgrp><abbr bid="B26">26</abbr></abbrgrp>, is more similar overall to those of the plants <it>Arabidopsis </it>and <it>Oryza </it>than to any other non-apicomplexan organism currently represented in GenBank. The program Glimmer predicted 5,519 protein-coding sequences in the <it>C. parvum </it>genome, 4,320 of which had similarity to other sequences deposited in the GenBank nonredundant protein database. A significant number of these sequences, 936 (E-value &lt; 10<sup>-3</sup>) or 783 (E-value &lt; 10<sup>-7</sup>), had their most significant, non-apicomplexan, similarity to a sequence isolated from plants, algae, eubacteria (including cyanobacteria) or archaea (Table <tblr tid="T1">1</tblr>). To evaluate these observations further, phylogenetic analyses were performed, when possible, for each predicted protein in the entire genome.</p>
				<tbl id="T1" hint_layout="single">
					<title>
						<p>Table 1</p>
					</title>
					<caption>
						<p>Distribution of best non-apicomplexan BLAST hits in searches of the GenBank non-redundant protein database</p>
					</caption>
					<tblbdy cols="3">
						<r>
							<c ca="left">
								<p>Category</p>
							</c>
							<c ca="center">
								<p>E &lt; 10<sup>-3</sup></p>
							</c>
							<c ca="center">
								<p>E &lt; 10<sup>-7</sup></p>
							</c>
						</r>
						<r>
							<c cspan="3">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Plants</p>
							</c>
							<c ca="center">
								<p>670</p>
							</c>
							<c ca="center">
								<p>588</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Algae</p>
							</c>
							<c ca="center">
								<p>30</p>
							</c>
							<c ca="center">
								<p>21</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Non-cyanobacterial eubacteria</p>
							</c>
							<c ca="center">
								<p>188</p>
							</c>
							<c ca="center">
								<p>117</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Cyanobacteria</p>
							</c>
							<c ca="center">
								<p>22</p>
							</c>
							<c ca="center">
								<p>16</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Archaea</p>
							</c>
							<c ca="center">
								<p>26</p>
							</c>
							<c ca="center">
								<p>11</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Total</p>
							</c>
							<c ca="center">
								<p>936</p>
							</c>
							<c ca="center">
								<p>783</p>
							</c>
						</r>
					</tblbdy>
				</tbl>
			</sec>
			<sec>
				<st>
					<p>Phylogenomic analyses</p>
				</st>
				<p>The Glimmer-predicted protein-coding regions of the <it>C. parvum </it>genome (5,519 sequences) were used as input for phylogenetic analyses using the PyPhy program <abbrgrp><abbr bid="B27">27</abbr></abbrgrp>. In this program, phylogenetic trees for each input sequence are analyzed to determine the taxonomic identity of the nearest neighbor relative to the input sequence at a variety of taxonomic levels, for example, genus, family, or phylum. Using stringent analysis criteria (see Materials and methods), 954 trees were constructed from the input set of 5,519 predicted protein sequences (Figure <figr fid="F1">1</figr>). Analysis of the nearest non-apicomplexan neighbor on the 954 trees revealed the following nearest neighbor relationships: eubacterial (115 trees), archaeal (30), green plant/algal (204), red algal (8), and glaucocystophyte (4); other alveolate (61) and other eukaryotes made up the remainder. As some input sequences may have more than one nearest neighbor of interest on a tree, a nonredundant total of 393 sequences were identified with nearest neighbors to the above lineages.</p>
				<fig id="F1">
					<title>
						<p>Figure 1</p>
					</title>
					<caption>
						<p>Phylogenomic analysis pipeline</p>
					</caption>
					<text>
						<p>Phylogenomic analysis pipeline. The procedures used to analyze, assess and manipulate the protein-sequence data at each stage of the analysis are diagrammed.</p>
					</text>
					<graphic file="gb-2004-5-11-r88-1"/>
				</fig>
				<p>Searches of the <it>C. parvum </it>predicted gene set with the 551 <it>P. falciparum </it>predicted nuclear-encoded apicoplast-targeted proteins (NEAPs) yielded 40 significant hits (E-value &lt; 10<sup>-5</sup>), 23 of which were also identified in the phylogenomic analyses. A combination of these two approaches identified 410 candidates requiring further detailed analyses. Of these candidates, the majority were eliminated after stringent criteria were applied because of ambiguous tree topologies, insufficient taxonomic sampling, lack of bootstrap support or the presence of clear vertical eukaryotic ancestry (see Materials and methods). Thirty-one genes survived the screen and were deemed to be either strong or likely candidates for gene transfer (Table <tblr tid="T2">2</tblr>).</p>
				<tbl id="T2">
					<title>
						<p>Table 2</p>
					</title>
					<caption>
						<p>Genes of algal or eubacterial origin in <it>C. parvum</it></p>
					</caption>
					<tblbdy cols="7">
						<r>
							<c ca="left">
								<p>Putative gene name</p>
							</c>
							<c ca="center">
								<p>Accession</p>
							</c>
							<c ca="center">
								<p>Location</p>
							</c>
							<c ca="center">
								<p>Expression</p>
							</c>
							<c ca="center">
								<p>Indel</p>
							</c>
							<c ca="left">
								<p>Putative origin</p>
							</c>
							<c ca="left">
								<p>Putative function</p>
							</c>
						</r>
						<r>
							<c cspan="7">
								<hr/>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Lactate dehydrogenase*</p>
							</c>
							<c ca="center">
								<p>AAG17668</p>
							</c>
							<c ca="center">
								<p>VII</p>
							</c>
							<c ca="center">
								<p>EST</p>
							</c>
							<c ca="center">
								<p>+</p>
							</c>
							<c ca="left">
								<p>&#945;-proteobacteria</p>
							</c>
							<c ca="left">
								<p>Oxidoreductase</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Malate dehydrogenase*</p>
							</c>
							<c ca="center">
								<p>AAP87358</p>
							</c>
							<c ca="center">
								<p>VII</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>+</p>
							</c>
							<c ca="left">
								<p>&#945;-proteobacteria</p>
							</c>
							<c ca="left">
								<p>Oxidoreductase</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Thymidine kinase</p>
							</c>
							<c ca="center">
								<p>AAS47699</p>
							</c>
							<c ca="center">
								<p>V</p>
							</c>
							<c ca="center">
								<p>Assay</p>
							</c>
							<c ca="center">
								<p>+</p>
							</c>
							<c ca="left">
								<p>&#945;/&#947;-proteobacteria</p>
							</c>
							<c ca="left">
								<p>Kinase; nucleotide metabolism</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Hypothetical protein A<sup>&#8224;</sup></p>
							</c>
							<c ca="center">
								<p>EAK88787</p>
							</c>
							<c ca="center">
								<p>II</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>&#947;-proteobacteria</p>
							</c>
							<c ca="left">
								<p>Unknown</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Inosine 5' monophosphate dehydrogenase</p>
							</c>
							<c ca="center">
								<p>AAL83208</p>
							</c>
							<c ca="center">
								<p>VI</p>
							</c>
							<c ca="center">
								<p>Assay</p>
							</c>
							<c ca="center">
								<p>+</p>
							</c>
							<c ca="left">
								<p>&#949;-proteobacteria</p>
							</c>
							<c ca="left">
								<p>Purine nucleotide biosynthesis</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Tryptophan synthetase &#946; chain</p>
							</c>
							<c ca="center">
								<p>EAK87294</p>
							</c>
							<c ca="center">
								<p>V</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Proteobacteria</p>
							</c>
							<c ca="left">
								<p>Amino acid biosynthesis</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>1,4-&#945;-glucan branching enzyme</p>
							</c>
							<c ca="center">
								<p>CAD98370</p>
							</c>
							<c ca="center">
								<p>VI</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Eubacteria</p>
							</c>
							<c ca="left">
								<p>Carbohydrate metabolism</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>1,4-&#945;-glucan branching enzyme</p>
							</c>
							<c ca="center">
								<p>CAD98416</p>
							</c>
							<c ca="center">
								<p>VI</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Eubacteria</p>
							</c>
							<c ca="left">
								<p>Carbohydrate metabolism</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Acetyltransferase</p>
							</c>
							<c ca="center">
								<p>EAK87438</p>
							</c>
							<c ca="center">
								<p>VIII</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Eubacteria</p>
							</c>
							<c ca="left">
								<p>Unknown</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>&#945;-amylase</p>
							</c>
							<c ca="center">
								<p>EAK88222</p>
							</c>
							<c ca="center">
								<p>V</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Eubacteria</p>
							</c>
							<c ca="left">
								<p>Carbohydrate metabolism</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>DNA-3-methyladenine glycosylase</p>
							</c>
							<c ca="center">
								<p>EAK89739</p>
							</c>
							<c ca="center">
								<p>VIII</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Eubacteria</p>
							</c>
							<c ca="left">
								<p>DNA repair</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>RNA methyltransferase</p>
							</c>
							<c ca="center">
								<p>AY599068</p>
							</c>
							<c ca="center">
								<p>II</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Eubacteria</p>
							</c>
							<c ca="left">
								<p>RNA processing and modification</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Peroxiredoxin</p>
							</c>
							<c ca="center">
								<p>AY599067</p>
							</c>
							<c ca="center">
								<p>IV</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Eubacteria</p>
							</c>
							<c ca="left">
								<p>Oxidoreductase; antioxidant</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Glycerophosphodiester phosphodiesterase</p>
							</c>
							<c ca="center">
								<p>AY599066</p>
							</c>
							<c ca="center">
								<p>IV</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Eubacteria</p>
							</c>
							<c ca="left">
								<p>Phosphoric ester hydrolase</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>ATPase of the AAA class</p>
							</c>
							<c ca="center">
								<p>EAK88388</p>
							</c>
							<c ca="center">
								<p>I</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Eubacteria</p>
							</c>
							<c ca="left">
								<p>Post-translational modification</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Alcohol dehydrogenase</p>
							</c>
							<c ca="center">
								<p>EAK89684</p>
							</c>
							<c ca="center">
								<p>VIII</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Eubacteria</p>
							</c>
							<c ca="left">
								<p>Energy production and conversion</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Aminopeptidase N</p>
							</c>
							<c ca="center">
								<p>AAK53986</p>
							</c>
							<c ca="center">
								<p>VIII</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Eubacteria</p>
							</c>
							<c ca="left">
								<p>Peptide hydrolase</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Glutamine synthetase</p>
							</c>
							<c ca="center">
								<p>CAD98273</p>
							</c>
							<c ca="center">
								<p>VI</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>+</p>
							</c>
							<c ca="left">
								<p>Eubacteria</p>
							</c>
							<c ca="left">
								<p>Amino acid biosynthesis</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Conserved hypothetical protein B</p>
							</c>
							<c ca="center">
								<p>CAD98502</p>
							</c>
							<c ca="center">
								<p>VI</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Eubacteria</p>
							</c>
							<c ca="left">
								<p>Unknown</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Aspartate-ammonia ligase<sup>&#8224;</sup></p>
							</c>
							<c ca="center">
								<p>EAK87293</p>
							</c>
							<c ca="center">
								<p>V</p>
							</c>
							<c ca="center">
								<p>EST</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Eubacteria</p>
							</c>
							<c ca="left">
								<p>Amino acid biosynthesis</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Asparaginyl tRNA synthetase<sup>&#8224;</sup></p>
							</c>
							<c ca="center">
								<p>EAK87485</p>
							</c>
							<c ca="center">
								<p>VIII</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Eubacteria</p>
							</c>
							<c ca="left">
								<p>Translation</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Glutamine cyclotransferase<sup>&#8224;</sup></p>
							</c>
							<c ca="center">
								<p>EAK88499</p>
							</c>
							<c ca="center">
								<p>I</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Eubacteria</p>
							</c>
							<c ca="left">
								<p>Amido transferase</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Leucine aminopeptidase</p>
							</c>
							<c ca="center">
								<p>EAK88215</p>
							</c>
							<c ca="center">
								<p>V</p>
							</c>
							<c ca="center">
								<p>RT-PCR</p>
							</c>
							<c ca="center">
								<p>+</p>
							</c>
							<c ca="left">
								<p>Cyanobacteria</p>
							</c>
							<c ca="left">
								<p>Hydrolase</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Biopteridine transporter (BT-1)</p>
							</c>
							<c ca="center">
								<p>CAD98492</p>
							</c>
							<c ca="center">
								<p>VI</p>
							</c>
							<c ca="center">
								<p>RT-PCR /EST</p>
							</c>
							<c ca="center">
								<p>+</p>
							</c>
							<c ca="left">
								<p>Cyanobacteria</p>
							</c>
							<c ca="left">
								<p>Biopterine transport</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Hypothetical protein C<sup>&#8224; </sup>(possible Zn-dependent metalloprotease)</p>
							</c>
							<c ca="center">
								<p>EAK89015</p>
							</c>
							<c ca="center">
								<p>III</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Archaea</p>
							</c>
							<c ca="left">
								<p>Putative protease</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Superoxide dismutase<sup>&#8224;</sup></p>
							</c>
							<c ca="center">
								<p>AY599065</p>
							</c>
							<c ca="center">
								<p>V</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Eubacteria /archaea</p>
							</c>
							<c ca="left">
								<p>Oxidoreductase; antioxidant</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Glucose-6-phosphate isomerase</p>
							</c>
							<c ca="center">
								<p>EAK88696</p>
							</c>
							<c ca="center">
								<p>II</p>
							</c>
							<c ca="center">
								<p>RT-PCR</p>
							</c>
							<c ca="center">
								<p>+</p>
							</c>
							<c ca="left">
								<p>Algae/plants</p>
							</c>
							<c ca="left">
								<p>Carbohydrate metabolism</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Uridine kinase/uracil phosphoribosyltransferase<sup>&#8224;</sup></p>
							</c>
							<c ca="center">
								<p>AAS47700</p>
							</c>
							<c ca="center">
								<p>VIII</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Algae/plants</p>
							</c>
							<c ca="left">
								<p>Nucleotide salvage metabolism</p>
							</c>
						</r>
						<r>
							<c ca="left">
								<p>Calcium-dependent protein kinases* <sup>&#8224;</sup></p>
							</c>
							<c ca="center">
								<p>AAS47705</p>
							</c>
							<c ca="center">
								<p>II</p>
							</c>
							<c ca="center">
								<p>RT-PCR</p>
							</c>
							<c>
								<p/>
							</c>
							<c ca="left">
								<p>Algae/plants</p>
							</c>
							<c ca="left">
								<p>Kinase; cell signal transduction</p>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>AAS47706</p>
							</c>
							<c ca="center">
								<p>II</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
						<r>
							<c>
								<p/>
							</c>
							<c ca="center">
								<p>AAS47707</p>
							</c>
							<c ca="center">
								<p>VII</p>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
							<c>
								<p/>
							</c>
						</r>
					</tblbdy>
					<tblfn>
						<p>*Genes that have been derived from a duplication following transfer; <sup>&#8224;</sup>transferred genes that have less support. GenBank accession numbers are as indicated. Locations are given as chromosome number. The expression status for each gene is indicated by method: EST, RT-PCR or assay. Only 567 EST sequences exist for <it>C. parvum</it>. A + in the indel colum indicates the presence of a shared insertion/deletion between the <it>C. parvum </it>sequence and other sequences from organisms identified in the putative origin column.</p>
					</tblfn>
				</tbl>
				<p>Of the 31 recovered genes, several have been previously published or submitted to the GenBank <abbrgrp><abbr bid="B20">20</abbr></abbrgrp>, including those identified as having plant or eubacterial 'likeness' on the basis of similarity searches when the genome sequence was published <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. The remaining sequences were further tested to rule out the possibility that they were artifacts (<it>C. parvum </it>oocysts are purified from cow feces which contain plant and bacterial matter). Two experiments were performed. In the first, nearly complete genomic sequences (generated in a different laboratory) from the closely related species <it>C. hominis </it>were screened using BLASTN for the existence of the predicted genes. Twenty out of 21 <it>C. parvum </it>sequences were identified in <it>C. hominis</it>. The remaining sequence was represented by two independently isolated expressed sequence tag (EST) sequences in the GenBank and CryptoDB databases (data not shown). In the second experiment, genomic Southern analyses of the IOWA isolate were carried out (Figure <figr fid="F2">2</figr>) for several of the genes of bacterial or plant origin. In each case, a band of the predicted size was identified (see Additional data file 1). The genes are not contaminants.</p>
				<fig id="F2">
					<title>
						<p>Figure 2</p>
					</title>
					<caption>
						<p><it>Cryptosporidium parvum </it>genomic Southern blot</p>
					</caption>
					<text>
						<p><it>Cryptosporidium parvum </it>genomic Southern blot. <it>C. parvum </it>genomic DNA, 5 &#956;g per lane. Lanes were probed for the following genes: (1) aminopeptidase N; (2) glucose-6-phosphate isomerase; (3) leucine aminopeptidase; (4) pteridine transporter (BT-1); and (5) glutamine synthetase. Lanes (1-4) were restricted with <it>Bam</it>H1 and lane (5) with <it>Eco</it>R1. The ladder is shown in 1 kb increments. See Additional data file 1 for probes and methods.</p>
					</text>
					<graphic file="gb-2004-5-11-r88-2"/>
				</fig>
			</sec>
			<sec>
				<st>
					<p>Genes of cyanobacterial/algal origin</p>
				</st>
				<p>Extant <it>Cryptosporidium </it>species do not contain an apicoplast genome or any physical structure thought to represent an algal endosymbiont or the plastid organelle it contained <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>. The only possible remaining evidence of the past association of an endosymbiont or its cyanobacterially derived plastid organelle might be genes transferred from these genetic sources to the host genome prior to the physical loss of the endosymbiont or organelle itself. Several such genes were identified.</p>
				<p>A leucine aminopeptidase gene of cyanobacterial origin was found in the <it>C. parvum </it>nuclear genome. This gene is also present in the nuclear genome of other apicomplexan species (<it>Plasmodium</it>, <it>Toxoplasma </it>and <it>Eimeria</it>), as confirmed by similarity searches against ApiDB (see Materials and methods). In <it>P. falciparum</it>, leucine aminopeptidase is a predicted NEAP and possesses an amino-terminal extension with a putative transit peptide. Consistent with the lack of an apicoplast, this gene in <it>Cryptosporidium </it>contains no evidence of a signal peptide and the amino-terminal extension is reduced. Similarity searches of the GenBank nonredundant protein database revealed top hits to <it>Plasmodium</it>, followed by <it>Arabidopsis thaliana</it>, and several cyanobacteria including <it>Prochlorococcus</it>, <it>Nostoc </it>and <it>Trichodesmium</it>, and plant chloroplast precursors in <it>Lycopersicon esculentum </it>and <it>Solanum tuberosum </it>(data not shown). A multiple sequence alignment of the predicted protein sequences of leucine aminopeptidase reveals overall similarity and a shared indel among apicomplexan, plant and cyanobacterial sequences (Figure <figr fid="F3">3</figr>). Phylogenetic analyses strongly support a monophyletic grouping of <it>C. parvum </it>and other apicomplexan leucine aminopeptidase proteins with cyanobacteria and plant chloroplast precursors (Figure <figr fid="F4">4a</figr>). So far, this gene has not been detected in ciliates.</p>
				<fig id="F3">
					<title>
						<p>Figure 3</p>
					</title>
					<caption>
						<p>Region of leucine aminopeptidase multiple sequence alignment that illustrates several characters uniting apicomplexan sequences with plant and cyanobacterial sequences</p>
					</caption>
					<text>
						<p>Region of leucine aminopeptidase multiple sequence alignment that illustrates several characters uniting apicomplexan sequences with plant and cyanobacterial sequences. The red box denotes an indel shared between apicomplexans, plants and cyanobacteria. The number preceeding each sequence is the position in the individual sequence at which this stretch of similarity begins. GenBank GI numbers for each sequence are as indicated in Additional data file 1. Colored boxes preceeding the alignment indicate the taxonomic group for the organisms named to the left. Red, apicomplexan; green, plant and cyanobacterial; blue, eubacterial; lavender, other protists and eukaryotes.</p>
					</text>
					<graphic file="gb-2004-5-11-r88-3"/>
				</fig>
				<fig id="F4">
					<title>
						<p>Figure 4</p>
					</title>
					<caption>
						<p>Phylogenetic analyses</p>
					</caption>
					<text>
						<p>Phylogenetic analyses. <b>(a) </b>Leucine aminopeptidase; <b>(b) </b>glucose-6-phosphate isomerase; <b>(c) </b>tryptophan synthetase &#946; subunit. Numbers above the branches (where space permits) show the puzzle frequency (with TREE-PUZZLE) and bootstrap support for both maximum parsimony and neighbor-joining analyses respectively. Asterisks indicate that support for this branch is below 50%. The scale is as indicated. GI accession numbers and alignments are provided in Additional data file 1.</p>
					</text>
					<graphic file="gb-2004-5-11-r88-4"/>
				</fig>
				<p>Another <it>C. parvum </it>nuclear-encoded gene of putative cyanobacterial origin is a protein of unknown function belonging to the biopterine transporter family (BT-1) (Table <tblr tid="T2">2</tblr>). Similarity searches with this protein revealed significant hits to other apicomplexans (for example, <it>P. falciparum</it>, <it>Theileria annulata</it>, <it>T. gondii</it>), plants (<it>Arabidopsis</it>, <it>Oryza</it>), cyanobacteria (<it>Trichodesmium</it>, <it>Nostoc </it>and <it>Synechocystis</it>), a ciliate (<it>Tetrahymena</it>) and the kinetoplastids (<it>Leishmania </it>and <it>Trypanosoma</it>). <it>Arabidopsis thaliana </it>apparently contains at least two copies of this gene; the protein of one (accession number NP_565734) is predicted by ChloroP <abbrgrp><abbr bid="B28">28</abbr></abbrgrp> to be chloroplast-targeted, suggestive of its plastid derivation. The taxonomic distribution and sequence similarity of this protein with cyanobacterial and chloroplast homologs are also indicative of its affinity to plastids.</p>
				<p>Only one gene of algal nuclear origin, glucose-6-phosphate isomerase (G6PI), was identified by the screen described here. Several other algal-like genes are probable, but their support was weaker (Table <tblr tid="T2">2</tblr>). A 'plant-like' G6PI has been described in other apicomplexan species (<it>P. falciparum</it>, <it>T. gondii </it><abbrgrp><abbr bid="B29">29</abbr></abbrgrp>) and a 'cyanobacterial-like' G6PI has been described in the diplomonads <it>Giardia intestinalis </it>and <it>Spironucleus </it>and the parabasalid <it>Trichomonas vaginalis </it><abbrgrp><abbr bid="B30">30</abbr></abbrgrp>. Figure <figr fid="F4">4b</figr> illustrates these observations nicely. At the base of the tree, the eukaryotic organisms <it>Giardia</it>, <it>Spironucleus </it>and <it>Trichomonas </it>group with the cyanobacterium <it>Nostoc</it>, as previously published. In the midsection of the tree, the G6PI of apicomplexans and ciliates forms a well-supported monophyletic group with the plants and the heterokont <it>Phytophthora</it>. The multiple protein sequence alignment of G6PI identifies several conserved positions shared exclusively by apicomplexans, <it>Tetrahymena</it>, plants and <it>Phytophthora</it>. This gene does not contain a signal or transit peptide and is not predicted to be targeted to the apicoplast in <it>P. falciparum</it>. The remainder of the tree shows a weakly supported branch including eubacteria, fungi and several eukaryotes. The eukaryotes are interrupted by the inclusion of G6PI from the eubacterial organisms <it>Escherichia coli </it>and <it>Cytophaga</it>. This relationship of <it>E. coli </it>G6PI and eukaryotic G6PI has been observed before and may represent yet another gene transfer <abbrgrp><abbr bid="B31">31</abbr></abbrgrp>.</p>
			</sec>
			<sec>
				<st>
					<p>Genes of eubacterial (non-cyanobacterial) origin</p>
				</st>
				<p>Our study identified HGTs from several distinct sources, involving a variety of biochemical activities and metabolic pathways (Table <tblr tid="T2">2</tblr>). Notably, the nucleotide biosynthesis pathway contains at least two previously published, independently transferred genes from eubacteria. Inosine 5' monophosphate dehydrogenase (IMPDH), an enzyme for purine salvage, was transferred from &#949;-proteobacteria <abbrgrp><abbr bid="B32">32</abbr></abbrgrp>. Another enzyme involved in pyrimidine salvage, thymidine kinase (TK), is of &#945; or &#947;-proteobacterial ancestry <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>.</p>
				<p>Another gene of eubacterial origin identified in <it>C. parvum </it>is tryptophan synthetase &#946; subunit (<it>trpB</it>). This gene has been identified in both <it>C. parvum </it>and <it>C. hominis</it>, but not in other apicomplexans. The relationship of <it>C. parvum trpB </it>to proteobacterial sequences is well-supported as a monophyletic group by two of the three methods used in our analyses (Figure <figr fid="F4">4c</figr>).</p>
				<p>Other HGTs of eubacterial origin include the genes encoding &#945;-amylase and glutamine synthetase and two copies of 1,4-&#945;-glucan branching enzyme, all of which are overwhelmingly similar to eubacterial sequences. &#945;-amylase shows no significant hit to any other apicomplexan or eukaryotic sequence, suggesting a unique HGT from eubacteria to <it>C. parvum</it>. Glutamine synthetase is a eubacterial gene found in <it>C. parvum </it>and all apicomplexans examined. The eubacterial affinity of the apicomplexan glutamine synthetase is also demonstrated by a well supported (80% with maximum parsimony) monophyletic grouping with eubacterial homologs (data not shown). The eubacterial origin of 1,4-&#945;-glucan branching enzyme is shown in Figure <figr fid="F5">5</figr>. Each copy of the gene is found in a strongly supported monophyletic group of sequences derived only from prokaryotes (including cyanobacteria) and one other apicomplexan organism, <it>T. gondii</it>. It is possible that these genes are of plastidic origin and were transferred to the nuclear genome before the divergence of <it>C. parvum </it>and <it>T. gondii</it>; the phylogenetic analysis provides little direct support for this interpretation, however.</p>
				<fig id="F5">
					<title>
						<p>Figure 5</p>
					</title>
					<caption>
						<p>Phylogenetic analyses of 1,4-&#945;-glucan branching enzyme</p>
					</caption>
					<text>
						<p>Phylogenetic analyses of 1,4-&#945;-glucan branching enzyme. Numbers above the branches (where space permits) show the puzzle frequency (TREE-PUZZLE) and bootstrap support for both maximum parsimony and neighbor-joining analyses respectively; Asterisks indicate that support for this branch is below 50%. The scale is as indicated. GI accession numbers and alignment are provided in Additional data file 1.</p>
					</text>
					<graphic file="gb-2004-5-11-r88-5"/>
				</fig>
			</sec>
			<sec>
				<st>
					<p>Mode of acquisition</p>
				</st>
				<p>We examined the transferred genes for evidence of non-independent acquisition, for example, blocks of transferred genes or evidence that genes were acquired together from the same source. Examination of the chromosomal location of the genes listed in Table <tblr tid="T2">2</tblr> demonstrates that the genes are currently located on different chromosomes and in most cases do not appear to have been transferred or retained in large blocks. There are two exceptions. The <it>trpB </it>gene and the gene for aspartate ammonia ligase are located 4,881 base-pairs (bp) apart on the same strand of a contig for chromosome V; there is no annotated gene between these two genes. Both genes are of eubacterial origin and are not found in other apicomplexan organisms. While it is possible that they have been acquired independently with this positioning, or later came to have this positioning via genome rearrangements, it is interesting to speculate that these genes were acquired together. The origin of <it>trpB </it>is proteobacterial. The origin of aspartate ammonia ligase is eubacterial, but not definitively of any particular lineage. In the absence of genome sequences for all organisms, throughout all of time, exact donors are extremely difficult to assess and inferences must be drawn from sequences that appear to be closely related to the actual donor.</p>
				<p>In the second case, <it>C. parvum </it>encodes two genes for 1,4-&#945;-glucan branching enzymes. Both are eubacterial in origin and both are located on chromosome VI, although not close together. They are approximately 110 kb apart and many intervening genes are present. The evidence that these genes were acquired together comes from the phylogenetic analysis presented in Figure <figr fid="F5">5</figr>. The duplication that gave rise to the two 1,4-&#945;-glucan branching enzymes is old, and is well supported by the tree shown in Figure <figr fid="F5">5</figr>. A number of eubacteria (11), including cyanobacteria, contain this duplication. The 1,4-&#945;-glucan branching enzymes of <it>C. parvum </it>and <it>T. gondii </it>represent one copy each of this ancient duplication. This suggests that the ancestor of <it>C. parvum </it>and <it>T. gondii </it>acquired the genes after they had duplicated and diverged in eubacteria.</p>
			</sec>
			<sec>
				<st>
					<p>Expression of transferred genes</p>
				</st>
				<p>Each of the genes identified in the above analyses (Table <tblr tid="T2">2</tblr>) appears to be an intact non-pseudogene, suggesting that these genes are functional. To verify the functional status of several of the transferred genes, semi-quantitative reverse transcription PCR (RT-PCR) was carried out to characterize their developmental expression profile. Each of the RNA samples from <it>C. parvum</it>-infected HCT-8 cells was shown to be free of contaminating <it>C. parvum </it>genomic DNA by the lack of amplification product from a reverse transcriptase reaction sham control. RT-PCR detected no signals in cDNA samples from mock-infected HCT-8 cells. On the other hand, RT-PCR product signals were detected in the <it>C. parvum</it>-infected cells of six independent time-course experiments for each of the genes examined (those for G6PI, leucine aminopeptidase, BT-1, a calcium-dependent protein kinase, tyrosyl-tRNA synthetase, dihydrofolate reductase- thymidine synthetase (DHFR-TS)). The expression profiles of the acquired genes show that they are regulated and differentially expressed throughout the life cycle of <it>C. parvum </it>in patterns characteristic of other non-transferred genes (Figure <figr fid="F6">6</figr>).</p>
				<fig id="F6">
					<title>
						<p>Figure 6</p>
					</title>
					<caption>
						<p>Expression profiles of select genes in <it>C. parvum</it>-infected HCT-8 cells</p>
					</caption>
					<text>
						<p>Expression profiles of select genes in <it>C. parvum</it>-infected HCT-8 cells. The expression level of each gene is calculated as the ratio of its RT-PCR product to that of <it>C. parvum </it>18s rRNA. <b>(a) </b>glucose-6-phospate isomerase; <b>(b) </b>leucine aminopeptidase; <b>(c) </b>pteridine transporter (BT-1); <b>(d) </b>tyrosyl-tRNA synthetase; <b>(e) </b>calcium-dependent protein kinase; <b>(f) </b>dihydrofolate reductase-thymidine synthetase (DHFR-TS). The genes examined in (a-c, e) represent transferred genes of different origins; (d, f) represent non-transferred references. Error bars show the standard deviation of the mean of six independent time-course experiments.</p>
					</text>
					<graphic file="gb-2004-5-11-r88-6"/>
				</fig>
				<p>A small published collection of 567 EST sequences for <it>C. parvum </it>is also available. These ESTs were searched with each of the 31 candidate genes surviving the phylogenomic screen. Three genes - aspartate ammonia ligase, BT-1 and lactate dehydrogenase - are expressed, as confirmed by the presence of an EST (Table <tblr tid="T2">2</tblr>).</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Discussion</p>
			</st>
			<p>A genome-wide search for intracellular and horizontal gene transfers in <it>C. parvum </it>was carried out. We systematically determined the evolutionary origins of genes in the genome using phylogenetic approaches, and further confirmed the existence and expression of putatively transferred genes with laboratory experiments. The methodology adopted in this study provides a broad picture of the extent and the importance of gene transfer in apicomplexan evolution.</p>
			<p>The identification of gene transfers is often subject to errors introduced by methodology, data quality and taxonomic sampling. The phylogenetic approach adopted in this study is preferable to similarity searches <abbrgrp><abbr bid="B33">33</abbr><abbr bid="B34">34</abbr></abbrgrp> but several factors, including long-branch attraction, mutational saturation, lineage-specific gene loss and acquisition, and incorrect identification of orthologs, can distort the topology of a gene tree <abbrgrp><abbr bid="B35">35</abbr><abbr bid="B36">36</abbr></abbrgrp>. Incompleteness in the taxonomic record may also lead to false positives for IGT and HGT identification. In our study, we have attempted to alleviate these factors, as best as is possible, by sampling the GenBank nonredundant protein database, dbEST and organism-specific databases and by using several phylogenetic methods. Still, these issues remain a concern for this study as the taxonomic diversity of unicellular eukaryotes is vastly undersampled and studies are almost entirely skewed towards parasitic organisms.</p>
			<p>The published analysis of the <it>C. parvum </it>genome sequence identified 14 bacteria-like and 15 plant-like genes based on similarity searches <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>. Six of these bacterial-like and three plant-like genes were also identified as probable transferred genes in the phylogenomic analyses presented here. We have examined the fate of genes identified by one analysis and not the other to uncover the origin of the discrepancy. First, methodology is the single largest contributing factor. Genes with bacterial-like or plant-like BLAST similarities which, from the phylogenetic analyses, do not appear to be transfers were caused by the fact that PyPhy was unable to generate trees due to an insufficient number of significant hits in the database, or because of the stringent coverage length and similarity requirements adopted in this analysis. Only seven of the previously identified 15 plant-like and 11 of 14 eubacterial-like genes survived the predefined criteria for tree construction. Second, subsequent phylogenetic analyses including additional sequences from non-GenBank databases failed to provide sufficient evidence or significant support for either plant or eubacterial ancestry. Third, searches of dbEST and other organism-specific databases yielded other non-plant or non-eubacterial organisms as nearest neighbors, thus removing the possibility of a transfer.</p>
			<p>The limitations of similarity searches and incomplete taxonomic sampling are well evidenced in our phylogenomic analyses. From similarity searches, <it>C. parvum</it>, like <it>P. falciparum </it><abbrgrp><abbr bid="B26">26</abbr></abbrgrp>, is more similar to the plants <it>Arabidopsis </it>and <it>Oryza </it>than to any other single organism. Almost 800 predicted genes have best non-apicomplexan BLAST hits of at least 10<sup>-7 </sup>to plants and eubacteria (Table <tblr tid="T1">1</tblr>). Yet only 31 can be inferred to be transferred genes at this time with the datasets and methodology available (Table <tblr tid="T2">2</tblr>). In many cases (for example, phosphoglucomutase) the <it>C. parvum </it>gene groups phylogenetically with plant and bacterial homologs, but with only modest support. In other cases, such as pyruvate kinase and the bi-functional dehydrogenase enzyme (AdhE), gene trees obtained from automated PyPhy analyses indicate a strong monophyletic grouping of the <it>C. parvum </it>gene with plant or eubacterial homologs, but this topology disappears when sequences from other unicellular eukaryotes, such as <it>Dictyostelium</it>, <it>Entamoeba </it>and <it>Trichomonas </it>are included in the analysis (data not shown).</p>
			<p>The list of genes in Table <tblr tid="T2">2</tblr> should be considered a current best estimate of the IGTs and HGTs in <it>C. parvum </it>instead of a definitive list. As genomic data are obtained from a greater diversity of unicellular eukaryotes and eubacteria, phylogenetic analyses of nearest neighbors are likely to change.</p>
			<sec>
				<st>
					<p>Did <it>Cryptosporidium</it> contain an endosymbiont or plastid organelle?</p>
				</st>
				<p>The <it>C. parvum </it>sequences of cyanobacterial and algal origin reported here had to enter the genome at some point during its evolution. Formal possibilities include vertical inheritance from a plastid-containing chromalveolate ancestor, HGT from the cyanobacterial and algal sources (or from a secondary source such as a plastid-containing apicomplexan), or IGT from an endosymbiont/plastid organelle during evolution, followed by loss of the source. <it>Cryptosporidium </it>does not harbor an apicoplast organelle or any trace of a plastid genome <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>; thus an IGT scenario would necessitate loss of the organelle in <it>Cryptosporidium </it>or the lineage giving rise to it. The exact position of <it>C. parvum </it>on the tree of life has been debated, with developmental and morphological considerations placing it within the Apicomplexa, and molecular analyses locating it in various positions, both within and outside the Apicomplexa <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>, but primarily within. If we assume that <it>C. parvum </it>is an apicomplexan, and if the secondary endosymbiosis which is believed to have given rise to the apicoplast occurred before the formation of the Apicomplexa, as has been suggested <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>, <it>C. parvum </it>would have evolved from a plastid-containing lineage and would be expected to harbor traces of this relationship in its nuclear genome. Genes of likely cyanobacterial and algal/plant origin are detected in the nuclear genome of <it>C. parvum </it>(Table <tblr tid="T2">2</tblr>) and thus IGT followed by organelle loss cannot be ruled out.</p>
				<p>What about other interpretations? While it is formally possible that these genes were acquired independently via HGT in <it>C. parvum</it>, their shared presence in other alveolates (including the non-plastidic ciliate <it>Tetrahymena</it>) provides the best evidence against this scenario as multiple independent transfers would be required and so far there is no evidence for intra-alveolate gene transfer. Vertical inheritance is more difficult to address as it involves distinguishing between genes acquired via IGT from a primary endosymbiotic event versus a secondary endosymbioic event. Our data, especially the analysis of G6PI and BT-1 are consistent with both primary and secondary endosymbioses, provided that the secondary endosymbiosis is pre-alveolate in origin. As more genome data become available and flanking genes can be examined for each gene in a larger context, positional information will be informative in distinguishing among the alternatives.</p>
				<p>The plastidic nature of some genes is particularly apparent. There is a shared indel among leucine aminopeptidase protein sequences in apicomplexans, cyanobacteria and plant chloroplast precursors (Figure <figr fid="F3">3</figr>). The <it>C. parvum </it>leucine aminopeptidase does contain an amino-terminal extension of approximately 85-65 amino acids (depending on the alignment) relative to bacterial homologs, but this extension does not contain a signal sequence. The extension in <it>P. falciparum </it>is 85 amino acids and the protein is believed to be targeted to the apicoplast <abbrgrp><abbr bid="B26">26</abbr><abbr bid="B37">37</abbr></abbrgrp>. No similarity is detected between the <it>C. parvum </it>and <it>P. falciparum </it>amino-terminal extensions (data not shown).</p>
				<p>Other genes were less informative in this analysis. Among these, aldolase was reported in both <it>P. falciparum </it><abbrgrp><abbr bid="B38">38</abbr></abbrgrp> and the kinetoplastid parasite <it>Trypanosoma </it><abbrgrp><abbr bid="B38">38</abbr></abbrgrp> as a plant-like gene. The protein sequences of aldolase are similar in <it>C. parvum </it>and <it>P. falciparum</it>, with an identity of 60%. In our phylogenetic analyses, <it>C. parvum </it>clearly forms a monophyletic group with <it>Plasmodium</it>, <it>Toxoplasma </it>and <it>Eimeria</it>. This branch groups with <it>Dictyostelium</it>, Kinetoplastida and cyanobacterial lineages, but bootstrap support is not significant. The sister group to the above organisms are the plants and additional cyanobacteria, but again with no bootstrap support (see Additional data file 1 for phylogenetic tree). Another gene, enolase, contains two indels shared between land plants and apicomplexans (including <it>C. parvum</it>) and was suggested to be a plant-like gene <abbrgrp><abbr bid="B29">29</abbr></abbrgrp>, but alternative explanations exist <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>.</p>
				<p>The biochemical activity of the polyamine biosynthetic enzyme arginine decarboxylase (ADC), which is typically found in plants and bacteria, was previously reported in <it>C. parvum </it><abbrgrp><abbr bid="B19">19</abbr></abbrgrp>. However, we were unable to confirm its presence by similarity searches of the two <it>Cryptosporidium </it>genome sequences deposited in CryptoDB using plant (<it>Cucumis sativa</it>, GenBank accession number AAP36992), cyanobacterial (<it>Nostoc </it>sp., NP-487441; <it>Synechocystis </it>sp., NP-439907) and other bacterial (<it>Yersinia pestis</it>, NP-404547) homologs.</p>
			</sec>
			<sec>
				<st>
					<p>A plethora of prokaryotic genes</p>
				</st>
				<p>Several HGTs from bacteria have been reported previously in <it>C. parvum </it><abbrgrp><abbr bid="B25">25</abbr><abbr bid="B32">32</abbr><abbr bid="B40">40</abbr></abbrgrp>. We detected many more in our screen of the completed <it>C. parvum </it>genome sequence (Table <tblr tid="T2">2</tblr>). In most cases, the exact donors of these transferred genes were difficult to determine. However, for those genes whose donors could be more reliably inferred (Table <tblr tid="T2">2</tblr>), several appear to be from different sources and hence represent independent transfer events. In one compelling case, both the <it>trpB </it>and aspartate ammonia ligase genes are located 4,881 bp apart on the same strand of a contig for chromosome V and there is no gene separating them. Both genes are of eubacterial origin and neither gene is detected in other apicomplexans. In addition, the aspartate ammonia ligase gene is expressed, as evidenced by an EST. In another case, copies of a 1,4-&#945;-glucan branching enzyme gene duplication pair that is present in many eubacteria, were detected on the same chromosome in <it>C. parvum</it>. <it>C. parvum </it>also contains many transferred genes from distinct eubacterial sources that are not present in other apicomplexans (for example, IMPDH, TK (thymidine kinase), <it>trpB </it>and the gene for aspartate ammonia ligase).</p>
				<p>The endosymbiotic event that gave rise to the mitochondrion occurred very early in eukaryotic evolution and is associated with significant IGT. However, most of these transfer events happened long before the evolutionary time window we explored in this study <abbrgrp><abbr bid="B41">41</abbr></abbrgrp>. Many IGTs from the mitochondrial genome that have been retained are almost universally present in eukaryotes (including <it>C. parvum </it>which does not contain a typical mitochondrion <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B42">42</abbr><abbr bid="B43">43</abbr><abbr bid="B44">44</abbr></abbrgrp>) and thus would not be detected in a PyPhy screen since the 'nearest phylogenetic neighbor' on the tree would be taxonomically correct and not appear as a relationship indicative of a gene transfer.</p>
			</sec>
			<sec>
				<st>
					<p>The impact of gene transfers on host evolution</p>
				</st>
				<p>Gene transfer is an important evolutionary force <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr><abbr bid="B45">45</abbr><abbr bid="B46">46</abbr></abbrgrp>. Several of the transferred genes identified in <it>C. parvum </it>are known to be expressed. IMPDH has been shown to be essential in <it>C. parvum </it>purine metabolism <abbrgrp><abbr bid="B32">32</abbr></abbrgrp> and TK has been shown to be functional in pyrimidine salvage <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. It is not yet clear whether these genes were acquired independently in this lineage, or have been lost from the rest of the apicomplexan lineage, or whether both these have happened. However, it is clear that their presence has facilitated the remodeling of nucleotide biosynthesis. <it>C. parvum </it>no longer possesses the ability to synthesize nucleotides; instead it relies entirely on salvage.</p>
				<p>Many apicoplast and algal nuclear genes have been transferred to the host nuclear genome, where they were subsequently translated in the cytosol and their proteins targeted to the apicoplast organelle. However, as there is no apicoplast in <it>C. parvum</it>, acquired plastidic proteins are theoretically destined to go elsewhere. In the absence of an apicoplast, it is tempting to suspect that plastid-targeted proteins would have been lost, or would be detected as pseudogenes. No identifiable pseudogenes were detected and at least one gene is still viable. The <it>C. parvum </it>leucine aminopeptidase, which still contains an amino-terminal extension (without a signal peptide), is intact and is expressed, as shown in Figure <figr fid="F6">6</figr>. None of the cyanobacterial/algal genes identified in our study contains a canonical presequence for apicoplast targeting. One exception to this is phosphoglucomutase, a gene not present in Table <tblr tid="T2">2</tblr> because of its poorly supported relationships in phylogenetic analyses. This gene exists in two copies as a tandem duplication in the <it>C. parvum </it>genome. One copy has a long amino-terminal extension (97 amino acids) beginning with a signal peptide. The extension does not contain characteristics of a transit peptide. Expression of a fluorescent reporter construct containing this extension in a related parasite, <it>T. gondii</it>, did not reveal apicoplast targeting but instead secretion via dense granules (see Additional data file 1). Exactly how and where intracellularly transferred genes (especially those that normally target the apicoplast) have become incorporated into other metabolic processes remains a fertile area for exploration.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Conclusions</p>
			</st>
			<p><it>Cryptosporidium </it>is the recipient of a large number (31) of transferred genes, many of which are not shared by other apicomplexan parasites. The genes have been acquired from several different sources including &#945;-, &#946;-, and &#949;-proteobacteria, cyanobacteria, algae/plants and possibly the Archaea. We have described two cases of two genes that appear to have been acquired together from a eubacterial source: <it>trpB </it>and the aspartate ammonia ligase gene are located within 5 kb of each other, while the two copies of 1,4-&#945;-glucan branching enzyme represent copies of an ancient gene duplication also observed in cyanobacteria.</p>
			<p>Once thought to be a relatively rare event, reports of gene transfers in eukaryotes are increasingly common. The abundance of available eukaryotic genome sequence is providing the material for analyses that were not possible only a few years ago. Analysis of the <it>Arabidopsis </it>genome <abbrgrp><abbr bid="B47">47</abbr></abbrgrp> has revealed potentially thousands of genes that were transferred intracellularly. HGTs are still a relatively rare class of genes among multicellular eukaryotes, most probably because of the segregation of the germ line. By definition, unicellular eukaryotes do not have a separate germ line and are obligated to tolerate the acquisition of foreign genes if they are to survive. Among unicellular eukaryotes, there are now many reports of HGTs: <it>Giardia </it><abbrgrp><abbr bid="B48">48</abbr><abbr bid="B49">49</abbr></abbrgrp>, <it>Trypanosoma </it><abbrgrp><abbr bid="B38">38</abbr></abbrgrp>, <it>Entamoeba </it><abbrgrp><abbr bid="B21">21</abbr><abbr bid="B49">49</abbr></abbrgrp>, <it>Euglena </it><abbrgrp><abbr bid="B50">50</abbr></abbrgrp>, <it>Cryptosporidium </it><abbrgrp><abbr bid="B25">25</abbr><abbr bid="B32">32</abbr><abbr bid="B40">40</abbr></abbrgrp> and other apicomplexans <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>.</p>
			<p>As discussed earlier, genes transferred from distant phylogenetic sources such as eubacteria could be potential therapeutic targets. In apicomplexans, transferred genes are already some of the most promising targets of anti-parasitic drugs and vaccines <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B25">25</abbr><abbr bid="B52">52</abbr></abbrgrp>. We have shown that several transferred genes are differentially expressed in the <it>C. parvum </it>genome, and in two cases (IMPDH and TK), the transferred genes have been shown to be functional <abbrgrp><abbr bid="B25">25</abbr><abbr bid="B32">32</abbr></abbrgrp>. The successful integration, expression and survival of transferred genes in the <it>Cryptosporidium </it>genome has changed the genetic and metabolic repertoire of the parasite.</p>
		</sec>
		<sec>
			<st>
				<p>Materials and methods</p>
			</st>
			<sec>
				<st>
					<p>Cryptosporidium sequence sources</p>
				</st>
				<p>Genomic sequences for <it>C. parvum </it>and <it>C. hominis </it>were downloaded from CryptoDB <abbrgrp><abbr bid="B53">53</abbr></abbrgrp>. Genes were predicted for the completed <it>C. parvum </it>(IOWA) sequence as previously described using the Glimmer program <abbrgrp><abbr bid="B54">54</abbr></abbrgrp> trained on <it>Cryptosporidium </it>coding sequences <abbrgrp><abbr bid="B52">52</abbr></abbrgrp>. A few predicted genes that demonstrated apparent sequence incompleteness were reconstructed from genomic sequence by comparison with apicomplexan orthologs. The predicted protein encoding data set contained 5,519 sequences. A comparison of this gene set to the published annotation revealed that the Glimmer-predicted gene set contained all but 40 of the 3,396 annotated protein-encoding sequences deposited in GenBank. These 40 were added to our dataset and analyzed. Glimmer does not predict introns and some introns are present in the genome <abbrgrp><abbr bid="B7">7</abbr><abbr bid="B20">20</abbr></abbrgrp>; thus our gene count is artificially inflated. Likewise, the official <it>C. parvum </it>annotation did not consider ORFs of less than 100 amino acids that did not have significant BLAST hits and thus may be a slight underestimate <abbrgrp><abbr bid="B7">7</abbr></abbrgrp>.</p>
			</sec>
			<sec>
				<st>
					<p>Database creation</p>
				</st>
				<p>An internal database (ApiDB) containing all available apicomplexan sequence data was created <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. A second BLAST-searchable database, PyPhynr, was constructed that included SwissProt, TrEMBL and TrEMBL_new, as released in August 2003, predicted genes from <it>C. parvum</it>, ORFs of more than 120 amino acids from <it>Theileria annulata</it>, and more than 75 amino acids from consensus ESTs for several apicomplexan organisms. Genomic sequences for <it>T. gondii </it>(8x coverage) and clustered ESTs were downloaded from ToxoDB <abbrgrp><abbr bid="B55">55</abbr><abbr bid="B56">56</abbr></abbrgrp>. Genomic data were provided by The Institute for Genomic Research (TIGR), and by the Sanger Institute. EST sequences were generated by Washington University. In addition, this study used sequence data from several general and species-specific databases. Specifically, the NCBI GenBank nr and dbEST were downloaded <abbrgrp><abbr bid="B57">57</abbr></abbrgrp> and extensively searched. To provide taxonomic completeness, additional genes were obtained via searches of additional databases including: <it>Entamoeba histolytica </it><abbrgrp><abbr bid="B58">58</abbr></abbrgrp>, <it>D. discoideum </it><abbrgrp><abbr bid="B59">59</abbr></abbrgrp>, the kinetoplastids <it>Leishmania major </it><abbrgrp><abbr bid="B59">59</abbr></abbrgrp>, <it>T. brucei </it><abbrgrp><abbr bid="B59">59</abbr></abbrgrp>, <it>T. cruzi </it><abbrgrp><abbr bid="B60">60</abbr></abbrgrp>, and a ciliate <it>Tetrahymena thermophila </it><abbrgrp><abbr bid="B61">61</abbr></abbrgrp>. Sequence data for <it>T. annulata</it>, <it>E. histolytica</it>, <it>D. discoideum</it>, <it>L. major </it>and <it>T. brucei </it>were produced by the Pathogen Sequencing Unit of the Sanger Institute and can be obtained from <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>. Preliminary sequence data for <it>T. thermophila </it>was obtained from TIGR and can be accessed at <abbrgrp><abbr bid="B63">63</abbr></abbrgrp>.</p>
			</sec>
			<sec>
				<st>
					<p>Phylogenomic analyses and similarity searches</p>
				</st>
				<p>The source code of the phylogenomic software PyPhy <abbrgrp><abbr bid="B27">27</abbr></abbrgrp> was kindly provided by Thomas Sicheritz-Ponten and modified to include analyses of eukaryotic groups, and changes to improve functionality <abbrgrp><abbr bid="B51">51</abbr></abbrgrp>. For initial phylogenomic analyses, a BLAST cutoff of 60% sequence length coverage and 50% sequence similarity was adopted and the neighbor-joining program of PAUP 4.0b10 for Unix <abbrgrp><abbr bid="B64">64</abbr></abbrgrp> was used. A detailed description of our phylogenomic pipeline and PyPhy implementation are described <abbrgrp><abbr bid="B51">51</abbr></abbrgrp> and outlined in Figure <figr fid="F1">1</figr>.</p>
				<p>Output gene trees with phylogenetic connections (that is, the nearest non-self neighbors at a distinct taxonomic rank) <abbrgrp><abbr bid="B27">27</abbr></abbrgrp> to prokaryotes and algae-related groups were manually inspected. As the trees are unrooted, several factors were considered in the screen for candidate transferred genes. If the <it>C. parvum </it>gene does not form a monophyletic group with prokaryotic or plant-related taxa regardless of rooting, the subject gene was eliminated from further consideration. If the topology of the gene tree is consistent with a phylogenetic anomaly caused by gene transfer, but may also be interpreted differently if the tree is rooted otherwise, it was removed from consideration at this time. If the top hits of both nr and dbEST database searches are predominantly non-plant eukaryotes, and the topology of the tree was poor, the subject gene was considered an unlikely candidate. Finally, all 551 protein sequences predicted to be NEAPs in the malarial parasite <it>P. falciparum </it><abbrgrp><abbr bid="B26">26</abbr></abbrgrp> were used to search the <it>C. parvum </it>genome and the results were screened using a BLAST cutoff E-value of 10<sup>-5 </sup>and a length coverage of 50%. Sequences identified by these searches were added to the candidate list (if not already present) for manual phylogenetic analyses to verify their likely origins. It should be noted that all trees were screened for the existence of a particular phylogenetic relationship. In some cases the proteins utilized to generate a particular tree are capable of resolving relationships among many branches of the tree of life, and in others they are not. Despite these differences in resolving power, the proteins which survive our phylogenetic screen and subsequent detailed analyses described below exhibit significant support for the branches of the tree in which we are interested. Similar procedures were used to characterize the complement of nuclear-encoded genes of plastid origin in the <it>Arabidopsis </it>genome <abbrgrp><abbr bid="B65">65</abbr></abbrgrp>. BLAST searches were performed on GenBank releases 138-140 <abbrgrp><abbr bid="B57">57</abbr></abbrgrp>.</p>
				<p>Detailed phylogenetic analyses of candidate genes identified by phylogenomic screening: candidate genes surviving the PyPhy phylogenomic screen were reanalyzed with careful attention to taxonomic completeness, including representative species from major prokaryotic and eukaryotic lineages when necessary and possible. New multiple sequence alignments were created with ClustalX <abbrgrp><abbr bid="B66">66</abbr></abbrgrp>, followed by manual refinement. Only unambiguously aligned sequence segments were used for subsequent analyses (see Additional data file 1). Phylogenetic analyses were performed with a maximum likelihood method using TREE-PUZZLE version 5.1 for Unix <abbrgrp><abbr bid="B67">67</abbr></abbrgrp>, a distance method using the program neighbor of PHYLIP version 3.6a package <abbrgrp><abbr bid="B68">68</abbr></abbrgrp>, and a maximum parsimony method with random stepwise addition using PAUP* 4.0b10 <abbrgrp><abbr bid="B64">64</abbr></abbrgrp>. Bootstrap support was estimated using 1,000 replicates for both parsimony and distance analyses and quartet puzzling values were obtained using 10,000 puzzling steps for maximum likelihood analyses. Distance calculation used the Jones-Taylor-Thornton (JTT) substitution matrix <abbrgrp><abbr bid="B69">69</abbr></abbrgrp>, and site-substitution variation was modeled with a gamma-distribution whose shape parameter was estimated from the data. For maximum likelihood analyses, a mixed model of eight gamma-distributed rates and one invariable rate was used to calculate the pairwise maximum likelihood distances. The unrooted trees presented in Figures <figr fid="F4">4</figr> and <figr fid="F5">5</figr> were drawn by supplying TREE-PUZZLE with the maximum parsimony tree and using TREE-PUZZLE distances as described above to calculate the branch lengths. The trees were visualized and prepared for publication with TreeView X Version 0.4.1 <abbrgrp><abbr bid="B70">70</abbr></abbrgrp>.</p>
			</sec>
			<sec>
				<st>
					<p>Genomic Southern analysis</p>
				</st>
				<p><it>C. parvum </it>(IOWA) oocysts (10<sup>8</sup>) were obtained from the Sterling Parasitology Laboratory at the University of Arizona and were lysed using a freeze/thaw method. Genomic DNA was purified using the DNeasy Tissue Kit (Qiagen). Genomic DNA (5 &#956;g) was restricted with <it>Bam</it>H1 and <it>Eco</it>R1 respectively and electrophoresed on a 0.8% gel in 1x TAE buffer, transferred to a positively charged nylon membrane (Bio-Rad), and fixed using a UVP crosslinker set at 125 mJ as described in <abbrgrp><abbr bid="B71">71</abbr></abbrgrp>. <it>C. parvum </it>genomic DNA for the probes (700-1,500 bp) was amplified by PCR (see Additional data file 1).</p>
			</sec>
			<sec>
				<st>
					<p>Semi-quantitative reverse transcription-PCR</p>
				</st>
				<p>Sterilized <it>C. parvum </it>(IOWA isolate) oocysts were used to infect confluent human adenocarcinoma cell monolayers at a concentration of one oocyst per cell as previously described <abbrgrp><abbr bid="B72">72</abbr></abbrgrp>. Total RNA was prepared from mock-infected and <it>C. parvum</it>-infected HCT-8 cultures at 2, 6, 12, 24, 36, 48 and 72 h post-inoculation by directly lysing the cells with 4 ml TRIzol reagent (GIBCO-BRL/Life Technologies). Purified RNA was resuspended in RNAse-free water and the integrity of the samples was confirmed by gel electrophoresis.</p>
				<p>Primers specific for several transferred genes identified in the study were designed (see Additional data file 1) and a semi-quantitative RT-PCR analysis was carried out as previously described <abbrgrp><abbr bid="B72">72</abbr></abbrgrp>. Primers specific for <it>C. parvum </it>18S rRNA were used to normalize the amount of cDNA product of the candidate gene to that of <it>C. parvum </it>rRNA in the same sample. PCR products were separated on a 4% non-denaturing polyacrylamide gel and signals from specific products were captured and quantified using a phosphorimaging system (Molecular Dynamics). The expression level of each gene at each time point was calculated as the ratio of its RT-PCR product signal to that of the <it>C. parvum </it>18S rRNA. Six independent time-course experiments were used in the analysis.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Additional data files</p>
			</st>
			<p>Additional data is provided with the online version of this paper, consisting of a PDF file (Additional data file <supplr sid="s1">1</supplr>) containing: materials and methods for genomic Southern analysis; the amino-acid sequences of genes listed in Table <tblr tid="T2">2</tblr>; accession numbers for sequences used in Figure <figr fid="F4">4</figr>; accession numbers for sequences used in Figure <figr fid="F5">5</figr>; expression of <it>C. parvum </it>phosphoglucomutase in <it>T. gondii</it>; table of primers used for RT-PCR experiments; phylogenetic tree of aldolase; alignment files for phylogenetic analyses in Figure <figr fid="F4">4</figr>; and the alignment of 1,4-&#945;-glucan branching enzyme sequences used in Figure <figr fid="F5">5</figr>.</p>
			<suppl id="s1">
				<title>
					<p>Additional data file 1</p>
				</title>
				<caption>
					<p>Additional materials and methods, sequence accession numbers and alignments</p>
				</caption>
				<text>
					<p>This file contains the materials and methods for genomic Southern analysis; the amino-acid sequences of genes listed in Table 2; accession numbers for sequences used in Figure 4; accession numbers for sequences used in Figure 5; expression of <it>C. parvum </it>phosphoglucomutase in <it>T. gondii</it>; table of primers used for RT-PCR experiments; phylogenetic tree of aldolase; alignment files for phylogenetic analyses in Figure 4; and the alignment of 1,4-&#945;-glucan branching enzyme sequences used in Figure 5</p>
				</text>
				<file name="gb-2004-5-11-r88-s1.pdf">
					<p>Click here for additional data file</p>
				</file>
			</suppl>
		</sec>
	</bdy>
	<bm>
		<ack>
			<sec>
				<st>
					<p>Acknowledgements</p>
				</st>
				<p>We thank G. Buck (Virginia Commonwealth University), G. Widmer and S. Tzipori (Tufts University) for access to <it>C. hominis </it>genotype I genomic sequence data. Genomic sequence for <it>Toxoplasma gondii </it>(8X coverage) and clustered ESTs were downloaded from ToxoDB <abbrgrp><abbr bid="B56">56</abbr></abbrgrp>. Genomic data were provided by The Institute for Genomic Research (supported by the NIH grant #AI05093), and by the Sanger Institute (Wellcome Trust). Apicomplexan EST sequences were generated by Washington University (NIH grant #1R01AI045806-01A1). Occasionally, genes were obtained via searches of several databases containing <it>Entamoeba histolytica </it><abbrgrp><abbr bid="B58">58</abbr></abbrgrp>, <it>Dictyostelium discoideum </it><abbrgrp><abbr bid="B59">59</abbr></abbrgrp>, kinetoplastids <it>Leishmania major </it><abbrgrp><abbr bid="B59">59</abbr></abbrgrp>, <it>Trypanosoma brucei </it><abbrgrp><abbr bid="B59">59</abbr></abbrgrp>, <it>T. cruzi </it><abbrgrp><abbr bid="B60">60</abbr></abbrgrp>, and ciliate <it>Tetrahymena thermophila </it><abbrgrp><abbr bid="B61">61</abbr></abbrgrp>. Sequence data for <it>T. annulata</it>, <it>E. histolytica</it>, <it>D. discoideum</it>, <it>Leishmania major </it>and <it>T. brucei </it>were produced by the Pathogen Sequencing Unit of the Sanger Institute and can be obtained from <abbrgrp><abbr bid="B62">62</abbr></abbrgrp>. Preliminary sequence data for <it>Tetrahymena thermophila </it>was obtained from The Institute for Genomic Research and can be accessed at <abbrgrp><abbr bid="B63">63</abbr></abbrgrp>. We thank Fallon Hampton for her work on the phosphoglucomutase expression constructs. She was supported by a summer undergraduate fellowship in genetics (SUNFIG) award. This study was funded by a research grant from the University of Georgia Research Foundation to J.C.K. and NIH grant U01 AI 46397 to M.S.A. J.H. is supported by a postdoctoral fellowship from the American Heart Association. We thank Boris Striepen, Marc-Jan Gubbels and three anonymous reviewers for comments that greatly increased the clarity and precision of the analyses in the manuscript.</p>
			</sec>
		</ack>
		<refgrp>
			<bibl id="B1">
				<title>
					<p>Cryptosporidiosis and the challenges of chemotherapy.</p>
				</title>
				<aug>
					<au>
						<snm>Mead</snm>
						<fnm>JR</fnm>
					</au>
				</aug>
				<source>Drug Resistance Updates</source>
				<pubdate>2002</pubdate>
				<volume>5</volume>
				<fpage>47</fpage>
				<lpage>57</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S1368-7646(02)00011-0</pubid>
						<pubid idtype="pmpid" link="fulltext">12127863</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B2">
				<title>
					<p>CDC: bioterrorism agents/diseases</p>
				</title>
				<url>http://www.bt.cdc.gov/agent/agentlist.asp</url>
			</bibl>
			<bibl id="B3">
				<title>
					<p>What is the phylogenetic position of <it>Cryptosporidium</it>?</p>
				</title>
				<aug>
					<au>
						<snm>Zhu</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Keithly</snm>
						<fnm>JS</fnm>
					</au>
					<au>
						<snm>Philippe</snm>
						<fnm>H</fnm>
					</au>
				</aug>
				<source>Int J Syst Evol Microbiol</source>
				<pubdate>2000</pubdate>
				<volume>50</volume>
				<fpage>1673</fpage>
				<lpage>1681</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">10939675</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B4">
				<title>
					<p>Phylogeny of gregarines (Apicomplexa) as inferred from small-subunit rDNA and beta-tubulin.</p>
				</title>
				<aug>
					<au>
						<snm>Leander</snm>
						<fnm>BS</fnm>
					</au>
					<au>
						<snm>Clopton</snm>
						<fnm>RE</fnm>
					</au>
					<au>
						<snm>Keeling</snm>
						<fnm>PJ</fnm>
					</au>
				</aug>
				<source>Int J Syst Evol Microbiol</source>
				<pubdate>2003</pubdate>
				<volume>53</volume>
				<fpage>345</fpage>
				<lpage>354</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1099/ijs.0.02284-0</pubid>
						<pubid idtype="pmpid" link="fulltext">12656194</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B5">
				<title>
					<p><it>Cryptosporidium </it>is more closely related to the gregarines than to coccidia as shown by phylogenetic analysis of apicomplexan parasites inferred using small-subunit ribosomal RNA gene sequences.</p>
				</title>
				<aug>
					<au>
						<snm>Carreno</snm>
						<fnm>RA</fnm>
					</au>
					<au>
						<snm>Martin</snm>
						<fnm>DS</fnm>
					</au>
					<au>
						<snm>Barta</snm>
						<fnm>JR</fnm>
					</au>
				</aug>
				<source>Parasitol Res</source>
				<pubdate>1999</pubdate>
				<volume>85</volume>
				<fpage>899</fpage>
				<lpage>904</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1007/s004360050655</pubid>
						<pubid idtype="pmpid" link="fulltext">10540950</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B6">
				<title>
					<p><it>Cryptosporidium parvum </it>appears to lack a plastid genome.</p>
				</title>
				<aug>
					<au>
						<snm>Zhu</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Marchewka</snm>
						<fnm>MJ</fnm>
					</au>
					<au>
						<snm>Keithly</snm>
						<fnm>JS</fnm>
					</au>
				</aug>
				<source>Microbiology</source>
				<pubdate>2000</pubdate>
				<volume>146</volume>
				<fpage>315</fpage>
				<lpage>321</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">10708370</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B7">
				<title>
					<p>Complete genome sequence of the apicomplexan, <it>Cryptosporidium parvum</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Abrahamsen</snm>
						<fnm>MS</fnm>
					</au>
					<au>
						<snm>Templeton</snm>
						<fnm>TJ</fnm>
					</au>
					<au>
						<snm>Enomoto</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Abrahante</snm>
						<fnm>JE</fnm>
					</au>
					<au>
						<snm>Zhu</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Lancto</snm>
						<fnm>CA</fnm>
					</au>
					<au>
						<snm>Deng</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Liu</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Widmer</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Tzipori</snm>
						<fnm>S</fnm>
					</au>
					<etal/>
				</aug>
				<source>Science</source>
				<pubdate>2004</pubdate>
				<volume>304</volume>
				<fpage>441</fpage>
				<lpage>445</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.1094786</pubid>
						<pubid idtype="pmpid" link="fulltext">15044751</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B8">
				<title>
					<p>Nuclear-encoded, plastid-targeted genes suggest a single common origin for apicomplexan and dinoflagellate plastids.</p>
				</title>
				<aug>
					<au>
						<snm>Fast</snm>
						<fnm>NM</fnm>
					</au>
					<au>
						<snm>Kissinger</snm>
						<fnm>JC</fnm>
					</au>
					<au>
						<snm>Roos</snm>
						<fnm>DS</fnm>
					</au>
					<au>
						<snm>Keeling</snm>
						<fnm>PJ</fnm>
					</au>
				</aug>
				<source>Mol Biol Evol</source>
				<pubdate>2001</pubdate>
				<volume>18</volume>
				<fpage>418</fpage>
				<lpage>426</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">11230543</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B9">
				<title>
					<p>A plastid of probable green algal origin in apicomplexan parasites.</p>
				</title>
				<aug>
					<au>
						<snm>Kohler</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Delwiche</snm>
						<fnm>CF</fnm>
					</au>
					<au>
						<snm>Denny</snm>
						<fnm>PW</fnm>
					</au>
					<au>
						<snm>Tilney</snm>
						<fnm>LG</fnm>
					</au>
					<au>
						<snm>Webster</snm>
						<fnm>P</fnm>
					</au>
					<au>
						<snm>Wilson</snm>
						<fnm>RJ</fnm>
					</au>
					<au>
						<snm>Palmer</snm>
						<fnm>JD</fnm>
					</au>
					<au>
						<snm>Roos</snm>
						<fnm>DS</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>1997</pubdate>
				<volume>275</volume>
				<fpage>1485</fpage>
				<lpage>1489</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.275.5305.1485</pubid>
						<pubid idtype="pmpid" link="fulltext">9045615</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B10">
				<title>
					<p>Complete gene map of the plastid-like DNA of the malaria parasite <it>Plasmodium falciparum</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Wilson</snm>
						<fnm>RJ</fnm>
					</au>
					<au>
						<snm>Denny</snm>
						<fnm>PW</fnm>
					</au>
					<au>
						<snm>Preiser</snm>
						<fnm>PR</fnm>
					</au>
					<au>
						<snm>Rangachari</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Roberts</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Roy</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Whyte</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Strath</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Moore</snm>
						<fnm>DJ</fnm>
					</au>
					<au>
						<snm>Moore</snm>
						<fnm>PW</fnm>
					</au>
					<etal/>
				</aug>
				<source>J Mol Biol</source>
				<pubdate>1996</pubdate>
				<volume>261</volume>
				<fpage>155</fpage>
				<lpage>172</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1006/jmbi.1996.0449</pubid>
						<pubid idtype="pmpid" link="fulltext">8757284</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B11">
				<title>
					<p>Evidence for a single origin of the 35 kb plastid DNA in apicomplexans.</p>
				</title>
				<aug>
					<au>
						<snm>Denny</snm>
						<fnm>PW</fnm>
					</au>
					<au>
						<snm>Preiser</snm>
						<fnm>PR</fnm>
					</au>
					<au>
						<snm>Williams</snm>
						<fnm>DC</fnm>
					</au>
					<au>
						<snm>Wilson</snm>
						<fnm>I</fnm>
					</au>
				</aug>
				<source>Protist </source>
				<pubdate>1998</pubdate>
				<volume>149</volume>
				<fpage>51</fpage>
				<lpage>59</lpage>
			</bibl>
			<bibl id="B12">
				<title>
					<p>Mining the <it>Plasmodium </it>genome database to define organellar function: what does the apicoplast do?</p>
				</title>
				<aug>
					<au>
						<snm>Roos</snm>
						<fnm>DS</fnm>
					</au>
					<au>
						<snm>Crawford</snm>
						<fnm>MJ</fnm>
					</au>
					<au>
						<snm>Donald</snm>
						<fnm>RG</fnm>
					</au>
					<au>
						<snm>Fraunholz</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Harb</snm>
						<fnm>OS</fnm>
					</au>
					<au>
						<snm>He</snm>
						<fnm>CY</fnm>
					</au>
					<au>
						<snm>Kissinger</snm>
						<fnm>JC</fnm>
					</au>
					<au>
						<snm>Shaw</snm>
						<fnm>MK</fnm>
					</au>
					<au>
						<snm>Striepen</snm>
						<fnm>B</fnm>
					</au>
				</aug>
				<source>Philos Trans R Soc Lond B Biol Sci</source>
				<pubdate>2002</pubdate>
				<volume>357</volume>
				<fpage>35</fpage>
				<lpage>46</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmpid" link="fulltext">11839180</pubid>
						<pubid idtype="doi">10.1098/rstb.2001.1047</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B13">
				<title>
					<p>Membrane heredity and early chloroplast evolution.</p>
				</title>
				<aug>
					<au>
						<snm>Cavalier-Smith</snm>
						<fnm>T</fnm>
					</au>
				</aug>
				<source>Trends Plant Sci</source>
				<pubdate>2000</pubdate>
				<volume>5</volume>
				<fpage>174</fpage>
				<lpage>182</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S1360-1385(00)01598-3</pubid>
						<pubid idtype="pmpid" link="fulltext">10740299</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B14">
				<title>
					<p>Tracing the thread of plastid diversity through the tapestry of life.</p>
				</title>
				<aug>
					<au>
						<snm>Delwiche</snm>
						<fnm>CF</fnm>
					</au>
				</aug>
				<source>Am Nat</source>
				<pubdate>1999</pubdate>
				<volume>154</volume>
				<fpage>S164</fpage>
				<lpage>S177</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1086/303291</pubid>
						<pubid idtype="pmpid">10527925</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B15">
				<title>
					<p>The nucleomorph genomes of cryptophytes and chlorarachniophytes.</p>
				</title>
				<aug>
					<au>
						<snm>Maier</snm>
						<fnm>UG</fnm>
					</au>
					<au>
						<snm>Douglas</snm>
						<fnm>SE</fnm>
					</au>
					<au>
						<snm>Cavalier-Smith</snm>
						<fnm>T</fnm>
					</au>
				</aug>
				<source>Protist</source>
				<pubdate>2000</pubdate>
				<volume>151</volume>
				<fpage>103</fpage>
				<lpage>109</lpage>
				<xrefbib>
					<pubid idtype="pmpid" link="fulltext">10965950</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B16">
				<title>
					<p>The single, ancient origin of chromist plastids.</p>
				</title>
				<aug>
					<au>
						<snm>Yoon</snm>
						<fnm>HS</fnm>
					</au>
					<au>
						<snm>Hackett</snm>
						<fnm>JD</fnm>
					</au>
					<au>
						<snm>Pinto</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Bhattacharya</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2002</pubdate>
				<volume>99</volume>
				<fpage>15507</fpage>
				<lpage>15512</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">137747</pubid>
						<pubid idtype="pmpid" link="fulltext">12438651</pubid>
						<pubid idtype="doi">10.1073/pnas.242379899</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B17">
				<title>
					<p>Photosynthetic eukaryotes unite: endosymbiosis connects the dots.</p>
				</title>
				<aug>
					<au>
						<snm>Bhattacharya</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Yoon</snm>
						<fnm>HS</fnm>
					</au>
					<au>
						<snm>Hackett</snm>
						<fnm>JD</fnm>
					</au>
				</aug>
				<source>BioEssays</source>
				<pubdate>2004</pubdate>
				<volume>26</volume>
				<fpage>50</fpage>
				<lpage>60</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1002/bies.10376</pubid>
						<pubid idtype="pmpid" link="fulltext">14696040</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B18">
				<title>
					<p>Nucleus-encoded, plastid-targeted glyceraldehyde-3-phosphate dehydrogenase (GAPDH) indicates a single origin for chromalveolate plastids.</p>
				</title>
				<aug>
					<au>
						<snm>Harper</snm>
						<fnm>JT</fnm>
					</au>
					<au>
						<snm>Keeling</snm>
						<fnm>PJ</fnm>
					</au>
				</aug>
				<source>Mol Biol Evol</source>
				<pubdate>2003</pubdate>
				<volume>20</volume>
				<fpage>1730</fpage>
				<lpage>1735</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/molbev/msg195</pubid>
						<pubid idtype="pmpid" link="fulltext">12885964</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B19">
				<title>
					<p>Polyamine biosynthesis in <it>Cryptosporidium parvum </it>and its implications for chemotherapy.</p>
				</title>
				<aug>
					<au>
						<snm>Keithly</snm>
						<fnm>JS</fnm>
					</au>
					<au>
						<snm>Zhu</snm>
						<fnm>G</fnm>
					</au>
					<au>
						<snm>Upton</snm>
						<fnm>SJ</fnm>
					</au>
					<au>
						<snm>Woods</snm>
						<fnm>KM</fnm>
					</au>
					<au>
						<snm>Martinez</snm>
						<fnm>MP</fnm>
					</au>
					<au>
						<snm>Yarlett</snm>
						<fnm>N</fnm>
					</au>
				</aug>
				<source>Mol Biochem Parasitol</source>
				<pubdate>1997</pubdate>
				<volume>88</volume>
				<fpage>35</fpage>
				<lpage>42</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0166-6851(97)00063-7</pubid>
						<pubid idtype="pmpid">9274865</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B20">
				<title>
					<p>Integrated mapping, chromosomal sequencing and sequence analysis of <it>Cryptosporidium parvum</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Bankier</snm>
						<fnm>AT</fnm>
					</au>
					<au>
						<snm>Spriggs</snm>
						<fnm>HF</fnm>
					</au>
					<au>
						<snm>Fartmann</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Konfortov</snm>
						<fnm>BA</fnm>
					</au>
					<au>
						<snm>Madera</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Vogel</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Teichmann</snm>
						<fnm>SA</fnm>
					</au>
					<au>
						<snm>Ivens</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Dear</snm>
						<fnm>PH</fnm>
					</au>
				</aug>
				<source>Genome Res</source>
				<pubdate>2003</pubdate>
				<volume>13</volume>
				<fpage>1787</fpage>
				<lpage>1799</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">403770</pubid>
						<pubid idtype="pmpid" link="fulltext">12869580</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B21">
				<title>
					<p>The iron-sulfur cluster assembly genes <it>iscS</it> and <it>iscU</it> of <it>Entamoeba histolytica </it>were acquired by horizontal gene transfer.</p>
				</title>
				<aug>
					<au>
						<snm>Van Der Giezen</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Cox</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Tovar</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>BMC Evol Biol</source>
				<pubdate>2004</pubdate>
				<volume>4</volume>
				<fpage>7</fpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">373444</pubid>
						<pubid idtype="pmpid" link="fulltext">15040816</pubid>
						<pubid idtype="doi">10.1186/1471-2148-4-7</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B22">
				<title>
					<p>Horizontal gene transfer promoted evolution of the ability to propagate under anaerobic conditions in yeasts.</p>
				</title>
				<aug>
					<au>
						<snm>Gojkovic</snm>
						<fnm>Z</fnm>
					</au>
					<au>
						<snm>Knecht</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Zameitat</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Warneboldt</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Coutelis</snm>
						<fnm>JB</fnm>
					</au>
					<au>
						<snm>Pynyaha</snm>
						<fnm>Y</fnm>
					</au>
					<au>
						<snm>Neuveglise</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Moller</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Loffler</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Piskur</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Mol Genet Genomics</source>
				<pubdate>2004</pubdate>
				<volume>271</volume>
				<fpage>387</fpage>
				<lpage>393</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1007/s00438-004-0995-7</pubid>
						<pubid idtype="pmpid" link="fulltext">15014982</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B23">
				<title>
					<p>Genomics. Are there bugs in our genome?</p>
				</title>
				<aug>
					<au>
						<snm>Andersson</snm>
						<fnm>JO</fnm>
					</au>
					<au>
						<snm>Doolittle</snm>
						<fnm>WF</fnm>
					</au>
					<au>
						<snm>Nesbo</snm>
						<fnm>CL</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>2001</pubdate>
				<volume>292</volume>
				<fpage>1848</fpage>
				<lpage>1850</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.1062241</pubid>
						<pubid idtype="pmpid" link="fulltext">11358998</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B24">
				<title>
					<p><it>Toxoplasma gondii</it>: A model for evolutionary genomics and chemotherapy.</p>
				</title>
				<aug>
					<au>
						<snm>Kissinger</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Crawford</snm>
						<fnm>MJ</fnm>
					</au>
					<au>
						<snm>Roos</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Ajioka</snm>
						<fnm>JW</fnm>
					</au>
				</aug>
				<source>In Pathogen Genomics: Impact on Human Health</source>
				<publisher>Totowa NJ: Humana Press</publisher>
				<editor>Shaw KJ</editor>
				<pubdate>2001</pubdate>
				<fpage>255</fpage>
				<lpage>279</lpage>
			</bibl>
			<bibl id="B25">
				<title>
					<p>Gene transfer in the evolution of parasite nucleotide biosynthesis.</p>
				</title>
				<aug>
					<au>
						<snm>Striepen</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Pruijssers</snm>
						<fnm>AJP</fnm>
					</au>
					<au>
						<snm>Huang</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Li</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Gubbels</snm>
						<fnm>MJ</fnm>
					</au>
					<au>
						<snm>Umejiego</snm>
						<fnm>NN</fnm>
					</au>
					<au>
						<snm>Hedstrom</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Kissinger</snm>
						<fnm>JC</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2004</pubdate>
				<volume>101</volume>
				<fpage>3154</fpage>
				<lpage>3159</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">365759</pubid>
						<pubid idtype="pmpid" link="fulltext">14973196</pubid>
						<pubid idtype="doi">10.1073/pnas.0304686101</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B26">
				<title>
					<p>Genome sequence of the human malaria parasite <it>Plasmodium falciparum</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Gardner</snm>
						<fnm>MJ</fnm>
					</au>
					<au>
						<snm>Hall</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Fung</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>White</snm>
						<fnm>O</fnm>
					</au>
					<au>
						<snm>Berriman</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Hyman</snm>
						<fnm>RW</fnm>
					</au>
					<au>
						<snm>Carlton</snm>
						<fnm>JM</fnm>
					</au>
					<au>
						<snm>Pain</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Nelson</snm>
						<fnm>KE</fnm>
					</au>
					<au>
						<snm>Bowman</snm>
						<fnm>S</fnm>
					</au>
					<etal/>
				</aug>
				<source>Nature</source>
				<pubdate>2002</pubdate>
				<volume>419</volume>
				<fpage>498</fpage>
				<lpage>511</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/nature01097</pubid>
						<pubid idtype="pmpid" link="fulltext">12368864</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B27">
				<title>
					<p>A phylogenomic approach to microbial evolution.</p>
				</title>
				<aug>
					<au>
						<snm>Sicheritz-Ponten</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Andersson</snm>
						<fnm>SG</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2001</pubdate>
				<volume>29</volume>
				<fpage>545</fpage>
				<lpage>552</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">29656</pubid>
						<pubid idtype="pmpid" link="fulltext">11139625</pubid>
						<pubid idtype="doi">10.1093/nar/29.2.545</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B28">
				<title>
					<p>ChloroP, a neural network-based method for predicting chloroplast transit peptides and their cleavage sites.</p>
				</title>
				<aug>
					<au>
						<snm>Emanuelsson</snm>
						<fnm>O</fnm>
					</au>
					<au>
						<snm>Nielsen</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>von Heijne</snm>
						<fnm>G</fnm>
					</au>
				</aug>
				<source>Protein Sci</source>
				<pubdate>1999</pubdate>
				<volume>8</volume>
				<fpage>978</fpage>
				<lpage>984</lpage>
				<xrefbib>
					<pubid idtype="pmpid">10338008</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B29">
				<title>
					<p>The protozoan parasite <it>Toxoplasma gondii </it>expresses two functional plant-like glycolytic enzymes. Implications for evolutionary origin of apicomplexans.</p>
				</title>
				<aug>
					<au>
						<snm>Dzierszinski</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Popescu</snm>
						<fnm>O</fnm>
					</au>
					<au>
						<snm>Toursel</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Slomianny</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Yahiaoui</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Tomavo</snm>
						<fnm>S</fnm>
					</au>
				</aug>
				<source>J Biol Chem</source>
				<pubdate>1999</pubdate>
				<volume>274</volume>
				<fpage>24888</fpage>
				<lpage>24895</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1074/jbc.274.35.24888</pubid>
						<pubid idtype="pmpid" link="fulltext">10455162</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B30">
				<title>
					<p>Unique phylogenetic relationships of glucokinase and glucosephosphate isomerase of the amitochondriate eukaryotes <it>Giardia intestinalis</it>, <it>Spironucleus barkhanus </it>and <it>Trichomonas vaginalis</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Henze</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Horner</snm>
						<fnm>DS</fnm>
					</au>
					<au>
						<snm>Suguri</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Moore</snm>
						<fnm>DV</fnm>
					</au>
					<au>
						<snm>Sanchez</snm>
						<fnm>LB</fnm>
					</au>
					<au>
						<snm>Muller</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Embley</snm>
						<fnm>TM</fnm>
					</au>
				</aug>
				<source>Gene</source>
				<pubdate>2001</pubdate>
				<volume>281</volume>
				<fpage>123</fpage>
				<lpage>131</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0378-1119(01)00773-9</pubid>
						<pubid idtype="pmpid" link="fulltext">11750134</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B31">
				<title>
					<p>Glucose-6-phosphate isomerase from the hyperthermophilic archaeon <it>Methanococcus jannaschii</it>: characterization of the first archaeal member of the phosphoglucose isomerase superfamily.</p>
				</title>
				<aug>
					<au>
						<snm>Rudolph</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Hansen</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Schonheit</snm>
						<fnm>P</fnm>
					</au>
				</aug>
				<source>Arch Microbiol</source>
				<pubdate>2004</pubdate>
				<volume>181</volume>
				<fpage>82</fpage>
				<lpage>87</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1007/s00203-003-0626-4</pubid>
						<pubid idtype="pmpid" link="fulltext">14655001</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B32">
				<title>
					<p>Genetic complementation in apicomplexan parasites.</p>
				</title>
				<aug>
					<au>
						<snm>Striepen</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>White</snm>
						<fnm>MW</fnm>
					</au>
					<au>
						<snm>Li</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Guerini</snm>
						<fnm>MN</fnm>
					</au>
					<au>
						<snm>Malik</snm>
						<fnm>SB</fnm>
					</au>
					<au>
						<snm>Logsdon</snm>
						<fnm>JM</fnm>
						<suf>Jr</suf>
					</au>
					<au>
						<snm>Liu</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Abrahamsen</snm>
						<fnm>MS</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2002</pubdate>
				<volume>99</volume>
				<fpage>6304</fpage>
				<lpage>6309</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">122944</pubid>
						<pubid idtype="pmpid" link="fulltext">11959921</pubid>
						<pubid idtype="doi">10.1073/pnas.092525699</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B33">
				<title>
					<p>Much ado about bacteria-to-vertebrate lateral gene transfer.</p>
				</title>
				<aug>
					<au>
						<snm>Genereux</snm>
						<fnm>DP</fnm>
					</au>
					<au>
						<snm>Logsdon</snm>
						<fnm>JM</fnm>
						<suf>Jr</suf>
					</au>
				</aug>
				<source>Trends Genet</source>
				<pubdate>2003</pubdate>
				<volume>19</volume>
				<fpage>191</fpage>
				<lpage>195</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0168-9525(03)00055-6</pubid>
						<pubid idtype="pmpid" link="fulltext">12683971</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B34">
				<title>
					<p>Phylogenetic analyses do not support horizontal gene transfers from bacteria to vertebrates.</p>
				</title>
				<aug>
					<au>
						<snm>Stanhope</snm>
						<fnm>MJ</fnm>
					</au>
					<au>
						<snm>Lupas</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Italia</snm>
						<fnm>MJ</fnm>
					</au>
					<au>
						<snm>Koretke</snm>
						<fnm>KK</fnm>
					</au>
					<au>
						<snm>Volker</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Brown</snm>
						<fnm>JR</fnm>
					</au>
				</aug>
				<source>Nature</source>
				<pubdate>2001</pubdate>
				<volume>411</volume>
				<fpage>940</fpage>
				<lpage>944</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/35082058</pubid>
						<pubid idtype="pmpid" link="fulltext">11418856</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B35">
				<title>
					<p>How good are deep phylogenetic trees?</p>
				</title>
				<aug>
					<au>
						<snm>Philippe</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Laurent</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Curr Opin Genet Dev</source>
				<pubdate>1998</pubdate>
				<volume>8</volume>
				<fpage>616</fpage>
				<lpage>623</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0959-437X(98)80028-2</pubid>
						<pubid idtype="pmpid">9914208</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B36">
				<title>
					<p>Horizontal gene transfer: a critical view.</p>
				</title>
				<aug>
					<au>
						<snm>Kurland</snm>
						<fnm>CG</fnm>
					</au>
					<au>
						<snm>Canback</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Berg</snm>
						<fnm>OG</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2003</pubdate>
				<volume>100</volume>
				<fpage>9658</fpage>
				<lpage>9662</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">187805</pubid>
						<pubid idtype="pmpid" link="fulltext">12902542</pubid>
						<pubid idtype="doi">10.1073/pnas.1632870100</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B37">
				<title>
					<p>PlasmoDB: the <it>Plasmodium </it>genome resource. A database integrating experimental and computational data.</p>
				</title>
				<aug>
					<au>
						<snm>Bahl</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Brunk</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Crabtree</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Fraunholz</snm>
						<fnm>MJ</fnm>
					</au>
					<au>
						<snm>Gajria</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Grant</snm>
						<fnm>GR</fnm>
					</au>
					<au>
						<snm>Ginsburg</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Gupta</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Kissinger</snm>
						<fnm>JC</fnm>
					</au>
					<au>
						<snm>Labo</snm>
						<fnm>P</fnm>
					</au>
					<etal/>
				</aug>
				<source>Nucleic Acids Res </source>
				<pubdate>2003</pubdate>
				<volume>31</volume>
				<fpage>212</fpage>
				<lpage>215</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">165528</pubid>
						<pubid idtype="pmpid" link="fulltext">12519984</pubid>
						<pubid idtype="doi">10.1093/nar/gkg081</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B38">
				<title>
					<p>Plant-like traits associated with metabolism of <it>Trypanosoma </it>parasites.</p>
				</title>
				<aug>
					<au>
						<snm>Hannaert</snm>
						<fnm>V</fnm>
					</au>
					<au>
						<snm>Saavedra</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Duffieux</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Szikora</snm>
						<fnm>JP</fnm>
					</au>
					<au>
						<snm>Rigden</snm>
						<fnm>DJ</fnm>
					</au>
					<au>
						<snm>Michels</snm>
						<fnm>PA</fnm>
					</au>
					<au>
						<snm>Opperdoes</snm>
						<fnm>FR</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2003</pubdate>
				<volume>100</volume>
				<fpage>1067</fpage>
				<lpage>1071</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">298727</pubid>
						<pubid idtype="pmpid" link="fulltext">12552132</pubid>
						<pubid idtype="doi">10.1073/pnas.0335769100</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B39">
				<title>
					<p>Lateral transfer at the gene and subgenic levels in the evolution of eukaryotic enolase.</p>
				</title>
				<aug>
					<au>
						<snm>Keeling</snm>
						<fnm>PJ</fnm>
					</au>
					<au>
						<snm>Palmer</snm>
						<fnm>JD</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2001</pubdate>
				<volume>98</volume>
				<fpage>10745</fpage>
				<lpage>10750</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">58546</pubid>
						<pubid idtype="pmpid" link="fulltext">11526220</pubid>
						<pubid idtype="doi">10.1073/pnas.191337098</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B40">
				<title>
					<p>Evolution of <it>Cryptosporidium parvum </it>lactate dehydrogenase from malate dehydrogenase by a very recent event of gene duplication.</p>
				</title>
				<aug>
					<au>
						<snm>Madern</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Cai</snm>
						<fnm>X</fnm>
					</au>
					<au>
						<snm>Abrahamsen</snm>
						<fnm>MS</fnm>
					</au>
					<au>
						<snm>Zhu</snm>
						<fnm>G</fnm>
					</au>
				</aug>
				<source>Mol Biol Evol</source>
				<pubdate>2004</pubdate>
				<volume>21</volume>
				<fpage>489</fpage>
				<lpage>497</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/molbev/msh042</pubid>
						<pubid idtype="pmpid" link="fulltext">14694073</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B41">
				<title>
					<p>The deep roots of eukaryotes.</p>
				</title>
				<aug>
					<au>
						<snm>Baldauf</snm>
						<fnm>SL</fnm>
					</au>
				</aug>
				<source>Science</source>
				<pubdate>2003</pubdate>
				<volume>300</volume>
				<fpage>1703</fpage>
				<lpage>1706</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.1085544</pubid>
						<pubid idtype="pmpid" link="fulltext">12805537</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B42">
				<title>
					<p>Mitochondrial-type iron-sulfur cluster biosynthesis genes (IscS and IscU) in the apicomplexan <it>Cryptosporidium parvum</it>.</p>
				</title>
				<aug>
					<au>
						<snm>LaGier</snm>
						<fnm>MJ</fnm>
					</au>
					<au>
						<snm>Tachezy</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Stejskal</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Kutisova</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Keithly</snm>
						<fnm>JS</fnm>
					</au>
				</aug>
				<source>Microbiology</source>
				<pubdate>2003</pubdate>
				<volume>149</volume>
				<fpage>3519</fpage>
				<lpage>3530</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1099/mic.0.26365-0</pubid>
						<pubid idtype="pmpid" link="fulltext">14663084</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B43">
				<title>
					<p>Preliminary evidence for a mitochondrion in <it>Cryptosporidium parvum </it>: phylogenetic and therapeutic implications.</p>
				</title>
				<aug>
					<au>
						<snm>Riordan</snm>
						<fnm>CE</fnm>
					</au>
					<au>
						<snm>Langreth</snm>
						<fnm>SG</fnm>
					</au>
					<au>
						<snm>Sanchez</snm>
						<fnm>LB</fnm>
					</au>
					<au>
						<snm>Kayser</snm>
						<fnm>O</fnm>
					</au>
					<au>
						<snm>Keithly</snm>
						<fnm>JS</fnm>
					</au>
				</aug>
				<source>J Eukaryot Microbiol</source>
				<pubdate>1999</pubdate>
				<volume>46</volume>
				<fpage>52S</fpage>
				<lpage>55S</lpage>
				<xrefbib>
					<pubid idtype="pmpid">10519246</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B44">
				<title>
					<p><it>Cryptosporidium parvum </it>Cpn60 targets a relict organelle.</p>
				</title>
				<aug>
					<au>
						<snm>Riordan</snm>
						<fnm>CE</fnm>
					</au>
					<au>
						<snm>Ault</snm>
						<fnm>JG</fnm>
					</au>
					<au>
						<snm>Langreth</snm>
						<fnm>SG</fnm>
					</au>
					<au>
						<snm>Keithly</snm>
						<fnm>JS</fnm>
					</au>
				</aug>
				<source>Curr Genet</source>
				<pubdate>2003</pubdate>
				<volume>44</volume>
				<fpage>138</fpage>
				<lpage>147</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1007/s00294-003-0432-1</pubid>
						<pubid idtype="pmpid" link="fulltext">12928750</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B45">
				<title>
					<p>Gene transfer from organelles to the nucleus: frequent and in big chunks.</p>
				</title>
				<aug>
					<au>
						<snm>Martin</snm>
						<fnm>W</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2003</pubdate>
				<volume>100</volume>
				<fpage>8612</fpage>
				<lpage>8614</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">166356</pubid>
						<pubid idtype="pmpid" link="fulltext">12861078</pubid>
						<pubid idtype="doi">10.1073/pnas.1633606100</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B46">
				<title>
					<p>Lateral gene transfer and the nature of bacterial innovation.</p>
				</title>
				<aug>
					<au>
						<snm>Ochman</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Lawrence</snm>
						<fnm>JG</fnm>
					</au>
					<au>
						<snm>Groisman</snm>
						<fnm>EA</fnm>
					</au>
				</aug>
				<source>Nature</source>
				<pubdate>2000</pubdate>
				<volume>405</volume>
				<fpage>299</fpage>
				<lpage>304</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1038/35012500</pubid>
						<pubid idtype="pmpid" link="fulltext">10830951</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B47">
				<title>
					<p>Evolutionary analysis of <it>Arabidopsis</it>, cyanobacterial, and chloroplast genomes reveals plastid phylogeny and thousands of cyanobacterial genes in the nucleus.</p>
				</title>
				<aug>
					<au>
						<snm>Martin</snm>
						<fnm>W</fnm>
					</au>
					<au>
						<snm>Rujan</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Richly</snm>
						<fnm>E</fnm>
					</au>
					<au>
						<snm>Hansen</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Cornelsen</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Lins</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Leister</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Stoebe</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Hasegawa</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Penny</snm>
						<fnm>D</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>2002</pubdate>
				<volume>99</volume>
				<fpage>12246</fpage>
				<lpage>12251</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">129430</pubid>
						<pubid idtype="pmpid" link="fulltext">12218172</pubid>
						<pubid idtype="doi">10.1073/pnas.182432999</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B48">
				<title>
					<p>Phylogenetic analyses of diplomonad genes reveal frequent lateral gene transfers affecting eukaryotes.</p>
				</title>
				<aug>
					<au>
						<snm>Andersson</snm>
						<fnm>JO</fnm>
					</au>
					<au>
						<snm>Sjogren</snm>
						<fnm>AM</fnm>
					</au>
					<au>
						<snm>Davis</snm>
						<fnm>LA</fnm>
					</au>
					<au>
						<snm>Embley</snm>
						<fnm>TM</fnm>
					</au>
					<au>
						<snm>Roger</snm>
						<fnm>AJ</fnm>
					</au>
				</aug>
				<source>Curr Biol</source>
				<pubdate>2003</pubdate>
				<volume>13</volume>
				<fpage>94</fpage>
				<lpage>104</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0960-9822(03)00003-4</pubid>
						<pubid idtype="pmpid" link="fulltext">12546782</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B49">
				<title>
					<p>Evidence for lateral transfer of genes encoding ferredoxins, nitroreductases, NADH oxidase, and alcohol dehydrogenase 3 from anaerobic prokaryotes to <it>Giardia lamblia </it>and <it>Entamoeba histolytica</it>.</p>
				</title>
				<aug>
					<au>
						<snm>Nixon</snm>
						<fnm>JE</fnm>
					</au>
					<au>
						<snm>Wang</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Field</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Morrison</snm>
						<fnm>HG</fnm>
					</au>
					<au>
						<snm>McArthur</snm>
						<fnm>AG</fnm>
					</au>
					<au>
						<snm>Sogin</snm>
						<fnm>ML</fnm>
					</au>
					<au>
						<snm>Loftus</snm>
						<fnm>BJ</fnm>
					</au>
					<au>
						<snm>Samuelson</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>Eukaryot Cell </source>
				<pubdate>2002</pubdate>
				<volume>1</volume>
				<fpage>181</fpage>
				<lpage>190</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">118039</pubid>
						<pubid idtype="pmpid" link="fulltext">12455953</pubid>
						<pubid idtype="doi">10.1128/EC.1.2.181-190.2002</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B50">
				<title>
					<p>A nuclear gene of eubacterial origin in <it>Euglena gracilis </it>reflects cryptic endosymbioses during protist evolution.</p>
				</title>
				<aug>
					<au>
						<snm>Henze</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Badr</snm>
						<fnm>A</fnm>
					</au>
					<au>
						<snm>Wettern</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Cerff</snm>
						<fnm>R</fnm>
					</au>
					<au>
						<snm>Martin</snm>
						<fnm>W</fnm>
					</au>
				</aug>
				<source>Proc Natl Acad Sci USA</source>
				<pubdate>1995</pubdate>
				<volume>92</volume>
				<fpage>9122</fpage>
				<lpage>9126</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">40936</pubid>
						<pubid idtype="pmpid" link="fulltext">7568085</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B51">
				<title>
					<p>A first glimpse into the pattern and scale of gene transfer in Apicomplexa.</p>
				</title>
				<aug>
					<au>
						<snm>Huang</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Mullapudi</snm>
						<fnm>N</fnm>
					</au>
					<au>
						<snm>Sicheritz-Ponten</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Kissinger</snm>
						<fnm>JC</fnm>
					</au>
				</aug>
				<source>Int J Parasitol</source>
				<pubdate>2004</pubdate>
				<volume>34</volume>
				<fpage>265</fpage>
				<lpage>274</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/j.ijpara.2003.11.025</pubid>
						<pubid idtype="pmpid" link="fulltext">15003488</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B52">
				<title>
					<p>Inhibitors of the nonmevalonate pathway of isoprenoid biosynthesis as antimalarial drugs.</p>
				</title>
				<aug>
					<au>
						<snm>Jomaa</snm>
						<fnm>H</fnm>
					</au>
					<au>
						<snm>Wiesner</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Sanderbrand</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Altincicek</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Weidemeyer</snm>
						<fnm>C</fnm>
					</au>
					<au>
						<snm>Hintz</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Turbachova</snm>
						<fnm>I</fnm>
					</au>
					<au>
						<snm>Eberl</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Zeidler</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Lichtenthaler</snm>
						<fnm>HK</fnm>
					</au>
					<etal/>
				</aug>
				<source>Science</source>
				<pubdate>1999</pubdate>
				<volume>285</volume>
				<fpage>1573</fpage>
				<lpage>1576</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1126/science.285.5433.1573</pubid>
						<pubid idtype="pmpid" link="fulltext">10477522</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B53">
				<title>
					<p>CryptoDB: The <it>Cryptosporidium </it>genome resource.</p>
				</title>
				<aug>
					<au>
						<snm>Puiu</snm>
						<fnm>D</fnm>
					</au>
					<au>
						<snm>Enomoto</snm>
						<fnm>S</fnm>
					</au>
					<au>
						<snm>Buck</snm>
						<fnm>GA</fnm>
					</au>
					<au>
						<snm>Abrahamsen</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Kissinger</snm>
						<fnm>JC</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2004</pubdate>
				<volume>32</volume>
				<fpage>D329</fpage>
				<lpage>D331</lpage>
				<xrefbib>
					<pubid idtype="doi">10.1093/nar/gkh050</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B54">
				<title>
					<p>Interpolated Markov models for eukaryotic gene finding.</p>
				</title>
				<aug>
					<au>
						<snm>Salzberg</snm>
						<fnm>SL</fnm>
					</au>
					<au>
						<snm>Pertea</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Delcher</snm>
						<fnm>AL</fnm>
					</au>
					<au>
						<snm>Gardner</snm>
						<fnm>MJ</fnm>
					</au>
					<au>
						<snm>Tettelin</snm>
						<fnm>H</fnm>
					</au>
				</aug>
				<source>Genomics</source>
				<pubdate>1999</pubdate>
				<volume>59</volume>
				<fpage>24</fpage>
				<lpage>31</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1006/geno.1999.5854</pubid>
						<pubid idtype="pmpid" link="fulltext">10395796</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B55">
				<title>
					<p>ToxoDB: accessing the <it>Toxoplasma gondii </it>genome.</p>
				</title>
				<aug>
					<au>
						<snm>Kissinger</snm>
						<fnm>JC</fnm>
					</au>
					<au>
						<snm>Gajria</snm>
						<fnm>B</fnm>
					</au>
					<au>
						<snm>Li</snm>
						<fnm>L</fnm>
					</au>
					<au>
						<snm>Paulsen</snm>
						<fnm>IT</fnm>
					</au>
					<au>
						<snm>Roos</snm>
						<fnm>DS</fnm>
					</au>
				</aug>
				<source>Nucleic Acids Res</source>
				<pubdate>2003</pubdate>
				<volume>31</volume>
				<fpage>234</fpage>
				<lpage>236</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="pmcid">165519</pubid>
						<pubid idtype="pmpid" link="fulltext">12519989</pubid>
						<pubid idtype="doi">10.1093/nar/gkg072</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B56">
				<title>
					<p>ToxoDB: the <it>Toxoplasma </it>genome resource</p>
				</title>
				<url>http://toxodb.org</url>
			</bibl>
			<bibl id="B57">
				<title>
					<p>GenBank FTP site</p>
				</title>
				<url>ftp://ftp.ncbi.nih.gov</url>
			</bibl>
			<bibl id="B58">
				<title>
					<p>The Sanger Institute: <it>Entamoeba histolytica</it></p>
				</title>
				<url>http://www.sanger.ac.uk/Projects/E_histolytica</url>
			</bibl>
			<bibl id="B59">
				<title>
					<p>GeneDB</p>
				</title>
				<url>http://www.geneDB.org</url>
			</bibl>
			<bibl id="B60">
				<title>
					<p>TcruziDB</p>
				</title>
				<url>http://TcruziDB.org</url>
			</bibl>
			<bibl id="B61">
				<title>
					<p>TIGR <it>Tetrahymena thermophila </it>genome project</p>
				</title>
				<url>http://www.tigr.org/tdb/e2k1/ttg</url>
			</bibl>
			<bibl id="B62">
				<title>
					<p>Sanger Institute Pathogen Sequencing Unit</p>
				</title>
				<url>http://www.sanger.ac.uk/Projects/Pathogens</url>
			</bibl>
			<bibl id="B63">
				<title>
					<p><it>Tetrahymena thermophila </it>sequence source</p>
				</title>
				<url>ftp://ftp.tigr.org/pub/data/Eukaryotic_Projects/t_thermophila</url>
			</bibl>
			<bibl id="B64">
				<aug>
					<au>
						<snm>Swofford</snm>
						<fnm>DL</fnm>
					</au>
				</aug>
				<source>PAUP* Phylogenetic Analysis Using Parsimony (*and other methods)</source>
				<publisher>Sunderland: Sinauer</publisher>
				<edition>4.0b10</edition>
				<pubdate>1998</pubdate>
			</bibl>
			<bibl id="B65">
				<title>
					<p>How many genes in <it>Arabidopsis </it>come from cyanobacteria? An estimate from 386 protein phylogenies.</p>
				</title>
				<aug>
					<au>
						<snm>Rujan</snm>
						<fnm>T</fnm>
					</au>
					<au>
						<snm>Martin</snm>
						<fnm>W</fnm>
					</au>
				</aug>
				<source>Trends Genet</source>
				<pubdate>2001</pubdate>
				<volume>17</volume>
				<fpage>113</fpage>
				<lpage>120</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0168-9525(00)02209-5</pubid>
						<pubid idtype="pmpid" link="fulltext">11226586</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B66">
				<title>
					<p>Multiple sequence alignment with ClustalX.</p>
				</title>
				<aug>
					<au>
						<snm>Jeanmougin</snm>
						<fnm>F</fnm>
					</au>
					<au>
						<snm>Thompson</snm>
						<fnm>JD</fnm>
					</au>
					<au>
						<snm>Gouy</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>Higgins</snm>
						<fnm>DG</fnm>
					</au>
					<au>
						<snm>Gibson</snm>
						<fnm>TJ</fnm>
					</au>
				</aug>
				<source>Trends Biochem Sci</source>
				<pubdate>1998</pubdate>
				<volume>23</volume>
				<fpage>403</fpage>
				<lpage>405</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0968-0004(98)01285-7</pubid>
						<pubid idtype="pmpid" link="fulltext">9810230</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B67">
				<title>
					<p>TREE-PUZZLE: maximum likelihood phylogenetic analysis using quartets and parallel computing.</p>
				</title>
				<aug>
					<au>
						<snm>Schmidt</snm>
						<fnm>HA</fnm>
					</au>
					<au>
						<snm>Strimmer</snm>
						<fnm>K</fnm>
					</au>
					<au>
						<snm>Vingron</snm>
						<fnm>M</fnm>
					</au>
					<au>
						<snm>von Haeseler</snm>
						<fnm>A</fnm>
					</au>
				</aug>
				<source>Bioinformatics</source>
				<pubdate>2002</pubdate>
				<volume>18</volume>
				<fpage>502</fpage>
				<lpage>504</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1093/bioinformatics/18.3.502</pubid>
						<pubid idtype="pmpid" link="fulltext">11934758</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
			<bibl id="B68">
				<aug>
					<au>
						<snm>Felsenstein</snm>
						<fnm>J</fnm>
					</au>
				</aug>
				<source>PHYLIP: Phylogenetic Inference Package</source>
				<publisher>Seattle: Department of Genetics, University of Washington</publisher>
				<edition>3.6a</edition>
				<pubdate>2002</pubdate>
			</bibl>
			<bibl id="B69">
				<title>
					<p>The rapid generation of mutation data matrices from protein sequences.</p>
				</title>
				<aug>
					<au>
						<snm>Jones</snm>
						<fnm>DT</fnm>
					</au>
					<au>
						<snm>Taylor</snm>
						<fnm>WR</fnm>
					</au>
					<au>
						<snm>Thornton</snm>
						<fnm>JM</fnm>
					</au>
				</aug>
				<source>Comput Appl Biosci </source>
				<pubdate>1992</pubdate>
				<volume>8</volume>
				<fpage>275</fpage>
				<lpage>282</lpage>
				<xrefbib>
					<pubid idtype="pmpid">1633570</pubid>
				</xrefbib>
			</bibl>
			<bibl id="B70">
				<title>
					<p>Treeview X</p>
				</title>
				<url>http://darwin.zoology.gla.ac.uk/~rpage/treeviewx/</url>
			</bibl>
			<bibl id="B71">
				<aug>
					<au>
						<snm>Sambrook</snm>
						<fnm>J</fnm>
					</au>
					<au>
						<snm>Fritsch</snm>
						<fnm>EF</fnm>
					</au>
					<au>
						<snm>Maniatis</snm>
						<fnm>T</fnm>
					</au>
				</aug>
				<source>Molecular Cloning: A Laboratory Manual</source>
				<publisher>New York: Cold Spring Harbor Laboratory Press</publisher>
				<pubdate>1989</pubdate>
			</bibl>
			<bibl id="B72">
				<title>
					<p>Identification and cloning of a developmentally regulated <it>Cryptosporidium parvum </it>gene by differential mRNA display PCR.</p>
				</title>
				<aug>
					<au>
						<snm>Schroeder</snm>
						<fnm>AA</fnm>
					</au>
					<au>
						<snm>Brown</snm>
						<fnm>AM</fnm>
					</au>
					<au>
						<snm>Abrahamsen</snm>
						<fnm>MS</fnm>
					</au>
				</aug>
				<source>Gene</source>
				<pubdate>1998</pubdate>
				<volume>216</volume>
				<fpage>327</fpage>
				<lpage>334</lpage>
				<xrefbib>
					<pubidlist>
						<pubid idtype="doi">10.1016/S0378-1119(98)00360-6</pubid>
						<pubid idtype="pmpid" link="fulltext">9729442</pubid>
					</pubidlist>
				</xrefbib>
			</bibl>
		</refgrp>
	</bm>
</art>
