<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
	<ui>1471-2164-13-617</ui>
	<ji>1471-2164</ji>
	<fm>
		<dochead>Research article</dochead>
		<bibl>
			<title>
				<p>The <it>Schistosoma mansoni</it> phylome: using evolutionary genomics to gain insight into a parasite&#8217;s biology</p>
			</title>
			<aug>
				<au id="A1"><snm>Silva</snm><mnm>Lopes</mnm><fnm>Larissa</fnm><insr iid="I1"/><insr iid="I2"/><insr iid="I3"/><email>larissa.silva@cebio.org</email></au>
				<au id="A2"><snm>Marcet-Houben</snm><fnm>Marina</fnm><insr iid="I4"/><insr iid="I5"/><email>marina.marcet-houben@crg.cat</email></au>
				<au id="A3"><snm>Nahum</snm><mnm>Alves</mnm><fnm>Laila</fnm><insr iid="I1"/><insr iid="I2"/><insr iid="I6"/><email>laila@nahum.com.br</email></au>
				<au id="A4"><snm>Zerlotini</snm><fnm>Adhemar</fnm><insr iid="I2"/><insr iid="I7"/><email>azneto@gmail.com</email></au>
				<au id="A5"><snm>Gabald&#243;n</snm><fnm>Toni</fnm><insr iid="I4"/><insr iid="I5"/><email>toni.gabaldon@crg.es</email></au>
				<au id="A6" ca="yes"><snm>Oliveira</snm><fnm>Guilherme</fnm><insr iid="I1"/><insr iid="I2"/><email>oliveira@cpqrr.fiocruz.br</email></au>
			</aug>
			<insg>
				<ins id="I1"><p>Grupo de Gen&#244;mica e Biologia Computacional, Centro de Pesquisas Ren&#233; Rachou. Instituto Nacional de Ci&#234;ncia e Tecnologia em Doen&#231;as Tropicais. Funda&#231;&#227;o Oswaldo Cruz - FIOCRUZ, Belo Horizonte, MG, 30190-002, Brazil</p></ins>
				<ins id="I2"><p>Centro de Excel&#234;ncia em Bioinform&#225;tica, Funda&#231;&#227;o Oswaldo Cruz &#8211; FIOCRUZ, Belo Horizonte, MG, Brazil</p></ins>
				<ins id="I3"><p>Instituto de Ci&#234;ncias Biol&#243;gicas, Universidade Federal de Minas Gerais &#8211; UFMG, Belo Horizonte, MG, Brazil</p></ins>
				<ins id="I4"><p>Bioinformatics and Genomics Programme, Centre for Genomic Regulation (CRG), Dr. Aiguader, 88, 08003, Barcelona, Spain</p></ins>
				<ins id="I5"><p>Universitat Pompeu Fabra (UPF), 08003, Barcelona, Spain</p></ins>
				<ins id="I6"><p>Faculdade Inf&#243;rium de Tecnologia, Belo Horizonte, MG, 30130-180, Brazil</p></ins>
				<ins id="I7"><p>Laborat&#243;rio Multiusu&#225;rio de Bioinform&#225;tica, Embrapa Inform&#225;tica Agropecu&#225;ria, Campinas, S&#227;o Paulo, Brazil</p></ins>
			</insg>
			<source>BMC Genomics</source>
			<section><title><p>Comparative and evolutionary genomics</p></title></section><issn>1471-2164</issn>
			<pubdate>2012</pubdate>
			<volume>13</volume>
			<issue>1</issue>
			<fpage>617</fpage>
			<url>http://www.biomedcentral.com/1471-2164/13/617</url>
			<xrefbib><pubidlist><pubid idtype="doi">10.1186/1471-2164-13-617</pubid><pubid idtype="pmpid">23148687</pubid></pubidlist></xrefbib>
		</bibl>
		<history><rec><date><day>30</day><month>9</month><year>2012</year></date></rec><acc><date><day>22</day><month>10</month><year>2012</year></date></acc><pub><date><day>13</day><month>11</month><year>2012</year></date></pub></history>
		<cpyrt><year>2012</year><collab>Silva et al.; licensee BioMed Central Ltd.</collab><note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note></cpyrt>
		<kwdg>
			<kwd>Phylogenomics</kwd>
			<kwd>Maximum likelihood analysis</kwd>
			<kwd>Homology prediction</kwd>
			<kwd>Functional annotation</kwd>
			<kwd>Paralogous families</kwd>
			<kwd>Parasite genomics</kwd>
			<kwd>Schistosomiasis</kwd>
		</kwdg>
		<abs>
			<sec>
				<st>
					<p>Abstract</p>
				</st>
				<sec>
					<st>
						<p>Background</p>
					</st><p>
						<it>Schistosoma mansoni</it> is one of the causative agents of schistosomiasis, a neglected tropical disease that affects about 237 million people worldwide. Despite recent efforts, we still lack a general understanding of the relevant host-parasite interactions, and the possible treatments are limited by the emergence of resistant strains and the absence of a vaccine. The <it>S. mansoni</it> genome was completely sequenced and still under continuous annotation. Nevertheless, more than 45% of the encoded proteins remain without experimental characterization or even functional prediction. To improve our knowledge regarding the biology of this parasite, we conducted a proteome-wide evolutionary analysis to provide a broad view of the <it>S. mansoni</it>&#8217;s proteome evolution and to improve its functional annotation.</p>
				</sec>
				<sec>
					<st>
						<p>Results</p>
					</st><p>Using a phylogenomic approach, we reconstructed the <it>S. mansoni</it> phylome, which comprises the evolutionary histories of all parasite proteins and their homologs across 12 other organisms. The analysis of a total of 7,964 phylogenies allowed a deeper understanding of genomic complexity and evolutionary adaptations to a parasitic lifestyle. In particular, the identification of lineage-specific gene duplications pointed to the diversification of several protein families that are relevant for host-parasite interaction, including proteases, tetraspanins, fucosyltransferases, venom allergen-like proteins, and tegumental-allergen-like proteins. In addition to the evolutionary knowledge, the phylome data enabled us to automatically re-annotate 3,451 proteins through a phylogenetic-based approach rather than solely sequence similarity searches. To allow further exploitation of this valuable data, all information has been made available at PhylomeDB (<url>http://www.phylomedb.org</url>).</p>
				</sec>
				<sec>
					<st>
						<p>Conclusions</p>
					</st><p>In this study, we used an evolutionary approach to assess <it>S. mansoni</it> parasite biology, improve genome/proteome functional annotation, and provide insights into host-parasite interactions. Taking advantage of a proteome-wide perspective rather than focusing on individual proteins, we identified that this parasite has experienced specific gene duplication events, particularly affecting genes that are potentially related to the parasitic lifestyle. These innovations may be related to the mechanisms that protect <it>S. mansoni</it> against host immune responses being important adaptations for the parasite survival in a potentially hostile environment. Continuing this work, a comparative analysis involving genomic, transcriptomic, and proteomic data from other helminth parasites, other parasites, and vectors will supply more information regarding parasite&#8217;s biology as well as host-parasite interactions.</p>
				</sec>
			</sec>
		</abs>
	</fm>
	<bdy>
		<sec>
			<st>
				<p>Background</p>
			</st><p>
				<it>Schistosoma mansoni, S. haematobium, and S. japonicum</it> (Platyhelminthes: Trematoda) are the main causative agents of human schistosomiasis, a neglected tropical disease that is endemic in 77 countries where more than 237 million people require preventive chemotherapy and other 779 million live in areas of risk of infection <abbrgrp>
					<abbr bid="B1">1</abbr>
					<abbr bid="B2">2</abbr>
					<abbr bid="B3">3</abbr>
					<abbr bid="B4">4</abbr>
				</abbrgrp>. The genomes of these parasites have been recently published providing insights into parasite&#8217;s development, infection, and host-parasite interactions <abbrgrp>
					<abbr bid="B5">5</abbr>
					<abbr bid="B6">6</abbr>
					<abbr bid="B7">7</abbr>
				</abbrgrp>. However, even with the progress made over the last years, schistosomiasis control depends primarily on the treatment of infected patients with Praziquantel<sup>&#174;</sup>, the only drug available for mass treatment (e.g. <abbrgrp>
					<abbr bid="B5">5</abbr>
					<abbr bid="B8">8</abbr>
					<abbr bid="B9">9</abbr>
				</abbrgrp>). Drawbacks of this drug are that it does not prevent against reinfection and its effectiveness varies depending on several factors such as the parasite&#8217;s gender, developmental stage, and the time of infection. Furthermore, Praziquantel<sup>&#174;</sup>-resistant parasites have been found both in the laboratory and in the field, thus increasing the urgent need for new effective drugs and vaccines <abbrgrp>
					<abbr bid="B10">10</abbr>
					<abbr bid="B11">11</abbr>
					<abbr bid="B12">12</abbr>
					<abbr bid="B13">13</abbr>
				</abbrgrp>.</p><p>
				<it>Schistosoma mansoni</it> infects 7.1 million people in America, 95% of which in Brazil, and 54 million people in Sub-Saharan Africa causing intestinal and hepatosplenic schistosomiasis <abbrgrp>
					<abbr bid="B14">14</abbr>
					<abbr bid="B15">15</abbr>
				</abbrgrp>. The <it>S. mansoni</it> genome sequencing data was published in 2009 and a new version was recently released <abbrgrp>
					<abbr bid="B5">5</abbr>
					<abbr bid="B16">16</abbr>
				</abbrgrp>. The improved genome has 364.5 megabases (Mb) assembled in 885 scaffolds, half of which are represented in scaffolds greater than 2 kilobases <abbrgrp>
					<abbr bid="B16">16</abbr>
				</abbrgrp>. A total of 10,852 genes were identified, encoding over 11,000 proteins, 45% of which remain without known or predicted function <abbrgrp>
					<abbr bid="B5">5</abbr>
					<abbr bid="B16">16</abbr>
					<abbr bid="B17">17</abbr>
				</abbrgrp>. 81% of the genome was assembled onto the parasite&#8217;s chromosomes, providing a partial genetic map <abbrgrp>
					<abbr bid="B16">16</abbr>
					<abbr bid="B18">18</abbr>
				</abbrgrp>. The availability of genomic data offers new opportunities for innovation in the control of schistosomiasis, by providing information that allows for the identification of novel drug targets and vaccine candidates through a system-wide perspective <abbrgrp>
					<abbr bid="B5">5</abbr>
					<abbr bid="B19">19</abbr>
					<abbr bid="B20">20</abbr>
				</abbrgrp>.</p><p>Making accurate functional predictions for genes or proteins is a key step in every genome sequencing project. However, on average, 30 to 50% of the predicted proteome remains uncharacterized while for the remaining set only general predictions are made. To deal with the gap between the rapid progress in genome sequencing and experimental characterization of genes and gene products, computational methods have been developed <abbrgrp>
					<abbr bid="B21">21</abbr>
					<abbr bid="B22">22</abbr>
					<abbr bid="B23">23</abbr>
				</abbrgrp>. Two main approaches are generally used for functional prediction of genes and their products: one based on sequence similarity searches and another on phylogenetic analysis.</p><p>Owing to the computational cost and complexity of large scale phylogenetic analysis, the accurate identification of orthology relationships remains a challenge in comparative genomics and most of the orthology prediction methods rely on similarity-based search (e.g. BLAST <abbrgrp>
					<abbr bid="B24">24</abbr>
				</abbrgrp>, OrthoMCL <abbrgrp>
					<abbr bid="B25">25</abbr>
				</abbrgrp>, InParanoid <abbrgrp>
					<abbr bid="B26">26</abbr>
				</abbrgrp>). In these cases, functional prediction is obtained based on the transfer of information from the most similar sequences in the database to the gene or protein of interest (e.g. <abbrgrp>
					<abbr bid="B24">24</abbr>
				</abbrgrp>). However, several limitations are associated with this method, mainly the lack of a straightforward relationship between sequence similarity and protein function <abbrgrp>
					<abbr bid="B21">21</abbr>
					<abbr bid="B27">27</abbr>
					<abbr bid="B28">28</abbr>
					<abbr bid="B29">29</abbr>
				</abbrgrp>. Since this approach is fast, simple, and can be automated to analyze thousands of genes, it has been used frequently to predict functional products encoded by newly sequenced genomes. Over the last years this practice has generated systematic errors, the extent of which is not completely known <abbrgrp>
					<abbr bid="B22">22</abbr>
					<abbr bid="B27">27</abbr>
					<abbr bid="B28">28</abbr>
					<abbr bid="B29">29</abbr>
					<abbr bid="B30">30</abbr>
					<abbr bid="B31">31</abbr>
					<abbr bid="B32">32</abbr>
				</abbrgrp>.</p><p>In an attempt to improve the accuracy of functional prediction at a large scale, phylogenetic methods may be applied <abbrgrp>
					<abbr bid="B33">33</abbr>
					<abbr bid="B34">34</abbr>
				</abbrgrp>. The advantage of such methods is that they focus on the evolutionary history of genes rather than merely on their sequence similarity <abbrgrp>
					<abbr bid="B30">30</abbr>
					<abbr bid="B35">35</abbr>
					<abbr bid="B36">36</abbr>
				</abbrgrp>. Ideally, functional transfer in the genomic context or for specific genes/proteins should be performed only when there is any experimental evidence for those used as source of information. However, in databases as UniProt, only 3% of proteins have experimental support for their annotations <abbrgrp>
					<abbr bid="B28">28</abbr>
				</abbrgrp>. To deal with the absence of experimental support for most part of the available proteomes, transfer of functional annotation aiming to provide hints regarding the gene/protein function needs to follow strict requirements to avoid, as much as possible, misclassifications. In the last decade, the publication of a large number of genomic and proteomic data and the development of faster and powerful computers, new software, and automated pipelines have allowed for the reconstruction of phylogenetic trees of the complete set of proteins encoded in a genome &#8211; the so called phylome <abbrgrp>
					<abbr bid="B37">37</abbr>
				</abbrgrp>.</p><p>The phylome data may give a broad view of the evolution of an organism, since it comprises the phylogenies of all proteins encoded in its genome <abbrgrp>
					<abbr bid="B37">37</abbr>
				</abbrgrp>. Most notably, a phylome can be used to detect specific evolutionary scenarios, to quantify the fraction of individual phylogenies whose topologies are consistent with a given hypothesis, and to improve functional annotation of proteins and biological systems <abbrgrp>
					<abbr bid="B38">38</abbr>
					<abbr bid="B39">39</abbr>
				</abbrgrp>. Furthermore, comparing genomes or proteomes through an evolutionary perspective may provide insights to the understanding of the metabolism, physiology, pathogenicity, and the adaptation to a particular life style of organisms. In this context, the availability of <it>S. mansoni</it> genomic data provides the opportunity to study this parasite from a genome-wide perspective rather than from individual gene or protein analyses.</p><p>Taking advantage of the benefits provided by a genome-wide approach combined with an evolutionary perspective, we reconstructed the <it>S. mansoni</it> phylome with the goals of i) gaining insight into lineage-specific evolutionary events potentially related to the parasitic lifestyle, and ii) improving the functional annotation of the genome/proteome.</p><p>Phylogenetic techniques used in the present work included multiple sequence alignment <abbrgrp>
					<abbr bid="B40">40</abbr>
					<abbr bid="B41">41</abbr>
					<abbr bid="B42">42</abbr>
					<abbr bid="B43">43</abbr>
				</abbrgrp> alignment trimming <abbrgrp>
					<abbr bid="B44">44</abbr>
				</abbrgrp>, neighbor-joining tree building <abbrgrp>
					<abbr bid="B45">45</abbr>
				</abbrgrp>, evolutionary model testing, and maximum likelihood analysis <abbrgrp>
					<abbr bid="B46">46</abbr>
				</abbrgrp>. The resulting phylome data contains 7,964 protein phylogenetic trees, covering the analysis of 11,763 <it>S. mansoni</it> proteins and their homologs in 12 other organisms, out of which we identified evolutionary events and homology relationships. The results provided useful information about the parasite&#8217;s genome evolution such as the identification of gene duplication events and expanded protein families such as proteases, tetraspanins, fucosyltransferases, venom allergen-like proteins (also called as SmVAL or SCP-like), tegumental-allergen-like proteins (SmTAL), among others. Altogether, the results obtained are likely to pave the way for a better understanding of the parasite&#8217;s biology including host-parasite interactions. This, in turn will accelerate the search for new drugs and vaccine directed toward the control and eradication of schistosomiasis.</p>
		</sec>
		<sec>
			<st>
				<p>Results and discussion</p>
			</st>
			<sec>
				<st>
					<p>Reconstruction of the <it>S. mansoni</it> phylome</p>
				</st><p>The <it>S. mansoni</it> phylome reconstructed in this work was derived from the comparative analysis of all proteins encoded in the parasite genome (predicted proteome) and their homologs in 12 other eukaryotic proteomes whose genomes were completely sequenced (Table <tblr tid="T1">1</tblr>). The set of selected species is particularly rich in metazoans (11 species), including ten invertebrates, one tunicate, and one vertebrate. One choanoflagellate<it>, Monosiga brevicollis</it>, was included as outgroup of the phylogenetic reconstruction. The metazoan species selected represent important evolutionary innovations, e.g. the origin of the third germ layer, the development of organs, systems, complex patterns of communication, and the emergence of the adaptive immune system, making this dataset set especially suitable for addressing the evolutionary innovations in <it>S. mansoni</it> in the context of metazoan evolution.</p>
				<table id="T1">
					<title>
						<p>Table 1</p>
					</title>
					<caption>
						<p>
							<b>Proteomes selected for the </b><b><it>S. mansoni </it></b><b>phylome reconstruction</b>
						</p>
					</caption>
					<tgroup align="left" cols="6">
						<colspec align="left" colname="c1" colnum="1" colwidth="1*"/>
						<colspec align="center" colname="c2" colnum="2" colwidth="1*"/>
						<colspec align="center" colname="c3" colnum="3" colwidth="1*"/>
						<colspec align="center" colname="c4" colnum="4" colwidth="1*"/>
						<colspec align="center" colname="c5" colnum="5" colwidth="1*"/>
						<colspec align="left" colname="c6" colnum="6" colwidth="1*"/>
						<thead valign="top">
							<row rowsep="1">
								<entry colname="c1">
									<p>
										<b>Scientific Name</b>
									</p>
								</entry>
								<entry align="center" colname="c2">
									<p>
										<b>UniProt Species Code</b>
										<sup>
											<b>1</b>
										</sup>
									</p>
								</entry>
								<entry align="center" colname="c3">
									<p>
										<b>TaxID</b>
										<sup>
											<b>2</b>
										</sup>
									</p>
								</entry>
								<entry align="center" colname="c4">
									<p>
										<b>Proteins</b>
										<sup>
											<b>3</b>
										</sup>
									</p>
								</entry>
								<entry align="center" colname="c5">
									<p>
										<b>Source</b>
										<sup>
											<b>4</b>
										</sup>
									</p>
								</entry>
								<entry colname="c6">
									<p>
										<b>Download</b>
									</p>
								</entry>
							</row>
						</thead>
						<tfoot>
							<p>1 - Code assigned to each species in the <it>S. mansoni</it> phylome. 2 - Taxonomic identifier at NCBI (TaxID). 3 - Number of proteins analyzed per species. 4 - Database from which the protein data were retrieved.</p>
						</tfoot>
						<tbody valign="top">
							<row rowsep="1">
								<entry colname="c1">
									<p>
										<it>Monosiga brevicollis</it>
									</p>
								</entry>
								<entry align="center" colname="c2">
									<p>MONBE</p>
								</entry>
								<entry align="center" colname="c3">
									<p>81824</p>
								</entry>
								<entry align="center" colname="c4">
									<p>9,170</p>
								</entry>
								<entry align="center" colname="c5">
									<p>JGI</p>
								</entry>
								<entry colname="c6">
									<p>2011-06-01</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1">
									<p>
										<it>Ciona Intestinalis</it>
									</p>
								</entry>
								<entry align="center" colname="c2">
									<p>CIOIN</p>
								</entry>
								<entry align="center" colname="c3">
									<p>7719</p>
								</entry>
								<entry align="center" colname="c4">
									<p>14,048</p>
								</entry>
								<entry align="center" colname="c5">
									<p>UniProt Reference Proteomes</p>
								</entry>
								<entry colname="c6">
									<p>2011-07-09</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1">
									<p>
										<it>Nematostella vectensis</it>
									</p>
								</entry>
								<entry align="center" colname="c2">
									<p>NEMVE</p>
								</entry>
								<entry align="center" colname="c3">
									<p>45351</p>
								</entry>
								<entry align="center" colname="c4">
									<p>24,424</p>
								</entry>
								<entry align="center" colname="c5">
									<p>UniProt Reference Proteomes</p>
								</entry>
								<entry colname="c6">
									<p>2011-07-09</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1">
									<p>
										<it>Schistosoma haematobium</it>
									</p>
								</entry>
								<entry align="center" colname="c2">
									<p>SCHHA</p>
								</entry>
								<entry align="center" colname="c3">
									<p>6185</p>
								</entry>
								<entry align="center" colname="c4">
									<p>12,767</p>
								</entry>
								<entry align="center" colname="c5">
									<p>SchistoDB</p>
								</entry>
								<entry colname="c6">
									<p>2012-03-09</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1">
									<p>
										<it>Schistosoma mansoni</it>
									</p>
								</entry>
								<entry align="center" colname="c2">
									<p>SCHMA</p>
								</entry>
								<entry align="center" colname="c3">
									<p>6183</p>
								</entry>
								<entry align="center" colname="c4">
									<p>11,103</p>
								</entry>
								<entry align="center" colname="c5">
									<p>SchistoDB</p>
								</entry>
								<entry colname="c6">
									<p>2012-03-09</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1">
									<p>
										<it>Schistosoma japonicum</it>
									</p>
								</entry>
								<entry align="center" colname="c2">
									<p>SCHJA</p>
								</entry>
								<entry align="center" colname="c3">
									<p>6182</p>
								</entry>
								<entry align="center" colname="c4">
									<p>12,636</p>
								</entry>
								<entry align="center" colname="c5">
									<p>SchistoDB</p>
								</entry>
								<entry colname="c6">
									<p>2012-03-09</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1">
									<p>
										<it>Caenorhabditis elegans</it>
									</p>
								</entry>
								<entry align="center" colname="c2">
									<p>CAEEL</p>
								</entry>
								<entry align="center" colname="c3">
									<p>6239</p>
								</entry>
								<entry align="center" colname="c4">
									<p>19,758</p>
								</entry>
								<entry align="center" colname="c5">
									<p>UniProt Reference Proteomes</p>
								</entry>
								<entry colname="c6">
									<p>2011-07-09</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1">
									<p>
										<it>Ascaris suum</it>
									</p>
								</entry>
								<entry align="center" colname="c2">
									<p>ASCSU</p>
								</entry>
								<entry align="center" colname="c3">
									<p>6253</p>
								</entry>
								<entry align="center" colname="c4">
									<p>18,430</p>
								</entry>
								<entry align="center" colname="c5">
									<p>WormBase</p>
								</entry>
								<entry colname="c6">
									<p>2012-03-09</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1">
									<p>
										<it>Brugia malayi</it>
									</p>
								</entry>
								<entry align="center" colname="c2">
									<p>BRUMA</p>
								</entry>
								<entry align="center" colname="c3">
									<p>6279</p>
								</entry>
								<entry align="center" colname="c4">
									<p>19,916</p>
								</entry>
								<entry align="center" colname="c5">
									<p>WormBase</p>
								</entry>
								<entry colname="c6">
									<p>2012-03-09</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1">
									<p>
										<it>Trichinella spiralis</it>
									</p>
								</entry>
								<entry align="center" colname="c2">
									<p>TRISP</p>
								</entry>
								<entry align="center" colname="c3">
									<p>6334</p>
								</entry>
								<entry align="center" colname="c4">
									<p>15,878</p>
								</entry>
								<entry align="center" colname="c5">
									<p>WormBase</p>
								</entry>
								<entry colname="c6">
									<p>2012-03-09</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1">
									<p>
										<it>Drosphila melanogaster</it>
									</p>
								</entry>
								<entry align="center" colname="c2">
									<p>DROME</p>
								</entry>
								<entry align="center" colname="c3">
									<p>7227</p>
								</entry>
								<entry align="center" colname="c4">
									<p>11,794</p>
								</entry>
								<entry align="center" colname="c5">
									<p>FlyBase</p>
								</entry>
								<entry colname="c6">
									<p>2011-09-13</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1">
									<p>
										<it>Tribolium castaneum</it>
									</p>
								</entry>
								<entry align="center" colname="c2">
									<p>TRICA</p>
								</entry>
								<entry align="center" colname="c3">
									<p>7070</p>
								</entry>
								<entry align="center" colname="c4">
									<p>16,533</p>
								</entry>
								<entry align="center" colname="c5">
									<p>BeetleBASE - HGSC</p>
								</entry>
								<entry colname="c6">
									<p>2011-12-16</p>
								</entry>
							</row>
							<row rowsep="1">
								<entry colname="c1">
									<p>
										<it>Homo sapiens</it>
									</p>
								</entry>
								<entry align="center" colname="c2">
									<p>HUMAN</p>
								</entry>
								<entry align="center" colname="c3">
									<p>9606</p>
								</entry>
								<entry align="center" colname="c4">
									<p>20,965</p>
								</entry>
								<entry align="center" colname="c5">
									<p>UniProt Reference Proteomes</p>
								</entry>
								<entry colname="c6">
									<p>2011-07-09</p>
								</entry>
							</row>
						</tbody>
					</tgroup>
				</table><p>To perform the phylogenetic analyses, we applied an automated pipeline similar to the one used for the human phylome project <abbrgrp>
						<abbr bid="B39">39</abbr>
					</abbrgrp>. This pipeline is illustrated here (Figure <figr fid="F1">1</figr>). The resulting alignments, phylogenies, and orthology predictions can be accessed at PhylomeDB <abbrgrp>
						<abbr bid="B47">47</abbr>
					</abbrgrp> (<url>http://phylomedb.org</url>).</p>
				<fig id="F1"><title><p>Figure 1</p></title><caption><p>Pipeline used to reconstruct and analyze the <it>S. mansoni</it> phylome</p></caption><text>
   <p><b>Pipeline used to reconstruct and analyze the </b><b><it>S. mansoni </it></b><b>phylome.</b> Each protein sequence encoded in the parasite genome was compared against a database of proteins from other 12 fully sequenced eukaryotic proteomes (Table <tblr tid="T1">1</tblr>) to select putative homologous proteins. Groups of potential homologs were aligned and subsequently trimmed to remove gap-rich regions. The refined alignment was used to build a NJ tree, which was then used as a &#8220;seed&#8221; tree to perform a ML likelihood analysis as implemented in PhyML. In the ML analysis, up to five different evolutionary models were tested and the model best fitting to the data was determined by the Akaike Information Criterion (AIC). Different algorithms were used to identify homology relationships and lineage-specific duplications. To extract and interpret the large data set obtained a Structured Query Language (SQL) relational database was built. This database was the main resource for data mining in this work. Adapted from <abbrgrp><abbr bid="B39">39</abbr></abbrgrp>.</p>
</text><graphic file="1471-2164-13-617-1"/></fig><p>Using this phylogenomic approach, we analyzed 11,763 <it>S. mansoni</it> proteins and obtained 7,964 phylogenetic trees covering 70% of the parasite&#8217;s proteome. This coverage is remarkably similar to that of other phylome data of newly sequenced genomes such as that of the pea aphid <it>Acyrthosiphon pisum</it> (67%) <abbrgrp>
						<abbr bid="B38">38</abbr>
					</abbrgrp>.</p><p>The absence of trees for the remaining 3,490 proteins is either due to a possible high degree of divergence between the <it>S. mansoni</it> proteins and their homologs in the other selected species, an indication of the uniqueness of the parasite&#8217;s proteome, or it reflects the presence of errors in gene models. Out of the 7,964 phylogenetic trees, 3.038 (38%) correspond to trees with &#8220;seed&#8221; proteins with a completely unknown function and without any GO <abbrgrp>
						<abbr bid="B48">48</abbr>
					</abbrgrp> assignment in SchistoDB <abbrgrp>
						<abbr bid="B17">17</abbr>
					</abbrgrp>.</p>
			</sec>
			<sec>
				<st>
					<p>Phylogeny-based orthology prediction</p>
				</st><p>In order to create a complete list of orthology and paralogy relationships among <it>S. mansoni</it> proteins and those encoded in the other eukaryotic proteomes included in this work, we analyzed the parasite&#8217;s phylome using a <it>species-overlap</it> algorithm as previously described <abbrgrp>
						<abbr bid="B39">39</abbr>
					</abbrgrp>. The comprehensive catalogue of phylogeny-based orthology and paralogy relationships among <it>S. mansoni</it> and other species was made publicly available at PhylomeDB <abbrgrp>
						<abbr bid="B47">47</abbr>
					</abbrgrp>.</p><p>Owing to the increasing rate at which new fully sequenced genomes are released, the accumulation of genomic and proteomic data has been much higher than the rates at which genes or proteins are experimentally characterized. Aiming at producing a high confidence set of functional predictions for <it>S. mansoni</it> proteins, we used the evolutionary relationships as inferred from phylogenetic trees to obtain subsets of one-to-one (single homolog in <it>S. mansoni</it> and in other species) homology relationships among <it>S. mansoni</it> proteins and the homologs from other species included in the present study (Figure <figr fid="F2">2</figr>).</p>
				<fig id="F2"><title><p>Figure 2</p></title><caption><p>Homology relationships and evolutionary events inferred from the analysis of a <it>S. mansoni</it> protein</p></caption><text>
   <p><b>Homology relationships and evolutionary events inferred from the analysis of a </b><b><it>S. mansoni </it></b><b>protein.</b><b>A</b>) Phylogenetic tree reconstructed for the parasite &#8220;seed&#8221; protein Phy000V0I5_SCHMA (Smp_175750). <b>B</b>) Homology relationships identified between the &#8220;seed&#8221; protein and its homologs in the other species.</p>
</text><graphic file="1471-2164-13-617-2"/></fig><p>By using such phylogeny-based approach, we transferred 10,175 functional annotations (GO terms <abbrgrp>
						<abbr bid="B48">48</abbr>
					</abbrgrp>) to 3,451 <it>S. mansoni</it> proteins, from which 790 (7% of the parasite&#8217;s proteome) were previously annotated as &#8220;hypothetical protein&#8221;, corresponding to proteins whose function had not been predicted or experimentally tested before (Additional file <supplr sid="S1">1</supplr> Table S1). The transfer was performed from each ortholog with known function in the selected taxa to the <it>S. mansoni</it> &#8220;seed&#8221; protein. For the other proteins that already had any functional prediction, the annotation was confirmed or improved. Consequently, a &#8220;seed&#8221; protein could receive more than one functional description. In these cases, all functional annotations were maintained allowing the user to choose the closest related transferred functional annotation, those that came from model organisms, or even to create a consensus based on all of them.</p>
				<suppl id="S1">
					<title>
						<p>Additional file 1</p>
					</title>
					<text>
						<p>
							<b>Table S1</b>: Functional prediction for the<it>S. mansoni</it> "seed" proteins. <b>Table S2</b>: Comparison between reviewed proteins with functional annotation at UniProt and GO terms transferred by phylogenomic approach. <b>Table S3</b>: Homologous proteins to the <it>S. mansoni</it> "seed" proteins.</p>
					</text>
					<file name="1471-2164-13-617-S1.xls">
   <p>Click here for file</p>
</file>
				</suppl><p>To validate the applied methodology, we retrieved reviewed <it>S. mansoni</it> proteins from UniProt <abbrgrp>
						<abbr bid="B49">49</abbr>
					</abbrgrp>, including experimentally confirmed ones, to evaluate the annotation transferred by the phylogenomic approach. The functional annotations performed by PhylomeDB correspond to known functions in the aforementioned database (Additional file <supplr sid="S1">1</supplr> Table S2). Even though the BLAST search may detect distant homologs with additional domains, our subsequent phylogenetic reconstruction and our selection of orthologs will select those orthologs that are likely to have similar domain architecture. This is an additional reason why an orthology-based annotation is preferred over sequence similarity searches, since orthologs as compared to paralogs have a higher tendency to share a similar domain architecture <abbrgrp>
						<abbr bid="B50">50</abbr>
					</abbrgrp>.</p><p>Although less reliable than those based on one-to-one orthology relationships, annotation transfer based on more complex subsets (one-to-many, many-to-one, or many-to-many) may provide important hints to predict the biological function of <it>S. mansoni</it> proteins. However, in these cases, one or more genes are co-orthologous to a set of genes in another genome due to lineage-specific duplication(s) that can be associated with functional shifts, affecting the reliability of the functional transfer <abbrgrp>
						<abbr bid="B38">38</abbr>
						<abbr bid="B51">51</abbr>
					</abbrgrp>. An example of a one-to-one transfer from a <it>Drosophila melanogaster</it> protein to a <it>S. mansoni</it> protein comes from the phylogenetic reconstruction of the Phy000V14T_SCHMA (Smp_170950) protein, potentially related to the glycine cleavage system, and its homologs in the selected species (Figure <figr fid="F3">3</figr>). The analysis of this tree resulted in six transfers of functional annotation from homologous proteins to the <it>S. mansoni</it> &#8220;seed&#8221; protein. The GO terms in all six functional annotations are related to aminomethyltransferase activity and glycine catabolic process providing further support for the annotation transfer. In this example, to illustrate a case of a one-to-one transfer, we chose the functional annotation transferred from <it>Drosophila melanogaster</it> once, according to the information available in UniProt <abbrgrp>
						<abbr bid="B49">49</abbr>
					</abbrgrp>, it is one of the orthologs with known function and experimental validation. Tags for homologous sequences with experimental validation are not available in PhylomeDB <abbrgrp>
						<abbr bid="B47">47</abbr>
					</abbrgrp>. However, links to UniProt <abbrgrp>
						<abbr bid="B49">49</abbr>
					</abbrgrp> and other databases are provided.</p>
				<fig id="F3"><title><p>Figure 3</p></title><caption><p>Example of functional prediction based on phylogenetic analysis</p></caption><text>
   <p><b>Example of functional prediction based on phylogenetic analysis.</b> The protein sequences are represented by the internal identifier in PhylomeDB. Relationships among the parasite Phy000V14T_SCHMA &#8220;seed&#8221; protein (Smp_170950) and its homologs in other species (Table <tblr tid="T1">1</tblr>) as inferred by maximum likelihood method implemented in PhyML. Support values were computed by approximate likelihood ratio test (aLTR). Curly brackets hold Gene Ontology (GO) terms for proteins in this dataset.</p>
</text><graphic file="1471-2164-13-617-3"/></fig><p>To explore the benefits offered by comparative genomics in order to improve functional annotation of genes and gene products, it is also necessary to consider the limitations involved in this approach. Although it is generally accepted that functional annotation through orthology, rather than just homology relationship, constitutes one of the most promising annotation approaches, these surveys are designed to provide predictions regarding the likely protein function, but it does not substitute experimental confirmation <abbrgrp>
						<abbr bid="B36">36</abbr>
						<abbr bid="B52">52</abbr>
					</abbrgrp>. Functional diversity is often associated with significant divergence at the sequence level, but high levels of identity do not ensure that two or more proteins perform the same function, since subtle changes in active sites are able to completely change the protein function <abbrgrp>
						<abbr bid="B53">53</abbr>
					</abbrgrp>.</p><p>As we previously mentioned, evolutionary analysis involving fully sequenced genomes/proteomes remains a challenge. Although the tools here applied were not originally designed for large scale phylogenetic analysis, we adapted them to work on a large scale, since we strongly believe that a system-wide perspective on evolutionary processes can greatly improve the understanding on how genomes came to be and what evolutionary process took them there. Functional prediction as described in the present work could be used as a starting point for future projects, prioritizing the selection of certain genes or proteins for new experimental studies.</p>
			</sec>
			<sec>
				<st>
					<p>Detection of gene duplications in <it>S. mansoni</it>
					</p>
				</st><p>An additional advantage of the phylogeny-based approach is that it readily provides a collection of gene evolutionary histories that can be mined for particular events. Since gene duplication is considered one of the main mechanisms for functional innovation and diversification <abbrgrp>
						<abbr bid="B54">54</abbr>
					</abbrgrp>, we explored the <it>S. mansoni</it> phylome to identify protein families that have been specifically expanded in this lineage, since its diversification from the other sequenced metazoans. We used the above-mentioned <it>species-overlap</it> algorithm that identifies duplication nodes and also provides clues of the relative dating of the duplication event <abbrgrp>
						<abbr bid="B39">39</abbr>
						<abbr bid="B55">55</abbr>
					</abbrgrp>.</p><p>Such analysis revealed that in 3,051 reconstructed phylogenetic trees there is at least one paralog connected to the &#8220;seed&#8221; protein through a duplication node (Additional file <supplr sid="S1">1</supplr> Table S3). Among these, 211 phylogenies show lineage-specific duplications in the three <it>Schistosoma</it> species in comparison with the other taxa. These expansions are small-to-moderate in size, resulting in a total of two to ten paralogs, and include some of the most significant expansions as discussed below.</p><p>The inclusion of <it>S. haematobium</it> and <it>S. japonicum</it> proteomes gave us a high resolution within <it>Schistosoma</it> genus and allowed us to make comparisons across this taxon. In general, the expansions observed in <it>S. mansoni</it> can also be observed in the other two <it>Schistosoma</it> species, although with variable number of paralogs in each species. As previously observed by evolutionary relationships, cytogenetic data, and syntenic analyses, the present study shows that <it>S. mansoni</it> is more closely related to <it>S. haematobium</it> than to the <it>S. japonicum</it>
					<abbrgrp>
						<abbr bid="B56">56</abbr>
						<abbr bid="B57">57</abbr>
						<abbr bid="B58">58</abbr>
						<abbr bid="B59">59</abbr>
					</abbrgrp>. Moreover, 170 evolutionary trees have only <it>S. mansoni</it> and <it>S. haematobium</it> proteins, while only six phylogenies have solely <it>S. mansoni</it> and <it>S. japonicum</it> proteins. Meanwhile, most of the homologous proteins shared by <it>S. mansoni</it> and <it>S. haematobium</it> are annotated as &#8220;hypothetical protein&#8221; and do not have any predicted function or significant hits with known proteins in public databases as UniProt <abbrgrp>
						<abbr bid="B49">49</abbr>
					</abbrgrp>, Pfam <abbrgrp>
						<abbr bid="B60">60</abbr>
					</abbrgrp>, or non-redundant (nr) NCBI database (<url>ftp://ftp.ncbi.nih.gov/blast/db</url>).</p><p>A small number of phylogenetic trees (1,45%) had only sequences of <it>S. mansoni</it>. These could be the result of very recent duplication events of proteins that are specific to this species. However, many of these genes were not found in the genetic map of <it>S. mansoni</it>
					<abbrgrp>
						<abbr bid="B16">16</abbr>
						<abbr bid="B18">18</abbr>
					</abbrgrp> and they do not contain protein domains traceable at Pfam <abbrgrp>
						<abbr bid="B60">60</abbr>
					</abbrgrp>. BLAST searches against the non-redundant (nr) NCBI database detected a few non-<it>Schistosoma</it> proteins as significant hits that were annotated as hypothetical in all cases. For these reasons we rather believe that these sequences correspond to spurious predictions. Further analyses will be conducted in the future in order to confirm or refute this hypothesis.</p><p>Among the most significant protein expansions in <it>S. mansoni</it> we identified tetraspanins, fucosyltransferases, venom allergen-like proteins (SmVAL), tegumental-allergen-like proteins (SmTAL), leishmanolysins, and elastases, which were previously proposed as drug targets, once they can be related to morphological or physiological specificities of this parasite <abbrgrp>
						<abbr bid="B5">5</abbr>
						<abbr bid="B20">20</abbr>
						<abbr bid="B61">61</abbr>
						<abbr bid="B62">62</abbr>
						<abbr bid="B63">63</abbr>
						<abbr bid="B64">64</abbr>
						<abbr bid="B65">65</abbr>
					</abbrgrp>. In these cases, the protein family membership ranged from 6 to 23 paralogs encoded in the parasite&#8217;s genome.</p><p>Tetraspanins are small proteins with four transmembrane domains involved in the coordination of intra and intercellular processes, such as signal transduction, cell proliferation, adhesion, and migration, cell fusion and host-parasite interactions <abbrgrp>
						<abbr bid="B66">66</abbr>
						<abbr bid="B67">67</abbr>
					</abbrgrp>. The function of schistosome tetraspanins are not completely understood, but cell-cell interactions and maintenance of cell membrane integrity might be performed by these proteins as well as they can be receptors for host ligands, acting on immune evasion <abbrgrp>
						<abbr bid="B61">61</abbr>
					</abbrgrp>. The suppression of two tetraspanin genes (<it>Sm-tsp-1</it> and <it>Sm-tsp-2</it>) by RNA interference in mice also suggests that these proteins play important structural roles in the parasite&#8217;s tegument, being a good target for anti-schistosomal vaccine <abbrgrp>
						<abbr bid="B68">68</abbr>
					</abbrgrp>. Figure <figr fid="F4">4</figr> illustrates an example of tetraspanin lineage-specific duplications. In this case, the number of homologs in the three Schistosoma species varies from six to eight. Tree topology shows distinct well-supported clades suggesting that structural and/or functional variants might be present. Three proteins in this dataset have experimental evidence: Phy0048JNS_SCHHA (Q26499), Phy0048WJL_SCHMA (P19331), and Phy0005UU9_DROME (O46101) <abbrgrp>
						<abbr bid="B49">49</abbr>
						<abbr bid="B69">69</abbr>
						<abbr bid="B70">70</abbr>
					</abbrgrp>.</p>
				<fig id="F4"><title><p>Figure 4</p></title><caption><p>Phylogenetic relationships of schistosome lineage-specific duplicated tetraspanins</p></caption><text>
   <p><b>Phylogenetic relationships of schistosome lineage-specific duplicated tetraspanins.</b> Analysis was performed with trimmed sequence alignment by using the maximum likelihood method as implemented in PhyML. Best fit model (WAG) and support values for each node were estimated by the Akaike Likelihood Ratio Test (aLRT). Sequence labels follow the PhylomeDB internal identifier. For details, see supplementary data (Additional file <supplr sid="S1">1</supplr> Table S3).</p>
</text><graphic file="1471-2164-13-617-4"/></fig><p>Venom allergen-like proteins (SmVAL), also called sperm-coating protein-like (SCP-like), are structurally related proteins members of the SCP/TAPS family. In Platyhelminthes, these proteins have been linked as potential modulators of immune function and components of sexual development <abbrgrp>
						<abbr bid="B71">71</abbr>
					</abbrgrp>. Although the specific function of each SmVAL family member is unknown, there is evidence suggesting potential roles in larval penetration, host immune response modulation, and adult worm development <abbrgrp>
						<abbr bid="B63">63</abbr>
						<abbr bid="B71">71</abbr>
					</abbrgrp>. Furthermore, analyses of SmVAL transcripts demonstrated that the corresponding genes are upregulated in infective stages of the parasite, highlighting SmVAL proteins as candidates for novel vaccine strategies <abbrgrp>
						<abbr bid="B71">71</abbr>
						<abbr bid="B72">72</abbr>
					</abbrgrp>.</p><p>Fucosyltransferases are enzymes that catalyses the fucose transfer from the donor guanosine-diphosphate fucose to different acceptor molecules such as oligosaccharides, glycoproteins, and glycolipids <abbrgrp>
						<abbr bid="B73">73</abbr>
					</abbrgrp>. In schistosomes, fucosyltransferases are involved in producing immunomodulatory epitopes during infection, granuloma formation, egg/endothelium interactions, and were previously highlighted as anti-schistosomal candidates <abbrgrp>
						<abbr bid="B63">63</abbr>
						<abbr bid="B74">74</abbr>
					</abbrgrp>.</p><p>Tegumental-Allergen-Like proteins (SmTALs) are members of a protein family present in parasitic Platyhelminthes <abbrgrp>
						<abbr bid="B64">64</abbr>
						<abbr bid="B75">75</abbr>
					</abbrgrp>. These proteins are located inside the tegument and have different life-cycle expression patterns <abbrgrp>
						<abbr bid="B64">64</abbr>
					</abbrgrp>. The tegumental protein Sm22.6 is considered the main target for human IgE in <it>S. mansoni</it> and human IgE response against this protein is associated with the development of age-dependent partial immunity to <it>S. mansoni</it> infections in endemic areas <abbrgrp>
						<abbr bid="B64">64</abbr>
						<abbr bid="B76">76</abbr>
					</abbrgrp>.</p><p>Leishmanolysin, (also called invadolysin and SmPepM8), is a major surface protease member of the metallopeptidase M8 family. This protein can perform activities in schistosomes similar to those performed in <it>Leishmania</it> where these proteins are involved in different types of processes like degradation of the extracellular matrix and inhibition or perturbations of host cell interactions <abbrgrp>
						<abbr bid="B63">63</abbr>
						<abbr bid="B72">72</abbr>
					</abbrgrp>. In turn, elastases are serine proteases that in schistosomes play a pivotal role in the penetration by cercariae of host skin to initiate infection. Recent studies have also revealed that these proteases can be employed by schistosomes to overcome or evade the host immune response <abbrgrp>
						<abbr bid="B77">77</abbr>
						<abbr bid="B78">78</abbr>
					</abbrgrp>. Members of <it>S. mansoni</it> peptidase families such as leishmanolysins, cercarial elastases, and cathepsin D proteins were subjected to a detailed study in respect to their domain architectures, functional properties, and evolutionary relationships as described elsewhere <abbrgrp>
						<abbr bid="B65">65</abbr>
					</abbrgrp>.</p><p>Another specific feature of schistosomes is related to their tegument. Distinct from nematodes, which have a cuticle covering and protecting the organism body, schistosomes are covered by a living syncytium bounded by a complex multilaminate surface, which undergoes several adaptations soon after infection is initiated <abbrgrp>
						<abbr bid="B79">79</abbr>
						<abbr bid="B80">80</abbr>
						<abbr bid="B81">81</abbr>
					</abbrgrp>. The external double membrane plays a crucial role in host-parasite interactions, being responsible for diverse mechanisms of survival <abbrgrp>
						<abbr bid="B19">19</abbr>
						<abbr bid="B82">82</abbr>
						<abbr bid="B83">83</abbr>
					</abbrgrp>. The development of a tegument, highly specialized and resistant to immune damage, was accompanied by evolutionary adaptations, for example, the expansions of other protein families encoding annexins, cadherins, and innexins.</p><p>Annexins are widely distributed in eukaryotes performing a broad range of important biological processes related to tegument membrane <abbrgrp>
						<abbr bid="B84">84</abbr>
						<abbr bid="B85">85</abbr>
						<abbr bid="B86">86</abbr>
					</abbrgrp>. In schistosomes, annexins appear to be involved in parasite&#8217;s stability protecting against immune attack by the host as well as against structural breakdown <abbrgrp>
						<abbr bid="B85">85</abbr>
						<abbr bid="B86">86</abbr>
					</abbrgrp>. Cadherins are adhesion molecules that mediate Ca<sup>2+</sup>-dependent cell-cell adhesion and whose duplication events happened probably in parallel to the advent of a third germ layer in flatworms <abbrgrp>
						<abbr bid="B5">5</abbr>
						<abbr bid="B87">87</abbr>
					</abbrgrp>. Innexins are components of gap-junction proteins, the intercellular channels that allow for the exchange of ions and other small signal molecules <abbrgrp>
						<abbr bid="B88">88</abbr>
						<abbr bid="B89">89</abbr>
					</abbrgrp>. In <it>C. elegans</it>, innexins have been implicated in different processes like electrical coupling between pharyngeal muscles, calcium propagation in the gut, gap junction-mediated oocyte, and sensory neuron identity <abbrgrp>
						<abbr bid="B89">89</abbr>
					</abbrgrp>.</p><p>In summary, we identified that approximately 45% of the <it>S. mansoni</it> predicted proteins that were covered by this phylogenomic analysis have, at least, one paralog encoded in the parasite genome that might have arisen by gene duplication events that occurred after its divergence from other selected taxa (Additional file <supplr sid="S1">1</supplr> Table S3). In other eukaryotic genomes this value ranges from 30 and 65% <abbrgrp>
						<abbr bid="B90">90</abbr>
					</abbrgrp>, whereas in <it>C. elegans</it> this value is equal to 49% <abbrgrp>
						<abbr bid="B91">91</abbr>
					</abbrgrp>.</p><p>Altogether, the present results indicate that besides the exploitation of host endocrine and immune signals, the parasite genome exhibit multiple events of gene duplication which may be, at least partially, an adaptive response related to the parasitic lifestyle. These expansions probably reflect the intriguing complexity of evolutionary events that happened over time, resulting in important characteristics in schistosome&#8217;s biology with consequences to the disease it causes. Taking into account the host environment and the selective forces that it imposes to a parasite, the phylogeny of host(s) and parasite(s) are probably closely related, once this coevolution will be responsible for the continuity or elimination of such an interaction. Nonetheless, previous empirical experiments involving schistosomes and the intermediate host provide further support to suggest the potential for host-schistosome coevolution <abbrgrp>
						<abbr bid="B92">92</abbr>
					</abbrgrp>.</p><p>In this context, it is important to analyze the evolutionary history of protein families during screening for potential targets for drug and vaccine development. Incorporating the evolutionary perspective in drug development studies can improve our understanding regarding drug resistance and effectiveness, as well as to guide new strategies of drug discovery. Gene duplication events as well as adaptive evolution should be considered during this process, since an anti-parasitic drug could bind a single protein or in all proteins encoded by a multi-gene family <abbrgrp>
						<abbr bid="B93">93</abbr>
					</abbrgrp>. As a consequence, therapies which target a subset of genes that arose by duplication may not be effective at low doses. To solve this problem, the drug's effectiveness can be increased when a single-copy gene is targeted and its function is inactivated causing complete perturbation of a vital pathway <abbrgrp>
						<abbr bid="B93">93</abbr>
						<abbr bid="B94">94</abbr>
					</abbrgrp>.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Conclusions</p>
			</st><p>Through a systemic approach, we may accelerate the advance towards the understanding of schistosomiasis, its etiologic agents, and host-parasite interactions, optimizing the discovery of therapeutic targets to the development of new drugs and vaccines. Besides promoting a significant improvement in the functional annotation of the <it>S. mansoni</it> predicted proteome, our approach provided relevant information about the parasite&#8217;s genome evolution such as the identification of gene duplication events and expanded protein families, supplying important information regarding the mechanisms involved in <it>Schistosoma</it>&#8217;s genome evolution. Among the parasite paralog groups, we identified proteases, tetraspanins, fucosyltransferases, venom allergen-like proteins (also called as SmVAL or SCP-like), and tegumental-allergen-like proteins (SmTAL) that may be related to morphological or physiological specificities of this parasite. In addition, we strongly believe that the <it>S. mansoni</it> phylome data will pave the way for other, more detailed analysis, such as those that have been already performed on expanded peptidases families <abbrgrp>
					<abbr bid="B65">65</abbr>
				</abbrgrp>.</p><p>One of the remaining challenges is to understand which evasion strategies enable this parasite to survive for years in a potentially hostile environment, protected from the host immune system action and/or actively making the host response ineffective. Different mechanisms may be involved in these processes, including the generation of variant proteins by expression of micro-exon genes (MEG), which have been pointed as a potential strategy <abbrgrp>
					<abbr bid="B94">94</abbr>
				</abbrgrp>, and small non-coding RNAs which perform many essential regulatory functions <abbrgrp>
					<abbr bid="B95">95</abbr>
				</abbrgrp>.</p><p>Insights obtained through this phylogenomic approach will help us to guide forward genetic approaches to better understand the host-pathogen relationships toward to the elucidation of novel drug targets and vaccine candidates urgently needed to reduce the morbidity and mortality caused by schistosomiasis worldwide. Continuing this work, a comparative analysis involving genomic, transcriptomic, and proteomic data from other helminth species as <it>Taenia solium</it>, <it>Echinococcus multiloculares</it>, <it>Echinococcus granulosus</it>, <it>Fasciola hepatica</it>, other parasites, and vectors will provide valuable information from a system-wide perspective of a broad range of organisms, improving our understanding regarding the parasitic lifestyle.</p>
		</sec>
		<sec>
			<st>
				<p>Methods</p>
			</st>
			<sec>
				<st>
					<p>Organisms and sequence data</p>
				</st><p>Predicted proteomes from 13 fully sequenced eukaryotic genomes were downloaded from JGI Genome Projects, SchistoDB, Quest For Orthologs, WormBase, BeetleBASE, and FlyBase (Table <tblr tid="T1">1</tblr>). The taxon sampling was selected according to the availability of the predicted proteomes and based on the phylogenetic position of each species. The comprehensive taxa selected cover important evolutionary innovations making this dataset set especially suitable for addressing the evolutionary innovations in schistosomes in the context of metazoan evolution. Model organisms were also included to provide functional annotations that could be potentially transferred to <it>S. mansoni</it> homologous proteins.</p>
			</sec>
			<sec>
				<st>
					<p>Phylome reconstruction</p>
				</st><p>To reconstruct the complete collection of phylogenetic trees for all <it>S. mansoni</it> proteins and their homologs in other 12 fully sequenced organisms (Table <tblr tid="T1">1</tblr>), we used a similar automated pipeline to that described earlier for the human proteome <abbrgrp>
						<abbr bid="B39">39</abbr>
					</abbrgrp> (Figure <figr fid="F1">1</figr>). A local database was created containing data from the <it>S. mansoni</it> proteome and those of 12 other completely sequenced genomes/proteomes. Alternative splicing products and identical sequences from the <it>S. mansoni</it> proteome were filtered out. For each protein encoded in the <it>S. mansoni</it> genome (&#8220;seed&#8221;), a Smith-Waterman search <abbrgrp>
						<abbr bid="B96">96</abbr>
					</abbrgrp> (E-value &#8804; 10<sup>-5</sup>) was performed against the above mentioned database to retrieve proteins with significant sequence similarity. Sequences that aligned with a continuous region longer than 50% of the query sequence were selected and aligned using MUSCLE 3.6 <abbrgrp>
						<abbr bid="B40">40</abbr>
					</abbrgrp>, MAFFT <abbrgrp>
						<abbr bid="B41">41</abbr>
					</abbrgrp>, DIALIGN-TX <abbrgrp>
						<abbr bid="B42">42</abbr>
					</abbrgrp>, and M-Coffee <abbrgrp>
						<abbr bid="B43">43</abbr>
					</abbrgrp> with default parameters. Positions in the alignment containing a high number of gaps were eliminated using trimAl <abbrgrp>
						<abbr bid="B44">44</abbr>
					</abbrgrp>, with a consistency cutoff of 0.1667 and a gap score cutoff of 0.1. Neighbor-joining trees were derived from the trimmed alignments using <it>scoredist</it> distances as implemented in BioNJ <abbrgrp>
						<abbr bid="B45">45</abbr>
					</abbrgrp> and maximum likelihood trees were obtained as implemented in PhyML using the NJ tree as a starting point <abbrgrp>
						<abbr bid="B46">46</abbr>
					</abbrgrp>. For each &#8220;seed&#8221; protein phylogenetic reconstruction, we tested four different evolutionary models (JTT, WAG, BLOSUM62, VT, LG, CpREV, and DCMut). In all cases a discrete gamma-distribution model with four rate categories plus invariant positions was assumed, the gamma parameter and the fraction of invariant positions were estimated from the data. Tree support values were computed by approximate likelihood ratio test (aLTR) as implemented in PhyML <abbrgrp>
						<abbr bid="B46">46</abbr>
						<abbr bid="B97">97</abbr>
					</abbrgrp>. The evolutionary model best fitting the data was determined by comparing the likelihood of the used models according to the Akaike Information Criterion (AIC) <abbrgrp>
						<abbr bid="B98">98</abbr>
					</abbrgrp>.</p>
			</sec>
			<sec>
				<st>
					<p>Prediction of homology relationships</p>
				</st><p>To derive orthology and paralogy relationships among <it>S. mansoni</it> proteins and those encoded in the other genomes included in this study we used a <it>species-overlap</it> algorithm as described in <abbrgrp>
						<abbr bid="B39">39</abbr>
					</abbrgrp> and as implemented in ETE (Environment for Tree Exploration) <abbrgrp>
						<abbr bid="B99">99</abbr>
					</abbrgrp>. This algorithm uses the level of species overlap between the two daughter partitions of a given node to define it as duplication or a speciation event. The analysis starts at the protein used to generate the tree (&#8220;seed&#8221; protein) and runs through the internal nodes of the tree until it reaches the root. All the trees were rooted at the midpoint. If the two partitions share any species (if there is species overlap), the node is defined as a duplication node and the proteins are considered paralogous ones. Otherwise (if there is no overlap) the node is defined as a speciation node leading to orthologous proteins. Once all the nodes have been classified, the algorithm establishes the orthology and paralogy relationships between the &#8220;seed&#8221; protein and other proteins included in the tree according to the original definition of these terms <abbrgrp>
						<abbr bid="B39">39</abbr>
						<abbr bid="B100">100</abbr>
					</abbrgrp>. A previous study has shown that the <it>species-overlap</it> algorithm produces reliable orthology predictions with higher sensibility than a strict reconciliation method <abbrgrp>
						<abbr bid="B101">101</abbr>
					</abbrgrp>.</p>
			</sec>
			<sec>
				<st>
					<p>Orthology-based functional annotation</p>
				</st><p>Based on the list of orthology and paralogy relationship we performed the transfer of functional annotation from each ortholog with known function to the <it>S. mansoni</it> &#8220;seed&#8221; proteins. To produce a confident set of functional predictions for <it>S. mansoni</it> proteins, we classified the list of orthologs in different subsets of orthology relationships (one-to-one, one-to-many, many-to-one, and many-to-many) between the <it>S. mansoni</it> proteins and the other proteins included in this phylome data. If no duplication has occurred since the speciation, the two genes form a one-to-one relationship. If subsequent duplications have occurred, other types of orthology relationships (one-to-many or many-to-many) were assigned <abbrgrp>
						<abbr bid="B51">51</abbr>
					</abbrgrp>. One example of this classification is provided (Figure <figr fid="F2">2</figr>).</p><p>To further analyze such large data set, we built the SchistoPhylomeSQL a Structured Query Language relational database using MySQL as a database management system. This local database integrates information from PhylomeDB (<url>http://phylomedb.org</url>) and SchistoDB (<url>http://www.schistodb.net</url>). Access to the database was obtained using DbVisualizer version 7.0.5 (<url>http://www.dbvis.com</url>), a graphical user interface that allows developing and accessing database management system (DBMSs) in different operating systems. The SchistoPhylomeSQL database was the main resource for data mining in this work. Perl scripts and SQL queries were implemented to parse the text files and load them to the database.</p>
			</sec>
			<sec>
				<st>
					<p>Detection of <it>S. mansoni</it> gene expansions</p>
				</st><p>Using ETE <abbrgrp>
						<abbr bid="B99">99</abbr>
					</abbrgrp>, we analyzed the <it>S. mansoni</it> phylome data to identify protein families that were specifically expanded in the <it>S. mansoni</it> lineage since its diversification from the other metazoans (Additional file <supplr sid="S1">1</supplr> Table S3). The duplication events defined by the <it>species-overlap</it> algorithm that only comprised paralogs from <it>S. mansoni</it> were considered lineage-specific duplications. In cases where the information extracted from more than one phylogenetic tree contained the same paralogous proteins, changing only the &#8220;seed&#8221; protein position, the data was filtered to obtain a non-redundant list of in-paralogs.</p>
			</sec>
		</sec>
		<sec>
			<st>
				<p>Competing interests</p>
			</st><p>The authors declare that they have no competing interests.</p>
		</sec>
		<sec>
			<st>
				<p>Author&#8217;s contributions</p>
			</st><p>LLS: carried out the phylogenetic and functional annotation studies, and drafted the manuscript. MM: performed the phylome reconstruction and functional annotation transfer. LAN: participated in the coordination of this study, and co-wrote this manuscript. AZ: wrote the Perl scripts for data manipulation and provided computational support for this study. TG: participated in the coordination of this study, supervised the phylome reconstruction, and co-wrote this manuscript. GO: participated in the design and coordination of this study, and co-wrote this manuscript. All authors read and approved the final manuscript.</p>
		</sec>
	</bdy>
	<bm>
		<ack>
			<sec>
				<st>
					<p>Acknowledgments</p>
				</st><p>The authors acknowledge the use of the computing resources of the Centre de Regulaci&#243; Gen&#242;mica (CRG, Spain) and the Center for Excellence in Bioinformatics (CEBio/Fiocruz, Brazil). The authors are grateful to Jaime Huerta-Cepas (CRG), Eric Aguiar, and Francislon Silva (CEBio) for bioinformatics technical support and to Mariana de Oliveira (CEBio) for help with illustrations in this work. This work was funded by the National Institutes of Health - NIH/Fogarty International Center (TW007012 to GO), the National Council for Research and Development - CNPq (CNPq Research Fellowship 306879/2009-3 to GO, INCT-DT 573839/2008-5 to GO, and CNPq-Universal 476036/2010-0 to LAN), the Spanish Ministry of Science and Innovation (BFU2009-09168 to TG), and the Research Foundation of the State of Minas Gerais - FAPEMIG (CBB-1181/08 and PPM-00439-10 to GO).</p>
			</sec>
		</ack>
		<refgrp><bibl id="B1"><title><p>The global epidemiological situation of schistosomiasis and new approaches to control and research</p></title><aug><au><snm>Engels</snm><fnm>D</fnm></au><au><snm>Chitsulo</snm><fnm>L</fnm></au><au><snm>Montresor</snm><fnm>A</fnm></au><au><snm>Savioli</snm><fnm>L</fnm></au></aug><source>Acta Trop</source><pubdate>2002</pubdate><volume>82</volume><issue>2</issue><fpage>139</fpage><lpage>146</lpage></bibl><bibl id="B2"><title><p>Schistosomiasis and water resources development: systematic review, meta-analysis, and estimates of people at risk</p></title><aug><au><snm>Steinmann</snm><fnm>P</fnm></au><au><snm>Keiser</snm><fnm>J</fnm></au><au><snm>Bos</snm><fnm>R</fnm></au><au><snm>Tanner</snm><fnm>M</fnm></au><au><snm>Utzinger</snm><fnm>J</fnm></au></aug><source>Lancet Infect Dis</source><pubdate>2006</pubdate><volume>6</volume><issue>7</issue><fpage>411</fpage><lpage>425</lpage></bibl><bibl id="B3"><title><p>Schistosomiasis</p></title><aug><au><snm>Gryseels</snm><fnm>B</fnm></au></aug><source>Infect Dis Clin North Am</source><pubdate>2012</pubdate><volume>26</volume><issue>2</issue><fpage>383</fpage><lpage>397</lpage></bibl><bibl id="B4"><title><p>Schistosomiasis: population requiring preventive chemotherapy and number of people treated in 2010</p></title><aug><au><snm>Organization</snm><fnm>WH</fnm></au></aug><source>Wkly Epidemiol Rec</source><pubdate>2012</pubdate><volume>87</volume><issue>4</issue><fpage>37</fpage><lpage>44</lpage></bibl><bibl id="B5"><title><p>The genome of the blood fluke Schistosoma mansoni</p></title><aug><au><snm>Berriman</snm><fnm>M</fnm></au><au><snm>Haas</snm><fnm>BJ</fnm></au><au><snm>LoVerde</snm><fnm>PT</fnm></au><au><snm>Wilson</snm><fnm>RA</fnm></au><au><snm>Dillon</snm><fnm>GP</fnm></au><au><snm>Cerqueira</snm><fnm>GC</fnm></au><au><snm>Mashiyama</snm><fnm>ST</fnm></au><au><snm>Al-Lazikani</snm><fnm>B</fnm></au><au><snm>Andrade</snm><fnm>LF</fnm></au><au><snm>Ashton</snm><fnm>PD</fnm></au><au><snm>Aslett</snm><fnm>MA</fnm></au><au><snm>Bartholomeu</snm><fnm>DC</fnm></au><au><snm>Blandin</snm><fnm>G</fnm></au><au><snm>Caffrey</snm><fnm>CR</fnm></au><au><snm>Coghlan</snm><fnm>A</fnm></au><au><snm>Coulson</snm><fnm>R</fnm></au><au><snm>Day</snm><fnm>TA</fnm></au><au><snm>Delcher</snm><fnm>A</fnm></au><au><snm>DeMarco</snm><fnm>R</fnm></au><au><snm>Djikeng</snm><fnm>A</fnm></au><au><snm>Eyre</snm><fnm>T</fnm></au><au><snm>Gamble</snm><fnm>JA</fnm></au><au><snm>Ghedin</snm><fnm>E</fnm></au><au><snm>Gu</snm><fnm>Y</fnm></au><au><snm>Hertz-Fowler</snm><fnm>C</fnm></au><au><snm>Hirai</snm><fnm>H</fnm></au><au><snm>Hirai</snm><fnm>Y</fnm></au><au><snm>Houston</snm><fnm>R</fnm></au><au><snm>Ivens</snm><fnm>A</fnm></au><au><snm>Johnston</snm><fnm>DA</fnm></au><au><snm>Lacerda</snm><fnm>D</fnm></au><au><snm>Macedo</snm><fnm>CD</fnm></au><au><snm>McVeigh</snm><fnm>P</fnm></au><au><snm>Ning</snm><fnm>Z</fnm></au><au><snm>Oliveira</snm><fnm>G</fnm></au><au><snm>Overington</snm><fnm>JP</fnm></au><au><snm>Parkhill</snm><fnm>J</fnm></au><au><snm>Pertea</snm><fnm>M</fnm></au><au><snm>Pierce</snm><fnm>RJ</fnm></au><au><snm>Protasio</snm><fnm>AV</fnm></au><au><snm>Quail</snm><fnm>MA</fnm></au><au><snm>Rajandream</snm><fnm>MA</fnm></au><au><snm>Rogers</snm><fnm>J</fnm></au><au><snm>Sajid</snm><fnm>M</fnm></au><au><snm>Salzberg</snm><fnm>SL</fnm></au><au><snm>Stanke</snm><fnm>M</fnm></au><au><snm>Tivey</snm><fnm>AR</fnm></au><au><snm>White</snm><fnm>O</fnm></au><au><snm>Williams</snm><fnm>DL</fnm></au><au><snm>Wortman</snm><fnm>J</fnm></au><au><snm>Wu</snm><fnm>W</fnm></au><au><snm>Zamanian</snm><fnm>M</fnm></au><au><snm>Zerlotini</snm><fnm>A</fnm></au><au><snm>Fraser-Liggett</snm><fnm>CM</fnm></au><au><snm>Barrell</snm><fnm>BG</fnm></au><au><snm>El-Sayed</snm><fnm>NM</fnm></au></aug><source>Nature</source><pubdate>2009</pubdate><volume>460</volume><issue>7253</issue><fpage>352</fpage><lpage>358</lpage></bibl><bibl id="B6"><title><p>The Schistosoma japonicum genome reveals features of host-parasite interplay</p></title><aug><au><snm>SjGSaFA</snm><fnm>C</fnm></au></aug><source>Nature</source><pubdate>2009</pubdate><volume>460</volume><issue>7253</issue><fpage>345</fpage><lpage>351</lpage></bibl><bibl id="B7"><title><p>Whole-genome sequence of Schistosoma haematobium</p></title><aug><au><snm>Young</snm><fnm>ND</fnm></au><au><snm>Jex</snm><fnm>AR</fnm></au><au><snm>Li</snm><fnm>B</fnm></au><au><snm>Liu</snm><fnm>S</fnm></au><au><snm>Yang</snm><fnm>L</fnm></au><au><snm>Xiong</snm><fnm>Z</fnm></au><au><snm>Li</snm><fnm>Y</fnm></au><au><snm>Cantacessi</snm><fnm>C</fnm></au><au><snm>Hall</snm><fnm>RS</fnm></au><au><snm>Xu</snm><fnm>X</fnm></au><au><snm>Chen</snm><fnm>F</fnm></au><au><snm>Wu</snm><fnm>X</fnm></au><au><snm>Zerlotini</snm><fnm>A</fnm></au><au><snm>Oliveira</snm><fnm>G</fnm></au><au><snm>Hofmann</snm><fnm>A</fnm></au><au><snm>Zhang</snm><fnm>G</fnm></au><au><snm>Fang</snm><fnm>X</fnm></au><au><snm>Kang</snm><fnm>Y</fnm></au><au><snm>Campbell</snm><fnm>BE</fnm></au><au><snm>Loukas</snm><fnm>A</fnm></au><au><snm>Ranganathan</snm><fnm>S</fnm></au><au><snm>Rollinson</snm><fnm>D</fnm></au><au><snm>Rinaldi</snm><fnm>G</fnm></au><au><snm>Brindley</snm><fnm>PJ</fnm></au><au><snm>Yang</snm><fnm>H</fnm></au><au><snm>Wang</snm><fnm>J</fnm></au><au><snm>Gasser</snm><fnm>RB</fnm></au></aug><source>Nat Genet</source><pubdate>2012</pubdate><volume>44</volume><issue>2</issue><fpage>221</fpage><lpage>225</lpage></bibl><bibl id="B8"><aug><au><cnm>TDR</cnm></au></aug><source>Tropical Disease Research: progress 2005&#8211;2006</source><publisher>Geneva: World Health Organization, Tropical Disease Research</publisher><pubdate>2007</pubdate><fpage>112</fpage></bibl><bibl id="B9"><aug><au><snm>Bruun</snm><fnm>B</fnm></au><au><snm>Aagaard-Hansen</snm><fnm>J</fnm></au></aug><source>The social context of schistosomiasis and its control: an introduction and annotated bibliography</source><publisher>Geneva: World Health Organization</publisher><pubdate>2008</pubdate></bibl><bibl id="B10"><title><p>Genetic analysis of praziquantel resistance in Schistosoma mansoni</p></title><aug><au><snm>Liang</snm><fnm>YS</fnm></au><au><snm>Dai</snm><fnm>JR</fnm></au><au><snm>Zhu</snm><fnm>YC</fnm></au><au><snm>Coles</snm><fnm>GC</fnm></au><au><snm>Doenhoff</snm><fnm>MJ</fnm></au></aug><source>Southeast Asian J Trop Med Public Health</source><pubdate>2003</pubdate><volume>34</volume><issue>2</issue><fpage>274</fpage><lpage>280</lpage></bibl><bibl id="B11"><title><p>Sex- and stage-related sensitivity of Schistosoma mansoni to in vivo and in vitro praziquantel treatment</p></title><aug><au><snm>Pica-Mattoccia</snm><fnm>L</fnm></au><au><snm>Cioli</snm><fnm>D</fnm></au></aug><source>Int J Parasitol</source><pubdate>2004</pubdate><volume>34</volume><issue>4</issue><fpage>527</fpage><lpage>533</lpage></bibl><bibl id="B12"><title><p>Praziquantel resistance</p></title><aug><au><snm>Botros</snm><fnm>SS</fnm></au><au><snm>Bennett</snm><fnm>JL</fnm></au></aug><source>Expert Opin Drug Discov</source><pubdate>2007</pubdate><volume>2</volume><fpage>S35</fpage><lpage>S40</lpage></bibl><bibl id="B13"><title><p>Reduced susceptibility to praziquantel among naturally occurring Kenyan isolates of Schistosoma mansoni</p></title><aug><au><snm>Melman</snm><fnm>SD</fnm></au><au><snm>Steinauer</snm><fnm>ML</fnm></au><au><snm>Cunningham</snm><fnm>C</fnm></au><au><snm>Kubatko</snm><fnm>LS</fnm></au><au><snm>Mwangi</snm><fnm>IN</fnm></au><au><snm>Wynn</snm><fnm>NB</fnm></au><au><snm>Mutuku</snm><fnm>MW</fnm></au><au><snm>Karanja</snm><fnm>DM</fnm></au><au><snm>Colley</snm><fnm>DG</fnm></au><au><snm>Black</snm><fnm>CL</fnm></au><au><snm>Secor</snm><fnm>WE</fnm></au><au><snm>Mkoji</snm><fnm>GM</fnm></au><au><snm>Loker</snm><fnm>ES</fnm></au></aug><source>PLoS Negl Trop Dis</source><pubdate>2009</pubdate><volume>3</volume><issue>8</issue><fpage>e504</fpage></bibl><bibl id="B14"><aug><au><snm>Rokni</snm><fnm>MB</fnm></au></aug><source>Schistosomiasis</source><publisher>InTech</publisher><pubdate>2002</pubdate></bibl><bibl id="B15"><title><p>Quantification of clinical morbidity associated with schistosome infection in sub-Saharan Africa</p></title><aug><au><snm>van der Werf</snm><fnm>MJ</fnm></au><au><snm>de Vlas</snm><fnm>SJ</fnm></au><au><snm>Brooker</snm><fnm>S</fnm></au><au><snm>Looman</snm><fnm>CW</fnm></au><au><snm>Nagelkerke</snm><fnm>NJ</fnm></au><au><snm>Habbema</snm><fnm>JD</fnm></au><au><snm>Engels</snm><fnm>D</fnm></au></aug><source>Acta Trop</source><pubdate>2003</pubdate><volume>86</volume><issue>2&#8211;3</issue><fpage>125</fpage><lpage>139</lpage></bibl><bibl id="B16"><title><p>A systematically improved high quality genome and transcriptome of the human blood fluke Schistosoma mansoni</p></title><aug><au><snm>Protasio</snm><fnm>AV</fnm></au><au><snm>Tsai</snm><fnm>IJ</fnm></au><au><snm>Babbage</snm><fnm>A</fnm></au><au><snm>Nichol</snm><fnm>S</fnm></au><au><snm>Hunt</snm><fnm>M</fnm></au><au><snm>Aslett</snm><fnm>MA</fnm></au><au><snm>De Silva</snm><fnm>N</fnm></au><au><snm>Velarde</snm><fnm>GS</fnm></au><au><snm>Anderson</snm><fnm>TJ</fnm></au><au><snm>Clark</snm><fnm>RC</fnm></au><au><snm>Davidson</snm><fnm>C</fnm></au><au><snm>Dillon</snm><fnm>GP</fnm></au><au><snm>Holroyd</snm><fnm>NE</fnm></au><au><snm>LoVerde</snm><fnm>PT</fnm></au><au><snm>Lloyd</snm><fnm>C</fnm></au><au><snm>McQuillan</snm><fnm>J</fnm></au><au><snm>Oliveira</snm><fnm>G</fnm></au><au><snm>Otto</snm><fnm>TD</fnm></au><au><snm>Parker-Manuel</snm><fnm>SJ</fnm></au><au><snm>Quail</snm><fnm>MA</fnm></au><au><snm>Wilson</snm><fnm>RA</fnm></au><au><snm>Zerlotini</snm><fnm>A</fnm></au><au><snm>Dunne</snm><fnm>DW</fnm></au><au><snm>Berriman</snm><fnm>M</fnm></au></aug><source>PLoS Negl Trop Dis</source><pubdate>2012</pubdate><volume>6</volume><issue>1</issue><fpage>e1455</fpage></bibl><bibl id="B17"><title><p>SchistoDB: a Schistosoma mansoni genome resource</p></title><aug><au><snm>Zerlotini</snm><fnm>A</fnm></au><au><snm>Heiges</snm><fnm>M</fnm></au><au><snm>Wang</snm><fnm>H</fnm></au><au><snm>Moraes</snm><fnm>RL</fnm></au><au><snm>Dominitini</snm><fnm>AJ</fnm></au><au><snm>Ruiz</snm><fnm>JC</fnm></au><au><snm>Kissinger</snm><fnm>JC</fnm></au><au><snm>Oliveira</snm><fnm>G</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2009</pubdate><volume>37</volume><issue>Database issue</issue><fpage>579</fpage><lpage>582</lpage></bibl><bibl id="B18"><title><p>Genomic linkage map of the human blood fluke Schistosoma mansoni</p></title><aug><au><snm>Criscione</snm><fnm>CD</fnm></au><au><snm>Valentim</snm><fnm>CL</fnm></au><au><snm>Hirai</snm><fnm>H</fnm></au><au><snm>LoVerde</snm><fnm>PT</fnm></au><au><snm>Anderson</snm><fnm>TJ</fnm></au></aug><source>Genome Biol</source><pubdate>2009</pubdate><volume>10</volume><issue>6</issue><fpage>R71</fpage></bibl><bibl id="B19"><title><p>Schistosoma genomics: new perspectives on schistosome biology and host-parasite interaction</p></title><aug><au><snm>Han</snm><fnm>ZG</fnm></au><au><snm>Brindley</snm><fnm>PJ</fnm></au><au><snm>Wang</snm><fnm>SY</fnm></au><au><snm>Chen</snm><fnm>Z</fnm></au></aug><source>Annu Rev Genomics Hum Genet</source><pubdate>2009</pubdate><volume>10</volume><fpage>211</fpage><lpage>240</lpage></bibl><bibl id="B20"><title><p>Schistosomes&#8211;proteomics studies for potential novel vaccines and drug targets</p></title><aug><au><snm>DeMarco</snm><fnm>R</fnm></au><au><snm>Verjovski-Almeida</snm><fnm>S</fnm></au></aug><source>Drug Discov Today</source><pubdate>2009</pubdate><volume>14</volume><issue>9&#8211;10</issue><fpage>472</fpage><lpage>478</lpage></bibl><bibl id="B21"><title><p>Function prediction of uncharacterized proteins</p></title><aug><au><snm>Hawkins</snm><fnm>T</fnm></au><au><snm>Kihara</snm><fnm>D</fnm></au></aug><source>J Bioinform Comput Biol</source><pubdate>2007</pubdate><volume>5</volume><issue>1</issue><fpage>1</fpage><lpage>30</lpage></bibl><bibl id="B22"><title><p>Phylogenomics, Protein Family Evolution, and the Tree of Life: An Integrated Approach between Molecular Evolution and Computational Intelligence</p></title><aug><au><snm>Nahum</snm><fnm>LA</fnm></au><au><snm>Pereira</snm><fnm>SL</fnm></au></aug><source>Studies in Computational Intelligence (SCI)</source><publisher>Berlin Heidelberg: Springer-Verlag</publisher><editor>Smolinski TG, Milanova MG, Hassanien AE</editor><edition>vol. 122</edition><pubdate>2008</pubdate><fpage>259</fpage><lpage>279</lpage></bibl><bibl id="B23"><title><p>Protein function predictions based on the phylogenetic profile method</p></title><aug><au><snm>Jiang</snm><fnm>Z</fnm></au></aug><source>Crit Rev Biotechnol</source><pubdate>2008</pubdate><volume>28</volume><issue>4</issue><fpage>233</fpage><lpage>238</lpage></bibl><bibl id="B24"><title><p>Basic local alignment search tool</p></title><aug><au><snm>Altschul</snm><fnm>SF</fnm></au><au><snm>Gish</snm><fnm>W</fnm></au><au><snm>Miller</snm><fnm>W</fnm></au><au><snm>Myers</snm><fnm>EW</fnm></au><au><snm>Lipman</snm><fnm>DJ</fnm></au></aug><source>J Mol Biol</source><pubdate>1990</pubdate><volume>215</volume><issue>3</issue><fpage>403</fpage><lpage>410</lpage></bibl><bibl id="B25"><title><p>OrthoMCL: identification of ortholog groups for eukaryotic genomes</p></title><aug><au><snm>Li</snm><fnm>L</fnm></au><au><snm>Stoeckert</snm><fnm>CJ</fnm></au><au><snm>Roos</snm><fnm>DS</fnm></au></aug><source>Genome Res</source><pubdate>2003</pubdate><volume>13</volume><issue>9</issue><fpage>2178</fpage><lpage>2189</lpage></bibl><bibl id="B26"><title><p>InParanoid 7: new algorithms and tools for eukaryotic orthology analysis</p></title><aug><au><snm>Ostlund</snm><fnm>G</fnm></au><au><snm>Schmitt</snm><fnm>T</fnm></au><au><snm>Forslund</snm><fnm>K</fnm></au><au><snm>K&#246;stler</snm><fnm>T</fnm></au><au><snm>Messina</snm><fnm>DN</fnm></au><au><snm>Roopra</snm><fnm>S</fnm></au><au><snm>Frings</snm><fnm>O</fnm></au><au><snm>Sonnhammer</snm><fnm>EL</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2010</pubdate><volume>38</volume><issue>Database issue</issue><fpage>196</fpage><lpage>203</lpage></bibl><bibl id="B27"><title><p>Errors in genome annotation</p></title><aug><au><snm>Brenner</snm><fnm>SE</fnm></au></aug><source>Trends Genet</source><pubdate>1999</pubdate><volume>15</volume><issue>4</issue><fpage>132</fpage><lpage>133</lpage></bibl><bibl id="B28"><title><p>Functional classification using phylogenomic inference</p></title><aug><au><snm>Brown</snm><fnm>D</fnm></au><au><snm>Sj&#246;lander</snm><fnm>K</fnm></au></aug><source>PLoS Comput Biol</source><pubdate>2006</pubdate><volume>2</volume><issue>6</issue><fpage>e77</fpage></bibl><bibl id="B29"><title><p>Comparative genomics-based prediction of protein function</p></title><aug><au><snm>Gabald&#243;n</snm><fnm>T</fnm></au></aug><source>Methods Mol Biol</source><pubdate>2008</pubdate><volume>439</volume><fpage>387</fpage><lpage>401</lpage></bibl><bibl id="B30"><title><p>Phylogenetic analysis and gene functional predictions: phylogenomics in action</p></title><aug><au><snm>Eisen</snm><fnm>JA</fnm></au><au><snm>Wu</snm><fnm>M</fnm></au></aug><source>Theor Popul Biol</source><pubdate>2002</pubdate><volume>61</volume><issue>4</issue><fpage>481</fpage><lpage>487</lpage></bibl><bibl id="B31"><title><p>Sources of systematic error in functional annotation of genomes: domain rearrangement, non-orthologous gene displacement and operon disruption</p></title><aug><au><snm>Galperin</snm><fnm>MY</fnm></au><au><snm>Koonin</snm><fnm>EV</fnm></au></aug><source>In Silico Biol</source><pubdate>1998</pubdate><volume>1</volume><issue>1</issue><fpage>55</fpage><lpage>67</lpage></bibl><bibl id="B32"><title><p>The closest BLAST hit is often not the nearest neighbor</p></title><aug><au><snm>Koski</snm><fnm>LB</fnm></au><au><snm>Golding</snm><fnm>GB</fnm></au></aug><source>J Mol Evol</source><pubdate>2001</pubdate><volume>52</volume><issue>6</issue><fpage>540</fpage><lpage>542</lpage></bibl><bibl id="B33"><title><p>Eukaryotic Protein Kinases (ePKs) of the Helminth Parasite Schistosoma mansoni</p></title><aug><au><snm>Andrade</snm><fnm>LF</fnm></au><au><snm>Nahum</snm><fnm>LA</fnm></au><au><snm>Avelar</snm><fnm>LG</fnm></au><au><snm>Silva</snm><fnm>LL</fnm></au><au><snm>Zerlotini</snm><fnm>A</fnm></au><au><snm>Ruiz</snm><fnm>JC</fnm></au><au><snm>Oliveira</snm><fnm>G</fnm></au></aug><source>BMC Genomics</source><pubdate>2011</pubdate><volume>12</volume><fpage>215</fpage></bibl><bibl id="B34"><title><p>Protein families reflect the metabolic diversity of organisms and provide support for functional prediction</p></title><aug><au><snm>Nahum</snm><fnm>LA</fnm></au><au><snm>Goswami</snm><fnm>S</fnm></au><au><snm>Serres</snm><fnm>MH</fnm></au></aug><source>Physiol Genomics</source><pubdate>2009</pubdate><volume>38</volume><issue>3</issue><fpage>250</fpage><lpage>260</lpage></bibl><bibl id="B35"><title><p>Phylogenomics: improving functional predictions for uncharacterized genes by evolutionary analysis</p></title><aug><au><snm>Eisen</snm><fnm>JA</fnm></au></aug><source>Genome Res</source><pubdate>1998</pubdate><volume>8</volume><issue>3</issue><fpage>163</fpage><lpage>167</lpage></bibl><bibl id="B36"><title><p>Joining forces in the quest for orthologs</p></title><aug><au><snm>Gabald&#243;n</snm><fnm>T</fnm></au><au><snm>Dessimoz</snm><fnm>C</fnm></au><au><snm>Huxley-Jones</snm><fnm>J</fnm></au><au><snm>Vilella</snm><fnm>AJ</fnm></au><au><snm>Sonnhammer</snm><fnm>EL</fnm></au><au><snm>Lewis</snm><fnm>S</fnm></au></aug><source>Genome Biol</source><pubdate>2009</pubdate><volume>10</volume><issue>9</issue><fpage>403</fpage></bibl><bibl id="B37"><title><p>A phylogenomic approach to microbial evolution</p></title><aug><au><snm>Sicheritz-Pont&#233;n</snm><fnm>T</fnm></au><au><snm>Andersson</snm><fnm>SG</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2001</pubdate><volume>29</volume><issue>2</issue><fpage>545</fpage><lpage>552</lpage></bibl><bibl id="B38"><title><p>The pea aphid phylome: a complete catalogue of evolutionary histories and arthropod orthology and paralogy relationships for Acyrthosiphon pisum genes.</p></title><aug><au><snm>Huerta-Cepas</snm><fnm>J</fnm></au><au><snm>Marcet-Houben</snm><fnm>M</fnm></au><au><snm>Pignatelli</snm><fnm>M</fnm></au><au><snm>Moya</snm><fnm>A</fnm></au><au><snm>Gabald&#243;n</snm><fnm>T</fnm></au></aug><source>Insect Mol Biol</source><pubdate>2010</pubdate><volume>19</volume><issue>Suppl 2</issue><fpage>13</fpage><lpage>21</lpage></bibl><bibl id="B39"><title><p>The human phylome</p></title><aug><au><snm>Huerta-Cepas</snm><fnm>J</fnm></au><au><snm>Dopazo</snm><fnm>H</fnm></au><au><snm>Dopazo</snm><fnm>J</fnm></au><au><snm>Gabald&#243;n</snm><fnm>T</fnm></au></aug><source>Genome Biol</source><pubdate>2007</pubdate><volume>8</volume><issue>6</issue><fpage>R109</fpage></bibl><bibl id="B40"><title><p>MUSCLE: a multiple sequence alignment method with reduced time and space complexity</p></title><aug><au><snm>Edgar</snm><fnm>RC</fnm></au></aug><source>BMC Bioinforma</source><pubdate>2004</pubdate><volume>5</volume><fpage>113</fpage></bibl><bibl id="B41"><title><p>Recent developments in the MAFFT multiple sequence alignment program</p></title><aug><au><snm>Katoh</snm><fnm>K</fnm></au><au><snm>Toh</snm><fnm>H</fnm></au></aug><source>Brief Bioinform</source><pubdate>2008</pubdate><volume>9</volume><issue>4</issue><fpage>286</fpage><lpage>298</lpage></bibl><bibl id="B42"><title><p>DIALIGN-TX: greedy and progressive approaches for segment-based multiple sequence alignment</p></title><aug><au><snm>Subramanian</snm><fnm>AR</fnm></au><au><snm>Kaufmann</snm><fnm>M</fnm></au><au><snm>Morgenstern</snm><fnm>B</fnm></au></aug><source>Algorithms Mol Biol</source><pubdate>2008</pubdate><volume>3</volume><fpage>6</fpage></bibl><bibl id="B43"><title><p>M-Coffee: combining multiple sequence alignment methods with T-Coffee</p></title><aug><au><snm>Wallace</snm><fnm>IM</fnm></au><au><snm>O&apos;Sullivan</snm><fnm>O</fnm></au><au><snm>Higgins</snm><fnm>DG</fnm></au><au><snm>Notredame</snm><fnm>C</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2006</pubdate><volume>34</volume><issue>6</issue><fpage>1692</fpage><lpage>1699</lpage></bibl><bibl id="B44"><title><p>trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses</p></title><aug><au><snm>Capella-Guti&#233;rrez</snm><fnm>S</fnm></au><au><snm>Silla-Mart&#237;nez</snm><fnm>JM</fnm></au><au><snm>Gabald&#243;n</snm><fnm>T</fnm></au></aug><source>Bioinformatics</source><pubdate>2009</pubdate><volume>25</volume><issue>15</issue><fpage>1972</fpage><lpage>1973</lpage></bibl><bibl id="B45"><title><p>BIONJ: an improved version of the NJ algorithm based on a simple model of sequence data</p></title><aug><au><snm>Gascuel</snm><fnm>O</fnm></au></aug><source>Mol Biol Evol</source><pubdate>1997</pubdate><volume>14</volume><issue>7</issue><fpage>685</fpage><lpage>695</lpage></bibl><bibl id="B46"><title><p>A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood</p></title><aug><au><snm>Guindon</snm><fnm>S</fnm></au><au><snm>Gascuel</snm><fnm>O</fnm></au></aug><source>Syst Biol</source><pubdate>2003</pubdate><volume>52</volume><issue>5</issue><fpage>696</fpage><lpage>704</lpage></bibl><bibl id="B47"><title><p>PhylomeDB v3.0: an expanding repository of genome-wide collections of trees, alignments and phylogeny-based orthology and paralogy predictions. </p></title><aug><au><snm>Huerta-Cepas</snm><fnm>J</fnm></au><au><snm>Capella-Gutierrez</snm><fnm>S</fnm></au><au><snm>Pryszcz</snm><fnm>LP</fnm></au><au><snm>Denisov</snm><fnm>I</fnm></au><au><snm>Kormes</snm><fnm>D</fnm></au><au><snm>Marcet-Houben</snm><fnm>M</fnm></au><au><snm>Gabald&#243;n</snm><fnm>T</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2011</pubdate><volume>39</volume><issue>Database issue</issue><fpage>556</fpage><lpage>560</lpage></bibl><bibl id="B48"><title><p>Gene ontology: tool for the unification of biology. The Gene Ontology Consortium</p></title><aug><au><snm>Ashburner</snm><fnm>M</fnm></au><au><snm>Ball</snm><fnm>CA</fnm></au><au><snm>Blake</snm><fnm>JA</fnm></au><au><snm>Botstein</snm><fnm>D</fnm></au><au><snm>Butler</snm><fnm>H</fnm></au><au><snm>Cherry</snm><fnm>JM</fnm></au><au><snm>Davis</snm><fnm>AP</fnm></au><au><snm>Dolinski</snm><fnm>K</fnm></au><au><snm>Dwight</snm><fnm>SS</fnm></au><au><snm>Eppig</snm><fnm>JT</fnm></au><au><snm>Harris</snm><fnm>MA</fnm></au><au><snm>Hill</snm><fnm>DP</fnm></au><au><snm>Issel-Tarver</snm><fnm>L</fnm></au><au><snm>Kasarskis</snm><fnm>A</fnm></au><au><snm>Lewis</snm><fnm>S</fnm></au><au><snm>Matese</snm><fnm>JC</fnm></au><au><snm>Richardson</snm><fnm>JE</fnm></au><au><snm>Ringwald</snm><fnm>M</fnm></au><au><snm>Rubin</snm><fnm>GM</fnm></au><au><snm>Sherlock</snm><fnm>G</fnm></au></aug><source>Nat Genet</source><pubdate>2000</pubdate><volume>25</volume><issue>1</issue><fpage>25</fpage><lpage>29</lpage></bibl><bibl id="B49"><title><p>Ongoing and future developments at the Universal Protein Resource</p></title><aug><au><snm>Apweiler</snm><fnm>R</fnm></au><au><snm>Martin</snm><fnm>MJ</fnm></au><au><snm>O&apos;Donovan</snm><fnm>C</fnm></au><au><snm>Magrane</snm><fnm>M</fnm></au><au><snm>Alam-Faruque</snm><fnm>Y</fnm></au><au><snm>Antunes</snm><fnm>R</fnm></au><au><snm>Barrell</snm><fnm>D</fnm></au><au><snm>Bely</snm><fnm>B</fnm></au><etal/></aug><source>Nucleic Acids Res</source><pubdate>2011</pubdate><volume>39</volume><issue>Database issue</issue><fpage>214</fpage><lpage>219</lpage></bibl><bibl id="B50"><title><p>Domain architecture conservation in orthologs</p></title><aug><au><snm>Forslund</snm><fnm>K</fnm></au><au><snm>Pekkari</snm><fnm>I</fnm></au><au><snm>Sonnhammer</snm><fnm>EL</fnm></au></aug><source>BMC Bioinforma</source><pubdate>2011</pubdate><volume>12</volume><fpage>326</fpage></bibl><bibl id="B51"><title><p>Orthologs, paralogs, and evolutionary genomics</p></title><aug><au><snm>Koonin</snm><fnm>EV</fnm></au></aug><source>Annu Rev Genet</source><pubdate>2005</pubdate><volume>39</volume><fpage>309</fpage><lpage>338</lpage></bibl><bibl id="B52"><title><p>The quest for orthologs: finding the corresponding gene across genomes</p></title><aug><au><snm>Kuzniar</snm><fnm>A</fnm></au><au><snm>van Ham</snm><fnm>RC</fnm></au><au><snm>Pongor</snm><fnm>S</fnm></au><au><snm>Leunissen</snm><fnm>JA</fnm></au></aug><source>Trends Genet</source><pubdate>2008</pubdate><volume>24</volume><issue>11</issue><fpage>539</fpage><lpage>551</lpage></bibl><bibl id="B53"><title><p>Can sequence determine function?</p></title><aug><au><snm>Gerlt</snm><fnm>JA</fnm></au><au><snm>Babbitt</snm><fnm>PC</fnm></au></aug><source>Genome Biol</source><pubdate>2000</pubdate><volume>1</volume><issue>5</issue><fpage>reviews0005.0001</fpage><lpage>reviews0005.0010</lpage></bibl><bibl id="B54"><aug><au><snm>Ohno</snm><fnm>S</fnm></au></aug><source>Evolution by gene duplication</source><publisher>New York Heidelberg Berlin: Springer-Verlag</publisher><edition>1</edition><pubdate>1970</pubdate></bibl><bibl id="B55"><title><p>Assigning duplication events to relative temporal scales in genome-wide studies</p></title><aug><au><snm>Huerta-Cepas</snm><fnm>J</fnm></au><au><snm>Gabald&#243;n</snm><fnm>T</fnm></au></aug><source>Bioinformatics</source><pubdate>2011</pubdate><volume>27</volume><issue>1</issue><fpage>38</fpage><lpage>45</lpage></bibl><bibl id="B56"><title><p>Genomes and geography: genomic insights into the evolution and phylogeography of the genus Schistosoma</p></title><aug><au><snm>Lawton</snm><fnm>SP</fnm></au><au><snm>Hirai</snm><fnm>H</fnm></au><au><snm>Ironside</snm><fnm>JE</fnm></au><au><snm>Johnston</snm><fnm>DA</fnm></au><au><snm>Rollinson</snm><fnm>D</fnm></au></aug><source>Parasit Vectors</source><pubdate>2011</pubdate><volume>4</volume><fpage>131</fpage></bibl><bibl id="B57"><title><p>A revision of the interrelationships of Schistosoma including the recently described Schistosoma guineensis</p></title><aug><au><snm>Webster</snm><fnm>BL</fnm></au><au><snm>Southgate</snm><fnm>VR</fnm></au><au><snm>Littlewood</snm><fnm>DT</fnm></au></aug><source>Int J Parasitol</source><pubdate>2006</pubdate><volume>36</volume><issue>8</issue><fpage>947</fpage><lpage>955</lpage></bibl><bibl id="B58"><title><p>The complete mitochondrial genomes of Schistosoma haematobium and Schistosoma spindale and the evolutionary history of mitochondrial genome changes among parasitic flatworms</p></title><aug><au><snm>Littlewood</snm><fnm>DT</fnm></au><au><snm>Lockyer</snm><fnm>AE</fnm></au><au><snm>Webster</snm><fnm>BL</fnm></au><au><snm>Johnston</snm><fnm>DA</fnm></au><au><snm>Le</snm><fnm>TH</fnm></au></aug><source>Molecular phylogenetics and evolution</source><pubdate>2006</pubdate><volume>39</volume><issue>2</issue><fpage>452</fpage><lpage>467</lpage></bibl><bibl id="B59"><title><p>Schistosoma comparative genomics: integrating genome structure, parasite biology and anthelmintic discovery</p></title><aug><au><snm>Swain</snm><fnm>MT</fnm></au><au><snm>Larkin</snm><fnm>DM</fnm></au><au><snm>Caffrey</snm><fnm>CR</fnm></au><au><snm>Davies</snm><fnm>SJ</fnm></au><au><snm>Loukas</snm><fnm>A</fnm></au><au><snm>Skelly</snm><fnm>PJ</fnm></au><au><snm>Hoffmann</snm><fnm>KF</fnm></au></aug><source>Trends Parasitol</source><pubdate>2011</pubdate><volume>27</volume><issue>12</issue><fpage>555</fpage><lpage>564</lpage></bibl><bibl id="B60"><title><p>The Pfam protein families database</p></title><aug><au><snm>Finn</snm><fnm>RD</fnm></au><au><snm>Mistry</snm><fnm>J</fnm></au><au><snm>Tate</snm><fnm>J</fnm></au><au><snm>Coggill</snm><fnm>P</fnm></au><au><snm>Heger</snm><fnm>A</fnm></au><au><snm>Pollington</snm><fnm>JE</fnm></au><au><snm>Gavin</snm><fnm>OL</fnm></au><au><snm>Gunasekaran</snm><fnm>P</fnm></au><au><snm>Ceric</snm><fnm>G</fnm></au><au><snm>Forslund</snm><fnm>K</fnm></au><au><snm>Holm</snm><fnm>L</fnm></au><au><snm>Sonnhammer</snm><fnm>EL</fnm></au><au><snm>Eddy</snm><fnm>SR</fnm></au><au><snm>Bateman</snm><fnm>A</fnm></au></aug><source>Nucleic Acids Res</source><pubdate>2010</pubdate><volume>38</volume><issue>Database issue</issue><fpage>211</fpage><lpage>222</lpage></bibl><bibl id="B61"><title><p>Tetraspanins on the surface of Schistosoma mansoni are protective antigens against schistosomiasis</p></title><aug><au><snm>Tran</snm><fnm>MH</fnm></au><au><snm>Pearson</snm><fnm>MS</fnm></au><au><snm>Bethony</snm><fnm>JM</fnm></au><au><snm>Smyth</snm><fnm>DJ</fnm></au><au><snm>Jones</snm><fnm>MK</fnm></au><au><snm>Duke</snm><fnm>M</fnm></au><au><snm>Don</snm><fnm>TA</fnm></au><au><snm>McManus</snm><fnm>DP</fnm></au><au><snm>Correa-Oliveira</snm><fnm>R</fnm></au><au><snm>Loukas</snm><fnm>A</fnm></au></aug><source>Nat Med</source><pubdate>2006</pubdate><volume>12</volume><issue>7</issue><fpage>835</fpage><lpage>840</lpage></bibl><bibl id="B62"><title><p>Current status of vaccines for schistosomiasis</p></title><aug><au><snm>McManus</snm><fnm>DP</fnm></au><au><snm>Loukas</snm><fnm>A</fnm></au></aug><source>Clin Microbiol Rev</source><pubdate>2008</pubdate><volume>21</volume><issue>1</issue><fpage>225</fpage><lpage>242</lpage></bibl><bibl id="B63"><title><p>Anti-schistosomal intervention targets identified by lifecycle transcriptomic analyses</p></title><aug><au><snm>Fitzpatrick</snm><fnm>JM</fnm></au><au><snm>Peak</snm><fnm>E</fnm></au><au><snm>Perally</snm><fnm>S</fnm></au><au><snm>Chalmers</snm><fnm>IW</fnm></au><au><snm>Barrett</snm><fnm>J</fnm></au><au><snm>Yoshino</snm><fnm>TP</fnm></au><au><snm>Ivens</snm><fnm>AC</fnm></au><au><snm>Hoffmann</snm><fnm>KF</fnm></au></aug><source>PLoS Negl Trop Dis</source><pubdate>2009</pubdate><volume>3</volume><issue>11</issue><fpage>e543</fpage></bibl><bibl id="B64"><title><p>The Schistosoma mansoni tegumental-allergen-like (TAL) protein family: influence of developmental expression on human IgE responses</p></title><aug><au><snm>Fitzsimmons</snm><fnm>CM</fnm></au><au><snm>Jones</snm><fnm>FM</fnm></au><au><snm>Stearn</snm><fnm>A</fnm></au><au><snm>Chalmers</snm><fnm>IW</fnm></au><au><snm>Hoffmann</snm><fnm>KF</fnm></au><au><snm>Wawrzyniak</snm><fnm>J</fnm></au><au><snm>Wilson</snm><fnm>S</fnm></au><au><snm>Kabatereine</snm><fnm>NB</fnm></au><au><snm>Dunne</snm><fnm>DW</fnm></au></aug><source>PLoS Negl Trop Dis</source><pubdate>2012</pubdate><volume>6</volume><issue>4</issue><fpage>e1593</fpage></bibl><bibl id="B65"><aug><au><snm>Silva</snm><fnm>LL</fnm></au><au><snm>Marcet-Houben</snm><fnm>M</fnm></au><au><snm>Zerlotini</snm><fnm>A</fnm></au><au><snm>Gabald&#243;n</snm><fnm>T</fnm></au><au><snm>Oliveira</snm><fnm>G</fnm></au><au><snm>Nahum</snm><fnm>LA</fnm></au></aug><source>Evolutionary Histories of Expanded Peptidase Families in Schistosoma mansoni</source><publisher>in press. Mem Inst Oswaldo Cruz</publisher><pubdate>2011</pubdate></bibl><bibl id="B66"><title><p>Protein-protein interactions in the tetraspanin web</p></title><aug><au><snm>Levy</snm><fnm>S</fnm></au><au><snm>Shoham</snm><fnm>T</fnm></au></aug><source>Physiology (Bethesda)</source><pubdate>2005</pubdate><volume>20</volume><fpage>218</fpage><lpage>224</lpage></bibl><bibl id="B67"><title><p>Tetraspanin-enriched microdomains: a functional unit in cell plasma membranes</p></title><aug><au><snm>Y&#225;&#241;ez-M&#243;</snm><fnm>M</fnm></au><au><snm>Barreiro</snm><fnm>O</fnm></au><au><snm>Gordon-Alonso</snm><fnm>M</fnm></au><au><snm>Sala-Vald&#233;s</snm><fnm>M</fnm></au><au><snm>S&#225;nchez-Madrid</snm><fnm>F</fnm></au></aug><source>Trends Cell Biol</source><pubdate>2009</pubdate><volume>19</volume><issue>9</issue><fpage>434</fpage><lpage>446</lpage></bibl><bibl id="B68"><title><p>Suppression of mRNAs encoding tegument tetraspanins from Schistosoma mansoni results in impaired tegument turnover</p></title><aug><au><snm>Tran</snm><fnm>MH</fnm></au><au><snm>Freitas</snm><fnm>TC</fnm></au><au><snm>Cooper</snm><fnm>L</fnm></au><au><snm>Gaze</snm><fnm>S</fnm></au><au><snm>Gatton</snm><fnm>ML</fnm></au><au><snm>Jones</snm><fnm>MK</fnm></au><au><snm>Lovas</snm><fnm>E</fnm></au><au><snm>Pearce</snm><fnm>EJ</fnm></au><au><snm>Loukas</snm><fnm>A</fnm></au></aug><source>PLoS Pathog</source><pubdate>2010</pubdate><volume>6</volume><issue>4</issue><fpage>e1000840</fpage></bibl><bibl id="B69"><title><p>Schistosoma mansoni: characterization of the gene encoding Sm23, an integral membrane protein</p></title><aug><au><snm>Lee</snm><fnm>KW</fnm></au><au><snm>Shalaby</snm><fnm>KA</fnm></au><au><snm>Medhat</snm><fnm>AM</fnm></au><au><snm>Shi</snm><fnm>H</fnm></au><au><snm>Yang</snm><fnm>Q</fnm></au><au><snm>Karim</snm><fnm>AM</fnm></au><au><snm>LoVerde</snm><fnm>PT</fnm></au></aug><source>Exp Parasitol</source><pubdate>1995</pubdate><volume>80</volume><issue>1</issue><fpage>155</fpage><lpage>158</lpage></bibl><bibl id="B70"><title><p>Sequence and immunogenicity of the 23-kDa transmembrane antigen of Schistosoma haematobium</p></title><aug><au><snm>Inal</snm><fnm>J</fnm></au><au><snm>Bickle</snm><fnm>Q</fnm></au></aug><source>Mol Biochem Parasitol</source><pubdate>1995</pubdate><volume>74</volume><issue>2</issue><fpage>217</fpage><lpage>221</lpage></bibl><bibl id="B71"><title><p>Developmentally regulated expression, alternative splicing and distinct sub-groupings in members of the Schistosoma mansoni venom allergen-like (SmVAL) gene family</p></title><aug><au><snm>Chalmers</snm><fnm>IW</fnm></au><au><snm>McArdle</snm><fnm>AJ</fnm></au><au><snm>Coulson</snm><fnm>RM</fnm></au><au><snm>Wagner</snm><fnm>MA</fnm></au><au><snm>Schmid</snm><fnm>R</fnm></au><au><snm>Hirai</snm><fnm>H</fnm></au><au><snm>Hoffmann</snm><fnm>KF</fnm></au></aug><source>BMC Genomics</source><pubdate>2008</pubdate><volume>9</volume><fpage>89</fpage></bibl><bibl id="B72"><title><p>Identification of novel proteases and immunomodulators in the secretions of schistosome cercariae that facilitate host entry</p></title><aug><au><snm>Curwen</snm><fnm>RS</fnm></au><au><snm>Ashton</snm><fnm>PD</fnm></au><au><snm>Sundaralingam</snm><fnm>S</fnm></au><au><snm>Wilson</snm><fnm>RA</fnm></au></aug><source>Mol Cell Proteomics</source><pubdate>2006</pubdate><volume>5</volume><issue>5</issue><fpage>835</fpage><lpage>844</lpage></bibl><bibl id="B73"><title><p>Fucosylation in prokaryotes and eukaryotes</p></title><aug><au><snm>Ma</snm><fnm>B</fnm></au><au><snm>Simala-Grant</snm><fnm>JL</fnm></au><au><snm>Taylor</snm><fnm>DE</fnm></au></aug><source>Glycobiology</source><pubdate>2006</pubdate><volume>16</volume><issue>12</issue><fpage>158R</fpage><lpage>184R</lpage></bibl><bibl id="B74"><title><p>Fucosyltransferases in Schistosoma mansoni development</p></title><aug><au><snm>Marques</snm><fnm>ET</fnm><suf>Jr</suf></au><au><snm>Ichikawa</snm><fnm>Y</fnm></au><au><snm>Strand</snm><fnm>M</fnm></au><au><snm>August</snm><fnm>JT</fnm></au><au><snm>Hart</snm><fnm>GW</fnm></au><au><snm>Schnaar</snm><fnm>RL</fnm></au></aug><source>Glycobiology</source><pubdate>2001</pubdate><volume>11</volume><issue>3</issue><fpage>249</fpage><lpage>259</lpage></bibl><bibl id="B75"><title><p>An analysis of the calcium-binding protein 1 of Fasciola gigantica with a comparison to its homologs in the phylum Platyhelminthes</p></title><aug><au><snm>Vichasri-Grams</snm><fnm>S</fnm></au><au><snm>Subpipattana</snm><fnm>P</fnm></au><au><snm>Sobhon</snm><fnm>P</fnm></au><au><snm>Viyanant</snm><fnm>V</fnm></au><au><snm>Grams</snm><fnm>R</fnm></au></aug><source>Mol Biochem Parasitol</source><pubdate>2006</pubdate><volume>146</volume><issue>1</issue><fpage>10</fpage><lpage>23</lpage></bibl><bibl id="B76"><title><p>Immunity after treatment of human schistosomiasis: association between IgE antibodies to adult worm antigens and resistance to reinfection</p></title><aug><au><snm>Dunne</snm><fnm>DW</fnm></au><au><snm>Butterworth</snm><fnm>AE</fnm></au><au><snm>Fulford</snm><fnm>AJ</fnm></au><au><snm>Kariuki</snm><fnm>HC</fnm></au><au><snm>Langley</snm><fnm>JG</fnm></au><au><snm>Ouma</snm><fnm>JH</fnm></au><au><snm>Capron</snm><fnm>A</fnm></au><au><snm>Pierce</snm><fnm>RJ</fnm></au><au><snm>Sturrock</snm><fnm>RF</fnm></au></aug><source>Eur J Immunol</source><pubdate>1992</pubdate><volume>22</volume><issue>6</issue><fpage>1483</fpage><lpage>1494</lpage></bibl><bibl id="B77"><title><p>Cercarial elastase is encoded by a functionally conserved gene family across multiple species of schistosomes</p></title><aug><au><snm>Salter</snm><fnm>JP</fnm></au><au><snm>Choe</snm><fnm>Y</fnm></au><au><snm>Albrecht</snm><fnm>H</fnm></au><au><snm>Franklin</snm><fnm>C</fnm></au><au><snm>Lim</snm><fnm>KC</fnm></au><au><snm>Craik</snm><fnm>CS</fnm></au><au><snm>McKerrow</snm><fnm>JH</fnm></au></aug><source>J Biol Chem</source><pubdate>2002</pubdate><volume>277</volume><issue>27</issue><fpage>24618</fpage><lpage>24624</lpage></bibl><bibl id="B78"><title><p>Proteases from Schistosoma mansoni cercariae cleave IgE at solvent exposed interdomain regions</p></title><aug><au><snm>Aslam</snm><fnm>A</fnm></au><au><snm>Quinn</snm><fnm>P</fnm></au><au><snm>McIntosh</snm><fnm>RS</fnm></au><au><snm>Shi</snm><fnm>J</fnm></au><au><snm>Ghumra</snm><fnm>A</fnm></au><au><snm>McKerrow</snm><fnm>JH</fnm></au><au><snm>Bunting</snm><fnm>KA</fnm></au><au><snm>Dunne</snm><fnm>DW</fnm></au><au><snm>Doenhoff</snm><fnm>MJ</fnm></au><au><snm>Morrison</snm><fnm>SL</fnm></au><au><snm>Zhang</snm><fnm>K</fnm></au><au><snm>Pleass</snm><fnm>RJ</fnm></au></aug><source>Mol Immunol</source><pubdate>2008</pubdate><volume>45</volume><issue>2</issue><fpage>567</fpage><lpage>574</lpage></bibl><bibl id="B79"><title><p>Schistosoma mansoni: changes in the outer membrane of the tegument during development from cercaria to adult worm</p></title><aug><au><snm>Hockley</snm><fnm>DJ</fnm></au><au><snm>McLaren</snm><fnm>DJ</fnm></au></aug><source>Int J Parasitol</source><pubdate>1973</pubdate><volume>3</volume><issue>1</issue><fpage>13</fpage><lpage>25</lpage></bibl><bibl id="B80"><title><p>Mechanisms of immune evasion in schistosomiasis</p></title><aug><au><snm>Pearce</snm><fnm>EJ</fnm></au><au><snm>Sher</snm><fnm>A</fnm></au></aug><source>Contrib Microbiol Immunol</source><pubdate>1987</pubdate><volume>8</volume><fpage>219</fpage><lpage>232</lpage></bibl><bibl id="B81"><title><p>Proteomic analysis of the schistosome tegument and its surface membranes</p></title><aug><au><snm>Braschi</snm><fnm>S</fnm></au><au><snm>Borges</snm><fnm>WC</fnm></au><au><snm>Wilson</snm><fnm>RA</fnm></au></aug><source>Mem Inst Oswaldo Cruz</source><pubdate>2006</pubdate><volume>101</volume><issue>Suppl 1</issue><fpage>205</fpage><lpage>212</lpage></bibl><bibl id="B82"><title><p>Blood flukes have a double outer membrane</p></title><aug><au><snm>Mclaren</snm><fnm>DJ</fnm></au><au><snm>Hockley</snm><fnm>DJ</fnm></au></aug><source>Nature</source><pubdate>1977</pubdate><volume>269</volume><issue>5624</issue><fpage>147</fpage><lpage>149</lpage></bibl><bibl id="B83"><title><p>Parasite regulation by host hormones: an old mechanism of host exploitation?</p></title><aug><au><snm>Escobedo</snm><fnm>G</fnm></au><au><snm>Roberts</snm><fnm>CW</fnm></au><au><snm>Carrero</snm><fnm>JC</fnm></au><au><snm>Morales-Montor</snm><fnm>J</fnm></au></aug><source>Trends Parasitol</source><pubdate>2005</pubdate><volume>21</volume><issue>12</issue><fpage>588</fpage><lpage>593</lpage></bibl><bibl id="B84"><title><p>Annexins&#8211;unique membrane binding proteins with diverse functions</p></title><aug><au><snm>Rescher</snm><fnm>U</fnm></au><au><snm>Gerke</snm><fnm>V</fnm></au></aug><source>J Cell Sci</source><pubdate>2004</pubdate><volume>117</volume><issue>Pt 13</issue><fpage>2631</fpage><lpage>2639</lpage></bibl><bibl id="B85"><title><p>Schistosoma mansoni Annexin 2: molecular characterization and immunolocalization</p></title><aug><au><snm>Tararam</snm><fnm>CA</fnm></au><au><snm>Farias</snm><fnm>LP</fnm></au><au><snm>Wilson</snm><fnm>RA</fnm></au><au><snm>Leite</snm><fnm>LC</fnm></au></aug><source>Exp Parasitol</source><pubdate>2010</pubdate><volume>126</volume><issue>2</issue><fpage>146</fpage><lpage>155</lpage></bibl><bibl id="B86"><title><p>Parasite annexins&#8211;new molecules with potential for drug and vaccine development</p></title><aug><au><snm>Hofmann</snm><fnm>A</fnm></au><au><snm>Osman</snm><fnm>A</fnm></au><au><snm>Leow</snm><fnm>CY</fnm></au><au><snm>Driguez</snm><fnm>P</fnm></au><au><snm>McManus</snm><fnm>DP</fnm></au><au><snm>Jones</snm><fnm>MK</fnm></au></aug><source>BioEssays</source><pubdate>2010</pubdate><volume>32</volume><issue>11</issue><fpage>967</fpage><lpage>976</lpage></bibl><bibl id="B87"><title><p>Cadherins: a molecular family important in selective cell-cell adhesion</p></title><aug><au><snm>Takeichi</snm><fnm>M</fnm></au></aug><source>Annu Rev Biochem</source><pubdate>1990</pubdate><volume>59</volume><fpage>237</fpage><lpage>252</lpage></bibl><bibl id="B88"><title><p>Innexins: a family of invertebrate gap-junction proteins</p></title><aug><au><snm>Phelan</snm><fnm>P</fnm></au><au><snm>Bacon</snm><fnm>JP</fnm></au><au><snm>Davies</snm><fnm>JA</fnm></au><au><snm>Stebbings</snm><fnm>LA</fnm></au><au><snm>Todman</snm><fnm>MG</fnm></au><au><snm>Avery</snm><fnm>L</fnm></au><au><snm>Baines</snm><fnm>RA</fnm></au><au><snm>Barnes</snm><fnm>TM</fnm></au><au><snm>Ford</snm><fnm>C</fnm></au><au><snm>Hekimi</snm><fnm>S</fnm></au><au><snm>Lee</snm><fnm>R</fnm></au><au><snm>Shaw</snm><fnm>JE</fnm></au><au><snm>Starich</snm><fnm>TA</fnm></au><au><snm>Curtin</snm><fnm>KD</fnm></au><au><snm>Sun</snm><fnm>YA</fnm></au><au><snm>Wyman</snm><fnm>RJ</fnm></au></aug><source>Trends Genet</source><pubdate>1998</pubdate><volume>14</volume><issue>9</issue><fpage>348</fpage><lpage>349</lpage></bibl><bibl id="B89"><title><p>Caenorhabditis elegans innexins regulate active zone differentiation</p></title><aug><au><snm>Yeh</snm><fnm>E</fnm></au><au><snm>Kawano</snm><fnm>T</fnm></au><au><snm>Ng</snm><fnm>S</fnm></au><au><snm>Fetter</snm><fnm>R</fnm></au><au><snm>Hung</snm><fnm>W</fnm></au><au><snm>Wang</snm><fnm>Y</fnm></au><au><snm>Zhen</snm><fnm>M</fnm></au></aug><source>J Neurosci</source><pubdate>2009</pubdate><volume>29</volume><issue>16</issue><fpage>5207</fpage><lpage>5217</lpage></bibl><bibl id="B90"><title><p>Evolution by gene duplication: an update</p></title><aug><au><snm>Zhang</snm><fnm>J</fnm></au></aug><source>Trends Ecol Evol</source><pubdate>2003</pubdate><volume>18</volume><issue>6</issue><fpage>292</fpage><lpage>298</lpage></bibl><bibl id="B91"><title><p>Comparative genomics of the eukaryotes</p></title><aug><au><snm>Rubin</snm><fnm>GM</fnm></au><au><snm>Yandell</snm><fnm>MD</fnm></au><au><snm>Wortman</snm><fnm>JR</fnm></au><au><snm>Gabor Miklos</snm><fnm>GL</fnm></au><au><snm>Nelson</snm><fnm>CR</fnm></au><au><snm>Hariharan</snm><fnm>IK</fnm></au><au><snm>Fortini</snm><fnm>ME</fnm></au><au><snm>Li</snm><fnm>PW</fnm></au><au><snm>Apweiler</snm><fnm>R</fnm></au><au><snm>Fleischmann</snm><fnm>W</fnm></au><au><snm>Cherry</snm><fnm>JM</fnm></au><au><snm>Henikoff</snm><fnm>S</fnm></au><au><snm>Skupski</snm><fnm>MP</fnm></au><au><snm>Misra</snm><fnm>S</fnm></au><au><snm>Ashburner</snm><fnm>M</fnm></au><au><snm>Birney</snm><fnm>E</fnm></au><au><snm>Boguski</snm><fnm>MS</fnm></au><au><snm>Brody</snm><fnm>T</fnm></au><au><snm>Brokstein</snm><fnm>P</fnm></au><au><snm>Celniker</snm><fnm>SE</fnm></au><au><snm>Chervitz</snm><fnm>SA</fnm></au><au><snm>Coates</snm><fnm>D</fnm></au><au><snm>Cravchik</snm><fnm>A</fnm></au><au><snm>Gabrielian</snm><fnm>A</fnm></au><au><snm>Galle</snm><fnm>RF</fnm></au><au><snm>Gelbart</snm><fnm>WM</fnm></au><au><snm>George</snm><fnm>RA</fnm></au><au><snm>Goldstein</snm><fnm>LS</fnm></au><au><snm>Gong</snm><fnm>F</fnm></au><au><snm>Guan</snm><fnm>P</fnm></au><au><snm>Harris</snm><fnm>NL</fnm></au><au><snm>Hay</snm><fnm>BA</fnm></au><au><snm>Hoskins</snm><fnm>RA</fnm></au><au><snm>Li</snm><fnm>J</fnm></au><au><snm>Li</snm><fnm>Z</fnm></au><au><snm>Hynes</snm><fnm>RO</fnm></au><au><snm>Jones</snm><fnm>SJ</fnm></au><au><snm>Kuehl</snm><fnm>PM</fnm></au><au><snm>Lemaitre</snm><fnm>B</fnm></au><au><snm>Littleton</snm><fnm>JT</fnm></au><au><snm>Morrison</snm><fnm>DK</fnm></au><au><snm>Mungall</snm><fnm>C</fnm></au><au><snm>O&apos;Farrell</snm><fnm>PH</fnm></au><au><snm>Pickeral</snm><fnm>OK</fnm></au><au><snm>Shue</snm><fnm>C</fnm></au><au><snm>Vosshall</snm><fnm>LB</fnm></au><au><snm>Zhang</snm><fnm>J</fnm></au><au><snm>Zhao</snm><fnm>Q</fnm></au><au><snm>Zheng</snm><fnm>XH</fnm></au><au><snm>Lewis</snm><fnm>S</fnm></au></aug><source>Science</source><pubdate>2000</pubdate><volume>287</volume><issue>5461</issue><fpage>2204</fpage><lpage>2215</lpage></bibl><bibl id="B92"><title><p>Is host-schistosome coevolution going anywhere?</p></title><aug><au><snm>Webster</snm><fnm>JP</fnm></au><au><snm>Shrivastava</snm><fnm>J</fnm></au><au><snm>Johnson</snm><fnm>PJ</fnm></au><au><snm>Blair</snm><fnm>L</fnm></au></aug><source>BMC Evol Biol</source><pubdate>2007</pubdate><volume>7</volume><fpage>91</fpage></bibl><bibl id="B93"><title><p>Duplicated paralogous genes subject to positive selection in the genome of Trypanosoma brucei</p></title><aug><au><snm>Emes</snm><fnm>RD</fnm></au><au><snm>Yang</snm><fnm>Z</fnm></au></aug><source>PLoS One</source><pubdate>2008</pubdate><volume>3</volume><issue>5</issue><fpage>e2295</fpage></bibl><bibl id="B94"><title><p>Protein variation in blood-dwelling schistosome worms generated by differential splicing of micro-exon gene transcripts</p></title><aug><au><snm>DeMarco</snm><fnm>R</fnm></au><au><snm>Mathieson</snm><fnm>W</fnm></au><au><snm>Manuel</snm><fnm>SJ</fnm></au><au><snm>Dillon</snm><fnm>GP</fnm></au><au><snm>Curwen</snm><fnm>RS</fnm></au><au><snm>Ashton</snm><fnm>PD</fnm></au><au><snm>Ivens</snm><fnm>AC</fnm></au><au><snm>Berriman</snm><fnm>M</fnm></au><au><snm>Verjovski-Almeida</snm><fnm>S</fnm></au><au><snm>Wilson</snm><fnm>RA</fnm></au></aug><source>Genome Res</source><pubdate>2010</pubdate><volume>20</volume><issue>8</issue><fpage>1112</fpage><lpage>1121</lpage></bibl><bibl id="B95"><title><p>Non-coding RNAs in schistosomes: an unexplored world</p></title><aug><au><snm>Oliveira</snm><fnm>KC</fnm></au><au><snm>Carvalho</snm><fnm>ML</fnm></au><au><snm>Maracaja-Coutinho</snm><fnm>V</fnm></au><au><snm>Kitajima</snm><fnm>JP</fnm></au><au><snm>Verjovski-Almeida</snm><fnm>S</fnm></au></aug><source>An Acad Bras Cienc</source><pubdate>2011</pubdate><volume>83</volume><issue>2</issue><fpage>673</fpage><lpage>694</lpage></bibl><bibl id="B96"><title><p>Identification of common molecular subsequences</p></title><aug><au><snm>Smith</snm><fnm>TF</fnm></au><au><snm>Waterman</snm><fnm>MS</fnm></au></aug><source>J Mol Biol</source><pubdate>1981</pubdate><volume>147</volume><issue>1</issue><fpage>195</fpage><lpage>197</lpage></bibl><bibl id="B97"><title><p>Approximate likelihood-ratio test for branches: A fast, accurate, and powerful alternative</p></title><aug><au><snm>Anisimova</snm><fnm>M</fnm></au><au><snm>Gascuel</snm><fnm>O</fnm></au></aug><source>Syst Biol</source><pubdate>2006</pubdate><volume>55</volume><issue>4</issue><fpage>539</fpage><lpage>552</lpage></bibl><bibl id="B98"><title><p>Information theory and an extension of the maximum likelihood principle</p></title><aug><au><snm>Akaike</snm><fnm>H</fnm></au></aug><source>2nd International Symposium on Information Theory 1973</source><publisher>Budapest: Akademiai kiado</publisher><editor>Petrov BN, Cs&#224;ki F</editor><pubdate>1973</pubdate><fpage>267</fpage><lpage>281</lpage></bibl><bibl id="B99"><title><p>ETE: a python Environment for Tree Exploration</p></title><aug><au><snm>Huerta-Cepas</snm><fnm>J</fnm></au><au><snm>Dopazo</snm><fnm>J</fnm></au><au><snm>Gabald&#243;n</snm><fnm>T</fnm></au></aug><source>BMC Bioinforma</source><pubdate>2010</pubdate><volume>11</volume><fpage>24</fpage></bibl><bibl id="B100"><title><p>Distinguishing homologous from analogous proteins</p></title><aug><au><snm>Fitch</snm><fnm>WM</fnm></au></aug><source>Syst Zool</source><pubdate>1970</pubdate><volume>19</volume><issue>2</issue><fpage>99</fpage><lpage>113</lpage></bibl><bibl id="B101"><title><p>The tree versus the forest: the fungal tree of life and the topological diversity within the yeast phylome</p></title><aug><au><snm>Marcet-Houben</snm><fnm>M</fnm></au><au><snm>Gabald&#243;n</snm><fnm>T</fnm></au></aug><source>PLoS One</source><pubdate>2009</pubdate><volume>4</volume><issue>2</issue><fpage>e4357</fpage></bibl></refgrp>
	</bm>
</art>