<?xml version='1.0'?>
<!DOCTYPE art SYSTEM 'http://www.biomedcentral.com/xml/article.dtd'>
<art>
   <ui>1471-2164-11-128</ui>
   <ji>1471-2164</ji>
   <fm>
      <dochead>Research article</dochead>
      <bibl>
         <title>
            <p>Defining species specific genome differences in malaria parasites</p>
         </title>
         <aug>
            <au id="A1">
               <snm>Liew</snm>
               <mi>JL</mi>
               <fnm>Kingsley</fnm>
               <insr iid="I1"/>
               <email>kingsley@pmail.ntu.edu.sg</email>
            </au>
            <au id="A2">
               <snm>Hu</snm>
               <fnm>Guangan</fnm>
               <insr iid="I1"/>
               <email>HU0002AN@ntu.edu.sg</email>
            </au>
            <au id="A3">
               <snm>Bozdech</snm>
               <fnm>Zbynek</fnm>
               <insr iid="I1"/>
               <email>ZBozdech@ntu.edu.sg</email>
            </au>
            <au ca="yes" id="A4">
               <snm>Peter</snm>
               <mi>R</mi>
               <fnm>Preiser</fnm>
               <insr iid="I1"/>
               <email>prpreiser@ntu.edu.sg</email>
            </au>
         </aug>
         <insg>
            <ins id="I1">
               <p>Division of Genomics and Genetics, School of Biological Sciences, Nanyang Technological University, 60 Nanyang Drive, Singapore 637551, Singapore</p>
            </ins>
         </insg>
         <source>BMC Genomics</source>
         <issn>1471-2164</issn>
         <pubdate>2010</pubdate>
         <volume>11</volume>
         <issue>1</issue>
         <fpage>128</fpage>
         <url>http://www.biomedcentral.com/1471-2164/11/128</url>
         <xrefbib>
            <pubidlist>
               <pubid idtype="doi">10.1186/1471-2164-11-128</pubid>
               <pubid idtype="pmpid">20175934</pubid>
            </pubidlist>
         </xrefbib>
      </bibl>
      <history>
         <rec>
            <date>
               <day>10</day>
               <month>7</month>
               <year>2009</year>
            </date>
         </rec>
         <acc>
            <date>
               <day>23</day>
               <month>2</month>
               <year>2010</year>
            </date>
         </acc>
         <pub>
            <date>
               <day>23</day>
               <month>2</month>
               <year>2010</year>
            </date>
         </pub>
      </history>
      <cpyrt>
         <year>2010</year>
         <collab>Liew et al; licensee BioMed Central Ltd.</collab>
         <note>This is an Open Access article distributed under the terms of the Creative Commons Attribution License (<url>http://creativecommons.org/licenses/by/2.0</url>), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</note>
      </cpyrt>
      <abs>
         <sec>
            <st>
               <p>Abstract</p>
            </st>
            <sec>
               <st>
                  <p>Background</p>
               </st>
               <p>In recent years a number of genome sequences for different <it>plasmodium </it>species have become available. This has allowed the identification of numerous conserved genes across the different species and has significantly enhanced our understanding of parasite biology. In contrast little is known about species specific differences between the different genomes partly due to the lower sequence coverage and therefore relatively poor annotation of some of the draft genomes particularly the rodent malarias parasite species.</p>
            </sec>
            <sec>
               <st>
                  <p>Results</p>
               </st>
               <p>To improve the current annotation and gene identification status of the draft genomes of <it>P. berghei</it>, <it>P. chabaudi </it>and <it>P. yoelii</it>, we performed genome-wide comparisons between these three species. Through analyses via comparative genome hybridizations using a newly designed pan-rodent array as well as in depth bioinformatics analysis, we were able to improve on the coverage of the draft rodent parasite genomes by detecting orthologous genes between these related rodent parasite species. More than 1,000 orthologs for <it>P. yoelii </it>were now newly associated with a <it>P. falciparum </it>gene. In addition to extending the current core gene set for all plasmodium species this analysis also for the first time identifies a relatively small number of genes that are unique to the primate malaria parasites while a larger gene set is uniquely conserved amongst the rodent malaria parasites.</p>
            </sec>
            <sec>
               <st>
                  <p>Conclusions</p>
               </st>
               <p>These findings allow a more thorough investigation of the genes that are important for host specificity in malaria.</p>
            </sec>
         </sec>
      </abs>
   </fm>
   <meta>
      <classifications>
         <classification id="endnote" subtype="user_supplied_xml" type="bmc"/>
      </classifications>
   </meta>
   <bdy>
      <sec>
         <st>
            <p>Background</p>
         </st>
         <p>Malaria is a disease caused by the parasitic protozoa from the genus <it>Plasmodium</it>. While the disease is restricted to the tropical and sub-tropical regions of the world due mainly to the natural habitat of the mosquito vector, these regions are densely populated with almost 2.2 billion people living in endemic areas and 515 million cases were expected per annum <abbrgrp><abbr bid="B1">1</abbr></abbrgrp>. Although well-established culture and molecular techniques have been established for <it>Plasmodium falciparum</it>, the use of rodent malaria parasites as <it>in vivo </it>models in the study of the host-parasite interactions is still as relevant today because the rodent parasites are very similar to the human and primate parasites in terms of life cycle, physiology and structure <abbrgrp><abbr bid="B2">2</abbr></abbrgrp>.</p>
         <p>Since the release of the genome sequences of <it>P. falciparum </it>and the rodent malaria species, common features of these haploid genomes include a genome size of 22-26 Mb that are arranged in 14 chromosomes ranging from 0.5-3.0 Mb. In addition the current genomic data show a high degree of conservation between different <it>Plasmodium </it>species with the exception of genes located in the telomeric and subtelomeric regions that are extremely variable due to their role in antigenic variation and immune evasion <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>. This suggests that genes located close to the centromeres of the chromosomes would be highly conserved. As a proof of concept, comparisons of these centrally located genes between <it>P. falciparum </it>and the most completely annotated rodent parasite species <it>P. yoelii </it>was shown to have a high degree of synteny <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. Thus, conservation of these common 'core' genes even amongst divergent species was demonstrated and differences were mainly due to chromosomal re-arrangements <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B4">4</abbr></abbrgrp>.</p>
         <p>In contrast, the chromosomal regions responsible for antigenic variation and host immune evasion show the most divergence. The genome of <it>P. falciparum </it>contains species-specific subtelomeric genes involved in host cell invasion, adhesion and antigenic variation that are not found in the <it>P. yoelii </it>genome. For example, the <it>P. falciparum </it>genes that are located in the sub-telomeric regions include the <it>var, stevors </it>and <it>rifins </it>that are responsible for antigenic variation and hence immune evasion <abbrgrp><abbr bid="B6">6</abbr><abbr bid="B7">7</abbr></abbrgrp>. In contrast, the <it>P. yoelii yir </it>gene family seems to be the largest multigene family found in the sub-telomeric regions and is absent in <it>P. falciparum</it>. Interestingly, this multigene family is common to all the rodent species and to <it>P. vivax </it><abbrgrp><abbr bid="B8">8</abbr></abbrgrp>. A more recent study employing probabilistic modelling in conjunction with genomic organization and protein structure analysis bridges this gap and places the <it>P. falciparum rjf/stevor </it>multigene family together into a conserved multigene superfamily of malaria parasites known as the Plasmodium interspersed repeats (PIRs) <abbrgrp><abbr bid="B9">9</abbr></abbrgrp>. This discovery further suggests that although genes are more conserved than previously thought; there is evidence of species-specific divergence that is dependent on each species' interaction with the host or host immune system. A recent phylogenetic survey of rodent malaria parasites based on DNA sequences from multiple loci in the nuclear, mitochondrial and plastid genomes placed <it>P. berghei </it>and <it>P. yoelii </it>as sister species forming a distinct clade while <it>P. chabaudi </it>and another rodent parasite <it>P. vinckei </it>forming another group <abbrgrp><abbr bid="B10">10</abbr></abbrgrp> thus suggesting that <it>P. berghei </it>and <it>P. yoelii </it>seemed to be more evolutionarily related to each other while <it>P. chabaudi </it>appeared to be a more distinct line. While these studies address gene conservation among different species as well as confirm species-specific sequence polymorphisms a global quantitative comparisons with regards to gain or loss of gene function have not been attempted due to the incomplete genome sequences of the malaria parasite species, barring the high resolution map of <it>P. falciparum </it><abbrgrp><abbr bid="B11">11</abbr></abbrgrp>. As a consequence, differences between the rodent parasites especially at the genomic level have not been fully elucidated as yet and thus linking genomic differences to phenotypic traits of these parasites have been difficult. In order to address this issue, genome-wide comparisons including comparative genomic hybridization (CGH) and detailed bioinformatics analysis can be employed as a valuable tool to identify similarities and differences between related species. This technique has been utilized to compare genomic differences between similar species in both prokaryotes and eukaryotes. For example, CGH has been used to genotype related yeast species to determine the presence and absence/polymorphic sequences <abbrgrp><abbr bid="B12">12</abbr></abbrgrp>. The usefulness of CGH in gene discovery was shown with the discovery of three thousand novel genes in <it>Klebsiella pneuomoniae </it>342 using microarray hybridization with <it>Escherichia coli </it>K-12 open reading frames with confirmation of the presence of genes coding for conserved metabolic functions and demonstrating specificity where genes obtained via lateral transfer in K-12 were absent in 342 <abbrgrp><abbr bid="B13">13</abbr></abbrgrp>. In addition, CGH is also able to differentiate genome-wide variation in various virulent and avirulent <it>Burkholderia </it>species <abbrgrp><abbr bid="B14">14</abbr></abbrgrp>. Besides analyzing sequence variation, the gain/loss of DNA can also be used as a tool for elucidating evolutionary divergence between species in their natural environment.</p>
         <p>The means for such analysis lies in the use of DNA microarray technology based on the 'spotting' of long oligonucleotides onto glass slides <abbrgrp><abbr bid="B15">15</abbr><abbr bid="B16">16</abbr><abbr bid="B17">17</abbr></abbrgrp>. Although the techniques employed for performing DNA microarray experiments have been well established, improvements in the algorithms used for designing the array probes have been critical for advancing the reproducibility and accuracy of detecting the respective gene targets. Recently, a novel robust program called 'OligoRankPick' was able to optimize oligonucleotide design by optimizing target specificity and GC% variation. This algorithm is based on a weighted rank-sum strategy to optimize oligonucleotide selection even along genomes of many diverse organisms <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. The resultant <it>P. falciparum </it>dataset generated from this strategy was shown to be highly reproducible and also led to increased coverage of the <it>P. falciparum </it>genome as compared to earlier <it>P. falciparum </it>microarray designs. Although the incompleteness of these rodent parasite genomes could present difficulties in their comparative analysis, there is some evidence that they are highly conserved <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>. We have thus emulated the 'OligoRankPick' strategy to design a pan-rodent cross-genome oligonucleotide microarray for the rodent malaria species <it>P. berghei</it>, <it>P. chabaudi </it>and <it>P. yoelii</it>. Due to the incompleteness of the genome sequence available for all three rodent genomes the microarray design approach resulted in oligonucleotide sequences predicted to be complementary to all three rodent species, two rodent malaria species or to be unique to a single species. Comparative genomic hybridization using genomic DNA from each of the three species not only validated the oligonucleotides but at the same time provided new information that allowed the closure of sequence gaps in the genomes of one or more of the rodent species based on complementary hybridization data, thereby significantly improving our overall understanding of gene content for each species. While the CGH approach proofed exceptionally powerful it was still likely that due to sequence polymorphisms found in the different species at the oligonucleotide probe region genes that are actually present would be missed using this strategy alone. For this reason the current available genome sequences were reanalyzed using bioinformatics tools to detect any additional genes missed by the microarray approach alone. Finally, a random selection of genes predicted from the microarray and bioinformatics data to now be present in all rodent genome species was validated using PCR and sequencing.</p>
         <p>Overall this multipronged approach allowed us to significantly improve the gene predictions for all three rodent malaria species. In addition the new information obtained in this study allows the assembly of an improved core set of genes present in all plasmodium species while at the same time also identifying a rodent malaria specific gene set.</p>
      </sec>
      <sec>
         <st>
            <p>Results</p>
         </st>
         <sec>
            <st>
               <p>Key design features of the 'Pan-rodent' array chip</p>
            </st>
            <p>In order to detect genes from all three rodent malaria parasite (RMP) species, the DNA microarray design was based on the 'OligoRankPick' program <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> as this algorithm was shown to have significant improvements over other strategies even when dealing with large fluctuations in GC content and abundant gene duplications. Since the microarray designed here was to include all available information for three RMP species, there are key challenges and modifications to our original program so as to accommodate multiple genomes on a single microarray. Since <it>P. yoelii </it>is the only RMP with a published draft genome <abbrgrp><abbr bid="B3">3</abbr></abbrgrp>, we first designed oligonucleotides against the predicted <it>P. yoelii </it>gene models. Next, all possible oligonucleotides of <it>P. yoelii </it>were used to query the homologous regions of the other two RMP species (Figure <figr fid="F1">1A</figr>). Next, four parameters were used to select for the best oligonucleotide, namely: BLAST scores (first and second hit), GC content, sequence complexity and self-annealing scores (Figure <figr fid="F1">1B</figr>). Each score is then transformed into a rank and a weighted rank-sum is calculated for each oligonucleotide with the final oligonucleotide being selected based on the smallest rank-sum value. These oligonucleotides were then selected to be optimal for all three species, followed by oligonucleotides for <it>P. yoelii </it>and <it>P. berghei </it>and then for <it>P. yoelii </it>and <it>P. chabaudi </it>(Figure <figr fid="F1">1A</figr>). For the remaining predicted open reading frames oligonucleotides specific for each species were then designed. A total of 14,736 oligonucleotides were thus obtained and the breakdown is shown in Figure <figr fid="F2">2</figr>.</p>
            <fig id="F1">
               <title>
                  <p>Figure 1</p>
               </title>
               <caption>
                  <p>The overall design schematics of the pan-rodent chip</p>
               </caption>
               <text>
                  <p><b>The overall design schematics of the pan-rodent chip</b>. (A) Methodology of the chip design. Firstly, all possible oligonucleotides of <it>P. yoelii </it>were used to search in the homologous region of the other two species using NCBI blastn and were scored and ranked accordingly. The oligonucleotides were then filtered using three rules such they must have: (i) at least 90% homology to target sequences, (ii) less than 37.5% to non-target sequences and (iii) GC% tolerance of &#177; 5%. Oligonucleotides for all three species were selected followed by oligonucleotides for <it>P. yoelii </it>and <it>P. berghei </it>and then for <it>P. yoelii </it>and <it>P. chabaudi</it>. Next, the remaining oligonucleotides were selected to be specific for both <it>P. berghei </it>and <it>P. chabaudi</it>. The remaining sequences unaccounted for were then used to design oligonucleotides specific either to <it>P. yoelii, P. berghei </it>or <it>P. chabaudi</it>. (B) Rank-sum strategy. Oligonucleotides were scored accordingly to (i) first and second BLAST hits, (ii) GC content (Tm) and (iii) Smith-Waterman score (self-binding). The oligonucleotides are then ranked based on each parameter and ordinal rank number is given to all oligonucleotides in each parameter rank independently. The final weighted rank-sum (RS) is calculated for all oligonucleotides using multiple weight sets (not indicated) and the lowest value is considered. Finally, the optimal candidate is selected based on the lowest RS(k) amongst all oligonucleotides in the locus of interest <abbrgrp><abbr bid="B18">18</abbr></abbrgrp></p>
               </text>
               <graphic file="1471-2164-11-128-1" hint_layout="double"/>
            </fig>
            <fig id="F2">
               <title>
                  <p>Figure 2</p>
               </title>
               <caption>
                  <p>Venn diagram showing distribution of target rodent parasite gene hits of all oligonucleotides</p>
               </caption>
               <text>
                  <p><b>Venn diagram showing distribution of target rodent parasite gene hits of all oligonucleotides</b>. All oligonucleotides are 60 bases long and the GC content is targeted at 30% and the allowable deviation is 5% for overlapping oligonucleotides. Complementary oligonucleotides to each rodent malaria parasite species was calculated from the sum of all possible combinations, i.e. oligonucleotides specific to itself and those that can hybridize to itself and to other rodent malaria parasite species. (Legend: <it>Pb </it>= <it>P. berghei </it>specific oligonucleotides only; <it>Pc </it>= <it>P. chabaudi </it>specific olinucleotides only; <it>Py </it>= <it>P. yoelii </it>specific oligonucleotides only; <it>Pbc </it>= <it>P. berghei </it>&amp;<it>P. chabaudi </it>specific oligonucleotides; <it>Pby </it>= <it>P. berghei </it>&amp;<it>P. yoelii </it>specific oligonucleotides; <it>Pcy </it>= <it>P. chabaudi </it>&amp;<it>P. yoelii </it>specific oligonucleotides; <it>Pbcy </it>= oligonucleotides specific to all 3 rodent parasite species)</p>
               </text>
               <graphic file="1471-2164-11-128-2" hint_layout="single"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Comparative Genomic Hybridization using the Pan-rodent microarray</p>
            </st>
            <p>Since some of the oligonucleotides are designed to be capable of cross hybridizing with more than one species, we have re-sorted all the genes from the three species and matched them with their respective oligonucleotides so that an accurate normalized value can be obtained for analysis. Using this approach, we performed comparative genomic hybridization experiments and first looked at the performance of the species-specific oligonucleotides that were designed to hybridize with their target genome. After normalizing and filtering the data the percentage of successful oligonucleotides for <it>P. yoelii-</it>specific, <it>P. berghei</it>-specific and <it>P. chabaudi</it>-specific oligonucleotides were 91% (7,347), 90% (6,314) and 84% (6,356) respectively. It is not surprising that a low proportion of oligonucleotides were unable to hybridize with their intended targets especially if they were designed in regions with low confidence in sequence quality. A likely reason is the relatively poor genome coverage of the rodent parasite genomes as compared to the more complete <it>P. falciparum </it>genome. Also, there are errors in the current sequence drafts of the rodent malaria parasites due to the propensity of random sequence rearrangements of an AT-rich genome in a sequencing vector.</p>
            <p>Using the validated set of oligonucleotides, we identified all the oligonucleotides that hybridized to the DNA of parasites they had not originally been designed against (Figure <figr fid="F2">2</figr>). A positive hybridization signal provides strong evidence that the sequences and therefore the genes represented by these oligonucleotides are also present in the genomes of the other rodent parasite species. This strongly implies that the respective genes that are represented by these oligonucleotides are actually present but are not found in the current database.</p>
         </sec>
         <sec>
            <st>
               <p>Filling the gaps in the genomes of the rodent malaria parasite species</p>
            </st>
            <p>Since it has been established that the malaria parasite genomes are well conserved <abbrgrp><abbr bid="B5">5</abbr></abbrgrp>, it is conceivable that genes that are missing in the current draft of either of the RMP genome have a well conserved ortholog in one or two of the other species. Using the pan-rodent microarray we wish to investigate this possibility by inspecting comparative genome hybridization (CGH) signals on "cross-species" oligonucleotide microarray elements. For example high signal from <it>P. yoelii </it>gDNA for an oligonucleotide that is not designed to hybridize to <it>P. yoelii </it>implies that this gene (sequence) is present in <it>P. yoelii</it>. In summary, a species where the draft genome currently does not possess a particular gene but now gives a signal on the array suggests the presence of this orthologous gene. Hence, this gene can thus be detected based on homology.</p>
            <p>Missing genes in the draft genome could arise due to two scenarios: either the sequence information is missing, or that the sequence is present in the genome but missed by current gene prediction algorithms. Since the oligonucleotides were designed based on predicted open reading frames, a CGH signal constitutes direct experimental evidence for the presence of an orthologous genomic sequence and thus potentially the gene in the RMP genome in which this gene is missing. Based on this approach, we detect 179, 306 and 215 genes missing in the current draft sequences of the <it>P. yoelii</it>, the <it>P. berghei </it>and the <it>P. chabaudi </it>genome, respectively. The majority of these genes code for hypothetical proteins that lack any functional assignment based on their amino acid sequence. For those whose function has been described, genes involved in biosynthesis, protein modifications, kinases and also invasion-related proteins in the case of <it>P. chabaudi </it>and <it>P. berghei </it>have been discovered (Additional file <supplr sid="S1">1</supplr>: Supplemental Table S1).</p>
            <suppl id="S1">
               <title>
                  <p>Additional file 1</p>
               </title>
               <text>
                  <p><b>Supplemental Table S1</b>. Identification of orthologs by microarray.</p>
               </text>
               <file name="1471-2164-11-128-S1.XLS">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>A selection of genes that were detected via the microarray data were randomly selected for polymerase chain reaction (PCR) screening (Figure <figr fid="F3">3</figr>) and direct sequencing in order to establish the confidence of this group of newly discovered genes. PCR primer pairs were designed flanking the oligonucleotide sequence to the species where there is known sequence information and these were used to amplify a newly predicted gene in another species where it is not annotated/predicted or where the sequence is absent. All screens were performed on regions where sequence information for the newly discovered orthologous genes is not present (i.e. missing sequence) except for PY00632 and PY03414 where a corresponding <it>P. berghei </it>contig is present but the gene is not predicted. In summary, 7 of the 8 PCR reactions gave a product around the predicted size and the one sample that was PCR negative could be due to sequence polymorphisms at the primer sites. Sequence analysis of the PCR product confirmed that the PCR product did indeed represent the predicted gene (data not shown). Some PCR products also exhibit a change in the predicted size, for example different PCR product sizes of PY00632 and its <it>P. berghei </it>ortholog are expected based on the currently available sequence information. Differences in PCR product size are due to variations in sequence length in the region bounded by the primers. The differences in PCR product size in the PY06972 screen of <it>P. yoelii </it>and <it>P. chabaudi </it>genomes are also due to the same reason. The high congruence of PCR-positive screens showed the power of the array in detecting homologous sequences currently absent from the available genome sequences of the other species. Additional microarray investigations pertaining to the detection and validation of polymorphic genes and differential transcription profiles between related parasite clones have further validated the performance of the oligonucleotide probes (unpublished).</p>
            <fig id="F3">
               <title>
                  <p>Figure 3</p>
               </title>
               <caption>
                  <p>PCR screening of a random sample of newly discovered genes</p>
               </caption>
               <text>
                  <p><b>PCR screening of a random sample of newly discovered genes</b>. Screenings were performed pair-wise with the PCR products of the species containing the known gene of interest loaded in odd-numbered wells while the corresponding PCR screen of the other species whereby sequence is absent or the gene is not predicted are in the even-numbered wells. The Genbank accession numbers of these novel orthologs are indicated in parentheses in the following description. (1&amp;2): <ext-link ext-link-type="gen" ext-link-id="PY00632">PY00632</ext-link> screen with Py and Pb gDNA (<ext-link ext-link-type="gen" ext-link-id="GU390534">GU390534</ext-link>); (3&amp;4): <ext-link ext-link-type="gen" ext-link-id="PY03414">PY03414</ext-link> screen with Py and Pb gDNA (<ext-link ext-link-type="gen" ext-link-id="GU390535">GU390535</ext-link>); (5&amp;6): <ext-link ext-link-type="gen" ext-link-id="PY04600">PY04600</ext-link> screen with Py and Pb gDNA (<ext-link ext-link-type="gen" ext-link-id="GU390540">GU390540</ext-link>); (7&amp;8): <ext-link ext-link-type="gen" ext-link-id="PY04485">PY04485</ext-link> screen with Py and Pb gDNA (<ext-link ext-link-type="gen" ext-link-id="GU390538">GU390538</ext-link>); (9&amp;10): <ext-link ext-link-type="gen" ext-link-id="PY02086">PY02086</ext-link> screen with Py and Pc gDNA (<ext-link ext-link-type="gen" ext-link-id="GU390539">GU390539</ext-link>); (11&amp;12): <ext-link ext-link-type="gen" ext-link-id="PY04869">PY04869</ext-link> with Py and Pc gDNA; (13&amp;14): <ext-link ext-link-type="gen" ext-link-id="PY06972">PY06972</ext-link> screen with Py and Pc gDNA (<ext-link ext-link-type="gen" ext-link-id="GU390536">GU390536</ext-link>); (15&amp;16): <ext-link ext-link-type="gen" ext-link-id="PY05482">PY05482</ext-link> screen with Py and Pc gDNA (<ext-link ext-link-type="gen" ext-link-id="GU390537">GU390537</ext-link>). (Legend: M = 100 bp DNA ladder)</p>
               </text>
               <graphic file="1471-2164-11-128-3" hint_layout="single"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Cross-species gene identification using bioinformatics</p>
            </st>
            <p>Although a total of 700 novel orthologous rodent parasite genes have been discovered via the array approach, these revisions still leave significant gaps in the genomes of the three rodent parasite species. Therefore in addition to CGH, we also utilize bioinformatics to look at the genome in its entirety so as to probe for open reading frames that have been missed by the automated gene predictions provided by the RMP sequencing projects. The use of bioinformatics tools to query non-coding regions of the rodent malaria parasite genomes can be useful in detecting genes that have been missed by automated gene prediction algorithms. The more complete <it>P. falciparum </it>amino acid sequences were used to query the rodent malaria parasite genomes as a tBLASTn search using an expect value of at least 10<sup>-15 </sup>as a cut-off threshold. While this approach does not account for synteny, evidence that regulation of transcription of individual genes occurred independently without any constraints on chromosomal location further supports this approach <abbrgrp><abbr bid="B19">19</abbr><abbr bid="B20">20</abbr></abbrgrp>. In addition, multigene families would be collapsed, as conserved domains would link such members together. Using this approach, 103 orthologous genes were discovered in <it>P. berghei</it>, 93 in <it>P. chabaudi </it>and 286 in <it>P. yoelii</it>. This data also suggests that while the <it>P. yoelii </it>genome is more completely assembled, it is lacking a number of predicted open reading frames. Similarly, PCR screens of these predicted genes were performed in order to obtain additional confidence for the bioinformatics derived dataset. For this confirmation, two groups were selected based on whether a homologous contig for a particular species is present in a region where a gene is not predicted, thereby checking if this gene is truly deleted from this locus or is still present, being either translocated to another locus or missed by the automated contig assembly (Figure <figr fid="F4">4</figr>). On another hand, we have observed another scenario where a newly discovered orthologous gene is not present in the species of interest. While sequence information is absent for these candidate genes, the adjacent genes are present in separate contigs. We thus wanted to screen these gaps or assembly errors that have excluded this gene (Figure <figr fid="F5">5</figr>). The re-annotated <it>P. chabaudi </it>gene orthologous to PFF1480w, PB000730.00.0 and PY03519 was screened twice as the best-fit <it>P. chabaudi </it>contig only contains the 5'-end sequence of this gene while the 3'-end is missing. Based on the random sampling approach (Figure <figr fid="F4">4</figr>), the PFI0535w-PB104921.00.0 pair seems to be exclusively found in <it>P. berghei </it>and not in the other 2 RMP species. For the PF14 0473-PC0001359.02.0 pair, it seems that this gene is also present in <it>P. berghei </it>while the MAL13P1.345-PY04218 pair looks to be also present in <it>P. berghei </it>as well. As for the other three pairs of genes (PFL0595c-PB301230.00.0&#172;PC000699.01.0; PFF1480w(3'end)-PB000730.00.0-PY03519; PF13_0131&#172;PC000708.04.0-PY04599) that seem to be missing in either one of the three RMP species, PCR screening suggests otherwise and that they are indeed common to all three RMP species. These results suggest that rodent parasite genes that are seemingly absent on a contig are likely to be present with high confidence in all three rodent malaria parasite species. Almost all of the other panel's screenings (Figure <figr fid="F5">5</figr>) showed that all of those genes were present in the three RMP species barring the <it>P. chabaudi </it>exclusive gene (PFL2450c-PC000344.03.0) that seemed to be present also in <it>P. berghei </it>but seemingly not in <it>P. yoelii</it>. This suggests with high confidence that the missing genes due to incomplete sequence information are present. Overall, it seems that it is highly likely that missing genes due to poor sequencing coverage are present and genes common to two rodent malaria parasite species are very likely to be present in all three species.</p>
            <fig id="F4">
               <title>
                  <p>Figure 4</p>
               </title>
               <caption>
                  <p>PCR screening of bioinformatics-filtered dataset with contig information</p>
               </caption>
               <text>
                  <p><b>PCR screening of bioinformatics-filtered dataset with contig information</b>. (A) Schematic depicting the scenario whereby a contig from 1 species containing the gene of interest (shown as a hatched block) is aligned with the best corresponding hit contig from another species. In this case, the gene of interest is lost in 1 species while the flanking sequences are still present. (B) PFI0535w-PB104921.00.0 orthologs screened with (1)Pb, (2)Pc and (3)Py gDNA. PF14 0473-PC0001359.02.0 orthologs screened with (4)Pb, (5)Pc and (6)Py gDNA. MAL13P1.345-PY04218 orthologs genes screened with (7)Pb, (8)Pc and (9)Py gDNA. PFL0595c-PB301230.00.0-PC000699.01.0 orthologs screened with (10)Pb, (11)Pc and (12)Py gDNA. PFF1480w(3'end)-PB000730.00.0PY03519 orthologs screened with (13)Pb, (14)Pc and (15)Py gDNA. PF13_0131PC000708.04.0-PY04599 orthologs screened with (16)Pb, (17)Pc and (18)Py gDNA. (Legend: M = 100 bp DNA ladder)</p>
               </text>
               <graphic file="1471-2164-11-128-4" hint_layout="single"/>
            </fig>
            <fig id="F5">
               <title>
                  <p>Figure 5</p>
               </title>
               <caption>
                  <p>PCR screening of bioinformatics-filtered dataset without contig information</p>
               </caption>
               <text>
                  <p><b>PCR screening of bioinformatics-filtered dataset without contig information</b>. (A) Schematic depicting the scenario whereby a contig from 1 species containing the gene of interest (shown as a hatched block) is aligned with the best corresponding hit contig from another species. In this case, the gene of interest is lost in 1 species due to missing sequence information while the adjacent flanking contig sequences are still present. (B) PFC0095c-PB000276.02.0 orthologs screened with (1)Pb, (2)Pc and (3)Py gDNA. PFL2450c-PC000344.03.0 orthologs screened with (4)Pb, (5)Pc and (6)Py gDNA. MAL8P1.310-PY06565 orthologs screened with (7)Pb, (8)Pc and (9)Py gDNA. PFB0645c-PB000193.00.0-PC000452.02.0 orthologs screened with (10)Pb, (11)Pc and (12)Py gDNA. PFF1480w(5'-end)-PB000730.00.0-PY03519 orthologs screened with (13)Pb, (14)Pc and (15)Py gDNA. PF13_0278-PC000860.02.0-PY04659 orthologs screened with (16)Pb, (17)Pc and (18)Py gDNA. (Legend: M = 100 bp DNA ladder)</p>
               </text>
               <graphic file="1471-2164-11-128-5" hint_layout="single"/>
            </fig>
         </sec>
         <sec>
            <st>
               <p>Defining the core <it>Plasmodium </it>genome</p>
            </st>
            <p>With the more complete coverage of the rodent malaria parasite genomes obtained via the analysis here, we now thought to define a common 'core' <it>Plasmodium </it>genome where orthologous genes present in all sequenced malaria parasites was defined. <it>Plasmodium falciparum</it>, being the most completely annotated and researched malaria parasite genome, is used as an index reference species in an attempt to consolidate genes common to as many species as possible. For each <it>P. falciparum </it>gene the orthologs present in the rodent parasite species were matched, and the resulting dataset was further appended with the respective orthologs from another human malaria parasite <it>P. vivax </it>and the simian parasite <it>P. knowlesi </it>as these genomes had been recently sequenced to 10&#215; and 8&#215; <abbrgrp><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr></abbrgrp> coverage respectively. This approach segregated <it>P. falciparum </it>genes into two groups: firstly, the 'core' genes containing orthologs to the rodent parasite as well as the <it>P. vivax </it>and <it>P. knowlesi </it>sequences (Additional file <supplr sid="S2">2</supplr>: Supplemental Table S2) and secondly, genes that have no significant alignment with rodent sequences (Additional file <supplr sid="S3">3</supplr>: Supplemental Table S3). While orthologous genes (with respect to <it>P. falciparum</it>) are highly conserved in regions where housekeeping genes predominate, the telomeric and sub-telomeric regions (i.e. non-syntenic) contain genes involved in antigenic variation that are more divergent <abbrgrp><abbr bid="B3">3</abbr></abbrgrp> and this is consistent with the observation that the group representing the non-orthologous genes is dominated by <it>var, stevor </it>and <it>rifin </it>gene families that are involved in antigenic variation. What is immediately apparent is the high similarity of all six parasite species in the regions that contain housekeeping genes. In total, 4,188 genes were common between <it>P. falciparum </it>and the rodent parasites. Of these, 73 genes, mainly hypothetical genes, were absent in both <it>P. vivax </it>and <it>P. knowlesi</it>. In addition, 50 genes were present in <it>P. vivax </it>but not in <it>P. knowlesi </it>and 79 genes were present in <it>P. knowlesi </it>but not in <it>P. vivax</it>. Without these species-specific genes, we found 3,986 genes that are common to all six species.</p>
            <suppl id="S2">
               <title>
                  <p>Additional file 2</p>
               </title>
               <text>
                  <p><b>Supplemental Table S2</b>. Orthologous genes in <it>P. falciparum </it>and RMP.</p>
               </text>
               <file name="1471-2164-11-128-S2.XLS">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S3">
               <title>
                  <p>Additional file 3</p>
               </title>
               <text>
                  <p><b>Supplemental Table S3</b>. <it>P. falciparum </it>genes with no RMP ortholog.</p>
               </text>
               <file name="1471-2164-11-128-S3.XLS">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <p>An additional 387 <it>P. falciparum </it>genes (conserved in the two primate malaria parasites) appear to be missing in one or two of the rodent malaria parasite species (Table <tblr tid="T1">1</tblr>) with the vast majority of these representing hypothetical genes. The main exception represents genes conserved between <it>P. falciparum</it>, <it>P. berghei </it>and <it>P. yoelii </it>but missing in <it>P. chabaudi </it>from which approximately 30% have a functional annotation. Of the 387 genes, only a relatively small number (27-40) are exclusively found in only <it>P. falciparum </it>and one of the rodent malaria species with a significantly large number (92-96) being present in two species. While the genes missing in one or two rodent species could potentially represent species specific gene deletions, the fact that a significant number of these (72%-96%) orthologs are also present in the more divergent <it>P. knowlesi </it>and <it>P. vivax </it>genomes are highly suggestive that the respective rodent orthologs that have not yet been identified by either the sequencing project or the analysis reported here could likely be present.</p>
            <tbl id="T1">
               <title>
                  <p>Table 1</p>
               </title>
               <caption>
                  <p>Syntenic Pf genes with corresponding RMP chromosome.</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="left">
                        <p>
                           <b>Gene groups</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>No. of genes</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Hypothetical genes</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Corresponding Pv genes</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Corresponding Pk genes</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Pf</p>
                     </c>
                     <c ca="center">
                        <p>46</p>
                     </c>
                     <c ca="center">
                        <p>45 (98%)</p>
                     </c>
                     <c ca="center">
                        <p>28 (61%)</p>
                     </c>
                     <c ca="center">
                        <p>30 (65%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Pf-Pb</p>
                     </c>
                     <c ca="center">
                        <p>40</p>
                     </c>
                     <c ca="center">
                        <p>38 (95%)</p>
                     </c>
                     <c ca="center">
                        <p>31 (78%)</p>
                     </c>
                     <c ca="center">
                        <p>32 (80%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Pf-Pc</p>
                     </c>
                     <c ca="center">
                        <p>27</p>
                     </c>
                     <c ca="center">
                        <p>25 (93%)</p>
                     </c>
                     <c ca="center">
                        <p>22 (81%)</p>
                     </c>
                     <c ca="center">
                        <p>24 (89%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Pf-Py</p>
                     </c>
                     <c ca="center">
                        <p>36</p>
                     </c>
                     <c ca="center">
                        <p>34 (94%)</p>
                     </c>
                     <c ca="center">
                        <p>26 (72%)</p>
                     </c>
                     <c ca="center">
                        <p>28 (78%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Pf-Pb-Py</p>
                     </c>
                     <c ca="center">
                        <p>96</p>
                     </c>
                     <c ca="center">
                        <p>67 (70%)</p>
                     </c>
                     <c ca="center">
                        <p>87 (91%)</p>
                     </c>
                     <c ca="center">
                        <p>90 (94%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Pf-Pb-Pc</p>
                     </c>
                     <c ca="center">
                        <p>96</p>
                     </c>
                     <c ca="center">
                        <p>77 (80%)</p>
                     </c>
                     <c ca="center">
                        <p>91 (95%)</p>
                     </c>
                     <c ca="center">
                        <p>93 (97%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Pf-Pc-Py</p>
                     </c>
                     <c ca="center">
                        <p>92</p>
                     </c>
                     <c ca="center">
                        <p>78 (85%)</p>
                     </c>
                     <c ca="center">
                        <p>88 (96%)</p>
                     </c>
                     <c ca="center">
                        <p>88 (96%)</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Gene clusters of orthologous genes across six malaria parasite species, namely <it>P. falciparum </it>(Pf), <it>P. berghei </it>(Pb), <it>P. chabaudi </it>(Pc), <it>P. yoelii </it>(Py), <it>P. knowlesi </it>(Pk) and <it>P. vivax </it>(Pv). More genes are common to all six species where evidence of rodent malaria parasite (RMP) chromosome matching to <it>P. falciparum </it>syntenic regions is present.</p>
               </tblfn>
            </tbl>
            <p>On the other hand we identify 927 <it>P. falciparum </it>genes without a match to any rodent parasite species, 469 were composed of hypothetical genes (Table <tblr tid="T2">2</tblr>), with the majority of known genes being made up of the <it>var, rif </it>and <it>stevor </it>multigene families that are present at the subtelomeric regions of essentially all <it>P. falciparum </it>chromosomes (Additional file <supplr sid="S3">3</supplr>: supplemental table S3). Only about 14% of the 927 genes were conserved in <it>P. vivax </it>and/or <it>P. knowlesi</it>, with 117 being present in both species. While all genes exclusive to <it>P. falciparum </it>and <it>P. knowlesi </it>represent hypothetical genes, 6 genes specific to <it>P. falciparum </it>and <it>P. vivax </it>encode proteins involved in parasite invasion; 2 cysteine proteases and 4 MSP7-like genes. Taken together we define 4,188 genes that form a core gene set for a Plasmodium genus. From this set only a small fraction of genes (27 to 96 ~ 0.6%-2.2%) is lost in one or two of the six <it>Plasmodium </it>species. In contrast there are a large number of genes that appear specific to individual <it>Plasmodium </it>species and conserved only in one or two others. These mainly involve gene associated with host parasite interaction and are localized in the subtelomeric regions. However, limited diversification was also observed in genes present in the intrachromosomal region mainly involved in other Plasmodium specific functions such as invasion.</p>
            <tbl id="T2">
               <title>
                  <p>Table 2</p>
               </title>
               <caption>
                  <p>Non-syntenic Pf genes without corresponding RMP chromosome.</p>
               </caption>
               <tblbdy cols="5">
                  <r>
                     <c ca="center">
                        <p>
                           <b>Gene groups</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>No. of genes</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Hypothetical genes</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Corresponding Pv genes</b>
                        </p>
                     </c>
                     <c ca="center">
                        <p>
                           <b>Corresponding Pk genes</b>
                        </p>
                     </c>
                  </r>
                  <r>
                     <c cspan="5">
                        <hr/>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Pf</p>
                     </c>
                     <c ca="center">
                        <p>927</p>
                     </c>
                     <c ca="center">
                        <p>469 (51%)</p>
                     </c>
                     <c ca="center">
                        <p>129 (14%)</p>
                     </c>
                     <c ca="center">
                        <p>120 (13%)</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Pf-Pb</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Pf-Pc</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Pf-Py</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Pf-Pb-Py</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>1 (100%)</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Pf-Pb-Pc</p>
                     </c>
                     <c ca="center">
                        <p>1</p>
                     </c>
                     <c ca="center">
                        <p>1 (100%)</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                  </r>
                  <r>
                     <c ca="left">
                        <p>Pf-Pc-Py</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                     <c ca="center">
                        <p>0</p>
                     </c>
                  </r>
               </tblbdy>
               <tblfn>
                  <p>Gene clusters of orthologous genes across six malaria parasite species, namely <it>P. falciparum </it>(Pf), <it>P. berghei </it>(Pb), <it>P. chabaudi </it>(Pc), <it>P. yoelii </it>(Py), <it>P. knowlesi </it>(Pk) and <it>P. vivax </it>(Pv). In contrast to genes found in syntenic regions (Table <tblr tid="T1">1</tblr>) significantly less genes are common in <it>P. falciparum </it>species specific coding regions.</p>
               </tblfn>
            </tbl>
         </sec>
         <sec>
            <st>
               <p>Defining the core rodent <it>Plasmodium </it>genome</p>
            </st>
            <p>Using a similar approach as in the previous section, we attempt to construct a rodent parasite specific associative table since we can also take advantage of the dataset from the genomic hybridizations. For this purpose, instead of using genes from <it>P. falciparum </it>as an index, the oligonucleotides were chosen as the index and then replaced with the best-hit <it>P. yoelii </it>gene. Next, <it>P. falciparum </it>- <it>P. yoelii</it>, <it>P. berghei </it>- <it>P. yoelii </it>and <it>P. chabaudi </it>- <it>P. yoelii </it>orthologs were appended to the list. This strategy creates a database of genes orthologous to <it>P. yoelii </it>and the reason for choosing this species as the index is due to it being more completely annotated than the other two rodent parasite species. This list was refined by removing matching orthologs from <it>P. falciparum </it>that were already present in the common 'core' set; thereby creating a filtered list that contains only genes that are specific to the rodent malaria parasites. Microarray data and bioinformatics (tBLASTn) to query the <it>P. yoelii </it>amino acid sequences were also employed. In order to reduce the complexity of the dataset, the PIRs, which consist of the largest multigene family and dominate the rodent specific genes, will not be discussed due to their variability of expression and gene expansion amongst the rodent malaria parasites due to its main role in antigenic variation <abbrgrp><abbr bid="B23">23</abbr><abbr bid="B24">24</abbr></abbrgrp> as well as its copy number being the most abundant in <it>P. yoelii</it>. Consequently, the huge reduction in genes common in all three rodent malaria parasite species is due to the removal of the PIR genes that could also confound relationships based on bioinformatics simply due to multiple matching of common conserved domains (Additional file <supplr sid="S4">4</supplr>: Supplemental Table S4; Figure <figr fid="F6">6</figr>).</p>
            <suppl id="S4">
               <title>
                  <p>Additional file 4</p>
               </title>
               <text>
                  <p><b>Supplemental Table S4</b>. Genes common to <it>P. yoelii, P. berghei </it>and <it>P. chabaudi</it>.</p>
               </text>
               <file name="1471-2164-11-128-S4.XLS">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <fig id="F6">
               <title>
                  <p>Figure 6</p>
               </title>
               <caption>
                  <p>Graphical plot showing the re-distribution of the rodent malaria parasite genes after removing the PIR genes</p>
               </caption>
               <text>
                  <p><b>Graphical plot showing the re-distribution of the rodent malaria parasite genes after removing the PIR genes</b>. Most of the reduction occurs in the group Py-Pb-Pc in which all the three rodent malaria parasite species share a common gene. (Legend: Py = genes specific to <it>P. yoelii</it>; Py-Pb = genes common to both <it>P. yoelii </it>and <it>P. berghei</it>; Py-Pc = genes common to both <it>P. yoelii </it>and <it>P. chabaudi</it>; Py-Pb-Pc = genes common to all three species)</p>
               </text>
               <graphic file="1471-2164-11-128-6" hint_layout="single"/>
            </fig>
            <p>Overall this approach now identified 1,238 genes that are common to all rodent malaria species most of which (1,013) represent hypothetical genes. Importantly only about 8% of these predicted genes have ORF that are shorter than 100 nucleotides making it less likely that these are annotation or prediction errors but indeed represent functional genes. Taking together these gene numbers along with the 4,188 genes found in the 'core' set, the total number of conserved rodent malaria genes is 5,426. In addition it appears that there are 210 genes specific to <it>P. yoelii </it>and <it>P. chabaudi </it>(Additional file <supplr sid="S5">5</supplr>: Supplemental Table S5), 247 genes specific to <it>P. yoelii </it>and <it>P. berghei </it>(Additional file <supplr sid="S6">6</supplr>: Supplemental Table S6) and 453 unique to <it>P. yoelii </it>(Additional file <supplr sid="S7">7</supplr>: Supplemental Table S7) with most of them (> 90%) again representing hypothetical genes. Of the 453 unique <it>P. yoelii </it>genes 45% are less then 100 nucleotide in length, compared to 24% in the <it>P. yoelii </it>and <it>P. berghei </it>group and 17% in the <it>P. yoelii </it>and <it>P. chabaudi </it>group indicating that a number of these genes represent false gene models.</p>
            <suppl id="S5">
               <title>
                  <p>Additional file 5</p>
               </title>
               <text>
                  <p><b>Supplemental Table S5</b>. Genes common to both <it>P. yoelii </it>and <it>P. chabaudi</it>.</p>
               </text>
               <file name="1471-2164-11-128-S5.XLS">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S6">
               <title>
                  <p>Additional file 6</p>
               </title>
               <text>
                  <p><b>Supplemental Table S6</b>. Genes common to both <it>P. yoelii </it>and <it>P. berghei</it>.</p>
               </text>
               <file name="1471-2164-11-128-S6.XLS">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S7">
               <title>
                  <p>Additional file 7</p>
               </title>
               <text>
                  <p><b>Supplemental Table S7</b>. <it>P. yoelii </it>specific genes.</p>
               </text>
               <file name="1471-2164-11-128-S7.XLS">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Discussion</p>
         </st>
         <p>Our microarray specific to the rodent malaria species <it>P. berghei, P. chabaudi </it>and <it>P. yoelii </it>has provided a golden opportunity to verify the genomes of these rodent malaria parasite (RMP) species. The RMP draft genomes were annotated similarly where GlimmerMExon was first trained on <it>P. falciparum </it>data and automated assignments were based on hidden Markov model association <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B25">25</abbr></abbrgrp>. Since these genomes were annotated in a similar manner, we do not foresee any annotation bias in the three RMP draft genomes. Our oligonucleotide selection process aims to include every predicted gene model and the robustness of the design algorithm has been shown in the ability to differentiate between members of large multigene families <abbrgrp><abbr bid="B18">18</abbr></abbrgrp>. Evidence of significant homology via hybridization and bioinformatics show that a high proportion of these gene models are conserved amongst the three RMP species, therefore suggesting that a high proportion represent true gene models that are refractory to random mutations as compared to intronic sequences.</p>
         <p>The definition of the core gene set of <it>Plasmodia </it>genes using the most completely annotated genome of <it>P. falciparum </it>as a reference index has enabled the use of direct hybridization and bioinformatics tools to expand the repertoire of common genes. The validity of utilizing CGH and bioinformatics to improve genome annotation is well established <abbrgrp><abbr bid="B12">12</abbr><abbr bid="B13">13</abbr><abbr bid="B14">14</abbr><abbr bid="B26">26</abbr></abbrgrp>. Novel gene models discovered via hybridization have been validated with PCR (Figure <figr fid="F3">3</figr>) while genes discovered via both hybridization and bioinformatics were also validated (Figures <figr fid="F4">4</figr> and <figr fid="F5">5</figr>). In fact, PCR screens of a subset of genes without at least one RMP ortholog suggest that there is high confidence that any <it>P. falciparum </it>gene that contains at least an ortholog from one of the RMP species can potentially contain orthologs from all three RMP species. This suggests that the high stringency thresholds in both hybridization and bioinformatics are in fact conservative and there are potentially more genes that have yet to be discovered. Hence, the results indicate that the genetic repertoires of the RMP species are indeed more common and that these genes should be present in the poorly assembled and annotated rodent parasite genomes.</p>
         <p>The construction of a rodent specific orthology map using <it>P. yoelii </it>as the reference index was undertaken to study and survey rodent parasite specific genes that are distinct from the common 'core' set. Although this process would filter off genes specific to the other two RMP species, the <it>P. yoelii </it>genome was chosen as it is the most completely annotated rodent parasite genome while the gene set in <it>P. chabaudi </it>is over-predicted thus making comparisons across species difficult. This associative table clearly demonstrates certain species-specific metabolic differences and reveals gene duplications and expansions from both the core set of <it>Plasmodia </it>genes and the rodent parasite specific genes. Similarly, hybridization data reveals more orthologous genes than the current known set.</p>
         <p>Due to the high AT-bias in the genome of the <it>Plasmodia </it>species, sequencing the entire genome, joining overlapping sequences to form contigs and gene predictions have been difficult. While the majority of the rodent malaria parasite genes have been sequenced and annotated, many gaps still remain and the data from the pan-rodent cross-genome oligonucleotide microarray provides direct experimental evidence for this case. While the genomes from <it>P. berghei </it>and <it>P. chabaudi </it>continues to be refined at the Sanger Institute, improvements on the sequence coverage of the <it>P. yoelii </it>genome has not been taken up by any group. Therefore, any improvements in genome coverage for <it>P. yoelii </it>would be extremely valuable.</p>
         <p>More importantly, the total number of orthologous gene pairs obtained in this survey (Table <tblr tid="T3">3</tblr>) is now significantly higher than the published dataset <abbrgrp><abbr bid="B25">25</abbr></abbrgrp> except for the <it>P. berghei </it>and <it>P. chabaudi </it>orthologous gene pair where they have been over-predicted. The total number of ortholgous pairs is calculated by the sum of the core genes together with the remaining <it>P. falciparum </it>- RMP gene permutations as well as any RMP specific genes. For example, for the <it>P. yoelii </it>vs. <it>P. falciparum </it>orthologous gene pair, the sum is 4,188 (core Pf-RMP) + 37 (Pf-Py) + 97 (Pf-Pb-Py) + 92 (Pf-Pc-Py) = 4,414. PCR screening also strongly suggests that a <it>P. falciparum </it>gene with at least a single RMP ortholog is also likely to contain orthologs from all three RMP species. Since these genes total up to 4,578, the potential maximum of orthologous genes will be computed using this assumption where the 4,188 core genes are replaced by the theoretical maximum of 4,578 genes. This work augments and complements the recent manual re-annotation of <it>P. yoelii </it>genes <abbrgrp><abbr bid="B26">26</abbr></abbrgrp> and the release of the dataset presented here is intended to provide a reference resource for researchers working on the in vivo rodent malaria species model, especially where predicted orthologs are currently unavailable. Data from both the core and rodent species specific gene datasets can now form the basis for investigating the differences in host cell selectivity and other pathophysiological traits such as sequestration observed between the different RMP. It is not surprising that the majority of the rodent species specific genes have no known function but by now having identified a relatively small number of genes, a more thorough investigation to qualitatively associate specific genes with species specific traits is feasible. On the other hand this study does not exclude that differences in the expression of conserved genes also contributes to species specific differences.</p>
         <tbl id="T3">
            <title>
               <p>Table 3</p>
            </title>
            <caption>
               <p>Orthologs between <it>P. falciparum </it>and three rodent malaria parasite species.</p>
            </caption>
            <tblbdy cols="7">
               <r>
                  <c>
                     <p/>
                  </c>
                  <c ca="center">
                     <p>
                        <it>
                           <b>Pb vs. Py</b>
                        </it>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <it>
                           <b>Pb vs. Pc</b>
                        </it>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <it>
                           <b>Pc vs. Py</b>
                        </it>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <it>
                           <b>Py vs. Pf</b>
                        </it>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <it>
                           <b>Pb vs. Pf</b>
                        </it>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <it>
                           <b>Pc vs. Pf</b>
                        </it>
                     </p>
                  </c>
               </r>
               <r>
                  <c cspan="7">
                     <hr/>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>'Core' set</p>
                  </c>
                  <c ca="center">
                     <p>4188</p>
                  </c>
                  <c ca="center">
                     <p>4188</p>
                  </c>
                  <c ca="center">
                     <p>4188</p>
                  </c>
                  <c ca="center">
                     <p>4188</p>
                  </c>
                  <c ca="center">
                     <p>4188</p>
                  </c>
                  <c ca="center">
                     <p>4188</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>&lt;3 RMP orthologs</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>226</p>
                  </c>
                  <c ca="center">
                     <p>234</p>
                  </c>
                  <c ca="center">
                     <p>215</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>RMP specific</p>
                  </c>
                  <c ca="center">
                     <p>1485</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>1448</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
                  <c ca="center">
                     <p>-</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>
                        <b>Total</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>5673</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>4188</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>5636</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>4414</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>4422</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>4403</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>
                        <b>Current set</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>3153</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>4641</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>3318</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>3375</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>3890</b>
                     </p>
                  </c>
                  <c ca="center">
                     <p>
                        <b>3842</b>
                     </p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>Potential 'Core' set</p>
                  </c>
                  <c ca="center">
                     <p>4578</p>
                  </c>
                  <c ca="center">
                     <p>4578</p>
                  </c>
                  <c ca="center">
                     <p>4578</p>
                  </c>
                  <c ca="center">
                     <p>4578</p>
                  </c>
                  <c ca="center">
                     <p>4578</p>
                  </c>
                  <c ca="center">
                     <p>4578</p>
                  </c>
               </r>
               <r>
                  <c ca="center">
                     <p>Potential total</p>
                  </c>
                  <c ca="center">
                     <p>6063</p>
                  </c>
                  <c ca="center">
                     <p>4578</p>
                  </c>
                  <c ca="center">
                     <p>6026</p>
                  </c>
                  <c ca="center">
                     <p>4578</p>
                  </c>
                  <c ca="center">
                     <p>4578</p>
                  </c>
                  <c ca="center">
                     <p>4578</p>
                  </c>
               </r>
            </tblbdy>
            <tblfn>
               <p>Summary table of orthologs between <it>P. falciparum </it>and the three rodent parasite species. The 'core' set represent orthologs common in all four species defined via hybridization and bioinformatics information. Other gene set combinations were taken from the remaining permutations of genes generated from those with at least 2 RMP orthologs as well as the rodent parasite specific genes. The total number of orthologous pairs is the sum of the core genes with the other respective gene permutations while the current set denotes the status quo <abbrgrp><abbr bid="B25">25</abbr></abbrgrp>. The potential total indicates the theoretical maximum of common genes as there is strong suggestion that a <it>P. falciparum </it>gene with at least one RMP ortholog can potentially be orthologous to all three RMP species.</p>
            </tblfn>
         </tbl>
         <p>Our approach here provides a significant improvement of gene coverage using existing information and well-established experimental and analysis techniques. The additional information provided here will be of particular use for future efforts using next-generation DNA sequencing technologies.</p>
      </sec>
      <sec>
         <st>
            <p>Conclusions</p>
         </st>
         <p>Unlike most previous studies that have focused on finding similarities between the genomes of different plasmodium species <abbrgrp><abbr bid="B3">3</abbr><abbr bid="B5">5</abbr><abbr bid="B21">21</abbr><abbr bid="B22">22</abbr><abbr bid="B25">25</abbr></abbrgrp> the work here for the first time attempts to define the difference in gene content between the different parasites. Currently, our understanding on the genetic basis for species specific restriction of malaria parasites is limited. This is mainly due to the incomplete status of the RMP genomes that do not allow reasonable assessment of their nuclear encoded proteomes. Conducting the cross species CGH and the reciprocal homology searches allowed us to generate substantially improved dataset that enables us for the first time to investigate the subsets of genes that may be involved in <it>Plasmodium </it>speciation. First, we identify 117 genes that are found in only the human and primate malaria species should be considered key candidates for separating rodent from primate malarias. Similarly the 1,238 RMP specific genes are likely to provide the biological framework that separates the rodent from primate malaria. How these mainly hypothetical genes function in the parasite biology now needs to be further investigated.</p>
         <p>In contrast to the relatively large numbers of genes that are unique to individual species (e.g. 927 gene unique to <it>P. falciparum</it>) or species subgoups (e.g. 1,238 specific to RMP), there is only a small number of genes that are lost from the generic core gene set 27 to 96 genes lost in one or two Plasmodium sp.). With the rodent parasites forming a distinct phylogenetic clade and genetic evidence that <it>P. vivax </it>and <it>P. knowlesi </it>are more similar to each other <abbrgrp><abbr bid="B27">27</abbr><abbr bid="B28">28</abbr></abbrgrp> this suggests that species specific gene expansion and diversification is the main driving force of speciation amongst <it>Plasmodium </it>species. This is likely to apply to genes responsible for invasion and antigenic variation, but also genes involved in other species specific phenotypes such as the asynchronous character of the intra-erythrocytic development of <it>P. yoelii</it>, the formation of hypnozoites in <it>P. vivax </it>and the propensity of sequestration.</p>
      </sec>
      <sec>
         <st>
            <p>Methods</p>
         </st>
         <sec>
            <st>
               <p>Microarray fabrication</p>
            </st>
            <p>A modification of the 'OligoRankPick' program <abbrgrp><abbr bid="B18">18</abbr></abbrgrp> was employed to design 60-mer probes to the three rodent parasite species <it>P. berghei, P. chabaudi </it>and <it>P. yoelii </it>(Additional file <supplr sid="S8">8</supplr>: Supplemental Table S8; Additional file <supplr sid="S9">9</supplr> - Supplementary Table S9) using sequences deposited in PlasmoDB <url>http://www.plasmodb.org</url>. Cross-species oligonucleotides were selected based on the criteria as described in the results section, i.e. at least 90% homology to the target sequence, less that 37.5% to non-target sequences and capped with a 5% difference tolerance in GC content. All possible oligonucleotides for a particular gene was then ranked and sorted based on target (>90%) and non-target hit (&lt;37.5%), GC content (&#177;5%) and the best Smith-Waterman score (Figure <figr fid="F1">1</figr>). Hence, all possible oligonucleotides were evaluated based on every available gene model and the best sequence was then selected. Oligonucleotides were spotted onto poly-L-lysine-coated microscopic glass slides <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>.</p>
            <suppl id="S8">
               <title>
                  <p>Additional file 8</p>
               </title>
               <text>
                  <p><b>Supplemental Table S8</b>. Primer sequences.</p>
               </text>
               <file name="1471-2164-11-128-S8.XLSX">
                  <p>Click here for file</p>
               </file>
            </suppl>
            <suppl id="S9">
               <title>
                  <p>Additional file 9</p>
               </title>
               <text>
                  <p><b>Supplemental Table S9</b>. Oligonucleotide probe sequences.</p>
               </text>
               <file name="1471-2164-11-128-S9.XLS">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Rodent parasite DNA preparation for genomic DNA microarray</p>
            </st>
            <p>Balb/c mice were used as vertebrate hosts for the propagation of <it>P. berghei </it>ANKA, <it>P. chabaudi </it>AS and <it>P. yoelii </it>17 &#215; 1.1. All procedures were in accordance with the guidelines for the use of experimental animals established by the Institutional Animal Care and Use Committee (IACUC) of the Nanyang Technological University of Singapore. Leukocytes were filtered from whole blood <abbrgrp><abbr bid="B29">29</abbr></abbrgrp> and gDNA extracted using the Easy-DNA&#8482; kit (Invitrogen) according to the manufacturer's protocol. For each parasite line, 3 &#956;g of DNA was mixed with 2.5 &#956;g of random nanomer primer to a volume of 15.25 &#956;l. DNA was denatured at 100&#176;C for 5 min and snap cooled on ice for 5 min. Labeled DNA was generated by adding dNTPs to a final concentration of 1 mM dATP and 500 fM each: dCTP, dGTP, dTTP and 5-(3-aminoallyl)&#172;2'-deoxyuridine-5'-triophosphate, (aa-dUTP) (Biotium), with 20 units of exo-Klenow Fragment (New England Biolabs) in a total volume of 50 &#956;l and incubated at room temperature for 10 min and then overnight at 37&#176;C. Cy-dye coupling, cleanup, hybridization, washing and slide scanning were performed as described by Bozdech and co-workers <abbrgrp><abbr bid="B15">15</abbr></abbrgrp>. Experiments were performed with at least duplicates for each species Subsequently, the data were normalized using the NOMAD microarray database <url>http://ucsf-nomad.sourceforge.net/</url>. For the comparative genome analysis, low quality features and features with a signal level less than two-fold of the background plus two-fold of the background standard deviation was filtered from the initial raw data set.</p>
         </sec>
         <sec>
            <st>
               <p>Polymerase Chain Reaction (PCR)</p>
            </st>
            <p>Verification of genes was performed via PCR thermocycling (Eppendorf) using the following conditions: 1 cycle of 95&#176;C for 2 min; followed by 35 cycles of 95&#176;C for 1 min, 50&#176;C for 1 min and 68&#176;C for 30 s. A final extension was set for 10 min at 68&#176;C and then the tubes were kept at 4&#176;C. For each 20 &#956;l reaction, 2 ng of DNA was used as a template with 10 pmol of the forward and reverse primers (Additional file <supplr sid="S10">10</supplr>: Supplemental Table S10), 200 &#956;M of dNTPs with 1.5 units of Taq DNA polymerase (Kapa Biosystems).</p>
            <suppl id="S10">
               <title>
                  <p>Additional file 10</p>
               </title>
               <text>
                  <p><b>Supplemental Table S10</b>. Associative table where predicted rodent parasite gene models are matched with their respective oligonucleotide probes.</p>
               </text>
               <file name="1471-2164-11-128-S10.XLSX">
                  <p>Click here for file</p>
               </file>
            </suppl>
         </sec>
         <sec>
            <st>
               <p>Bioinformatics tools</p>
            </st>
            <p>The sequences of <it>P. falciparum, P. knowlesi </it>and <it>P. vivax </it>were obtained from PlasmoDB <url>http://www.plasmodb.org</url>. For any gene missing an orthologous partner in another species even after complementation with the array data, known orthologs were appended using the resource available at PlasmoDB (version 5.5). Unannotated genes were identified using the tBLASTn search where amino acid sequences from a well-annotated species (PlasmoDB version 5.5) are used to query the genome of another species. A threshold of 10<sup>-15 </sup>was used to ensure the stringency of locating matching genes.</p>
         </sec>
         <sec>
            <st>
               <p>Availability of microarray data</p>
            </st>
            <p>Microarray data are publically available at the Centre for Information Biology Gene Expression Database (CIBEX; <url>http://cibex.nig.ac.jp</url>). The Accession number for these data is CBX114.</p>
         </sec>
      </sec>
      <sec>
         <st>
            <p>Authors' contributions</p>
         </st>
         <p>KL designed and carried out the experiments and analyzed the data and helped with the writing of the manuscript. GH designed the microarray and helped with the analysis of the array data. ZB helped with the design of this study and helped in the writing of the manuscript. PRP designed and supervised the study and wrote the manuscript. All authors read and approved the final manuscript.</p>
      </sec>
   </bdy>
   <bm>
      <ack>
         <sec>
            <st>
               <p>Acknowledgements</p>
            </st>
            <p>This work was funded by a grant of the National Medical Research Council of Singapore.</p>
         </sec>
      </ack>
      <refgrp>
         <bibl id="B1">
            <title>
               <p>The global distribution of clinical episodes of Plasmodium falciparum malaria</p>
            </title>
            <aug>
               <au>
                  <snm>Snow</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>Guerra</snm>
                  <fnm>CA</fnm>
               </au>
               <au>
                  <snm>Noor</snm>
                  <fnm>AM</fnm>
               </au>
               <au>
                  <snm>Myint</snm>
                  <fnm>HY</fnm>
               </au>
               <au>
                  <snm>Hay</snm>
                  <fnm>SI</fnm>
               </au>
            </aug>
            <source>Nature</source>
            <pubdate>2005</pubdate>
            <volume>434</volume>
            <issue>7030</issue>
            <fpage>214</fpage>
            <lpage>217</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature03342</pubid>
                  <pubid idtype="pmpid" link="fulltext">15759000</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B2">
            <title>
               <p>Plasmodia of rodents</p>
            </title>
            <aug>
               <au>
                  <snm>Carter</snm>
                  <fnm>R</fnm>
               </au>
               <au>
                  <snm>Diggs</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Parasitic Protozoa</source>
            <publisher>New York: Academic Press</publisher>
            <editor>Kreier J</editor>
            <pubdate>1977</pubdate>
            <volume>3</volume>
            <fpage>359</fpage>
            <lpage>465</lpage>
         </bibl>
         <bibl id="B3">
            <title>
               <p>Genome sequence and comparative analysis of the model rodent malaria parasite Plasmodium yoelii yoelii</p>
            </title>
            <aug>
               <au>
                  <snm>Carlton</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Angiuoli</snm>
                  <fnm>SV</fnm>
               </au>
               <au>
                  <snm>Suh</snm>
                  <fnm>BB</fnm>
               </au>
               <au>
                  <snm>Kooij</snm>
                  <fnm>TW</fnm>
               </au>
               <au>
                  <snm>Pertea</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Silva</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Ermolaeva</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Allen</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Selengut</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Koo</snm>
                  <fnm>HL</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>419</volume>
            <issue>6906</issue>
            <fpage>512</fpage>
            <lpage>519</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature01099</pubid>
                  <pubid idtype="pmpid" link="fulltext">12368865</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B4">
            <title>
               <p>Orthology between the genomes of Plasmodium falciparum and rodent malaria parasites: possible practical applications</p>
            </title>
            <aug>
               <au>
                  <snm>Waters</snm>
                  <fnm>AP</fnm>
               </au>
            </aug>
            <source>Philos Trans R Soc Lond B Biol Sci</source>
            <pubdate>2002</pubdate>
            <volume>357</volume>
            <issue>1417</issue>
            <fpage>55</fpage>
            <lpage>63</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1098/rstb.2001.1011</pubid>
                  <pubid idtype="pmcid">1692921</pubid>
                  <pubid idtype="pmpid" link="fulltext">11839182</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B5">
            <title>
               <p>A Plasmodium whole-genome synteny map: indels and synteny breakpoints as foci for species-specific genes</p>
            </title>
            <aug>
               <au>
                  <snm>Kooij</snm>
                  <fnm>TW</fnm>
               </au>
               <au>
                  <snm>Carlton</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Bidwell</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Hall</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Ramesar</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Janse</snm>
                  <fnm>CJ</fnm>
               </au>
               <au>
                  <snm>Waters</snm>
                  <fnm>AP</fnm>
               </au>
            </aug>
            <source>PLoS Pathog</source>
            <pubdate>2005</pubdate>
            <volume>1</volume>
            <issue>4</issue>
            <fpage>e44</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1371/journal.ppat.0010044</pubid>
                  <pubid idtype="pmcid">1317653,1317653</pubid>
                  <pubid idtype="pmpid" link="fulltext">16389297</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B6">
            <title>
               <p>STEVOR--a multifunctional protein?</p>
            </title>
            <aug>
               <au>
                  <snm>Blythe</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Surentheran</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Preiser</snm>
                  <fnm>PR</fnm>
               </au>
            </aug>
            <source>Molecular and biochemical parasitology</source>
            <pubdate>2004</pubdate>
            <volume>134</volume>
            <issue>1</issue>
            <fpage>11</fpage>
            <lpage>15</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.molbiopara.2003.09.011</pubid>
                  <pubid idtype="pmpid" link="fulltext">14747138</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B7">
            <title>
               <p>Antigenic variation at the infected red cell surface in malaria</p>
            </title>
            <aug>
               <au>
                  <snm>Kyes</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Horrocks</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Newbold</snm>
                  <fnm>C</fnm>
               </au>
            </aug>
            <source>Annu Rev Microbiol</source>
            <pubdate>2001</pubdate>
            <volume>55</volume>
            <fpage>673</fpage>
            <lpage>707</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1146/annurev.micro.55.1.673</pubid>
                  <pubid idtype="pmpid" link="fulltext">11544371</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B8">
            <title>
               <p>A large gene family for putative variant antigens shared by human and rodent malaria parasites</p>
            </title>
            <aug>
               <au>
                  <snm>Janssen</snm>
                  <fnm>CS</fnm>
               </au>
               <au>
                  <snm>Barrett</snm>
                  <fnm>MP</fnm>
               </au>
               <au>
                  <snm>Turner</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Phillips</snm>
                  <fnm>RS</fnm>
               </au>
            </aug>
            <source>Proc R Soc Lond B Biol Sci</source>
            <pubdate>2002</pubdate>
            <volume>269</volume>
            <issue>1489</issue>
            <fpage>431</fpage>
            <lpage>436</lpage>
            <xrefbib>
               <pubid idtype="doi">10.1098/rspb.2001.1903</pubid>
            </xrefbib>
         </bibl>
         <bibl id="B9">
            <title>
               <p>Plasmodium interspersed repeats: the major multigene superfamily of malaria parasites</p>
            </title>
            <aug>
               <au>
                  <snm>Janssen</snm>
                  <fnm>CS</fnm>
               </au>
               <au>
                  <snm>Phillips</snm>
                  <fnm>RS</fnm>
               </au>
               <au>
                  <snm>Turner</snm>
                  <fnm>CM</fnm>
               </au>
               <au>
                  <snm>Barrett</snm>
                  <fnm>MP</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2004</pubdate>
            <volume>32</volume>
            <issue>19</issue>
            <fpage>5712</fpage>
            <lpage>5720</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/gkh907</pubid>
                  <pubid idtype="pmcid">528792</pubid>
                  <pubid idtype="pmpid" link="fulltext">15507685</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B10">
            <title>
               <p>The phylogeny of rodent malaria parasites: simultaneous analysis across three genomes</p>
            </title>
            <aug>
               <au>
                  <snm>Perkins</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Sarkar</snm>
                  <fnm>IN</fnm>
               </au>
               <au>
                  <snm>Carter</snm>
                  <fnm>R</fnm>
               </au>
            </aug>
            <source>Infect Genet Evol</source>
            <pubdate>2007</pubdate>
            <volume>7</volume>
            <issue>1</issue>
            <fpage>74</fpage>
            <lpage>83</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.meegid.2006.04.005</pubid>
                  <pubid idtype="pmpid" link="fulltext">16765106</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B11">
            <title>
               <p>Genome sequence of the human malaria parasite Plasmodium falciparum</p>
            </title>
            <aug>
               <au>
                  <snm>Gardner</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Hall</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Fung</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>O</fnm>
               </au>
               <au>
                  <snm>Berriman</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Hyman</snm>
                  <fnm>RW</fnm>
               </au>
               <au>
                  <snm>Carlton</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Pain</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Nelson</snm>
                  <fnm>KE</fnm>
               </au>
               <au>
                  <snm>Bowman</snm>
                  <fnm>S</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2002</pubdate>
            <volume>419</volume>
            <issue>6906</issue>
            <fpage>498</fpage>
            <lpage>511</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature01097</pubid>
                  <pubid idtype="pmpid" link="fulltext">12368864</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B12">
            <title>
               <p>Comparative genotyping of the Saccharomyces cerevisiae laboratory strains S288C and CEN.PK113-7D using oligonucleotide microarrays</p>
            </title>
            <aug>
               <au>
                  <snm>Daran-Lapujade</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Daran</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Kotter</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Petit</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Piper</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Pronk</snm>
                  <fnm>JT</fnm>
               </au>
            </aug>
            <source>FEMS Yeast Res</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <issue>3</issue>
            <fpage>259</fpage>
            <lpage>269</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/S1567-1356(03)00156-9</pubid>
                  <pubid idtype="pmpid" link="fulltext">14654430</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B13">
            <title>
               <p>Genomic interspecies microarray hybridization: rapid discovery of three thousand genes in the maize endophyte, Klebsiella pneumoniae 342, by microarray hybridization with Escherichia coli K-12 open reading frames</p>
            </title>
            <aug>
               <au>
                  <snm>Dong</snm>
                  <fnm>Y</fnm>
               </au>
               <au>
                  <snm>Glasner</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Blattner</snm>
                  <fnm>FR</fnm>
               </au>
               <au>
                  <snm>Triplett</snm>
                  <fnm>EW</fnm>
               </au>
            </aug>
            <source>Appl Environ Microbiol</source>
            <pubdate>2001</pubdate>
            <volume>67</volume>
            <issue>4</issue>
            <fpage>1911</fpage>
            <lpage>1921</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1128/AEM.67.4.1911-1921.2001</pubid>
                  <pubid idtype="pmcid">92813</pubid>
                  <pubid idtype="pmpid" link="fulltext">11282649</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B14">
            <title>
               <p>Patterns of large-scale genomic variation in virulent and avirulent Burkholderia species</p>
            </title>
            <aug>
               <au>
                  <snm>Ong</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Ooi</snm>
                  <fnm>CH</fnm>
               </au>
               <au>
                  <snm>Wang</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Chong</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Ng</snm>
                  <fnm>KC</fnm>
               </au>
               <au>
                  <snm>Rodrigues</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Lee</snm>
                  <fnm>MA</fnm>
               </au>
               <au>
                  <snm>Tan</snm>
                  <fnm>P</fnm>
               </au>
            </aug>
            <source>Genome Res</source>
            <pubdate>2004</pubdate>
            <volume>14</volume>
            <issue>11</issue>
            <fpage>2295</fpage>
            <lpage>2307</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1101/gr.1608904</pubid>
                  <pubid idtype="pmcid">525689</pubid>
                  <pubid idtype="pmpid" link="fulltext">15520292</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B15">
            <title>
               <p>Expression profiling of the schizont and trophozoite stages of Plasmodium falciparum with a long-oligonucleotide microarray</p>
            </title>
            <aug>
               <au>
                  <snm>Bozdech</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Joachimiak</snm>
                  <fnm>MP</fnm>
               </au>
               <au>
                  <snm>Cohen</snm>
                  <fnm>FE</fnm>
               </au>
               <au>
                  <snm>Pulliam</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>DeRisi</snm>
                  <fnm>JL</fnm>
               </au>
            </aug>
            <source>Genome Biol</source>
            <pubdate>2003</pubdate>
            <volume>4</volume>
            <issue>2</issue>
            <fpage>R9</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/gb-2003-4-2-r9</pubid>
                  <pubid idtype="pmcid">151308</pubid>
                  <pubid idtype="pmpid" link="fulltext">12620119</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B16">
            <title>
               <p>Expression profiling using microarrays fabricated by an ink-jet oligonucleotide synthesizer</p>
            </title>
            <aug>
               <au>
                  <snm>Hughes</snm>
                  <fnm>TR</fnm>
               </au>
               <au>
                  <snm>Mao</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Jones</snm>
                  <fnm>AR</fnm>
               </au>
               <au>
                  <snm>Burchard</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Marton</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Shannon</snm>
                  <fnm>KW</fnm>
               </au>
               <au>
                  <snm>Lefkowitz</snm>
                  <fnm>SM</fnm>
               </au>
               <au>
                  <snm>Ziman</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Schelter</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Meyer</snm>
                  <fnm>MR</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nat Biotechnol</source>
            <pubdate>2001</pubdate>
            <volume>19</volume>
            <issue>4</issue>
            <fpage>342</fpage>
            <lpage>347</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/86730</pubid>
                  <pubid idtype="pmpid" link="fulltext">11283592</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B17">
            <title>
               <p>Assessment of the sensitivity and specificity of oligonucleotide (50 mer) microarrays</p>
            </title>
            <aug>
               <au>
                  <snm>Kane</snm>
                  <fnm>MD</fnm>
               </au>
               <au>
                  <snm>Jatkoe</snm>
                  <fnm>TA</fnm>
               </au>
               <au>
                  <snm>Stumpf</snm>
                  <fnm>CR</fnm>
               </au>
               <au>
                  <snm>Lu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Thomas</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Madore</snm>
                  <fnm>SJ</fnm>
               </au>
            </aug>
            <source>Nucleic Acids Res</source>
            <pubdate>2000</pubdate>
            <volume>28</volume>
            <issue>22</issue>
            <fpage>4552</fpage>
            <lpage>4557</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/nar/28.22.4552</pubid>
                  <pubid idtype="pmcid">113865</pubid>
                  <pubid idtype="pmpid" link="fulltext">11071945</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B18">
            <title>
               <p>Selection of long oligonucleotides for gene expression microarrays using weighted rank-sum strategy</p>
            </title>
            <aug>
               <au>
                  <snm>Hu</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Llinas</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Preiser</snm>
                  <fnm>PR</fnm>
               </au>
               <au>
                  <snm>Bozdech</snm>
                  <fnm>Z</fnm>
               </au>
            </aug>
            <source>BMC bioinformatics</source>
            <pubdate>2007</pubdate>
            <volume>8</volume>
            <fpage>350</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1186/1471-2105-8-350</pubid>
                  <pubid idtype="pmcid">2099447</pubid>
                  <pubid idtype="pmpid" link="fulltext">17880708</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B19">
            <title>
               <p>The transcriptome of the intraerythrocytic developmental cycle of Plasmodium falciparum</p>
            </title>
            <aug>
               <au>
                  <snm>Bozdech</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Llinas</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Pulliam</snm>
                  <fnm>BL</fnm>
               </au>
               <au>
                  <snm>Wong</snm>
                  <fnm>ED</fnm>
               </au>
               <au>
                  <snm>Zhu</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>DeRisi</snm>
                  <fnm>JL</fnm>
               </au>
            </aug>
            <source>PLoS Biol</source>
            <pubdate>2003</pubdate>
            <volume>1</volume>
            <issue>1</issue>
            <fpage>E5</fpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1371/journal.pbio.0000005</pubid>
                  <pubid idtype="pmcid">176545</pubid>
                  <pubid idtype="pmpid" link="fulltext">12929205</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B20">
            <title>
               <p>The transcriptome of Plasmodium vivax reveals divergence and diversity of transcriptional regulation in malaria parasites</p>
            </title>
            <aug>
               <au>
                  <snm>Bozdech</snm>
                  <fnm>Z</fnm>
               </au>
               <au>
                  <snm>Mok</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Hu</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Imwong</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Jaidee</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Russell</snm>
                  <fnm>B</fnm>
               </au>
               <au>
                  <snm>Ginsburg</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Nosten</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Day</snm>
                  <fnm>NP</fnm>
               </au>
               <au>
                  <snm>White</snm>
                  <fnm>NJ</fnm>
               </au>
               <etal/>
            </aug>
            <source>Proceedings of the National Academy of Sciences of the United States of America</source>
            <pubdate>2008</pubdate>
            <volume>105</volume>
            <issue>42</issue>
            <fpage>16290</fpage>
            <lpage>16295</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.0807404105</pubid>
                  <pubid idtype="pmcid">2571024</pubid>
                  <pubid idtype="pmpid" link="fulltext">18852452</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B21">
            <title>
               <p>Comparative genomics of the neglected human malaria parasite Plasmodium vivax</p>
            </title>
            <aug>
               <au>
                  <snm>Carlton</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Adams</snm>
                  <fnm>JH</fnm>
               </au>
               <au>
                  <snm>Silva</snm>
                  <fnm>JC</fnm>
               </au>
               <au>
                  <snm>Bidwell</snm>
                  <fnm>SL</fnm>
               </au>
               <au>
                  <snm>Lorenzi</snm>
                  <fnm>H</fnm>
               </au>
               <au>
                  <snm>Caler</snm>
                  <fnm>E</fnm>
               </au>
               <au>
                  <snm>Crabtree</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Angiuoli</snm>
                  <fnm>SV</fnm>
               </au>
               <au>
                  <snm>Merino</snm>
                  <fnm>EF</fnm>
               </au>
               <au>
                  <snm>Amedeo</snm>
                  <fnm>P</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2008</pubdate>
            <volume>455</volume>
            <issue>7214</issue>
            <fpage>757</fpage>
            <lpage>763</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature07327</pubid>
                  <pubid idtype="pmcid">2651158</pubid>
                  <pubid idtype="pmpid" link="fulltext">18843361</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B22">
            <title>
               <p>The genome of the simian and human malaria parasite Plasmodium knowlesi</p>
            </title>
            <aug>
               <au>
                  <snm>Pain</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bohme</snm>
                  <fnm>U</fnm>
               </au>
               <au>
                  <snm>Berry</snm>
                  <fnm>AE</fnm>
               </au>
               <au>
                  <snm>Mungall</snm>
                  <fnm>K</fnm>
               </au>
               <au>
                  <snm>Finn</snm>
                  <fnm>RD</fnm>
               </au>
               <au>
                  <snm>Jackson</snm>
                  <fnm>AP</fnm>
               </au>
               <au>
                  <snm>Mourier</snm>
                  <fnm>T</fnm>
               </au>
               <au>
                  <snm>Mistry</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Pasini</snm>
                  <fnm>EM</fnm>
               </au>
               <au>
                  <snm>Aslett</snm>
                  <fnm>MA</fnm>
               </au>
               <etal/>
            </aug>
            <source>Nature</source>
            <pubdate>2008</pubdate>
            <volume>455</volume>
            <issue>7214</issue>
            <fpage>799</fpage>
            <lpage>803</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1038/nature07306</pubid>
                  <pubid idtype="pmcid">2656934</pubid>
                  <pubid idtype="pmpid" link="fulltext">18843368</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B23">
            <title>
               <p>The pir multigene family of Plasmodium: antigenic variation and beyond</p>
            </title>
            <aug>
               <au>
                  <snm>Cunningham</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Lawton</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Jarra</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Preiser</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Langhorne</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Molecular and biochemical parasitology</source>
            <pubdate>2010</pubdate>
            <volume>170</volume>
            <issue>2</issue>
            <fpage>65</fpage>
            <lpage>73</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1016/j.molbiopara.2009.12.010</pubid>
                  <pubid idtype="pmpid" link="fulltext">20045030</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B24">
            <title>
               <p>Host immunity modulates transcriptional changes in a multigene family (yir) of rodent malaria</p>
            </title>
            <aug>
               <au>
                  <snm>Cunningham</snm>
                  <fnm>DA</fnm>
               </au>
               <au>
                  <snm>Jarra</snm>
                  <fnm>W</fnm>
               </au>
               <au>
                  <snm>Koernig</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Fonager</snm>
                  <fnm>J</fnm>
               </au>
               <au>
                  <snm>Fernandez-Reyes</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Blythe</snm>
                  <fnm>JE</fnm>
               </au>
               <au>
                  <snm>Waller</snm>
                  <fnm>C</fnm>
               </au>
               <au>
                  <snm>Preiser</snm>
                  <fnm>PR</fnm>
               </au>
               <au>
                  <snm>Langhorne</snm>
                  <fnm>J</fnm>
               </au>
            </aug>
            <source>Mol Microbiol</source>
            <pubdate>2005</pubdate>
            <volume>58</volume>
            <issue>3</issue>
            <fpage>636</fpage>
            <lpage>647</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1111/j.1365-2958.2005.04840.x</pubid>
                  <pubid idtype="pmpid" link="fulltext">16238615</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B25">
            <title>
               <p>A comprehensive survey of the Plasmodium life cycle by genomic, transcriptomic, and proteomic analyses</p>
            </title>
            <aug>
               <au>
                  <snm>Hall</snm>
                  <fnm>N</fnm>
               </au>
               <au>
                  <snm>Karras</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Raine</snm>
                  <fnm>JD</fnm>
               </au>
               <au>
                  <snm>Carlton</snm>
                  <fnm>JM</fnm>
               </au>
               <au>
                  <snm>Kooij</snm>
                  <fnm>TW</fnm>
               </au>
               <au>
                  <snm>Berriman</snm>
                  <fnm>M</fnm>
               </au>
               <au>
                  <snm>Florens</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Janssen</snm>
                  <fnm>CS</fnm>
               </au>
               <au>
                  <snm>Pain</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Christophides</snm>
                  <fnm>GK</fnm>
               </au>
               <etal/>
            </aug>
            <source>Science</source>
            <pubdate>2005</pubdate>
            <volume>307</volume>
            <issue>5706</issue>
            <fpage>82</fpage>
            <lpage>86</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1126/science.1103717</pubid>
                  <pubid idtype="pmpid" link="fulltext">15637271</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B26">
            <title>
               <p>Assessment and improvement of the Plasmodium yoelii yoelii genome annotation through comparative analysis</p>
            </title>
            <aug>
               <au>
                  <snm>Vaughan</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Chiu</snm>
                  <fnm>SY</fnm>
               </au>
               <au>
                  <snm>Ramasamy</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Li</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Gardner</snm>
                  <fnm>MJ</fnm>
               </au>
               <au>
                  <snm>Tarun</snm>
                  <fnm>AS</fnm>
               </au>
               <au>
                  <snm>Kappe</snm>
                  <fnm>SH</fnm>
               </au>
               <au>
                  <snm>Peng</snm>
                  <fnm>X</fnm>
               </au>
            </aug>
            <source>Bioinformatics</source>
            <pubdate>2008</pubdate>
            <volume>24</volume>
            <issue>13</issue>
            <fpage>i383</fpage>
            <lpage>389</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/bioinformatics/btn140</pubid>
                  <pubid idtype="pmcid">2718618</pubid>
                  <pubid idtype="pmpid" link="fulltext">18586738</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B27">
            <title>
               <p>Phylogeny of the malarial genus Plasmodium, derived from rRNA gene sequences</p>
            </title>
            <aug>
               <au>
                  <snm>Escalante</snm>
                  <fnm>AA</fnm>
               </au>
               <au>
                  <snm>Ayala</snm>
                  <fnm>FJ</fnm>
               </au>
            </aug>
            <source>Proceedings of the National Academy of Sciences of the United States of America</source>
            <pubdate>1994</pubdate>
            <volume>91</volume>
            <issue>24</issue>
            <fpage>11373</fpage>
            <lpage>11377</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1073/pnas.91.24.11373</pubid>
                  <pubid idtype="pmcid">45233</pubid>
                  <pubid idtype="pmpid">7972067</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B28">
            <title>
               <p>Origins of human malaria: rare genomic changes and full mitochondrial genomes confirm the relationship of Plasmodium falciparum to other mammalian parasites but complicate the origins of Plasmodium vivax</p>
            </title>
            <aug>
               <au>
                  <snm>Roy</snm>
                  <fnm>SW</fnm>
               </au>
               <au>
                  <snm>Irimia</snm>
                  <fnm>M</fnm>
               </au>
            </aug>
            <source>Mol Biol Evol</source>
            <pubdate>2008</pubdate>
            <volume>25</volume>
            <issue>6</issue>
            <fpage>1192</fpage>
            <lpage>1198</lpage>
            <xrefbib>
               <pubidlist>
                  <pubid idtype="doi">10.1093/molbev/msn069</pubid>
                  <pubid idtype="pmcid">2386083</pubid>
                  <pubid idtype="pmpid" link="fulltext">18359945</pubid>
               </pubidlist>
            </xrefbib>
         </bibl>
         <bibl id="B29">
            <title>
               <p>Quantitative real-time polymerase chain reaction for malaria diagnosis and its use in malaria vaccine clinical trials</p>
            </title>
            <aug>
               <au>
                  <snm>Andrews</snm>
                  <fnm>L</fnm>
               </au>
               <au>
                  <snm>Andersen</snm>
                  <fnm>RF</fnm>
               </au>
               <au>
                  <snm>Webster</snm>
                  <fnm>D</fnm>
               </au>
               <au>
                  <snm>Dunachie</snm>
                  <fnm>S</fnm>
               </au>
               <au>
                  <snm>Walther</snm>
                  <fnm>RM</fnm>
               </au>
               <au>
                  <snm>Bejon</snm>
                  <fnm>P</fnm>
               </au>
               <au>
                  <snm>Hunt-Cooke</snm>
                  <fnm>A</fnm>
               </au>
               <au>
                  <snm>Bergson</snm>
                  <fnm>G</fnm>
               </au>
               <au>
                  <snm>Sanderson</snm>
                  <fnm>F</fnm>
               </au>
               <au>
                  <snm>Hill</snm>
                  <fnm>AV</fnm>
               </au>
               <etal/>
            </aug>
            <source>Am J Trop Med Hyg</source>
            <pubdate>2005</pubdate>
            <volume>73</volume>
            <issue>1</issue>
            <fpage>191</fpage>
            <lpage>198</lpage>
            <xrefbib>
               <pubid idtype="pmpid" link="fulltext">16014857</pubid>
            </xrefbib>
         </bibl>
      </refgrp>
   </bm>
</art>

